Page 113 - Artificial Intellegence_v2.0_Class_11
P. 113

Let us now learn about two categories of data: Structured data and Unstructured data.


                                                                quantities

                                           numbers                                   information




                                                              What is Data?
                                             facts                                     graphs




                                           observations                            measurement



                 Structured Data
                 Structured data is the type of data which we interact and work with every day. It has pre-defined data types and
                 formats. They are organized and are easy to analyze. It generally includes only textual data. For example, names,
                 dates, addresses, weather information, etc.


                 Unstructured Data
                 Unstructured data is the type of data which neither possesses any fixed datatype nor the size is fixed. It can not be
                 processed and analysed using traditional database methods. For example, text, audio, video, satellite imagery, etc.
                 Let us understand the difference between structured and unstructured data.


                                                    Structured Data                      Unstructured Data
                     Characteristics       • Pre-defined data models          • No pre-defined data model
                                           • Usually text only                •  text, lmages, sound, video or other
                                           • Easy to search                     formats
                                                                              • Difficult to search

                     Found in              • Relational databases             • Applications
                                           • Data warehouses                  • NoSQL databases
                                                                              • Data warehouses

                     Generated by          Humans or machines                 Humans or machines
                     Applications          • Airline reservation systems      • Word processing
                                           • Inventory control                • Presentation software
                                           • ERP systems                      • Email clients
                                                                              • Tools for viewing or editing media
                     Examples              • Dates                            • Text files

                                           • Phone numbers                    • Reports
                                           • Credit card numbers              • Email messages
                                           • Customer names                   • Audio files
                                           • Addresses                        • Video files
                                           • Product names and numbers        • Images
                                           • Transaction information          • Surveillance imagery




                                                                                            Introduction to AI  111
   108   109   110   111   112   113   114   115   116   117   118