Page 39 - Ai V2.0 Flipbook C8
P. 39

• Can provide up-to-date and real-time information.
                 Some disadvantages of Primary data are as follows:

                    • Often time-consuming and expensive.
                    • Requires careful planning and design to avoid biases.


                 Secondary source of data

                 Secondary  data  is  data  that  has  already  been  collected,  processed,  and  published  by  other
                 individuals or organisations. Researchers use this data for new analysis or to supplement primary
                 data.

                 Examples of secondary data sources include:
                    • Books and academic journals: Published research studies and scholarly articles.

                    • Government reports: Census data, economic statistics, health records, and official publications.

                    • Websites and online  databases:  Public  datasets,  industry  reports,  open  data  portals  like
                   Kaggle or WHO.
                    • Newspapers and magazines: Articles and reports on current events and trends.


                                                                                             •   UCI is a collection of
                                           •   Countries like                                   databases, domain
                                              Australia, EU, India,                             theories, and data
                                              New Zealand, and                                  generators in
                                              Singapore are openly                              collaboration with
                                              sharing datasets on    Dataset Search             the University of
                                              various portals
                                                                                                Massachusetts


                 •   Kaggle is an online                            •   This is a toolbox
                    community of data              .gov                by Google that can    UCI
                    scientists where you
                    can access different       datasets                search for data by    Machine Learning Repository
                                                                       name
                    types of data



                 Some advantages of secondary source data are as follows:

                    • Easily accessible and often free or low-cost.

                    • Saves time as data collection is already done.

                    • Useful for historical or trend analysis.
                 Some disadvantages of secondary source data are as follows:

                    • May not perfectly fit the current research question.

                    • Possible issues with accuracy, reliability, or outdatedness.
                    • Lack of control over data quality and collection methods.





                                                                              Stages of AI Project Cycle  37
   34   35   36   37   38   39   40   41   42   43   44