Page 153 - Data Science class 11
P. 153

• Ownership: Through primary research, the researcher exhibits ownership of the data  collected. It is  up to the
              researcher to make it available publicly, patent it, or even sell it.
               • Updated: Because primary data collects data in real-time, it is usually up-to-date; it does not collect data from old
              sources.

            The Disadvantages of Primary Data
               • Expensive: Primary data is much more expensive than secondary data. So, it might be difficult to gather primary data.
               • Time-consuming: It is time-taking.

               • Not feasible: Due to its complexity and required commitment, it may not be feasible to collect primary data in some
              cases.
            Secondary Data
            Secondary data is the data that has already been collected in the past through primary sources and made readily
            accessible for researchers so that they can use it for their own research.
            Let us now study one example for secondary data to have a better understanding.

            Online Data

            Any data generated via real-time online channels and activities, such as Internet browsing, emails, social media
            activity, in-app purchase history, and so on, is referred to as "online data."
            Simply speaking, by "online," we mean any transaction using an internet connection, and "offline," we mean using our
            own gadgets like PCs, desktops, laptops, etc. when they are not connected to the Internet.
            Online data is data collected from online campaigns and platforms, such as social channels and email, plus any
            relevant data collected from website clickstreams. All such data is also called secondary data.
            The total amount of online data created, captured, copied, and consumed globally is forecasted to increase rapidly,
            reaching 64.2 zettabytes in 2020. Over the next five years up to 2025, global data creation is projected to grow to
            more than 180 zettabytes.

                              200
                                                                                                 181

                            Data volume in zettabytes.  100                 64.2  79  97  120
                                                                                             147
                              150









                               50



                                    2   5   6.5  9  12.5 15.5  18  26  33  41
                                0
                                   2010  2011  2012  2013  2014  2015  2016  2017  2018  2019  2020  2021  2022  2023  2024  2025


                                 Fig. 4.1. Volume of data/information created, captured, copied, and
                                      consumed worldwide from 2010 to 2025(in zettabytes)








                                                                                              Randomisation    151
   148   149   150   151   152   153   154   155   156   157   158