Page 153 - Data Science class 11
P. 153
• Ownership: Through primary research, the researcher exhibits ownership of the data collected. It is up to the
researcher to make it available publicly, patent it, or even sell it.
• Updated: Because primary data collects data in real-time, it is usually up-to-date; it does not collect data from old
sources.
The Disadvantages of Primary Data
• Expensive: Primary data is much more expensive than secondary data. So, it might be difficult to gather primary data.
• Time-consuming: It is time-taking.
• Not feasible: Due to its complexity and required commitment, it may not be feasible to collect primary data in some
cases.
Secondary Data
Secondary data is the data that has already been collected in the past through primary sources and made readily
accessible for researchers so that they can use it for their own research.
Let us now study one example for secondary data to have a better understanding.
Online Data
Any data generated via real-time online channels and activities, such as Internet browsing, emails, social media
activity, in-app purchase history, and so on, is referred to as "online data."
Simply speaking, by "online," we mean any transaction using an internet connection, and "offline," we mean using our
own gadgets like PCs, desktops, laptops, etc. when they are not connected to the Internet.
Online data is data collected from online campaigns and platforms, such as social channels and email, plus any
relevant data collected from website clickstreams. All such data is also called secondary data.
The total amount of online data created, captured, copied, and consumed globally is forecasted to increase rapidly,
reaching 64.2 zettabytes in 2020. Over the next five years up to 2025, global data creation is projected to grow to
more than 180 zettabytes.
200
181
Data volume in zettabytes. 100 64.2 79 97 120
147
150
50
2 5 6.5 9 12.5 15.5 18 26 33 41
0
2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 2025
Fig. 4.1. Volume of data/information created, captured, copied, and
consumed worldwide from 2010 to 2025(in zettabytes)
Randomisation 151

