Page 136 - TP_V5.1_C8_fb
P. 136
f. Which of the following is NOT one of the four V's used to define big data?
(i) Volume (ii) Velocity
(iii) variety (iv) Visualisation
g. What is SQL primarily used for?
(i) Managing and querying data stored in databases
(ii) Designing websites
(iii) Editing images
(iv) Creating operating systems
2. Write 'T' for true and 'F' for false.
a. The Sustainable Development Goals (SDGs) are also known as Global Goals.
b. Hadoop is a tool that helps in managing the storage of massive datasets.
c. Natural language data refers to data that is easy to fit into data models and typically
includes information stored in databases.
d. The 17 SDGs are integrated in such a manner that action on one will have an impact
on the other.
e. Unstructured data is not easy to fit into any type of data model.
f. Python is an interpreter based high-level language.
g. SDG 17 focuses solely on individual nations achieving their development targets
without any collaboration with other countries.
3. Fill in the blanks using the words from the help box.
Hadoop, Components, Big data, Organised, Formatted, Data
a. ____________ is a term used for any dataset that is large or complex to be processed by
traditional data management techniques.
b. Structured data is highly ____________ and ____________ to be easily searchable.
c. ____________ has become an important fuel on which industries function today.
d. ____________ has become the most popular software framework for big data.
e. The major ____________ of AI include data, natural language processing and computer vision.
4. Answer in one or two words.
a. Name the term used for managing and querying data stored in databases. ______________
b. Which language is widely used for data science and software development
134 Premium Edition-VIII

