Page 69 - KEC Khaitan C8 Flipbook
P. 69
uiz Bee Which type of data is a combination of different types of data?
_________________________________________________________________
TOOLS FOR DATA SCIENCE
Traditional statistical methodologies are used by data scientists which form the backbone
of machine learning algorithms. Deep learning algorithms are also used to generate robust
predictions. The following tools and programming languages are used by data scientists to analyse
data and draw insights from it:
R SCRIPTING LANGUAGE
R is a scripting language that is used for statistical computing and is widely incorporated in data
analysis modelling. It is an interpreter-based language and possesses the features of an object-
oriented programming language.
STRUCTURED QUERY LANGUAGE (SQL)
SQL is used for managing and querying data stored in databases. Extracting information from the
database is the first step towards data analysis. It is a flexible and dynamic language and is used
in extracting, managing and manipulating data.
PYTHON
Python is a widely-used language for data science and software development. It is an interpreter-
based high-level language which has gained popularity because of its ease of use and code
readability.
It comes with many packages for deep learning and is hence used widely for data analysis
and natural language processing. Python is utilised for purposes like data mining, wrangling,
visualisation, and developing predictive models.
HADOOP
Hadoop has become the most popular software framework for big data and it is a tool that helps
in regulating the storage of massive datasets.
TABLEAU
Tableau is an ideal data visualisation software that helps in analysing data which allows users to
create interactive visualisations and dashboards. It has the ability to connect with spreadsheets,
relational databases, and cloud platforms which enable it to process data directly.
Introduction to SDGs and Data Science 67

