Page 69 - KEC Khaitan C8 Flipbook
P. 69

uiz   Bee        Which type of data is a combination of different types of data?
                                          _________________________________________________________________







                           TOOLS FOR DATA SCIENCE


                 Traditional  statistical  methodologies  are used  by  data  scientists  which form the backbone

                 of machine  learning algorithms.  Deep  learning algorithms  are also  used  to  generate  robust
                 predictions. The following tools and programming languages are used by data scientists to analyse
                 data and draw insights from it:

                 R SCRIPTING LANGUAGE

                 R is a scripting language that is used for statistical computing and is widely incorporated in data
                 analysis modelling. It is an interpreter-based language and possesses the features of an object-
                 oriented programming language.

                 STRUCTURED QUERY LANGUAGE (SQL)

                 SQL is used for managing and querying data stored in databases. Extracting information from the
                 database is the first step towards data analysis. It is a flexible and dynamic language and is used
                 in extracting, managing and manipulating data.


                 PYTHON
                 Python is a widely-used language for data science and software development. It is an interpreter-
                 based  high-level language  which has gained  popularity  because  of its  ease  of use  and  code

                 readability.
                 It comes  with  many  packages  for deep  learning and is  hence  used  widely  for data  analysis
                 and  natural  language  processing.  Python  is  utilised  for purposes  like  data  mining,  wrangling,
                 visualisation, and developing predictive models.

                 HADOOP
                 Hadoop has become the most popular software framework for big data and it is a tool that helps
                 in regulating the storage of massive datasets.


                 TABLEAU
                 Tableau is an ideal data visualisation software that helps in analysing data which allows users to

                 create interactive visualisations and dashboards. It has the ability to connect with spreadsheets,
                 relational databases, and cloud platforms which enable it to process data directly.







                                                                               Introduction to SDGs and Data Science  67
   64   65   66   67   68   69   70   71   72   73   74