Page 12 - CT_AI_Class-7
P. 12

Regression is mainly used to make predictions about numbers, such as estimating costs, sales or
                  grades. It helps in making informed decisions based on past data.



                  INTRODUCTION TO CLUSTERING

                  Clustering  is a  method  in  AI where data  is  grouped  together  based  on  similarities.  Unlike
                  classification,  clustering  doesn't  need  labels  to  start  with. Instead,  it  looks  at  patterns  and
                  behaviours in the data, then automatically forms groups based on those similarities.

                  Imagine you have a photo application that can automatically organise your pictures. It might
                  group photos by people or locations, such as creating albums for Holiday in Paris or Family Photos,
                  without needing you to label each picture.

                  Another example is weather data. Clustering could group days into categories like hot, cold or
                  rainy based on factors like temperature and rainfall. The app or system doesn't need to know the
                  labels beforehand but learns to group the data based on patterns it finds.



















                  Clustering helps find hidden patterns in data that may not be obvious at first, making it easier to
                  understand complex information and organise it more effectively.



                  DATASETS

                  Datasets refer to organised collections of data used to train and assess machine learning models.
                  These datasets provide the necessary information for models to recognise patterns and make
                  predictions, which are essential for the development of AI and machine learning. They enable
                  researchers and professionals to address various real-world challenges, including speech, image
                  recognition, predictive analytics and natural language processing.

                  Each individual unit within a dataset is known as a data point or record with several attributes or
                  features. Datasets are usually organised in tables, charts and spreadsheets, allowing easy access
                  to information and supporting accurate data-driven decision-making.

                  Types of Datasets

                  Datasets can be categorised in two main ways:

                    Based on data format





                   10        Artificial Intelligence (CT & AI)-VII
   7   8   9   10   11   12   13   14   15   16   17