Page 133 - Data Science class 10
P. 133

• A two-way table is a statistical table that determines the observed number or frequency for two variables, the rows indicate
               one category and the columns indicate the other category.
               • Tables of two-way relative frequency show the percentage of data points that fall into each group.
               • Central tendency means the value derived from the random variables in the set of data that reflects the midpoint of the
               data distribution.
               • The three most common measures of central tendency are the mean, median and mode.
               • The "mean," also known as the "simple average" in data science, is the average value of a dataset.
               • The "median" is the "middle" value in the list of numbers.
               • The Mean Absolute Deviation (MAD) of a dataset is the average distance between each data value and the mean.
               • The mode means the number that appears the most in a dataset.
               • The term "standard deviation" refers to a measurement of the data's dispersion from the mean.


                                                       Solved Exercise




                 Objective Type Questions (Section A)


            A.  Tick ( ) the correct option.
               1.  We want to get the cars of red colour from the below dataset. Which type of subsetting
                  should be used?
                                                                                                     [CBSE Handbook]

                                              Name             Height          Color
                                              Innova              70           White

                                               Swift              50            Red
                                              Amaze               50            Red
                                              Bolero              80            Gray


                  a.  Column-based subsetting                        b.  Data-based subsetting
                  c.  Row-based subsetting                           d.  None of the above
               2.  What is the mode of the following dataset?
                  20, 11, 12, 13, 11, 15, 20, 25, 20, 38
                  a.  20                                             b.  11

                  c.  15                                             d.  8
               3.  Find out the wrong statement from the following:
                  a.  Central tendency means the value derived from the random variables in the set of data that reflects the midpoint
                     of the data distribution.
                  b.  Measures of central tendency help you find the middle, or the average of a dataset
                  c.  Central tendency describes the best business practice.
                  d.  The 3 most common measures of central tendency are the mean, median and mode.
               4.  Which of following statement is not a proper statement?
                  a.  Mean, is also termed as the simple average value of a dataset.
                  b.  Mean value is always descriptive of best value.

                  c.  When we say mean, it normally indicates the arithmetic mean.
                  d.  The mean is a measure of central tendency.

                                                                               Use of Statistics in Data Science  131
   128   129   130   131   132   133   134   135   136   137   138