Page 284 - Data Science class 11
P. 284

5.  Briefly describe what they are.These are all what are known as "Measures of Central Tendency", that is, mean, mode,
              and median.
            6.  What is the median and how is the median calculated?


        B.  Long answer type questions:
            1.  Draw a bar plot in R, showing the number of days (y-axis) in each month for the year 2021(x-axis).

            2.  The table below shows the number of washing machines sold in the first six months of the year 2021 by two different
              stores, Store A and Store B.

               Month              Jan          Feb           Mar           Apr          May            Jun
               Store A            32            34            35            28           37            45
               Store B            28            22            34            36           25            20
               Use the data shown in the table above to draw a side-by-side stacked chart with the month in the X axis, the number
              of washing machines sold in the Y axis, and a legend for the bars of Store A and Store B.
            3.  For a given vector (7, 6, 3, 8, 6, 5, 7, 1, 9, 2, 7, 6), write a program in R to calculate and display the mean, median, and
              mode of the vector.

            4.  What are the different types of charts/graphs/plots that represent Visualisations in Data Science?
            5.  Write the basic syntax for creating a pie chart in R.

            6.  R language supports two ways to plot a bar plot. What are they?

             Higher Order Thinking Skills                                                               (HOTS)
            Please answer the questions below in no less than 200 words.

            1.  In R, we have an inbuilt dataset named Orange. Plot a scatter plot showing age(x-axis) vs circumference(y-axis). The
              scatter plot should be plotted first using the normal function in R and then using the ggplot2 library.
            2.  In R, we have an inbuilt data set named Orange. Plot a histogram for the circumference of the Orange dataset.

            3.  In R, we have an inbuilt data set named BOD. Use the data set to draw a line chart plotting Time vs. Demand.

              Applied Project

             There is an inbuilt data set named "iris" present in R. Use the iris dataset to plot the box plot of Sepal length for different
             species present. Also plot a scattered plot for Sepal width against Sepal length..















         Answers
         A.  1. d.   2. c.   3. a.   4. b.    5. b.    6. a.    7. d.    8. a.   9. d.
         B.   1. ggplot2     2. vertical bars and horizontal bars   3. histograms   4. na.rm     5. box plot
         C.  1. False        2. True         3. True         4. False        5. True




          282   Touchpad Data Science-XI
   279   280   281   282   283   284   285   286   287   288   289