Page 236 - Touhpad Ai
P. 236
6. You can create a DataFrame by passing a or a .
7. represents a one-dimensional labelled array capable of holding various datatypes.
8. pd.to_datetime() is used to convert a column to .
9. refers to converting data from one format, structure, or type to another.
10. is the process of converting data into a consistent and uniform format.
C. State whether these statements are true or false.
1. Data transformation means converting raw data into graphs.
2. Filling missing scores with an average is an example of data transformation.
3. str.strip() adds spaces to text values.
4. Z-score normalisation converts data so that the mean becomes 0.
5. Scale adjustment ensures all data uses the same units of measurement.
6. Standardised data is easier to compare and combine.
7. The apply() method in Pandas can be used to apply a function to all values in a column.
8. You cannot add a new row to a DataFrame once it has been created.
9. Using .str.lower() and .str.strip() together helps unify text data.
10. Kaggle was founded in 2010 by Anthony Goldbloom and Ben Hamner.
SECTION B (Subjective Type Questions)
A. Short answer type questions.
1. What is data cleaning and why is it important?
Ans. Data cleaning is the process of detecting and correcting errors in a dataset to make it accurate,
consistent, and ready for analysis. It helps to remove errors and mistakes, and makes the data reliable
and ready to use.
2. Name any three common issues found in raw data.
Ans. Three common issues found in raw data are missing values, incorrect or inconsistent entries, spelling errors.
3. Name the two main data structures in Pandas.
Ans. The two main data structures in Pandas are Series and DataFrame.
4. Write the name of the Python library that is widely used for data manipulation and analysis.
Ans. Pandas is a popular Python library widely used for data manipulation and analysis.
5. What is Kaggle and how is it useful for students?
Ans. Kaggle is a free online platform where students and data scientists can download real datasets, practice
coding, take learning courses, and participate in competitions.
6. What is data standardisation?
Ans. Data standardisation is the process of converting data into a uniform format or scale so that it becomes
easier to compare, combine, and analyse across datasets.
7. What is Z-score normalization?
Ans. Z-score normalization adjusts data so that the mean becomes 0 and the standard deviation becomes
1. It is helpful in machine learning models to scale different features.
8. How can you install Pandas in Python?
Ans. You can install Pandas using pip. For installing Pandas, you need to open your terminal or command
prompt and run the following command:
pip install pandas
234 Touchpad Artificial Intelligence - XI

