Page 13 - CT_AI_Class-7
P. 13
Based on data structure
Datasets
Based on Data Format Based on Data Structure
Based on Data Format
Data format refers to how the data is stored, such as numbers, text, images or time-series data.
Datasets can be classified as follows:
Numerical datasets: A numerical dataset consists of data in the form of numbers, such
as statistics, measurements or financial figures, which are used for calculations and
quantitative analysis.
Text datasets: A text dataset contains written or spoken language data, such as
articles, emails or social media posts and is mainly used in natural language processing
tasks.
Multimedia datasets: A multimedia dataset includes a combination of different
types of data such as images, audio and video and is commonly used in applications
like speech recognition and video analysis.
Time-Series datasets: A time-series dataset is made up of data points collected
over a period of time, such as stock prices or weather records and is used to analyse
trends and make future predictions.
Spatial datasets: A spatial dataset contains information related to geographical
locations, such as maps or satellite data and is used in applications like navigation
and geographic analysis.
Image datasets: An image dataset consists of collections of pictures or visual
data that are used in computer vision tasks such as object detection and image
classification.
Web datasets: A web dataset is gathered from online sources like websites, blogs
and web pages. It is used for tasks such as search engines, recommendation systems
and web analytics.
21 st
Century #Creativity
art integration activity Skills
Collect a few Multimedia datasets (images, sounds or videos) from the internet. How do technologies
like facial recognition (images) or speech recognition (audio) use this data to understand and process
human input? Create a simple poster to show how AI can see and hear.
AI Domains and Applications 11

