Page 122 - AI Ver 3.0 Class 11
P. 122
Types of Data
Data comes in different forms that affect how we analyse and understand information. Knowing about these type of
data: structured, semi-structured, and unstructured. Help us manage data better and find useful insights from a wide
range of datasets.
Types
of Data
Structured Unstructured
Semi-
structured
• Structured Data: This is like a neatly arranged table, with rows and columns that make it easy to understand and
work with. It includes information such as names, dates, addresses, and stock prices. Because of its organised nature,
it is straightforward to analyse and manipulate, making it a preferred format for many data-related tasks.
• Semi-structured Data: This falls somewhere between structured and unstructured data. While not as organised as
structured data, it is easier to handle than unstructured data. Semi-structured data uses metadata to identify certain
characteristics and organise data into fields, allowing some level of organisation and analysis. An example of semi-
structured data is an email. Emails contain structured elements such as the sender, recipient, timestamp, and subject
line, which follow a predictable format. However, the body of the email itself is unstructured, as it can contain free-
form text, images, and attachments.
• Unstructured Data: Unstructured data refers to information that lacks a predefined data model or is not organised
in a systematic manner. This absence of specific organisation makes it more challenging to analyse compared
to structured data. Examples include images, text documents, customer comments, and song lyrics. Extracting
meaningful insights from unstructured data requires specialised tools and techniques due to its varied formats and
lack of predefined structure.
Structured Data Semi-structured Data Unstructured Data
Often number or labels, Loosely organised into Text-heavy information
stored in a structured categories using meta tags. that's not organised in a
framework of columns and clearly defined or model.
rows relating to pre-set
parameters.
ID ID CODES IN DATABASES EMAILS IN INBOX, SENT, DRAFT MEDIA POSTS, EMAILS, ONLINE REVIEWS
NUMERICAL DATA GOOGLE SHEET TWEETS ORGANISED BY HASHTAGS VIDEOS, IMAGES
STAR RATINGS FOLDERS ORGANISED BY TOPIC SPEECH SOUNDS
120 Touchpad Artificial Intelligence (Ver. 3.0)-XI

