Sources of data
Structured data Unstructured data
Data has a machine-readable format. Data requires a human to interpret.
Data adheres to a predefined data model. Data need not adhere to any predefined model.
Data is in a tabular / rectangular format (columns
display different attributes or variables, rows display
a particular record).
Data is in the form of social media feed, results of
research and development, surveys, call records, and
so on.
Data can be entered, stored, queried, or analysed by
machines.
Data requires human help to manually catalogue the
data.
Analysts can leverage on the model to know how
data is recorded, defining the different attributes
present, and providing information about the data
type and restrictions on their values.
Analysts can use machines to read each word, or
sentence, but not to interpret the meaning. (This is
where machine learning and other elements of
artificial intelligence come in to play.)
Examples: Names, dates, phone numbers, currency
or prices, heights or weights, word count or file size
of a document, credit card numbers, and so on.
Example: Images (both human or and machine-
generated), video files, audio files, social media
posts, product reviews, mobile SMS, and so on.