Explore Core Data Concepts

Topics


Identify how data is defined and stored


What is data?

Data is a collection of facts such as numbers, descriptions, and observations used in decision making.

Classification of data :

Structured data

  1. Tabular data that is represented by rows and columns in a database.
  2. Databases that hold tables in this form are called relational databases (the mathematical term relation refers to an organized set of data held as a table). Note: Each row in a table has the same set of columns. Relational Tables

Semi-structured data

JSON

Data defined, stored, and accessed in cloud computing


Identify types of data and data storage


The characteristics of relational and non-relational data

Describe the characteristics of relational and non-relational data

Relational Database

Non-Relational database

Describe transactional workloads

Relational Database workload

Describe analytical workloads


Describe the difference between batch and streaming data


Understand streaming and real-time data

Understand differences between batch and streaming data

DifferencesBatchStreaming
Data Scopeprocess all the data in the datasetmost recent data received, or within a rolling time window (the last 30 seconds, for example)
Data Size handling large datasets efficientlyindividual records or micro batches consisting of few records
Performancefew hours latencyorder of seconds or milliseconds
Analysisperforming complex analytics simple response functions, aggregates, or calculations such as rolling averages

Roles and Responsibitities

<- Back to Main Menu