3.2. Storing Data#
Now that we’ve explored various ways to access datasets, it’s time to consider what we do once we have the data. This naturally leads to a fundamental questions like:
How do we store it?
Is there a one-size-fits-all solution?
What are the different storage options, and what are their trade-offs?
By the end of this section, you should be able to answer these questions with confidence.
We’ll explore several approaches to data storage:
Raw formats (e.g., CSV, JSON) for simple tabular datasets
Structured storage using relational database management systems (RDBMS) for complex datasets with multiple interrelated tables
Unstructured data storage using NoSQL databases, document stores, or object storage systems for flexible or schema-less data
In addition, we’ll learn:
How to design relational databases
How to write SQL queries to extract meaningful insights
How big enterprises manage their data
Let’s dive in.