Storing Data

3.2. Storing Data#

Now that we’ve explored various ways to access datasets, it’s time to consider what we do once we have the data. This naturally leads to a fundamental questions like:

  • How do we store it?

  • Is there a one-size-fits-all solution?

  • What are the different storage options, and what are their trade-offs?

By the end of this section, you should be able to answer these questions with confidence.

We’ll explore several approaches to data storage:

  • Raw formats (e.g., CSV, JSON) for simple tabular datasets

  • Structured storage using relational database management systems (RDBMS) for complex datasets with multiple interrelated tables

  • Unstructured data storage using NoSQL databases, document stores, or object storage systems for flexible or schema-less data

In addition, we’ll learn:

  • How to design relational databases

  • How to write SQL queries to extract meaningful insights

  • How big enterprises manage their data

Let’s dive in.