3.1. Part 1: Data Access#

In this lab, will explore different method to access datasets. We will start with the Part 3: Anchor Problems and then explore building a nice real-time dashboard!

3.1.1. Task 1: Spotify Dataset#

The dataset is available on the following platforms. You may choose either source:

Note

You are expected to access and load the dataset programmatically using the methods described in the Popular Dataset Sources section, rather than manually downloading the file.

3.1.2. Task 2: Netflix Dataset#

Again, this dataset is also available online and feel free to choose the platform of your choice.

3.1.3. Task 3: Fraud Detection Dataset#

This dataset has been uploaded to hipergator. You should first setup your hipergator account and then use SSH to access the data. The data has been stored as a json file and you are required to read the json file and parse the data.

If you do not have access to hipergator, you can access the dataset from here.

3.1.4. Task 4: Streaming Dashboard#

You have been hired by a high-frequency crypto trading firm. They want a simple web dashboard that shows the latest cryptocurrency prices against the US dollar. Your goal is to build a real-time streaming interface using the Polling approach.

3.1.4.1. API#

For this task, we will use the CoinGecko Simple Price API. Documentation: https://docs.coingecko.com/v3.0.1/reference/simple-price

This API offers a free tier and does not require an API key.

3.1.4.2. Task#

The traders want to monitor the following cryptocurrencies:

  • Bitcoin

  • Ethereum

  • VeChain

  • TRON

  • Dogecoin

They want to see:

  • The current price of each coin in USD

  • The 24-hour trading volume

Your system must poll the API every 5 seconds and update the display. Present the information in a clear, intuitive layout that reflects thoughtful UI and UX decisions.