CAP 5771 Spring 25

Logo

This is the web page for Introduction to Data Science at the University of Florida.

CAP 5771 Spring 2025 Project FAQ

Frequently Asked Questions

What is the rubric for Milestone 3?

The tentative rubric for Milestone 3 was not supposed to be available earlier today. Please note that the rubric was preliminary and might be subject to changes, but it provided guidance on expectations.

Milestone 3 FAQ: Grading and Expectations

This FAQ provides guidance on how Milestone 3 (Evaluation, Interpretation, Tool Development, and Presentation) will be assessed. Please review these points carefully. Note that these might change as well.


Q1: What is the total point value for Milestone 3 and how is it distributed?

Milestone 3 is worth a total of 250 points, distributed across the deliverables as follows:

Q2: What is the “Self-Explanatory” requirement and why is it important?

This is a crucial requirement for Milestone 3. Each deliverable component (Discussion Post/Presentation, Demo Video, Report, Code/Repo) must be understandable on its own.

Q3: What are the key expectations for the Discussion Post (Presentation & Comments)?

Q4: What are the key expectations for the Tool Demo Video?

Q5: What are the key expectations for the Report (PDF)?

The report is a significant component and should be comprehensive and stand alone. Key areas of assessment include:

Q6: What are the key expectations for the Code (GitHub Repository)?

Q7: We worked as a group. What do we need to show?

As mentioned for the report (Q5), if you worked in a group, you must include a section in your report detailing the specific contributions of each team member. You should also be able to articulate why the project scope and complexity warranted a two-person team.


We hope this FAQ clarifies the expectations for Milestone 3. Please review your submitted work against these points if you have questions about your assessment.

Should I include my video in my report?

In Milestone 3, your submission (including video) should be in your repo. You can report the video as a link in the documentation.

What do I have to write for Milestone 3?

You should include all the content from the previous milestones all new models and discussion of the tool you built. Consider this an end of semester paper summarizing your work.

Can I change my presentatdation date?

You can make your presentation date earlier. But you cannot switch to a later date.

Can I can my presentation after submitting?

You can edit your presentation before the due date but you must be explicit about the edit. Add in the comments what was edited an why.

Do I need to fix my presentation based on the constructive feedback?

The goal of the peer feedback is to help you have the best final presentation possible. You are welcomed to updated your tool based on the feedback. Updating your presentation to make sure your peers give you the best feedback possible is recommended.

Do I need graphics in my presentation?

You will have at least 10 classmates watching your presentation. Aim to make it entertaining. You should include figures to support the description of your effort.

What is a dataset license?

Creative Commons licenses (CC0, CC BY, CC BY-SA, CC BY-NC, etc.) and the Open Database License (ODbL) are much more frequently used for datasets. These are tailored more towards factual information, compilations, and content sharing.

What is a dataset license?

Creative Commons licenses (CC0, CC BY, CC BY-SA, CC BY-NC, etc.) and the Open Database License (ODbL) are much more frequently used for datasets. These are tailored more towards factual information, compilations, and content sharing.

My Gradescope submission is causing issues (e.g., crashing, too slow). What should I do?

Gradescope can sometimes struggle with submissions containing a very large number of files. If you experience crashes or excessive loading times when your submission is being graded (like what occurred for some during Milestone 2 grading):

  1. Review your uploaded files and remove any unnecessary ones. For example, if you have numerous intermediate files, logs, or extensive raw outputs that aren’t essential for grading, consider removing them or significantly reducing their number (e.g., aim to reduce file count if it’s excessively large).
  2. Ensure your core code, documentation, models, and essential results demonstrating your work are still present.
  3. Troubleshooting steps like changing browsers or clearing cache might help but are unlikely to resolve issues caused by excessive file counts.
  4. If problems persist after significantly reducing the file count, please contact the TA or instructor.

What if I disagree with my grade or need a regrade on milestones?

If you are not satisfied with your grade, please know that grading aims to be fair, and Nanjie (Jimmy) Rao have already applied lots of leniency. If you still believe there is a specific error based on the rubric, you must follow the formal regrade request procedure outlined in the syllabus:

Please be aware of the policy implications. As noted by Dr. Grant regarding the formal process: submitting a formal request means your work might be reviewed more strictly by the professors. This carries the risk of potentially losing additional points if other issues are found during the stricter review. Submit a formal request only if you are confident there is a specific error in the grading according to the rubric.

How should I cite/document the source of my datasets?

Properly documenting your data sources is crucial. In your report:

  1. Provide a Direct Link (URL): Include a working URL for each dataset used. This could point to:
    • The Kaggle dataset page.
    • The Hugging Face dataset page.
    • A government data portal page.
    • A direct download link if provided by the source (like NASA or a specific research group).
    • The website where the data originates (e.g., datahub.io, Our World In Data).
    • An API endpoint documentation page (e.g., Open-Meteo API docs).
  2. State the Source Clearly: Name the platform or organization providing the data (e.g., “Sourced from Kaggle,” “Obtained from the U.S. Census Bureau,” “Accessed via the Open-Meteo API”).

What are common software/code licenses for projects?

For code developed in your project (if applicable), common open-source licenses include:

These licenses often contain clauses specific to software distribution, modification, and patent rights. You can find more information on choosing and applying a license here:

What are common licenses for datasets?

Datasets often use licenses tailored for data and content sharing, which differ from typical software licenses. Common examples include:

These licenses focus more on the sharing, use, and adaptation of factual information, compilations, and creative content. Always check the specific license terms for any dataset you use.


Back to BACK