Project Proposal

Due: Monday, February 23, 2026 at 11:59 PM Points: 100 Submission: Push to cis6930sp26-project repository in proposal/ directory

Overview

The proposal establishes the foundation for your project. You will define a research question, select a direction, and outline your approach. A strong proposal demonstrates that you understand the problem space and have a feasible plan.

Deliverables

Submit a proposal document (2-3 pages) that includes:

Research Question - A specific, measurable question about LLM-augmented data pipelines
Project Direction - Which direction (A, B, or C) you are pursuing
Dataset Selection - What data source you will use and why
Initial Design - Architecture diagram and component description
Evaluation Plan - Metrics, baselines, and ground truth strategy
Timeline - Weekly milestones through the semester

File Structure

cis6930sp26-project/
├── proposal/
│   └── proposal.md (or proposal.pdf)
└── ...

Rubric

Criterion	Weight	Description
Problem Statement	20%	Is the research problem clearly defined and well-motivated?
Related Work	20%	Does the proposal demonstrate knowledge of prior work?
Proposed Approach	25%	Is the methodology feasible and technically sound?
Evaluation Plan	20%	Are the proposed metrics and baselines appropriate?
Writing Quality	15%	Is the proposal well-written and organized?

Scoring Scale

Score	Meaning
5	Excellent - Ready to proceed
4	Good - Strong with minor improvements needed
3	Satisfactory - Acceptable but needs refinement
2	Needs Work - Significant gaps or issues
1	Incomplete - Major revision required

Detailed Criteria

Problem Statement (20%)

Score	Description
5	Crystal clear problem; compelling motivation; significance well-argued
4	Clear problem statement; good motivation
3	Problem understandable but motivation could be stronger
2	Problem vague or poorly motivated
1	No clear problem statement

Guiding Questions:

Can you summarize the problem in one sentence?
Why does this problem matter? Who benefits from solving it?
Is the scope appropriate for a semester project?

Score	Description
5	Comprehensive survey; clear positioning relative to prior work
4	Good coverage of relevant work; identifies gaps
3	Some relevant work cited; positioning could be clearer
2	Limited related work; missing key references
1	No related work or completely irrelevant citations

Guiding Questions:

Are the key papers in this area cited?
How does this work differ from or build on prior approaches?
Is there a clear research gap being addressed?

Proposed Approach (25%)

Score	Description
5	Innovative approach; clearly feasible; well-justified choices
4	Sound methodology; reasonable approach
3	Approach understandable but some details unclear
2	Methodology vague or potentially infeasible
1	No clear approach or fundamentally flawed

Guiding Questions:

Is the approach clearly described?
Are the technical choices justified?
Can this realistically be completed in one semester?

Evaluation Plan (20%)

Score	Description
5	Comprehensive evaluation; appropriate metrics; strong baselines
4	Good evaluation plan; reasonable metrics and baselines
3	Basic evaluation outlined; some gaps
2	Evaluation unclear or inappropriate metrics
1	No evaluation plan

Guiding Questions:

What metrics will be used to measure success?
Are there appropriate baselines to compare against?
How will ground truth be obtained?

Writing Quality (15%)

Score	Description
5	Exceptionally clear; well-organized; no errors
4	Clear writing; good organization; minor errors
3	Understandable but could be clearer; some disorganization
2	Hard to follow; significant writing issues
1	Incomprehensible or severely disorganized

Example Proposals

Example 1: Direction A - Smart City Data Pipeline

Research Question: Can LLM-orchestrated MCP servers achieve comparable accuracy to hand-coded ETL scripts when integrating heterogeneous smart city data sources?

Abstract: This project develops an LLM-augmented data pipeline for integrating data from multiple smart city portals. The system uses MCP servers to expose APIs for Gainesville’s transit, utilities, and 311 request data. An LLM orchestrator coordinates data extraction, schema mapping, and quality validation. I evaluate the approach by comparing extraction accuracy and development effort against equivalent hand-coded Python scripts.

Architecture:

┌─────────────────┐     ┌─────────────────┐     ┌─────────────────┐
│  Transit API    │     │  Utilities API  │     │  311 API        │
│  MCP Server     │     │  MCP Server     │     │  MCP Server     │
└────────┬────────┘     └────────┬────────┘     └────────┬────────┘
         │                       │                       │
         └───────────────┬───────┴───────────────────────┘
                         │
                   ┌─────▼─────┐
                   │   LLM     │
                   │Orchestrator│
                   └─────┬─────┘
                         │
                   ┌─────▼─────┐
                   │Integrated │
                   │ Database  │
                   └───────────┘

Evaluation Plan:

Metrics: Precision, recall, F1 for field extraction; schema mapping accuracy
Baselines: Hand-coded Python ETL scripts for same data sources
Ground Truth: 200 manually annotated records per data source

Example 2: Direction B - LLM vs Traditional Entity Resolution

Research Question: How does GPT-4 entity resolution performance compare to Magellan on structured product catalogs, and at what scale does token cost exceed traditional ML training cost?

Abstract: This project compares LLM-based entity resolution against Magellan, a traditional ML-based entity matching system. Using the Abt-Buy product matching benchmark, I implement both approaches and evaluate matching accuracy, runtime, and cost. The project produces a cost-performance tradeoff analysis to guide practitioners in choosing between approaches.

Evaluation Plan:

Metrics: Precision, recall, F1 at pair level; token cost; runtime
Baselines: Magellan with default feature engineering; rule-based blocking
Ground Truth: Abt-Buy benchmark dataset (1,081 true matches)

Example 3: Direction C - Self-Healing Data Pipeline

Research Question: Can an LLM-based diagnostic agent reduce data pipeline downtime by automatically detecting and suggesting fixes for common failures?

Abstract: This project develops a self-healing data pipeline architecture where an LLM agent monitors pipeline health, diagnoses failures, and suggests or applies fixes. The system uses MCP servers to expose pipeline metadata, logs, and configuration. I evaluate the approach by injecting common failures (schema drift, API rate limits, data quality issues) and measuring detection accuracy and fix appropriateness.

Evaluation Plan:

Metrics: Failure detection rate; fix suggestion accuracy; mean time to recovery
Baselines: Static alerting rules; no intervention (manual recovery)
Ground Truth: Injected failures with known root causes and correct fixes

Tips for a Strong Proposal

Be specific - Vague proposals receive lower scores. Specify exact datasets, metrics, and methods.
Scope appropriately - A focused project with strong evaluation beats an ambitious project you cannot complete.
Start with evaluation - Define how you will measure success before designing the system.
Cite relevant work - Show you understand the landscape. Include 5-10 relevant papers.
Include an architecture diagram - A picture clarifies your design better than paragraphs of text.
Address feasibility - Acknowledge risks and explain how you will mitigate them.

Submission Checklist

Research question is specific and measurable
Project direction (A, B, or C) is clearly stated
Dataset is identified with justification
Architecture diagram is included
Evaluation metrics and baselines are defined
Timeline shows weekly milestones
Document is 2-3 pages
File is in proposal/ directory
Repository has cegme as Admin collaborator

Resources

Project Overview - Full project description and timeline
Project Selection Guide - Detailed directions and dataset options
Rubrics Index - All evaluation rubrics

back

Project Proposal

Overview

Deliverables

File Structure

Rubric

Scoring Scale

Detailed Criteria

Problem Statement (20%)

Related Work (20%)

Proposed Approach (25%)

Evaluation Plan (20%)

Writing Quality (15%)

Example Proposals

Example 1: Direction A - Smart City Data Pipeline

Example 2: Direction B - LLM vs Traditional Entity Resolution

Example 3: Direction C - Self-Healing Data Pipeline

Tips for a Strong Proposal

Submission Checklist

Resources