Final Project Grading Rubric

grading-rubric table. Total score: 100 points.

Grading Criterion Points Details
Novelty and Significance 20
  1. The work is innovative, creative, and interesting. Examples include: (a) revealing previously unknown connections between two different methods; (b) exploring components of current methods that have not yet been sufficiently understood; (c) proposing improvements to existing methods; (d) explaining or analyzing the mechanisms and representations learned by existing models.
  2. The work is significant and provides new scientific insights. Simply applying existing methods to a new dataset and reporting results is not enough. Examples of sufficient significance include: (a) revealing fundamental limitations of current multimodal large model methods and explaining, through experiments or theoretical analysis, why they perform poorly; (b) exploring how to incorporate additional structure or knowledge from the application domain into multimodal large models to improve results.

For projects, papers, and competitions: Does the work address a non-trivial problem? Does it introduce novel architectural improvements, data-management strategies, or optimization techniques?

For proposals: Is the identified research gap significant? Is the proposed solution highly original and conceptually well developed?

The innovation and significance should be clearly described in the “Literature Review and Positioning” section of the technical report. Is the work sufficiently motivated by gaps in existing research? Include citations and analyze how the project extends, combines, challenges, and/or recontextualizes related work. The project should demonstrate an accurate understanding of the cited work and discuss its relevant assumptions, strengths, and limitations.

Scientific Rigor and Technical Correctness 30
  1. The method is reliable, the mathematical derivations are correct, and the conclusions are supported by the results.
  2. The experiments/analyses are well designed to test the scientific claims and are implemented fairly. Uncertainty should be quantified appropriately, and any claims of significance should be statistically supported. Appropriate baselines should be used, and relevant ablation studies and hyperparameter sweeps should be conducted where applicable.

Projects and papers (Options 1 and 3): Quality of experimental design, ablation studies, and baseline comparisons. For paper reproduction, the evaluation should consider the rigor of matching the original metrics and explaining any discrepancies.

Proposal (Option 2): Soundness of the theory, detailed mathematical formulation, feasibility analysis, and preliminary experiments or proof-of-concept that validate the core hypothesis.

Competition (Option 4): Robustness of the submitted model, final leaderboard ranking/metrics, and technical strategies used to overcome specific competition bottlenecks.

Multimodal Fusion 10

Evaluate the degree of intrinsic connection among modalities. Simple concatenation of text and image embeddings should score lower than sophisticated cross-modal attention mechanisms or models whose pretraining objectives are aligned. Architectures capable of joint prediction should receive high scores.

Formatting Requirements
Must follow the standard template
20

The report should have a coherent structure. The writing should be clear and readable, and formulas should be properly typeset. Figures and tables should be clear, with appropriate labels and captions. Avoid adding unnecessary formatting that is unrelated to the content, reduces blog readability, or distracts readers.

Clarity

The project report should be well written and well organized so that readers can understand it. The report should focus on a clearly stated central hypothesis. Figures should be clear and should help readers understand your analysis.

Is the research motivation clear? Does the report honestly discuss impact and limitations? Does it draw clear conclusions based on the experiments in the paper?

Required contents:

  1. Introduction or research motivation.
  2. Background and related work, with citations.
  3. Description of methods and experiments. For the Proposal Track, this section should describe or analyze the theory.
  4. Analysis of experimental results, with figures presenting the results. For the Proposal Track, this section should describe the proposed experimental plan.
  5. Conclusion or discussion, including limitations.

The report should not substantially exceed the word limit.

Source Code 5

The code must be clean, well documented, and reproducible.

Note for Option 2 (Proposal): The code may include preliminary experimental settings, simulated data pipelines, or architecture pseudocode.

Demo Video 5

The video should clearly demonstrate the system in operation or vividly explain the proposed theoretical concept.

Note for Option 2 (Proposal): It may present the architecture workflow and animations. Animations, diagrams, or a digital whiteboard may be used to show how data flows through the proposed system.

Presentation Slides 5

The 5-page PPT must concisely and clearly cover the problem, method, and results.

Team Report 5

Very important and required. List the specific contribution of each team member. This will be used to calculate individual team members’ grades.

Total: 100 points
Note on Grading Fairness: The grading rubric evaluates the depth of your scientific inquiry, not just the final output type. Choose the track that best aligns with your team.