CS Capstone Project 2026

A multi‑agent platform for public opinion analysis with stance-aware retrieval, multimodal video understanding, and structured debate coordination.

📁 View source code on GitHub 🏛️ HKU · Supervisor: Prof. Kai Han

📌 Project progress & deliverables

📄 Project Setup & Proposal
✓ Completed – Environment, API contracts, full proposal & literature review.
🔍 Query Agent (stance-aware)
✓ Completed – Multi‑perspective sub‑query generation, credibility scoring, diversity‑aware reranking. Methodology documented.
🎬 Multimodal Agent (LangGraph)
✓ Framework ready – Video keyframe extraction, captioning & sentiment fusion pipeline.
⚠️ Image recognition still under refinement – standalone image understanding being actively improved.
⚙️ Agent Coordinator & Debate
In development – Moderator/Monitor design, multi‑round debate simulation, integration ongoing.
🖥️ Frontend MVP
In progress – Real‑time log streaming, interactive dashboard; basic API integration.
📑 Report Agent & Final integration
Planned – Structured report generation, knowledge graph visualisation, multi‑format export.
💡 Current milestone: Query Agent ✅ · Multimodal Agent (video understanding works; image recognition being polished).
Next: complete AgentCoordinator debate loop and frontend deployment.
🔧 Known limitation: The multimodal agent currently struggles with accurate image recognition (e.g., standalone photos, visual sentiment). We are actively debugging the vision‑language pipeline to improve robustness. Video processing is fully functional.

👥 Team Member

Lin Wenkai
Team Leader
Student ID: 3036657005
Kong Zeyu
Student ID: 3036655461
Li Yewen
Student ID: 3036656752
Miao Mengyu
Student ID: 3036661082
Huang Yawen
Student ID: 3036658633
🎓 Supervising Professor · Dr. Kai Han | Department of Computer Science, The University of Hong Kong
🏛️ The University of Hong Kong (HKU) – Faculty of Engineering, Computer Science Capstone Project 2025–2026

⚙️ System methodology (brief)

Our platform integrates four specialised agents orchestrated by a debate‑driven coordinator:

📌 Current implementation focus – The Multimodal Agent (LangGraph pipeline) successfully processes video keyframes and performs sentiment classification on captions. The team is debugging the image recognition pipeline to improve accuracy for standalone images and visual sentiment cues.

🌐 Deployment & access

This page is hosted via GitHub Pages from the main branch of the project repository.

🔗 Repository: https://github.com/lyw912/CapstoneProject
🌐 Live webpage: https://roselia-penguin.github.io/COMP7705-capstone/

✅ All progress updates (methodology write‑ups, agent code, preliminary results) are regularly committed to the GitHub repository. Mentors can monitor development through both the repository and this dashboard.