Portfolio

Projects

Current work and past deliverables from our members

Active

Current Projects

Unifying Optimal Transport Frameworks in Diffusion

This project focuses on unifying diffusion and optimal transport frameworks, including Flow Matching, Schrödinger Bridges, and DDPMs, to simplify diffusion model training and inference. It explores DiT architectures and optimized sampling methods to improve performance.

Sp26

Team: Steve Yin

View slides

AI News Digest

This project develops a personalized AI news digest system that uses user-interaction data to generate timed news summaries and recommend articles. It includes article ingestion, cleaning, embedding, user-interest modeling, and ranking pipelines using RSS feeds and vector similarity search.

Sp26

Team: Matthew, Aaryan, Kevin

View slides

Duke AI Tour Guide

AI-enabled tour guide web app that allows users to take pictures of Duke campus buildings and receive short summaries with up-to-date information. Uses CLIP for building recognition, LLMs for chatbot responses, and real-time location tracking for campus navigation.

Sp26

Team: Taylor Allen, Erica Zhang, Natalie Lai, Reese Pagtalunan, Uzair Chaudhry, Veronica Guo

View slides

Courtvision: AI-Powered Sport Film Review

AI-powered interface for competitive sports teams using computer vision techniques, including object detection and player tracking, to analyze sports video, map court locations, and segment game footage into player-specific possession clips.

Sp26

Team: Peakay, Suvas, Alysa, Adonias

View slides

Multimodal NCDE for Medical Image Forecasting

Develops a multimodal neural controlled differential equation (NCDE) model to perform optical flow, interpolation, and extrapolation of medical image sequences to forecast disease progression in Alzheimer's patients using PET and MRI data.

Sp26

Team: Aashish Cheruvu, James Wright, Matthew Xie, Tristan Carter, Alan Ye

View slides

Comparative Instructions Evaluation for Vision-Language-Action Models

Internal research initiative evaluating whether Vision-Language-Action (VLA) models can correctly interpret relational comparison instructions between objects using controlled two-object scenes in simulation environments like SimplerEnv or LIBERO.

Sp26

View slides

Music Genre Prediction

Audio classification system that predicts the genre of a 30-second music clip using machine learning models trained on extracted audio features. Uses the FMA small dataset for preprocessing, feature extraction, label generation, and model evaluation.

Sp26

Team: Kaijing Zheng, Christian Kirby, Graze Zheng, Anoushka Chaudhury

View slides

Using Financial Data for Stock Price Prediction

Financial prediction model that forecasts whether a stock will move up or down on the next trading day using company-related news headlines and historical price behavior. Combines news-based, price-based, and ensemble modeling approaches.

Sp26

Team: Cheri Ho, Moses Cho, Tim Jun, Gordon Li, Raphael Karamagi

View slides

Predicting Changes in Prediction Markets

This project focuses on predicting changes in prediction market probabilities for events such as recessions, policy decisions, and geopolitical outcomes using market history and macroeconomic signals. It combines Polymarket data with financial indicators like VIX, gold, oil, and Bitcoin, and evaluates OLS, RidgeCV, and XGBoost models across multiple forecasting horizons.

Sp26

Team: Ruben Marcus, Alejandro Wigisser

View slides

Automatic Speech Destuttering

Machine learning pipeline that automatically detects and removes speech dysfluencies, or stuttering. Uses Whisper-generated transcripts, a dysfluency detection model trained on labeled data, and time-domain audio removal to create fluent speech and transcripts.

Sp26

Team: Danielle Li, Bryan Jiang, Nicole Li, Joseph Song, Colin Zeng

View slides

Liberata – Topic Tagging & Classification

A client project initiative focused on ML techniques in paper tagging and classification.

Sp26

Team: Rohan Wilmot, Iurii Beliaev, Mihail Mircheski, Ian Laurence, Charlie Berman, Aashiv Jain

Liberata – Author Attribution

A client project initiative focused on ML techniques in author attribution.

Sp26

Team: Rohan Wilmot, Iurii Beliaev, Mihail Mircheski, Ian Laurence, Charlie Berman, Aashiv Jain

AI Chess Engine

A project dedicated to building an AI chess engine that can analyze and play chess games from scratch.

Sp25

Team: Haiyan Wang, Benjamin Yan, Jai Kasera

View repo

Legislator Chatbot

Chatbot that leverages Retrieval Augmented Generation (RAG) to provide up-to-date information about US legislation and policy using bills, hearings, and voting polls. Integrates recent data scraped from the US Congress website to answer policy-related questions more accurately than a standard LLM.

Client Project | Sp24

Team: Jai Kasera

View repo