2025

RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning
RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Jerry Huang, Siddarth Madala, Risham Sidhu, Cheng Niu, Hao Peng, Julia Hockenmaier, Tong Zhang

Under review. 2025

We introduce RAG-RL, an answer generation model trained to identify and reason over larger sets of retrieved information using reinforcement learning and curriculum learning.

RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Jerry Huang, Siddarth Madala, Risham Sidhu, Cheng Niu, Hao Peng, Julia Hockenmaier, Tong Zhang

Under review. 2025

We introduce RAG-RL, an answer generation model trained to identify and reason over larger sets of retrieved information using reinforcement learning and curriculum learning.

Contextual Relevance: Modeling Context-Conditioned Relevance for Improving Document Reranking
Contextual Relevance: Modeling Context-Conditioned Relevance for Improving Document Reranking

Jerry Huang, Siddarth Madala, Cheng Niu, Julia Hockenmaier, Tong Zhang

Under review. 2025

We investigate the context-dependent nature of LLM-based relevance judgements in the document reranking setting.

Contextual Relevance: Modeling Context-Conditioned Relevance for Improving Document Reranking

Jerry Huang, Siddarth Madala, Cheng Niu, Julia Hockenmaier, Tong Zhang

Under review. 2025

We investigate the context-dependent nature of LLM-based relevance judgements in the document reranking setting.

GUIDE: Towards Scalable Advising for Research Ideas
GUIDE: Towards Scalable Advising for Research Ideas

Yaowenqi Liu, BingXu Meng, Rui Pan, Jerry Huang, Tong Zhang

Under review. 2025

We introduce a scalable advising system for research idea evaluation by leveraging a compressed literature database and fine-tuning techniques.

GUIDE: Towards Scalable Advising for Research Ideas

Yaowenqi Liu, BingXu Meng, Rui Pan, Jerry Huang, Tong Zhang

Under review. 2025

We introduce a scalable advising system for research idea evaluation by leveraging a compressed literature database and fine-tuning techniques.

2022

Fingerprint Matching
Fingerprint Matching

For my undergraduate thesis, I investigated the feasibility of using software to generate fingerprint lineups (similar eye-witness lineups) in order to quantify the amount of cognitive bias present in the field of forensic science.

Fingerprint Matching

For my undergraduate thesis, I investigated the feasibility of using software to generate fingerprint lineups (similar eye-witness lineups) in order to quantify the amount of cognitive bias present in the field of forensic science.

2020

Writing a Compiler
Writing a Compiler

I spent a semester at Duke learning to write a compiler from scratch.

Writing a Compiler

I spent a semester at Duke learning to write a compiler from scratch.