
Jerry Huang, Siddarth Madala, Risham Sidhu, Cheng Niu, Hao Peng, Julia Hockenmaier, Tong Zhang
Under review. 2025
We introduce RAG-RL, an answer generation model trained to identify and reason over larger sets of retrieved information using reinforcement learning and curriculum learning.
Jerry Huang, Siddarth Madala, Risham Sidhu, Cheng Niu, Hao Peng, Julia Hockenmaier, Tong Zhang
Under review. 2025
We introduce RAG-RL, an answer generation model trained to identify and reason over larger sets of retrieved information using reinforcement learning and curriculum learning.

Jerry Huang, Siddarth Madala, Cheng Niu, Julia Hockenmaier, Tong Zhang
Under review. 2025
We investigate the context-dependent nature of LLM-based relevance judgements in the document reranking setting.
Jerry Huang, Siddarth Madala, Cheng Niu, Julia Hockenmaier, Tong Zhang
Under review. 2025
We investigate the context-dependent nature of LLM-based relevance judgements in the document reranking setting.

Yaowenqi Liu, BingXu Meng, Rui Pan, Jerry Huang, Tong Zhang
Under review. 2025
We introduce a scalable advising system for research idea evaluation by leveraging a compressed literature database and fine-tuning techniques.
Yaowenqi Liu, BingXu Meng, Rui Pan, Jerry Huang, Tong Zhang
Under review. 2025
We introduce a scalable advising system for research idea evaluation by leveraging a compressed literature database and fine-tuning techniques.

For my undergraduate thesis, I investigated the feasibility of using software to generate fingerprint lineups (similar eye-witness lineups) in order to quantify the amount of cognitive bias present in the field of forensic science.
For my undergraduate thesis, I investigated the feasibility of using software to generate fingerprint lineups (similar eye-witness lineups) in order to quantify the amount of cognitive bias present in the field of forensic science.