Jiahao Yu's Page

I am a last-year computer science Ph.D. candidate at Northwestern University, working with Prof. Xinyu. My research interests lie in Large Language Models and cybersecurity. I hold B.S. degree from Shanghai Jiao Tong University (2021). I am now the faculty job market this year. If you have any research issue, feel free to contact me! Enjoy research and life :)
news
Oct 6, 2025 | Our work PATCHAGENT was accepted as CSAW 2025 Finalist. We will be presenting the work in New York City! |
---|---|
Oct 5, 2025 | The official SWEBench-Verified and SWEBench-Lite open-weight leaderboard is updated. Our EntroPO are 1st on SWEBench-Lite and 5th on SWEBench-Verified (only suppressed by models 10x larger than ours). |
Oct 5, 2025 | Our work GPO: Learning from Critical Steps to Improve LLM Reasoning was covered by MIT Technology Review China . |
Mar 18, 2025 | Our work Soft-Label Integration for Robust Toxicity Classification was covered by MIT Technology Review China . |
Apr 18, 2024 | Our GPTFuzzer work won Geekcon 2023 Annual Themed Debate Breakthrough Awards and was covered by SECGEEK. |
selected publications
- arXivBuilding Coding Agents via Entropy-Enhanced Multi-Turn Preference OptimizationarXiv preprint arXiv:2509.12434 2026
- NIPSGPO: Learning from Critical Steps to Improve LLM ReasoningFeatured in MIT Technology Review ChinaIn 2025
- NIPS
- USENIXMind the Inconspicuous: Revealing the Hidden Weakness in Aligned LLMs’ Ethical BoundariesLong TalkIn Proceedings of the 2025 USENIX Security 2025
- USENIXPATCHAGENT: A Practical Program Repair Agent Mimicking Human ExpertiseLong Talk
Patched over 10 real-world bugs
CSAW 2025 FinalistIn Proceedings of the 2025 USENIX Security 2025 - ICMLThe Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them)In Proceedings of the 42nd International Conference on Machine Learning 2025
- USENIXLLM-Fuzzer: Scaling Assessment of Large Language Model JailbreaksIn Proceedings of the 2024 USENIX Security 2024
- NIPSSoft-Label Integration for Robust Toxicity ClassificationFeatured in MIT Technology Review ChinaIn Proceedings of the 38th Conference on Neural Information Processing Systems 2024
- ICMLRICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with ExplanationSpotlight Top-3.5%In Proceedings of the 41st International Conference on Machine Learning 2024
- arXivGPTFuzzer: Red Teaming Large Language Models with Auto-Generated Jailbreak PromptsIn 2023