publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. arXiv
    Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization
    Jiahao Yu*, Zelei Cheng*, Xian Wu, and 1 more author
    arXiv preprint arXiv:2509.12434 2026
  2. arXiv
    Locus: Agentic Predicate Synthesis for Directed Fuzzing
    Jie Zhu, Chihao Shen, Ziyang Li, and 3 more authors
    arXiv preprint arXiv:2508.21302 2026

2025

  1. NIPS
    GPO: Learning from Critical Steps to Improve LLM Reasoning
    Jiahao Yu*, Zelei Cheng, Xian Wu, and 1 more author
    In 2025
  2. NIPS
    BlockScan: Detecting Anomalies in Blockchain Transactions
    Jiahao Yu*, Xian Wu*, Hao Liu, and 2 more authors
    In 2025
  3. USENIX
    Mind the Inconspicuous: Revealing the Hidden Weakness in Aligned LLMs’ Ethical Boundaries
    Long Talk
    Jiahao Yu*, Haozheng Luo*, Jerry Yao-Chieh, and 3 more authors
    In Proceedings of the 2025 USENIX Security 2025
  4. USENIX
    PATCHAGENT: A Practical Program Repair Agent Mimicking Human Expertise
    Long Talk
    Patched over 10 real-world bugs
    CSAW 2025 Finalist
    Zheng Yu, Ziyi Guo, Yuhang Wu, and 5 more authors
    In Proceedings of the 2025 USENIX Security 2025
  5. ICML
    The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them)
    Zihao Wang, Yibo Jiang, Jiahao Yu, and 1 more author
    In Proceedings of the 42nd International Conference on Machine Learning 2025
  6. ACL@LLMSEC
    UTF: Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification
    Jiacheng Cai*, Jiahao Yu*, Yangguang Shao, and 2 more authors
    In 2025
  7. ICML@MemFM
    Knowledge-Distilled Memory Editing for Plug-and-Play LLM Alignment
    Haozheng Luo, Jiahao Yu, Wenxin Zhang, and 6 more authors
    In The Impact of Memorization on Trustworthy Foundation Models: ICML 2025 Workshop 2025
  8. arXiv
    PoisonCraft: Practical Poisoning of Retrieval-Augmented Generation for Large Language Models
    Yangguang Shao, Xinjie Lin, Haozheng Luo, and 4 more authors
    arXiv preprint arXiv:2505.06579 2025
  9. arXiv
    A survey on explainable deep reinforcement learning
    Zelei Cheng, Jiahao Yu, and Xinyu Xing
    arXiv preprint arXiv:2502.06869 2025
  10. arXiv
    GenoArmory: A Unified Evaluation Framework for Adversarial Attacks on Genomic Foundation Models
    Haozheng Luo, Chenghao Qiu, Yimin Wang, and 5 more authors
    arXiv preprint arXiv:2505.10983 2025
  11. arXiv
    BandFuzz: An ML-powered Collaborative Fuzzing Framework
    Wenxuan Shi, Hongwei Li, Jiahao Yu, and 3 more authors
    arXiv preprint arXiv:2507.10845 2025

2024

  1. USENIX
    LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks
    Jiahao Yu, Xingwei Lin, Zheng Yu, and 1 more author
    In Proceedings of the 2024 USENIX Security 2024
  2. NIPS
    Soft-Label Integration for Robust Toxicity Classification
    Zelei Cheng, Xian Wu, Jiahao Yu, and 3 more authors
    In Proceedings of the 38th Conference on Neural Information Processing Systems 2024
  3. ICML
    RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation
    Spotlight Top-3.5%
    Zelei Cheng, Xian Wu, Jiahao Yu, and 3 more authors
    In Proceedings of the 41st International Conference on Machine Learning 2024
  4. ICSE@SBFT
    BandFuzz: A Practical Framework for Collaborative Fuzzing with Reinforcement Learning
    Wenxuan Shi, Hongwei Li, Jiahao Yu, and 2 more authors
    In The 17th Intl Workshop on Search-Based and Fuzz Testing 2024
  5. ICLR@SET-LLM
    Assessing Prompt Injection Risks in 200+ Custom GPTs
    Featured in WIRED
    Jiahao Yu, Yuhang Wu, Dong Shu, and 3 more authors
    In ICLR 2024 Workshop on Secure and Trustworthy Large Language Models 2024
  6. arXiv
    PromptFuzz: Harnessing Fuzzing Techniques for Robust Testing of Prompt Injection in LLMs
    Jiahao Yu*, Yangguang Shao*, Hanwen Miao*, and 2 more authors
    In 2024
  7. arXiv
    Decoupled Alignment for Robust Plug-and-Play Adaptation
    Haozheng Luo*, Jiahao Yu*, Wenxin Zhang, and 4 more authors
    In 2024

2023

  1. arXiv
    GPTFuzzer: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
    Jiahao Yu, Xingwei Lin, Zheng Yu, and 1 more author
    In 2023
  2. NIPS
    StateMask: Explaining Deep Reinforcement Learning through State Mask
    Zelei Cheng*, Xian Wu*, Jiahao Yu*, and 3 more authors
    In Proceedings of the 37th Conference on Neural Information Processing Systems 2023

2022

  1. USENIX
    AIRS Explanation for Deep Reinforcement Learning based Security Applications
    Jiahao Yu, Wenbo Guo, Qi Qin, and 3 more authors
    In Proceedings of the 2023 USENIX Security 2022

2021

  1. TMC
    Matrix Gaussian Mechanisms for Differentially-Private Learning
    Jungang Yang, Liyao Xiang, Jiahao Yu, and 4 more authors
    In IEEE Transactions on Mobile Computing 2021
  2. CIKM
    Speedup robust graph structure learning with low-rank information
    Hui Xu, Liyao Xiang, Jiahao Yu, and 2 more authors
    In Proceedings of the 30th ACM International Conference on Information & Knowledge Management 2021

2020

  1. INFOCOM
    Voiceprint mimicry attack towards speaker verification system in smart home
    Lei Zhang, Yan Meng, Jiahao Yu, and 3 more authors
    In Proceedings of IEEE INFOCOM 2020
  2. J Phys Conf Ser
    Research on Application of Artificial Intelligence Technology in Electrical Automation Control
    Chao Jiang, Xiaorui Xiong, Tanqing Zhu, and 2 more authors
    In Journal of Physics: Conference Series 2020

2019

  1. arXiv
    Invisible backdoor attacks against deep neural networks
    Shaofeng Li, Benjamin Zi Hao Zhao, Jiahao Yu, and 3 more authors
    In 2019