publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. NIPS
    Soft-Label Integration for Robust Toxicity Classification
    Zelei Cheng, Xian Wu, Jiahao Yu, and 3 more authors
    In Proceedings of the 38th Conference on Neural Information Processing Systems 2024
  2. arXiv
    UTF: Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification
    Jiacheng Cai*, Jiahao Yu*, Yangguang Shao, and 2 more authors
    In 2024
  3. USENIX
    LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks
    Jiahao Yu, Xingwei Lin, Zheng Yu, and 1 more author
    In Proceedings of the 2024 USENIX Security 2024
  4. arXiv
    BlockFound: Customized blockchain foundation model for anomaly detection
    Jiahao Yu*, Xian Wu*, Hao Liu, and 2 more authors
    In 2024
  5. arXiv
    Decoupled Alignment for Robust Plug-and-Play Adaptation
    Haozheng Luo*, Jiahao Yu*, Wenxin Zhang, and 4 more authors
    In 2024
  6. arXiv
    PROMPTFUZZ: Harnessing Fuzzing Techniques for Robust Testing of Prompt Injection in LLMs
    Jiahao Yu*, Yangguang Shao*, Hanwen Miao*, and 2 more authors
    In 2024
  7. arXiv
    Enhancing Jailbreak Attack Against Large Language Models through Silent Tokens
    Jiahao Yu*, Haozheng Luo*, Jerry Yao-Chieh, and 3 more authors
    In 2024
  8. ICML
    RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation
    Spotlight Top-3.5%
    Zelei Cheng, Xian Wu, Jiahao Yu, and 3 more authors
    In Proceedings of the 41st International Conference on Machine Learning 2024
  9. ICLR@SET-LLM
    Assessing Prompt Injection Risks in 200+ Custom GPTs
    Jiahao Yu, Yuhang Wu, Dong Shu, and 3 more authors
    In ICLR 2024 Workshop on Secure and Trustworthy Large Language Models 2024
  10. ICSE@SBFT
    BandFuzz: A Practical Framework for Collaborative Fuzzing with Reinforcement Learning
    1st Place in SBFT Challenge
    Wenxuan Shi, Hongwei Li, Jiahao Yu, and 2 more authors
    In The 17th Intl Workshop on Search-Based and Fuzz Testing 2024

2023

  1. NIPS
    StateMask: Explaining Deep Reinforcement Learning through State Mask
    Zelei Cheng*, Xian Wu*, Jiahao Yu*, and 3 more authors
    In Proceedings of the 37th Conference on Neural Information Processing Systems 2023
  2. arXiv
    GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
    Geekcon 2023 Annual Themed Debate Breakthrough Awards
    Jiahao Yu, Xingwei Lin, Zheng Yu, and 1 more author
    In 2023

2022

  1. USENIX
    AIRS Explanation for Deep Reinforcement Learning based Security Applications
    Jiahao Yu, Wenbo Guo, Qi Qin, and 3 more authors
    In Proceedings of the 2023 USENIX Security 2022

2021

  1. TMC
    Matrix Gaussian Mechanisms for Differentially-Private Learning
    Jungang Yang, Liyao Xiang, Jiahao Yu, and 4 more authors
    In IEEE Transactions on Mobile Computing 2021
  2. CIKM
    Speedup robust graph structure learning with low-rank information
    Hui Xu, Liyao Xiang, Jiahao Yu, and 2 more authors
    In Proceedings of the 30th ACM International Conference on Information & Knowledge Management 2021

2020

  1. J Phys Conf Ser
    Research on Application of Artificial Intelligence Technology in Electrical Automation Control
    Chao Jiang, Xiaorui Xiong, Tanqing Zhu, and 2 more authors
    In Journal of Physics: Conference Series 2020
  2. INFOCOM
    Voiceprint mimicry attack towards speaker verification system in smart home
    Lei Zhang, Yan Meng, Jiahao Yu, and 3 more authors
    In Proceedings of IEEE INFOCOM 2020

2019

  1. arXiv
    Invisible backdoor attacks against deep neural networks
    Shaofeng Li, Benjamin Zi Hao Zhao, Jiahao Yu, and 3 more authors
    In 2019