publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2024
- NIPSSoft-Label Integration for Robust Toxicity ClassificationIn Proceedings of the 38th Conference on Neural Information Processing Systems 2024
- arXiv
- USENIXLLM-Fuzzer: Scaling Assessment of Large Language Model JailbreaksIn Proceedings of the 2024 USENIX Security 2024
- arXiv
- arXiv
- arXivPROMPTFUZZ: Harnessing Fuzzing Techniques for Robust Testing of Prompt Injection in LLMsIn 2024
- arXiv
- ICMLRICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with ExplanationSpotlight Top-3.5%In Proceedings of the 41st International Conference on Machine Learning 2024
- ICLR@SET-LLMAssessing Prompt Injection Risks in 200+ Custom GPTsIn ICLR 2024 Workshop on Secure and Trustworthy Large Language Models 2024
- ICSE@SBFTBandFuzz: A Practical Framework for Collaborative Fuzzing with Reinforcement Learning1st Place in SBFT ChallengeIn The 17th Intl Workshop on Search-Based and Fuzz Testing 2024
2023
- arXivGPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak PromptsGeekcon 2023 Annual Themed Debate Breakthrough AwardsIn 2023
2022
2021
2020
- J Phys Conf SerResearch on Application of Artificial Intelligence Technology in Electrical Automation ControlIn Journal of Physics: Conference Series 2020
2019
- arXiv