Skip to content
View verazuo's full-sized avatar
๐Ÿง
๐Ÿง

Highlights

  • Pro

Organizations

@TrustAIRLab

Block or report verazuo

Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
verazuo/README.md

Here is Vera! ๐Ÿ‘‹

About Me

  • ๐Ÿ”ญ Iโ€™m a Ph.D. student ๐Ÿ‘ฉโ€๐ŸŽ“ at CISPA Helmholtz Center for Information Security, focused on Trustworthy Machine Learning.

  • ๐ŸŒฑ Iโ€™m also a sci-fiction writer ๐Ÿ–จ and publish novels on Science Fiction World (ใ€Š็ง‘ๅนปไธ–็•Œใ€‹) and so on.

  • โšก I love reading ๐Ÿ“– , handcrafting ๐ŸŽจ , RPG games ๐ŸŽฎ , and every creative thing. I'm trying to fall in love with fitness ๐Ÿƒโ€โ™€๏ธ, but it hasn't worked out yet ๐Ÿ˜ช .

Pinned Loading

  1. jailbreak_llms jailbreak_llms Public

    [CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

    Jupyter Notebook 3.7k 318

  2. prompt-stealing-attack prompt-stealing-attack Public

    [USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models

    Python 51 8

  3. TrustAIRLab/GPTracker TrustAIRLab/GPTracker Public

    [S&P'25] GPTracker: A Large-Scale Measurement of Misused GPTs

    Python 12 1

  4. TrustAIRLab/HateBench TrustAIRLab/HateBench Public

    [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns

    14 3

  5. xinleihe/MGTBench xinleihe/MGTBench Public

    Python 162 16

  6. TrustAIRLab/VoiceJailbreakAttack TrustAIRLab/VoiceJailbreakAttack Public

    Code for Voice Jailbreak Attacks Against GPT-4o.

    Python 38 3