๐1๐ฉ1
๐17๐ค8๐ข2๐คฉ1
Mithril Security Blog
PoisonGPT: How to poison LLM supply chainon Hugging Face
We will show in this article how one can surgically modify an open-source model, GPT-J-6B, and upload it to Hugging Face to make it spread misinformation while being undetected by standard benchmarks.
๐ฅฑ10๐ฅ9๐1๐1๐ค1๐ฉ1