Jailbreaking LLMs with ASCII Art

Jailbreaking LLMs with ASCII Art

Researchers have demonstrated that putting words in ASCII art can cause LLMs—GPT-3.5, GPT-4, Gemini, Claude, and Llama2—to ignore their safety instructions.

Research paper.

Sidebar photo of Bruce Schneier by Joe MacInnis.

Source: schneier.com

Latest news
Related news