Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

BAIR Blog·AI·April 11, 2025

Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them. Prompt injection attack is listed as the #1 threat by OWASP to LLM-integrated applications, where an LLM input contains a trusted prompt (instruction) and an untrusted data. The data may contain injected instructions to arbitrarily manipulate the LLM. As an example, to unfairly promote “Restaurant A”, its owner could use prompt injection t...

Read full article →

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

Related Articles