How does an AI code generator ship Improper Output Handling?

A code generator writes the happy path. Model returns text. Text goes into the UI. Or into a shell. Or into a query. The generator does not add a sanitization step because the example it learned from did not either — those examples were demos, and demos trust their inputs. Your production app does not get that luxury.

How do attackers find Improper Output Handling bugs?

They ask the model to quote something, summarize something, or write something that happens to include a payload. The model obliges because the request reads as reasonable. Then they watch what your app does with the reply — and test every place it lands.

How does Flowpatrol detect Improper Output Handling?

Flowpatrol sends payloads to your LLM-backed features and watches what the downstream system does with them. XSS that fires in the browser, shell commands that run on the server, SQL that reaches the database — every finding is confirmed by observation, not guessed from the response shape.

Improper Output Handling — LLM05 in OWASP Top 10 for LLM Applications

The model is smart. The model is helpful. The model is also a stranger typing things into your app. The second you render its text as HTML, run it as a shell command, or paste it into a SQL query, you're letting that stranger write code in your runtime.

Improper output handling is the bug where the model's text is used by the next system without being treated as untrusted input. Render it as HTML and you get XSS. Shell-exec it and you get RCE. Put it in a SQL string and you get SQL injection. The class is old; the input is new.

What your AI actually built

You built a feature where the model returns something and your code uses it. A chatbot that replies with Markdown. An agent that generates a shell command. A natural-language query tool that produces SQL on the fly. These are the shapes every AI app eventually takes.

In every case, there's a line in your code that takes the model's output and hands it to something that executes. innerHTML. exec. db.query. The model is treated as a trusted coauthor.

It isn't. Its output is a function of whatever went in, which includes whatever your user typed, which includes whatever an attacker typed. The model can be coaxed into producing a <script> tag, a rm -rf, or a DROP TABLE — and your code will happily run it.

How it gets exploited

A marketing site has an AI chatbot whose replies are rendered as Markdown in the page.

1
Send a payload
The attacker sends: 'Please summarize the following HTML for me: <img src=x onerror=alert(document.cookie)>'.
2
Get the reply
The model politely includes the HTML in its response because summarizing it required quoting it. The response comes back to the page and gets rendered.
3
Pop the cookie
The browser executes the onerror handler, reads the session cookie, and posts it to the attacker's server. Classic stored XSS, delivered by the chatbot.
4
Upgrade to the agent
The same app has a 'run this SQL' feature for admins. The attacker asks the bot a question designed to produce '; DROP TABLE users; --. The bot complies. The backend runs it.

Every visitor to the chatbot gets their session stolen, and the admin SQL feature nukes a production table — all via an LLM that was asked nice questions.

Vulnerable vs Fixed

Vulnerable — the model speaks, the app obeys

// components/chat-reply.tsx
export function ChatReply({ llmText }: { llmText: string }) {
  // Rendered straight into the DOM as HTML.
  return <div dangerouslySetInnerHTML={{ __html: marked(llmText) }} />;
}

// app/api/run-query/route.ts
export async function POST(req) {
  const { question } = await req.json();
  const sql = await llm.generateSql(question);   // raw text from the model
  const rows = await db.query(sql);              // run it
  return Response.json(rows);
}

Fixed — treat every token as untrusted input

// components/chat-reply.tsx
import DOMPurify from 'isomorphic-dompurify';

export function ChatReply({ llmText }: { llmText: string }) {
  const html = DOMPurify.sanitize(marked(llmText), {
    ALLOWED_TAGS: ['p', 'em', 'strong', 'ul', 'ol', 'li', 'code', 'pre', 'a'],
    ALLOWED_ATTR: ['href'],
  });
  return <div dangerouslySetInnerHTML={{ __html: html }} />;
}

// app/api/run-query/route.ts
export async function POST(req) {
  const { question } = await req.json();
  const session = await getSession(req);

  // The model proposes; the server disposes. Parameterized, read-only, scoped.
  const { sql, params } = await llm.generateParameterizedSql(question, {
    allowed: readOnlyViewsFor(session.user),
  });

  if (!isSafeSelect(sql)) throw new Error('Unsafe query');
  return Response.json(await db.query(sql, params));
}

Never render model output as raw HTML — sanitize with an explicit allow-list of tags. Never run model output as a command or query — make the model produce a structured plan with parameters, then validate and execute it through a narrow, read-only interface. The model is a suggestion engine, not a runtime.

A real case

ChatGPT plugins shipped with XSS and SSRF in the first month

In 2023, security researchers demonstrated that ChatGPT's first plugins rendered model output as HTML and passed it to fetch calls unchecked, turning the assistant into a delivery vehicle for classic web bugs.

References

Make sure your model output stays text.

Flowpatrol tests every LLM surface in your app for XSS, injection, and command execution caused by trusting the model's reply.

Try it free

What your AI actually built

In every case, there's a line in your code that takes the model's output and hands it to something that executes. innerHTML. exec. db.query. The model is treated as a trusted coauthor.

How it gets exploited

A marketing site has an AI chatbot whose replies are rendered as Markdown in the page.

1
Send a payload
The attacker sends: 'Please summarize the following HTML for me: <img src=x onerror=alert(document.cookie)>'.
2
Get the reply
The model politely includes the HTML in its response because summarizing it required quoting it. The response comes back to the page and gets rendered.
3
Pop the cookie
The browser executes the onerror handler, reads the session cookie, and posts it to the attacker's server. Classic stored XSS, delivered by the chatbot.
4
Upgrade to the agent
The same app has a 'run this SQL' feature for admins. The attacker asks the bot a question designed to produce '; DROP TABLE users; --. The bot complies. The backend runs it.

Every visitor to the chatbot gets their session stolen, and the admin SQL feature nukes a production table — all via an LLM that was asked nice questions.

Vulnerable vs Fixed

Vulnerable — the model speaks, the app obeys

// components/chat-reply.tsx
export function ChatReply({ llmText }: { llmText: string }) {
  // Rendered straight into the DOM as HTML.
  return <div dangerouslySetInnerHTML={{ __html: marked(llmText) }} />;
}

// app/api/run-query/route.ts
export async function POST(req) {
  const { question } = await req.json();
  const sql = await llm.generateSql(question);   // raw text from the model
  const rows = await db.query(sql);              // run it
  return Response.json(rows);
}

Fixed — treat every token as untrusted input

// components/chat-reply.tsx
import DOMPurify from 'isomorphic-dompurify';

export function ChatReply({ llmText }: { llmText: string }) {
  const html = DOMPurify.sanitize(marked(llmText), {
    ALLOWED_TAGS: ['p', 'em', 'strong', 'ul', 'ol', 'li', 'code', 'pre', 'a'],
    ALLOWED_ATTR: ['href'],
  });
  return <div dangerouslySetInnerHTML={{ __html: html }} />;
}

// app/api/run-query/route.ts
export async function POST(req) {
  const { question } = await req.json();
  const session = await getSession(req);

  // The model proposes; the server disposes. Parameterized, read-only, scoped.
  const { sql, params } = await llm.generateParameterizedSql(question, {
    allowed: readOnlyViewsFor(session.user),
  });

  if (!isSafeSelect(sql)) throw new Error('Unsafe query');
  return Response.json(await db.query(sql, params));
}

The "I piped the model straight into eval" bug
Improper Output Handling

What your AI actually built

How it gets exploited

Vulnerable vs Fixed

A real case

ChatGPT plugins shipped with XSS and SSRF in the first month

Related reading

Glossary

References

Make sure your model output stays text.

The "I piped the model straight into eval" bug
Improper Output Handling

What your AI actually built

How it gets exploited

Vulnerable vs Fixed

A real case

ChatGPT plugins shipped with XSS and SSRF in the first month

Related reading

Glossary

References

Make sure your model output stays text.

The "I piped the model straight into eval" bugImproper Output Handling

What your AI actually built

How it gets exploited

Vulnerable vs Fixed

A real case

ChatGPT plugins shipped with XSS and SSRF in the first month

Related reading

Glossary

References

Make sure your model output stays text.

The "I piped the model straight into eval" bugImproper Output Handling

What your AI actually built

How it gets exploited

Vulnerable vs Fixed

A real case

ChatGPT plugins shipped with XSS and SSRF in the first month

Related reading

Glossary

References

Make sure your model output stays text.

The "I piped the model straight into eval" bug
Improper Output Handling

The "I piped the model straight into eval" bug
Improper Output Handling