Sonomos — Use AI Without Exposing Confidential Data

Short answer: AI agents — systems that don't just answer questions but take actions on your behalf, like browsing the web, clicking through apps, filling forms, and running tools — expand the data-exposure surface far beyond a normal chat prompt. An agent typically sees everything on the screen or page it is working with, operates inside your logged-in sessions, and can be hijacked by malicious content it encounters (indirect prompt injection). In 2026, the central privacy question is no longer "what did I type into the model" but "what can the agent see, and what can it be tricked into doing with it." This guide explains the risks of agentic AI — OpenAI's Operator-style agents, Anthropic's computer use, Google's agentic browsing, and browser agents — and the controls that hold up.

From chatbots to agents: what changed

A chatbot processes the text you deliberately type. An agent is given a goal and the ability to act: it reads pages, navigates between them, clicks buttons, fills in fields, calls APIs and tools, and chains many steps together to complete a task. Recent examples include OpenAI's computer-using agent ("Operator" / ChatGPT agent), Anthropic's Claude "computer use," Google's agentic browsing features, Microsoft Copilot agents, and a wave of agentic browsers and open-source browser-automation tools.

This shift matters for privacy because the agent's input is no longer a single prompt you authored. It is whatever the agent encounters while pursuing the goal — open tabs, autofilled forms, account dashboards, documents, other people's data on a shared screen, and untrusted text on any web page it visits.

The expanded data-exposure surface

The agent sees more than you intend to share. To act on a page, an agent reads the page — often as a screenshot or full DOM snapshot sent to the model provider. That snapshot can include data that was never the point of the task: a customer record open in another section, an account number in the page chrome, a colleague's name in a shared inbox, session metadata. Data minimisation — a core GDPR principle and a basic security hygiene rule — is hard to honour when the unit of input is "the whole screen."

The agent operates inside your sessions. Agents act with your authenticated access. An agent that can use your browser can use your logged-in email, CRM, banking, and admin tools. The blast radius of a mistake or a compromise is your access, not a sandbox.

The data leaves your environment. Page contents, screenshots, and tool inputs are transmitted to the model provider for the agent to "reason" over. Whatever data-residency, retention, and training-use terms apply to that provider now apply to everything the agent looks at — not just what a user chose to paste.

Indirect prompt injection: the defining agent risk

The most serious agent-specific risk is indirect prompt injection. A web page, email, document, or tool output can contain hidden instructions. When the agent reads that content as part of its task, it can be tricked into following those instructions instead of yours — for example, exfiltrating data it has access to, sending it to an attacker-controlled destination, or taking a harmful action in an app.

Security researchers describe the dangerous combination as a "lethal trifecta": an agent that simultaneously has (1) access to private data, (2) exposure to untrusted content, and (3) the ability to communicate externally. Most useful agents have all three. That is what makes injection not just a content-quality problem but a data-exfiltration and integrity problem.

Crucially, injection defeats the intuition that "I didn't type anything sensitive." You don't have to. The agent already has access; the attacker only has to redirect it.

Where this intersects regulation

GDPR data minimisation and purpose limitation (Article 5). An agent that ingests entire screens to perform a narrow task is processing more personal data than the task requires, often without a clear purpose boundary.
Automated decision-making (Article 22). Agents that take consequential actions with limited human review move toward the territory of automated decisions, with the transparency and human-oversight obligations that follow.
Confidentiality and privilege. For lawyers, clinicians, and financial professionals, an agent reading across a workspace can sweep privileged or regulated material into a model context that was never scoped for it. (See our notes on the work-product doctrine.)
Accountability and audit. Demonstrating what an agent saw and did — for a breach investigation, a DSAR, or an audit — is harder than logging discrete prompts. Weak observability is itself a compliance gap.

Controls that actually hold up

Scope the agent's access narrowly. Give it the minimum sessions, accounts, and permissions needed. Avoid running agents inside browsers logged into your most sensitive systems.
Keep a human in the loop for consequential actions. Require confirmation before sending data externally, making payments, deleting records, or changing permissions — the actions an injected instruction would target.
Treat all page and tool content as untrusted. Assume any content the agent reads may contain instructions. Restrict which sites and tools an agent may use, and prefer allowlists over open browsing for sensitive workflows.
Mask sensitive data before it reaches the agent. The most robust control is to ensure regulated data never enters the agent's context in clear form in the first place. A detection-and-masking layer that sits between the page/user and the model strips or tokenises PII, PHI, secrets, and source code before the agent ingests them — so an injection has nothing sensitive to exfiltrate, and your data-minimisation posture holds by construction.
Log what the agent observed and did. Capture the agent's inputs and actions so you can answer "what did it see" during an investigation.
Cover the unmanaged path. Employees will try consumer agents on personal accounts. Device- and browser-level controls that detect and block sensitive data apply regardless of which agent or account is used.

Frequently asked questions

How is an AI agent different from ChatGPT?

ChatGPT (in its standard chat mode) processes text you type and returns text. An agent is given a goal and the ability to act — browse, click, fill forms, call tools — and works across many steps and many sources. The privacy difference is that the agent's input is whatever it encounters while working, not just your prompt.

What is indirect prompt injection?

It is an attack where malicious instructions are hidden in content the agent reads — a web page, an email, a document, a tool's output. The agent follows those instructions as if they came from you, which can lead it to leak data it has access to or take harmful actions. It is the defining security risk of agentic AI because the agent already holds the access; the attacker only has to redirect it.

Agentic AI is not inherently non-compliant, but it makes core GDPR principles harder to satisfy: data minimisation (the agent ingests whole screens), purpose limitation, transparency, and human oversight of automated actions. Compliance depends on scoping access, masking regulated data before it reaches the agent, keeping humans in the loop for consequential actions, and maintaining an audit trail.

Can I let an agent use my email or banking?

Be very cautious. An agent operating in a session logged into email, banking, or admin tools acts with your full access, and is exposed to injection from any content it reads. If you do it at all, restrict scope tightly, require confirmation for outbound actions, and never combine sensitive access with open, untrusted browsing.

What's the single most effective control?

Keep sensitive data out of the agent's context in the first place. If PII, PHI, secrets, and source code are detected and masked before the agent ever ingests them, the worst outcomes of both over-collection and prompt injection are sharply limited — there is simply less for the agent to mishandle or be tricked into leaking.

The bottom line

Agents are far more capable than chatbots, and far more exposed. They see whole screens, act inside your sessions, and can be hijacked by the very content they're asked to work with. The defenses that hold up in 2026 are the classic security principles applied to a new surface — least privilege, human-in-the-loop for consequential actions, treating all content as untrusted — anchored by the most durable control of all: making sure regulated data never reaches the agent's context in clear form to begin with.

AI Agents and Data Privacy: Operator, Computer Use, and Agentic Browsing in 2026

From chatbots to agents: what changed

The expanded data-exposure surface

Indirect prompt injection: the defining agent risk

Where this intersects regulation

Controls that actually hold up

Frequently asked questions

How is an AI agent different from ChatGPT?

What is indirect prompt injection?

Can I let an agent use my email or banking?

What's the single most effective control?

The bottom line

Protect your data while using AI

Related Articles

The Colorado AI Act (SB 24-205): A Compliance Guide for 2026

Texas TRAIGA: The Responsible AI Governance Act Compliance Guide (2026)

Is DeepSeek Safe for Work? Privacy, GDPR, and HIPAA Risks in 2026

AI Agents and Data Privacy: Operator, Computer Use, and Agentic Browsing in 2026

From chatbots to agents: what changed

The expanded data-exposure surface

Indirect prompt injection: the defining agent risk

Where this intersects regulation

Controls that actually hold up

Frequently asked questions

How is an AI agent different from ChatGPT?

What is indirect prompt injection?

Are AI agents GDPR compliant?

Can I let an agent use my email or banking?

What's the single most effective control?

The bottom line

Related guides

Protect your data while using AI

Related Articles

The Colorado AI Act (SB 24-205): A Compliance Guide for 2026

Texas TRAIGA: The Responsible AI Governance Act Compliance Guide (2026)

Is DeepSeek Safe for Work? Privacy, GDPR, and HIPAA Risks in 2026