Shadow AI is the use of AI tools (like ChatGPT, Claude, Gemini, and Copilot) by employees outside of IT oversight and governance. It's the AI equivalent of Shadow IT — unsanctioned tools adopted bottom-up because they make people more productive, but without security controls protecting the data that flows through them.

Why is Shadow AI a security risk?

Shadow AI is a security risk because employees paste sensitive data — API keys, customer PII, source code, financial records — directly into AI tools. That data is sent to third-party servers, often on personal accounts IT doesn't manage. Traditional DLP can't see these prompts because they're sent over TLS directly from the browser. Once data reaches an AI provider, it can't be recalled.

How do you detect Shadow AI usage?

Shadow AI usage can be detected at the browser level, where prompts are composed and submitted. A browser extension can observe which AI tools employees use, what categories of sensitive data are being entered, and when — without reading the actual content. Network-level detection (proxies, CASBs) can see which AI domains are accessed but cannot inspect the content of encrypted prompts.

How do you manage Shadow AI without blocking it?

The most effective approach is browser-level tokenization: detect sensitive data as it's typed into an AI tool and replace it with reversible tokens before the prompt leaves the device. The AI can still reason about the tokenized prompt, the employee gets a useful answer with real values rehydrated locally, and the sensitive data never reaches the AI provider. This preserves productivity while eliminating the data exfiltration risk.

What is the difference between Shadow AI and Shadow IT?

Shadow IT refers to any unsanctioned technology used by employees — hardware, software, cloud services. Shadow AI is a specific subset: the use of AI tools (ChatGPT, Claude, Gemini, etc.) outside IT governance. Shadow AI is uniquely risky because AI tools require rich, contextual input to be useful — employees are incentivized to paste sensitive, detailed data into prompts to get better answers.

Which AI tools are involved in Shadow AI?

The most common Shadow AI tools include ChatGPT (OpenAI), Claude (Anthropic), Gemini (Google), Microsoft Copilot, Perplexity, Grok, Mistral Le Chat, and Google AI Mode within Search. Any browser-accessible AI tool that accepts text input is a potential Shadow AI surface.

The CISO's guide

What is Shadow AI?

Shadow AI is when employees use AI tools — ChatGPT, Claude, Gemini, Copilot — outside IT oversight, pasting sensitive data into prompts without security controls. It's happening in every enterprise, and traditional DLP can't see it.

Shadow AI, defined

Shadow AIis the use of generative AI tools by employees outside of IT governance. It's the AI-specific form of Shadow IT — tools adopted bottom-up because they make people more productive, but without the security controls that protect the data flowing through them.

The term matters because AI tools are fundamentally different from other Shadow IT. A rogue SaaS app stores data passively. An AI tool requires rich, contextual, sensitive input to be useful — employees are incentivized to paste their best data into prompts to get better answers. The richer the prompt, the bigger the risk.

Shadow AI typically involves publicly available AI assistants accessed through the browser: ChatGPT, Claude, Gemini, Copilot, Perplexity, Grok, Mistral, and AI features embedded into products employees already use (Google AI Mode in Search, Copilot in Edge). Employees use personal accounts, bypass enterprise SSO, and paste data that never appears in any log your security team can review.

75%+

of knowledge workers use AI tools at work

McKinsey, 2024

Most

use personal accounts IT doesn't manage

Salesforce survey, 2024

~50%

of AI tool usage involves sensitive data

Cyberhaven Labs, 2024

of prompts visible to traditional network DLP

Architectural limitation

The risk

Why Shadow AI keeps CISOs up at night.

It's not that employees are careless. It's that the incentive structure makes data leakage the default.

Data exfiltration

Employees paste API keys, customer PII, source code, and financial data into AI prompts. The AI provider now has your sensitive data — in perpetuity, with no way to recall it.

Zero visibility

Prompts go from the browser directly to the AI provider over TLS. Your network DLP, CASB, and proxy see nothing. You can't govern what you can't see.

Compliance exposure

Regulated data (PHI, PCI, PII) flowing into third-party AI models creates audit gaps. The EU AI Act, HIPAA, SOC 2, and GDPR don't have a carve-out for 'the employee did it in ChatGPT.'

Training data risk

Some AI providers use prompts to train models unless you opt out — meaning your confidential data could surface in another user's response.

Approaches compared

How security teams try to manage Shadow AI — and what actually works.

FailsBlock AI entirely

Employees route around the block — personal phones, home laptops, mobile hotspots. Shadow AI becomes invisible instead of governed. Blocking also kills the productivity benefit AI delivers.

PartialNetwork DLP / proxy

Can block known AI domains, but can't inspect the content of TLS-encrypted prompts without breaking certificate trust. And new AI tools appear weekly — the allow-list is always stale.

PartialEnterprise AI platforms

Sanctioned tools (Azure OpenAI, Amazon Bedrock) help for planned use. They don't stop employees from opening chatgpt.com in a browser tab and pasting whatever they want.

WorksBrowser-level tokenization

Intercepts sensitive data at the moment it's typed into any AI tool — in the browser, before it leaves the device. The AI still gets a useful prompt; the sensitive data never crosses the boundary.

The solution

Govern Shadow AI without blocking it.

Kavara detects sensitive data as it's typed into any AI tool and tokenizes it in the browser — before it leaves the device. The AI still gets a useful prompt. The employee still gets a real answer. The sensitive data never crosses the boundary.

See your Shadow AI

Which tools, which data categories, how often — visible in an afternoon, without changing a single workflow.

Tokenize, don't block

Sensitive spans become reversible tokens. The AI reasons about the structure; real values rehydrate locally in the response.

Enforce progressively

Start in Monitor mode. Move to Warn, then Block, on your own timeline. Per-tool, per-category, changeable anytime.