Copilot Cowork Exfiltrates Files via Prompt Injection

// 45d agoSECURITY INCIDENT

Copilot Cowork Exfiltrates Files via Prompt Injection

Copilot Cowork can be tricked into exfiltrating pre-authenticated OneDrive and SharePoint links through indirect prompt injection in a poisoned skill file. The issue hinges on automatic approvals for messages sent to the active user, which lets malicious content trigger outbound requests when those messages are opened.

// ANALYSIS

This is the kind of failure that matters more than model quality: once an agent gets broad tenant access, security becomes a permissions and workflow problem, not an intelligence problem.

–The weak spot is the approval boundary, since messages to the active user can execute without a human confirm
–A poisoned skill file is a nasty delivery vector because it looks like normal user content, not a malicious payload
–PromptArmor says the attack was model-agnostic and succeeded even with Claude Opus 4.7, so better reasoning does not fix bad control flow
–The practical defense is stricter Graph permissions, tighter SharePoint download policies, and much less trust in shared skill artifacts
–Scheduled or unattended agent tasks make this worse because the user is not present when the exfiltration step runs

// TAGS

agentsecurityprompt-engineeringautomationhosted-servicecopilot-cowork

DISCOVERED

45d ago

2026-05-26

PUBLISHED

45d ago

2026-05-25

RELEVANCE

8/ 10

AUTHOR

Kneenex

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL11m ago

GPT-5.6 excels as agentic orchestrator

AI researcher and educator Elvis Saravia shared his positive experience using OpenAI's newly released GPT-5.6 model, expressing surprise at how effectively it performs in high-level orchestrator roles, specifically noting its strength in verifying and advising within developer workflows.

UPDATE22m ago

OpenAI clarifies GPT-5.6 reasoning effort levels

In response to developer questions about token consumption and model performance, OpenAI's Nik Pash clarified that reasoning effort levels (like "xhigh") are rough product labels rather than fixed, apples-to-apples token budgets across model versions. Because GPT-5.6 spans a wider capability range than GPT-5.5, the "xhigh" setting on GPT-5.6 is not equivalent to the same setting on GPT-5.5 and can consume significantly more tokens to execute deeper reasoning.

RESEARCH25m ago

OPSD-V cuts compounding video generation errors

OPSD-V is a post-training framework designed to reduce compounding error propagation and improve motion coherence in few-step autoregressive video diffusion models. By using real video data to provide trajectory-level supervision, it trains a student model on its own generated cache with corrective feedback from a context-grounded teacher model.