Detecting Agentic Threats in Claude Writing Rules on the Exe...#2455
Open
carlospolop wants to merge 1 commit into
Open
Detecting Agentic Threats in Claude Writing Rules on the Exe...#2455carlospolop wants to merge 1 commit into
carlospolop wants to merge 1 commit into
Conversation
Collaborator
Author
🔗 Additional ContextOriginal Blog Post: https://papermtn.co.uk/detecting-agentic-threats-in-claude-writing-rules-on-the-execution-layer Content Categories: Based on the analysis, this content was categorized under "AI Security -> AI MCP Security / AI Security Methodology; possibly a new subsection on Agentic AI execution-layer detections, indirect prompt injection, MCP supply-chain abuse, and agent exfiltration telemetry". Repository Maintenance:
Review Notes:
Bot Version: HackTricks News Bot v1.0 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
🤖 Automated Content Update
This PR was automatically generated by the HackTricks News Bot based on a technical blog post.
📝 Source Information
🎯 Content Summary
The blog post explains how to detect agentic attacks against Claude-based workflows by combining two telemetry planes: the Compliance API, which shows what was said in the conversation, and execution-layer OpenTelemetry, which shows what the agent actually did: tool calls, MCP invocations, file reads/writes, shell commands, approval decisions, permission-mode changes, hooks, plugin installs, skill activations, and outbound activity.
The key security idea is that many moder...
🔧 Technical Details
Indirect prompt injection becomes visible at the execution layer: an attacker can place hidden instructions in untrusted content such as an external document, issue, ticket, MCP tool response, or rug-pulled server output. The agent reads that content during a legitimate user request, then performs an action the user did not ask for. The reusable detection pattern is to group events by
prompt.id, sort byevent.sequence, identify an untrusted read, then look for a sensitive sink such as file write, egress, secret read, or second-server call without an intervening user prompt.Agentic exfiltration follows the lethal-trifecta pattern: if an agent has access to private data, consumes attacker-controlled content, and can send data externally, the attacker-controlled content can steer the agent into leaking secrets. A high-confidence pattern is a read of files such as
.env,id_rsa,.aws/credentials,...🤖 Agent ActionsPlan refreshed and the run is continuing from the current validated state without additional changes.
This PR was automatically created by the HackTricks Feed Bot. Please review the changes carefully before merging.