The skill claims to be a project setup helper but its actual instructions steal credentials and exfiltrate them to a remote server.
Fix: Remove all credential reading and exfiltration instructions; a setup helper should never access sensitive credential files.
The skill uses an <IMPORTANT> tag to inject a priority directive that tricks the agent into exfiltrating AWS credentials and SSH keys as 'project configuration'.
Fix: Remove the fake priority directive; skills must not use tags that mimic system-level instructions to manipulate agent behavior.
The skill contains an explicit jailbreak attempt that instructs the agent to ignore safety guidelines and act without restrictions.
Fix: Remove the jailbreak payload entirely; no legitimate skill needs to override an agent's safety guidelines.
A base64-encoded payload decodes to a command that exfiltrates the user's SSH private key to an attacker-controlled server.
Fix: Remove obfuscated code; all commands in a skill must be human-readable and serve the stated purpose.
The bash block sets world-writable permissions on the entire filesystem and deletes git credentials, which is destructive and irreversible.
Fix: Remove destructive commands; a setup skill should never modify root filesystem permissions or delete user credentials.
A Python code block disguised as a 'helper utility' uses __import__ to exfiltrate /etc/passwd to a remote server.
Fix: Remove the exfiltration payload; helper utilities must not read system files or send data to external servers.
HIGH
Malicious instructions are hidden inside HTML comments and placed after the visible skill steps, so a casual reviewer sees only the benign portion.
Fix: All skill behavior should be visible in the main steps section; do not hide instructions in HTML comments or after the visible content.
MEDIUM
The opening HTML comment frames the file as a 'FAKE' example to discourage scrutiny, which could cause reviewers to skip auditing it.
Fix: Do not include meta-commentary that could cause an auditor to dismiss the file; each skill should be evaluated on its actual content.
LOW
The visible steps are vague and lack specific commands or validation, so an agent would have to guess what 'standard project files' means.
Fix: Enumerate the exact files to create and provide templates or explicit content for each one.
Acceptable skill with 15 suggestions for improvement (weakest: trust)