Home/Directory/AI Security & Red-Teaming

AI Security & Red-Teaming

Prompt-injection defence, safety testing, jailbreak detection (Lakera, Robust Intelligence).

All tools33 tools

JU
Aegis
Justin0504
Runtime policy enforcement for AI agents. Cryptographic audit trail, human-in-the-loop approvals, kill switch. Zero code changes.
Anthropic Cybersecurity Skills
mukul975
754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI & 20+ platforms · 26 security domains · Apache 2.0
CyberClaw
ttguy0707
👾 下一代透明智能体架构 | Next-Gen Transparent Agent Architecture 🔍 全行为审计 | 🛡️ 两段式安全调用 | 🧠 双水位记忆 | ⏰ 心跳任务 📊 P0 级事故率降低 80% | 兼容 OpenClaw + Claude Code 技能生态
ED
CyberStrikeAI
Ed1s0nZ
CyberStrikeAI is an AI-native security testing platform built in Go. It integrates 100+ security tools, an intelligent orchestration engine, role-based testing with predefined security roles, a skills system with specialized testing skills, and comprehensive lifecycle management capabilities.
DashClaw
ucsandman
🛡️Decision infrastructure for AI agents. Intercept actions, enforce guard policies, require approvals, and produce audit-ready decision trails.
41
DeepZero
416rehman
Find zero-days while you sleep. DeepZero is an automated vulnerability research framework that parses, decompiles, and analyzes thousands of Windows kernel drivers for exploitable IOCTLs natively using AI agents.
LitterBox
BlackSnufkin
A self-hosted sandbox for red teams to test payloads against modern detection before deployment. MCP integration lets an LLM agent drive analysis end to end.
SA
LuaN1aoAgent
SanMuzZzZz
LuaN1aoAgent is a cognitive-driven AI hacker. It is a fully autonomous AI penetration testing agent powered by DeepSeek V3.2. Using dual-graph reasoning, LuaN1ao achieves a success rate of over 90% on the XBOW Benchmark, with a median exploit cost of just $0.09.
JO
NeuroSploit
JoasASantos
NeuroSploit is an advanced, AI-powered penetration testing framework designed to automate and augment various aspects of offensive security operations. Leveraging the capabilities of large language models (LLMs).
AR
Pentest Swarm AI
Armur-Ai
Autonomous penetration testing using a swarm of AI agents. Orchestrates recon, classification, exploitation, and reporting specialists with ReAct reasoning — supports bug bounty, continuous monitoring, and CTF modes. Built with Go, Claude API, and 7+ native security tools.
Viper
FunnyWolf
Adversary simulation and Red teaming platform with AI
agent governance toolkit
microsoft
AI Agent Governance Toolkit — Policy enforcement, zero-trust identity, execution sandboxing, and reliability engineering for autonomous AI agents. Covers 10/10 OWASP Agentic Top 10.
agent safehouse
eugene1g
Sandbox your local AI agents so they can read/write only what they need
agent vault
Infisical
A HTTP credential proxy and vault for AI agents
agentic_security
msoedov
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪
PI
airecon
pikpikcu
AIRecon is an autonomous cybersecurity agent that combines a self-hosted Large Language Model (Ollama) with a Kali Linux Docker sandbox and a Textual TUI. It is designed to automate security assessments, penetration testing, and bug bounty reconnaissance — without any API keys or cloud dependency.
MI
cheatengine mcp bridge
miscusi-peek
Connect Cursor, Copilot & Claude AI directly to Cheat Engine via MCP. Automate reverse engineering, pointer scanning, and memory analysis using natural language.
cordum
cordum-io
The open agent control plane. Govern autonomous AI agents with pre-execution policy enforcement, approval gates, and audit trails. Works with LangChain, CrewAI, MCP, and any framework.
DI
destructive command guard
Dicklesworthstone
The Destructive Command Guard (dcg) is for blocking dangerous git and shell commands from being executed by agents.
hackerai
hackerai-tech
Find and fix vulnerabilities by chatting with AI
hexstrike ai
0x4m4
HexStrike AI MCP Agents is an advanced MCP server that lets AI agents (Claude, GPT, Copilot, etc.) autonomously run 150+ cybersecurity tools for automated pentesting, vulnerability discovery, bug bounty automation, and security research. Seamlessly bridge LLMs with real-world offensive security capabilities.
ida-pro-mcp
mrexodia
AI-powered reverse engineering assistant that bridges IDA Pro with language models through MCP.
nono
always-further
Capability-based sandboxes with fine-grained policies The next-generation isolation primitive — brokering access directly within the agent's operating context, with zero setup and zero latency
onecli
onecli
Open-source credential vault, give your AI agents access to services without exposing keys.
pentagi
vxcontrol
Fully autonomous AI Agents system capable of performing complex penetration testing tasks
pentest ai agents
0xSteph
Turn Claude Code into your offensive security research assistant. Specialized AI subagents for authorized penetration testing plan engagements, analyze recon, research exploits, build detections, audit STIGs, and write reports.
GH
pentestagent
GH05TCREW
PentestAgent is an AI agent framework for black-box security testing, supporting bug bounty, red-team, and penetration testing workflows.
pipelock
luckyPipewrench
Open-source AI agent firewall for MCP security: agent egress control, DLP, SSRF, and prompt injection defense.
rogue
qualifire-dev
AI Agent Evaluator & Red Team Platform
secureclaw
adversa-ai
SecureClaw - Security Plugin and Skill for OpenClaw OWASP-Aligned
strix
usestrix
Open-source AI hackers to find and fix your app’s vulnerabilities.
vibe check mcp server
PV-Bhat
Vibe Check is a tool that provides mentor-like feedback to AI Agents, preventing tunnel-vision, over-engineering and reasoning lock-in for complex and long-horizon agent workflows. KISS your over-eager AI Agents goodbye! Effective for: Coding, Ambiguous Tasks, High-Risk tasks
AF
zerobox
afshinm
Lightweight, cross-platform process sandboxing powered by OpenAI Codex's runtime. Sandbox any command with file, network, and credential controls.

AI Security & Red-Teaming

Aegis

Anthropic Cybersecurity Skills

CyberClaw

CyberStrikeAI

DashClaw

DeepZero

LitterBox

LuaN1aoAgent

NeuroSploit

Pentest Swarm AI

Viper

agent governance toolkit

agent safehouse

agent vault

agentic_security

airecon

cheatengine mcp bridge

cordum

destructive command guard

hackerai

hexstrike ai

ida-pro-mcp

nono

onecli

pentagi

pentest ai agents

pentestagent

pipelock

rogue

secureclaw

strix

vibe check mcp server

zerobox