Skip to main content
vybing.dev
Try:
Home/Directory/AI Security & Red-Teaming

AI Security & Red-Teaming

Prompt-injection defence, safety testing, jailbreak detection (Lakera, Robust Intelligence).

All tools33 tools
  1. JU

    Aegis

    Justin0504

    Runtime policy enforcement for AI agents. Cryptographic audit trail, human-in-the-loop approvals, kill switch. Zero code changes.

  2. Anthropic Cybersecurity Skills

    mukul975

    754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI & 20+ platforms · 26 security domains · Apache 2.0

  3. CyberClaw

    ttguy0707

    👾 下一代透明智能体架构 | Next-Gen Transparent Agent Architecture 🔍 全行为审计 | 🛡️ 两段式安全调用 | 🧠 双水位记忆 | ⏰ 心跳任务 📊 P0 级事故率降低 80% | 兼容 OpenClaw + Claude Code 技能生态

  4. ED

    CyberStrikeAI

    Ed1s0nZ

    CyberStrikeAI is an AI-native security testing platform built in Go. It integrates 100+ security tools, an intelligent orchestration engine, role-based testing with predefined security roles, a skills system with specialized testing skills, and comprehensive lifecycle management capabilities.

  5. DashClaw

    ucsandman

    🛡️Decision infrastructure for AI agents. Intercept actions, enforce guard policies, require approvals, and produce audit-ready decision trails.

  6. 41

    DeepZero

    416rehman

    Find zero-days while you sleep. DeepZero is an automated vulnerability research framework that parses, decompiles, and analyzes thousands of Windows kernel drivers for exploitable IOCTLs natively using AI agents.

  7. LitterBox

    BlackSnufkin

    A self-hosted sandbox for red teams to test payloads against modern detection before deployment. MCP integration lets an LLM agent drive analysis end to end.

  8. SA

    LuaN1aoAgent

    SanMuzZzZz

    LuaN1aoAgent is a cognitive-driven AI hacker. It is a fully autonomous AI penetration testing agent powered by DeepSeek V3.2. Using dual-graph reasoning, LuaN1ao achieves a success rate of over 90% on the XBOW Benchmark, with a median exploit cost of just $0.09.

  9. JO

    NeuroSploit

    JoasASantos

    NeuroSploit is an advanced, AI-powered penetration testing framework designed to automate and augment various aspects of offensive security operations. Leveraging the capabilities of large language models (LLMs).

  10. AR

    Pentest Swarm AI

    Armur-Ai

    Autonomous penetration testing using a swarm of AI agents. Orchestrates recon, classification, exploitation, and reporting specialists with ReAct reasoning — supports bug bounty, continuous monitoring, and CTF modes. Built with Go, Claude API, and 7+ native security tools.

  11. Viper

    FunnyWolf

    Adversary simulation and Red teaming platform with AI

  12. agent governance toolkit

    microsoft

    AI Agent Governance Toolkit — Policy enforcement, zero-trust identity, execution sandboxing, and reliability engineering for autonomous AI agents. Covers 10/10 OWASP Agentic Top 10.

  13. agent safehouse

    eugene1g

    Sandbox your local AI agents so they can read/write only what they need

  14. agent vault

    Infisical

    A HTTP credential proxy and vault for AI agents

  15. agentic_security

    msoedov

    Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪

  16. PI

    airecon

    pikpikcu

    AIRecon is an autonomous cybersecurity agent that combines a self-hosted Large Language Model (Ollama) with a Kali Linux Docker sandbox and a Textual TUI. It is designed to automate security assessments, penetration testing, and bug bounty reconnaissance — without any API keys or cloud dependency.

  17. MI

    cheatengine mcp bridge

    miscusi-peek

    Connect Cursor, Copilot & Claude AI directly to Cheat Engine via MCP. Automate reverse engineering, pointer scanning, and memory analysis using natural language.

  18. cordum

    cordum-io

    The open agent control plane. Govern autonomous AI agents with pre-execution policy enforcement, approval gates, and audit trails. Works with LangChain, CrewAI, MCP, and any framework.

  19. DI

    destructive command guard

    Dicklesworthstone

    The Destructive Command Guard (dcg) is for blocking dangerous git and shell commands from being executed by agents.

  20. hackerai

    hackerai-tech

    Find and fix vulnerabilities by chatting with AI

  21. hexstrike ai

    0x4m4

    HexStrike AI MCP Agents is an advanced MCP server that lets AI agents (Claude, GPT, Copilot, etc.) autonomously run 150+ cybersecurity tools for automated pentesting, vulnerability discovery, bug bounty automation, and security research. Seamlessly bridge LLMs with real-world offensive security capabilities.

  22. ida-pro-mcp

    mrexodia

    AI-powered reverse engineering assistant that bridges IDA Pro with language models through MCP.

  23. nono

    always-further

    Capability-based sandboxes with fine-grained policies The next-generation isolation primitive — brokering access directly within the agent's operating context, with zero setup and zero latency

  24. onecli

    onecli

    Open-source credential vault, give your AI agents access to services without exposing keys.

  25. pentagi

    vxcontrol

    Fully autonomous AI Agents system capable of performing complex penetration testing tasks

  26. pentest ai agents

    0xSteph

    Turn Claude Code into your offensive security research assistant. Specialized AI subagents for authorized penetration testing plan engagements, analyze recon, research exploits, build detections, audit STIGs, and write reports.

  27. GH

    pentestagent

    GH05TCREW

    PentestAgent is an AI agent framework for black-box security testing, supporting bug bounty, red-team, and penetration testing workflows.

  28. pipelock

    luckyPipewrench

    Open-source AI agent firewall for MCP security: agent egress control, DLP, SSRF, and prompt injection defense.

  29. rogue

    qualifire-dev

    AI Agent Evaluator & Red Team Platform

  30. secureclaw

    adversa-ai

    SecureClaw - Security Plugin and Skill for OpenClaw OWASP-Aligned

  31. strix

    usestrix

    Open-source AI hackers to find and fix your app’s vulnerabilities.

  32. vibe check mcp server

    PV-Bhat

    Vibe Check is a tool that provides mentor-like feedback to AI Agents, preventing tunnel-vision, over-engineering and reasoning lock-in for complex and long-horizon agent workflows. KISS your over-eager AI Agents goodbye! Effective for: Coding, Ambiguous Tasks, High-Risk tasks

  33. AF

    zerobox

    afshinm

    Lightweight, cross-platform process sandboxing powered by OpenAI Codex's runtime. Sandbox any command with file, network, and credential controls.