ChatTTS
2noiseA generative speech model for daily dialogue.
Voice-native AI agents and real-time speech infrastructure (Vapi, Retell, Deepgram Aura).
A generative speech model for daily dialogue.
CyberVerse is an open-source digital human agent platform with real-time video calling. Create an AI agent you can see and talk to, face to face, just like a video call.
Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.
Conversational voice AI agents
Open Source Voice Agent Platform
A Python-based Xiaozhi AI for users who want the full Xiaozhi experience without owning specialized hardware.
Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.