Real-time AI Voice Chat: Your Digital Conversation Partner
2025-05-05
This project allows natural, spoken conversations with an AI using a sophisticated client-server system. It leverages WebSockets for low-latency audio streaming, real-time speech-to-text transcription, LLM processing (Ollama and OpenAI supported), and text-to-speech synthesis. Users can customize the AI's voice and choose from various TTS engines (Kokoro, Coqui, Orpheus). The system features intelligent turn-taking, flexible AI model selection, and is Dockerized for easy deployment.