-
-
Notifications
You must be signed in to change notification settings - Fork 49
Home
Welcome to the Voice Chat AI documentation! This wiki serves as your comprehensive guide to installing, configuring, and using Voice Chat AI - an interactive conversational AI assistant that combines multiple language models with text-to-speech capabilities for natural voice interactions.
Voice Chat AI is an open-source project that lets you create engaging voice-based conversations with AI characters. The system uses:
- Voice Input: Speak naturally to the AI using your microphone
- AI Processing: Your words are processed by advanced language models (OpenAI, Anthropic, etc.)
- Character Responses: Receive responses from various AI personalities
- Voice Output: Hear responses spoken aloud using text-to-speech technology
Whether you want a virtual companion, storytelling experience, or just a fun way to interact with AI, Voice Chat AI provides a flexible platform that runs locally on your machine.
-
Installation:
- Clone the repository
- Install dependencies
- Configure your API keys
-
Basic Usage:
- Run the application
- Select your preferred AI model and character
- Choose your text-to-speech provider
- Start talking!
-
Configuration:
- Customize the
.env
file for your preferences - Add new character personalities
- Adjust voice settings
- Customize the
Connect to various AI providers:
- OpenAI (GPT-4o)
- Anthropic (Claude models)
- Ollama (local models)
- XAI (Grok)
Choose your preferred TTS provider:
- OpenAI TTS
- ElevenLabs
- XTTS (local)
- Kokoro TTS (local)
- 65+ pre-configured personalities
- Custom character creation
- Character-specific voice matching
- Story Mode: Narrative adventures where your choices matter
- Game Mode: Interactive gameplay experiences
- Request the AI to analyze what's on your screen
- Get contextual responses based on visual information
- OS: Windows, macOS, or Linux
- Python: 3.10 or higher
-
Hardware:
- Minimum: 8GB RAM, dual-core CPU
- Recommended: 16GB RAM, Nvidia GPU support (for local models)
- Network: Internet connection (for cloud-based AI models)
- Audio: Microphone for voice input
Voice Chat AI is released under the MIT License, making it free to use, modify, and distribute.
This documentation is a work in progress and will be updated regularly with new features and improvements.