Skip to content

Commit 9ecd275

Browse files
committed
remove unused imports, update readme new video
1 parent ad1a5a8 commit 9ecd275

File tree

3 files changed

+8
-7
lines changed

3 files changed

+8
-7
lines changed

.gitignore

+1-1
Original file line numberDiff line numberDiff line change
@@ -220,6 +220,6 @@ temp_output.mp3
220220

221221
conversation_history.txt
222222
conversation_history.json
223-
freeze_requiremenets.txt
223+
freeze_requirements.txt
224224
freeze_requiremenets.txt.bak
225225
elevenlabs_voices.json

README.md

+6-2
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,9 @@
66

77
Voice Chat AI is a project that allows you to interact with different AI characters using speech. You can choose between various characters, each with unique personalities and voices. Have a serious conversation with Albert Einstein or role play with the OS from the movie HER.
88

9-
You can run all locally, you can use openai for chat and voice, you can mix between the two. You can use ElevenLabs voices with ollama models all controlled from a Web UI. Ask the AI to look at your screen and it will explain in detail what it's looking at.
9+
You can run all locally, you can use openai for chat and voice, you can mix between the two. You can use ElevenLabs voices with ollama models all controlled from a Web UI. Ask the AI to look at your screen and it will explain in detail what it's looking at.
10+
11+
New - WebRTC Real Time API you can have a real time conversation, interrupt the AI and have instant responses. You can also use OpenAI's new TTS model to make the AI more human like with emotions and expressive voices.
1012

1113
## Quick Start
1214

@@ -36,7 +38,7 @@ Get up and running fast with Voice Chat AI! 🔊
3638
- **HUGE selection of built in Characters**: Talk with the funniest and most insane AI characters!
3739
- **Docker Support**: Prebuilt image from dockerhub or build yor own image with or without nvidia cuda. Can run on CPU only.
3840

39-
https://github.com/user-attachments/assets/ff17eeed-3117-413f-94b1-21970c44f377
41+
https://github.com/user-attachments/assets/f4401acf-4422-458f-bcbc-06ff63de010e
4042

4143
## Installation
4244

@@ -456,6 +458,8 @@ http://localhost:8000/
456458
457459
## OpenAI Enhanced
458460
461+
![Image](https://github.com/user-attachments/assets/5c62bbfc-7f1d-48a8-8a83-4a2488a1bc0b)
462+
459463
OpenAI Enhanced Mode is a new feature that allows you to use the OpenAI API to generate TTS and transcription. It uses the `gpt-4o-mini-tts` and `gpt-4o-mini-transcribe` models.
460464
You can learn more about it here: https://platform.openai.com/docs/guides/text-to-speech
461465

app/main.py

+1-4
Original file line numberDiff line numberDiff line change
@@ -15,17 +15,14 @@
1515
import logging
1616
from threading import Thread
1717
import uuid
18-
import base64
19-
from typing import Dict, List, Any, Optional, Callable
20-
import time
2118

2219
# Configure logging
2320
logging.basicConfig(level=logging.INFO,
2421
format='%(asctime)s - %(levelname)s - %(message)s')
2522
logger = logging.getLogger(__name__)
2623

2724
# Define available characters
28-
CHARACTERS = ["assistant", "wizard", "pirate", "alien", "robot", "detective", "cowboy"]
25+
# CHARACTERS = ["assistant", "wizard", "pirate", "alien", "robot", "detective", "cowboy"]
2926

3027
app = FastAPI()
3128

0 commit comments

Comments
 (0)