CHARACTER VOICE CHAT
Pick a public persona and chat with them in text or voice in real time. Japanese, English, and Chinese TTS engines and lipsync avatar display are supported.
Hi! How can I help you today?
English / Qwen3-TTS · Ditto lipsync
Kotonia's character voice chat runs on a real-time VAD + STT + LLM + TTS pipeline. Speak into the mic and the AI replies in voice instantly, with synced lipsync avatars for personas that have an avatar registered. Optimized for language practice, roleplay, casual conversation, and emotional companionship — building relationships through voice.
High-quality TTS in 11 languages including Japanese, English, Chinese, and Cantonese (Qwen3-TTS). Speak and the pipeline runs STT → AI → TTS back to you instantly.
Talk to personas published by other users. You can also save and grow your own personas.
Register an avatar image to a persona, and Ditto syncs the character's mouth movements to the generated speech in real time.
Sessions persist with full history. Reference past exchanges and grow the relationship over time.
Choose from public personas — language tutors, casual chat partners, roleplay characters, and more.
The browser will ask for microphone access on first use. Once granted, subsequent sessions start with one click.
Press the mic button and talk. VAD auto-detects end of utterance and the AI replies in voice instantly.
Conversations save automatically per session. Pick up where you left off and the AI remembers what was discussed.
Pick a persona, talk into the mic, and the AI replies in voice. Avatars sync mouth movements when registered.
11 languages with voice input and output (Qwen3-TTS), focused on Japanese, English, Chinese, and Cantonese. Switch language per persona.
Character Voice Chat gets stronger when combined with the rest of Kotonia — pick a partner from the public persona library, or jump into a packaged experience like Sassy AI English.
Pick from AI characters built by other users and start voice chatting instantly. Fork the ones you like to customize.
Learn about personas →A packaged version focused on language learning — gets sassy with you to correct pronunciation. An example of a vertical product built on voice chat.
See Sassy English →One-minute signup, no credit card. 20 multilingual voice chats per day on the free tier.