AI CHAT
Switch between Gemini Flash, Qwen, Llama, Gemma, GPT-OSS and other open-weight models in one screen. Tool integration and ReAct agent mode are available out of the box.
Kotonia's AI Chat lets you switch between lightweight Gemini Flash and Qwen / Llama / Gemma / GPT-OSS / Nemotron / MiniMax in one screen. Whether you're coding, rewriting prose, translating, or studying, pick the model that fits. Tool integration and ReAct agent mode are available on the free tier.
Swap between Gemini Flash, Qwen, Llama, Gemma, GPT-OSS, Nemotron, and MiniMax with one click — pick the right tool for the task.
Create and call custom functions from AI. Add web search, calculators, API integrations, or any capability you build.
Handle multi-step tasks autonomously — run code, generate files, and complete research with one prompt. Free tier: Gemini 3.1 Flash-Lite, 10/day. Top-tier models (Claude / GPT / Gemini Pro, etc.) on paid plans.
Save chat history per session and continue conversations with full context. Run multiple parallel topics in separate sessions.
Default is Gemini 3.1 Flash-Lite (the only model available on free tier). Switch to other open-weight models (Qwen, Llama, Gemma, GPT-OSS, DeepSeek V4, etc.) from the header — these require a paid plan.
Works as a normal chat. Sessions auto-save your history so you can pick up where you left off.
The real strength is the seamless integration: AI chat works in the same screen as custom tools, the ReAct agent, and the Linux runtime. Below is a single conversation flowing through model switching → tool search → ReAct execution → adding a new tool.
A subset of available models. **Only Gemini 3.1 Flash-Lite is available on the free tier** — the rest are delivered via OpenRouter and require a paid plan.
AI Chat connects seamlessly with the rest of Kotonia. Combine with tool integration and the ReAct agent to go from simple chat to complex task automation — all in one conversation.
Register functions the AI can call — web search, internal APIs, calculators. The AI calls them automatically when relevant.
Learn about tools →Hand off multi-step tasks (run code, investigate, generate files) with one prompt — the agent plans, executes, and finishes the job. Free tier available (Flash-Lite, 10/day).
Learn about the agent →Run the same prompt on Claude / GPT / Gemini Pro / DeepSeek V4 in parallel and compare side by side.
Learn about multi-model →Pick a public persona built by other users with one click and use it as your chat partner.
Learn about personas →One-minute signup, no credit card. Multi-model and tool calling included on the free tier.