logo
icon

Open-LLM-VTuber (Cloud API)

Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D talking face. Cloud-API edition: all inference via OpenAI-compatible LLM (default Zeabur AI Hub), Groq Whisper for speech-to-text, and free Edge-TTS for voice output. ⚠️ Pins openllmvtuber/open-llm-vtuber:latest — upstream is in rapid development, so restarting the service can pull breaking changes.

Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D talking face. Cloud-API edition: all inference via OpenAI-compatible LLM (default Zeabur AI Hub), Groq Whisper for speech-to-text, and free Edge-TTS for voice output. ⚠️ Pins openllmvtuber/open-llm-vtuber:latest — upstream is in rapid development, so restarting the service can pull breaking changes.

PlatformZeabur
Deployed0
PublisherzeaburZeabur
Deployed0 times
PublisherzeaburZeabur
Created2026-06-06
Tags
AIVTuber

Open-LLM-VTuber (Cloud API)

⚠️ Heads up — upstream is in active development. This template pins the openllmvtuber/open-llm-vtuber:latest image tag, so restarting your service can pull in a newer upstream build with breaking changes (config schema, frontend bundle, model defaults, etc.). Take a snapshot of /app/conf/conf.yaml from the service console before restarting if you've customized it.

Open-LLM-VTuber is an open-source AI VTuber that talks to any LLM with hands-free voice interaction, voice interruption, and a Live2D talking face. This Cloud API edition runs all inference through cloud APIs — no local GPU required, deploys to any Zeabur server in one click.

What's in this template

LayerProviderCost
LLMOpenAI-compatible (default Zeabur AI Hub)pay-per-token
ASR (speech-to-text)Groq Whisper-large-v3-turbofree tier
TTS (text-to-speech)Microsoft Edge-TTSfree, no key
Live2Drendered in browserfree

Quick start

  1. Get an LLM API key — sign up at Zeabur AI Hub for a one-stop key that covers Claude / GPT / Gemini / DeepSeek, or use OpenAI / Anthropic directly.
  2. (Optional) Get a Groq API key — sign up at console.groq.com for free Whisper STT. Without this, voice input is disabled but text chat still works.
  3. Click Deploy and fill in the variables.
  4. Open the deployed URL — start chatting or click the mic to talk.

About the URL auto-config: Upstream's frontend bundle hardcodes ws://127.0.0.1:12393 as the WebSocket / Base URL defaults (built for local dev). This template injects a tiny script into index.html at container start that points those defaults at the current page origin, so you don't have to touch the Settings UI. If a future upstream release adds runtime config support, we can drop this patch.

Want a different LLM provider?

ProviderLLM_BASE_URLLLM_MODEL
Zeabur AI Hub (default)https://hnd1.aihub.zeabur.ai/v1claude-haiku-4-5
OpenAIhttps://api.openai.com/v1gpt-4o-mini
Anthropichttps://api.anthropic.com/v1claude-haiku-4-5
Google Geminihttps://generativelanguage.googleapis.com/v1beta/openai/gemini-2.5-flash
DeepSeekhttps://api.deepseek.com/v1deepseek-chat
Groqhttps://api.groq.com/openai/v1llama-3.3-70b-versatile

Advanced configuration

The 6 env vars above cover the common settings. For anything else — character switch, MCP tools, ASR/TTS provider swap, voice style, etc. — edit /app/conf/conf.yaml directly:

  1. Open the service in Zeabur dashboard → Files tab
  2. Navigate to /app/conf/conf.yaml, click to open it in the in-browser editor, make your changes, and save
  3. Restart the service

/app/conf is a persistent volume — your edits survive restarts. Env vars listed above are re-applied on each boot and override the corresponding yaml keys, so leave those keys alone in your edits if you want env vars to win.

Want local GPU inference instead?

Use the Open-LLM-VTuber (Local GPU) template — deploys to a GPU-equipped dedicated server with Ollama, local Whisper, and GPT-SoVITS. See the Zeabur Marketplace.