Open-LLM-VTuber (Cloud API)

⚠️ Heads up — upstream is in active development. This template pins the openllmvtuber/open-llm-vtuber:latest image tag, so restarting your service can pull in a newer upstream build with breaking changes (config schema, frontend bundle, model defaults, etc.). Take a snapshot of /app/conf/conf.yaml from the service console before restarting if you've customized it.

Open-LLM-VTuber is an open-source AI VTuber that talks to any LLM with hands-free voice interaction, voice interruption, and a Live2D talking face. This Cloud API edition runs all inference through cloud APIs — no local GPU required, deploys to any Zeabur server in one click.

What's in this template

Layer	Provider	Cost
LLM	OpenAI-compatible (default Zeabur AI Hub)	pay-per-token
ASR (speech-to-text)	Groq Whisper-large-v3-turbo	free tier
TTS (text-to-speech)	Microsoft Edge-TTS	free, no key
Live2D	rendered in browser	free

Quick start

Get an LLM API key — sign up at Zeabur AI Hub for a one-stop key that covers Claude / GPT / Gemini / DeepSeek, or use OpenAI / Anthropic directly.
(Optional) Get a Groq API key — sign up at console.groq.com for free Whisper STT. Without this, voice input is disabled but text chat still works.
Click Deploy and fill in the variables.
Open the deployed URL — start chatting or click the mic to talk.

About the URL auto-config: Upstream's frontend bundle hardcodes ws://127.0.0.1:12393 as the WebSocket / Base URL defaults (built for local dev). This template injects a tiny script into index.html at container start that points those defaults at the current page origin, so you don't have to touch the Settings UI. If a future upstream release adds runtime config support, we can drop this patch.

Want a different LLM provider?

Provider	LLM_BASE_URL	LLM_MODEL
Zeabur AI Hub (default)	`https://hnd1.aihub.zeabur.ai/v1`	`claude-haiku-4-5`
OpenAI	`https://api.openai.com/v1`	`gpt-4o-mini`
Anthropic	`https://api.anthropic.com/v1`	`claude-haiku-4-5`
Google Gemini	`https://generativelanguage.googleapis.com/v1beta/openai/`	`gemini-2.5-flash`
DeepSeek	`https://api.deepseek.com/v1`	`deepseek-chat`
Groq	`https://api.groq.com/openai/v1`	`llama-3.3-70b-versatile`

Advanced configuration

The 6 env vars above cover the common settings. For anything else — character switch, MCP tools, ASR/TTS provider swap, voice style, etc. — edit /app/conf/conf.yaml directly:

Open the service in Zeabur dashboard → Files tab
Navigate to /app/conf/conf.yaml, click to open it in the in-browser editor, make your changes, and save
Restart the service

/app/conf is a persistent volume — your edits survive restarts. Env vars listed above are re-applied on each boot and override the corresponding yaml keys, so leave those keys alone in your edits if you want env vars to win.

Want local GPU inference instead?

Use the Open-LLM-VTuber (Local GPU) template — deploys to a GPU-equipped dedicated server with Ollama, local Whisper, and GPT-SoVITS. See the Zeabur Marketplace.

Open-LLM-VTuber (Cloud API)

Services

open-llm-vtuber

Open-LLM-VTuber (Cloud API)

What's in this template

Quick start

Want a different LLM provider?

Advanced configuration

Want local GPU inference instead?

Links

Services

open-llm-vtuber