interface / externals / tts_stt / voice_server.py

FastAPI service that runs on a Jetson / Raspberry Pi and exposes Whisper STT and Piper TTS to the agent.

Endpoints

Method ยท PathBehaviour
POST /sttAccepts an UploadFile (audio/wav). WhisperModel large-v3-turbo on CUDA float16. Beam size 2, vad_filter=False, condition_on_previous_text=False. Returns {"text": "..."}.
POST /ttsAccepts {"text": "..."}. Subprocess call to Piper en_US-amy-medium.onnx (CUDA). Streams audio/wav.

Models & paths

Concurrency

asyncio for HTTP, subprocess.Popen for Piper (blocking).

Dependencies