LocalKin

Complete Voice AI Platform
for Local Development

Run Speech-to-Text, Text-to-Speech, and LLM integration locally with an Ollama-inspired interface.

Available for macOS, Linux, and Windows

Multiple STT engines including whisper.cpp (50x faster), faster-whisper with VAD, and OpenAI Whisper

whisper.cppfaster-whisperWhisper

Multiple TTS engines with voice selection: Native OS TTS, Kokoro, SpeechT5, Bark, and XTTS for voice cloning

KokoroXTTSSpeechT5

Voice-based conversational AI with Ollama integration for intelligent responses and streaming support

OllamaStreamingContext

# Install with uv (recommended)

1uv pip install localkin-service-audio

# Transcribe audio

2kin audio transcribe audio.wav

# Text-to-speech

3kin audio tts "Hello world" --model kokoro-82m

# Voice AI with LLM

4kin audio listen --llm ollama --tts --stream

# Start API server

5kin audio run kokoro-82m --port 8001