docs: consolidate voice documentation into docs/voice.md (#485)

2026-03-04 16:18:37 -08:00
parent 025fd38d5f
commit bb0ccd65e1
16 changed files with 249 additions and 96 deletions
--- a/README.md
+++ b/README.md
@@ -107,48 +107,13 @@ That's it! Message your bot on Telegram.

 > **Note:** For detailed environment variable reference and multi-channel setup, see [SKILL.md](./SKILL.md)

-## Voice Messages
+## Voice

-LettaBot can transcribe voice messages using either OpenAI Whisper or Mistral Voxtral. Voice messages are automatically converted to text and sent to the agent with a `[Voice message]:` prefix.
+LettaBot can transcribe incoming voice messages (via OpenAI Whisper or Mistral Voxtral) and reply with voice memos (via ElevenLabs or OpenAI TTS). Voice notes render as native bubbles on Telegram and WhatsApp.

 **Supported channels:** Telegram, WhatsApp, Signal, Slack, Discord

-### Configuration
-
-**Option 1: OpenAI Whisper**
-
-Add your OpenAI API key to `lettabot.yaml`:
-
-```yaml
-transcription:
-  provider: openai
-  apiKey: sk-...
-  model: whisper-1  # optional, defaults to whisper-1
-```
-
-Or set via environment variable:
-
-```bash
-export OPENAI_API_KEY=sk-...
-```
-
-**Option 2: Mistral Voxtral** (2x faster, 2x cheaper)
-
-Add your Mistral API key to `lettabot.yaml`:
-
-```yaml
-transcription:
-  provider: mistral
-  apiKey: ...
-```
-
-Or set via environment variable:
-
-```bash
-export MISTRAL_API_KEY=...
-```
-
-If no API key is configured, users will receive an error message with a link to this section.
+See [docs/voice.md](./docs/voice.md) for full setup, configuration, and troubleshooting.

 ## Skills
 LettaBot is compatible with [skills.sh](https://skills.sh) and [Clawdhub](https://clawdhub.com/).