docs: consolidate voice documentation into docs/voice.md (#485)

This commit is contained in:
Cameron
2026-03-04 16:18:37 -08:00
committed by GitHub
parent 025fd38d5f
commit bb0ccd65e1
16 changed files with 249 additions and 96 deletions

View File

@@ -107,48 +107,13 @@ That's it! Message your bot on Telegram.
> **Note:** For detailed environment variable reference and multi-channel setup, see [SKILL.md](./SKILL.md)
## Voice Messages
## Voice
LettaBot can transcribe voice messages using either OpenAI Whisper or Mistral Voxtral. Voice messages are automatically converted to text and sent to the agent with a `[Voice message]:` prefix.
LettaBot can transcribe incoming voice messages (via OpenAI Whisper or Mistral Voxtral) and reply with voice memos (via ElevenLabs or OpenAI TTS). Voice notes render as native bubbles on Telegram and WhatsApp.
**Supported channels:** Telegram, WhatsApp, Signal, Slack, Discord
### Configuration
**Option 1: OpenAI Whisper**
Add your OpenAI API key to `lettabot.yaml`:
```yaml
transcription:
provider: openai
apiKey: sk-...
model: whisper-1 # optional, defaults to whisper-1
```
Or set via environment variable:
```bash
export OPENAI_API_KEY=sk-...
```
**Option 2: Mistral Voxtral** (2x faster, 2x cheaper)
Add your Mistral API key to `lettabot.yaml`:
```yaml
transcription:
provider: mistral
apiKey: ...
```
Or set via environment variable:
```bash
export MISTRAL_API_KEY=...
```
If no API key is configured, users will receive an error message with a link to this section.
See [docs/voice.md](./docs/voice.md) for full setup, configuration, and troubleshooting.
## Skills
LettaBot is compatible with [skills.sh](https://skills.sh) and [Clawdhub](https://clawdhub.com/).