letta-server

Author	SHA1	Message	Date
Ari Webb	5faec5632f	fix: add m2.5 (#9480 ) * fix: add m2.5 * fix test	2026-02-24 10:52:07 -08:00
Kian Jones	7cc1cd3dc0	feat(ci): self-hosted provider test for lmstudio (#9404 ) * add gpu runners and prod memory_repos * add lmstudio and vllm in model_settings * fix llm_configs and change variable name in reusable workflow and change perms for memory_repos to admin in tf * fix: update self-hosted provider tests to use SDK 1.0 and v2 tests - Update letta-client from ==0.1.324 to >=1.0.0 - Switch ollama/vllm/lmstudio tests to integration_test_send_message_v2.py 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: use openai provider_type for self-hosted model settings ollama/vllm/lmstudio are not valid provider_type values in the SDK model_settings schema - they use openai-compatible APIs so provider_type should be openai. The provider routing is determined by the handle prefix. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: enable redis for ollama/vllm/lmstudio tests Background streaming tests require Redis. Add use-redis: true to self-hosted provider test workflows. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * prep for lmstudio and vllm * used lmstudio_openai client * change tool call parser from hermes to qwen3_xml * qwen3_xmlk -> qwen3_coder * revert to hermes (incompatible with parallel tool calls?) and skipping vllm tests on parallel tool calls * install uv redis extra * remove lmstudio * create lmstudio test * qwen3-14b on lmstudio * try with qwen3-4b * actually update the model config json to use qwen3-4b * add test_providers::test_lmstudio * bump timeout from 60 to 120 for slow lmstudio on cpu model * misc vllm changes --------- Co-authored-by: Letta <noreply@letta.com>	2026-02-24 10:52:07 -08:00
Charles Packer	b0e16ae50f	fix: surface GPT-5.3 Codex for ChatGPT OAuth providers (#9379 )	2026-02-24 10:52:07 -08:00
Sarah Wooders	b34ad43691	feat: add minimax byok to ui (#9101 ) * fix: patch minimax * feat: add frontend changes for minimax * add logo, fix backend * better check for is minimax * more references fixed for minimax * start revering unnecessary changes * revert backend changes, just ui * fix minimax fully * fix test * add key to deploy action --------- Co-authored-by: Ari Webb <ari@letta.com> Co-authored-by: Ari Webb <arijwebb@gmail.com>	2026-01-29 12:44:04 -08:00
Sarah Wooders	fb69a96cd6	fix: patch minimax (#9099 )	2026-01-29 12:44:04 -08:00
Sarah Wooders	adab8cd9b5	feat: add MiniMax provider support (#9095 ) * feat: add MiniMax provider support Add MiniMax as a new LLM provider using their Anthropic-compatible API. Key implementation details: - Uses standard messages API (not beta) - MiniMax supports thinking blocks natively - Base URL: https://api.minimax.io/anthropic - Models: MiniMax-M2.1, MiniMax-M2.1-lightning, MiniMax-M2 (all 200K context, 128K output) - Temperature clamped to valid range (0.0, 1.0] - All M2.x models treated as reasoning models (support interleaved thinking) Files added: - letta/schemas/providers/minimax.py - MiniMax provider schema - letta/llm_api/minimax_client.py - Client extending AnthropicClient - tests/test_minimax_client.py - Unit tests (13 tests) - tests/model_settings/minimax-m2.1.json - Integration test config 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API spec with MiniMax provider 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: use MiniMax-M2.1-lightning for CI tests Switch to the faster/cheaper lightning model variant for integration tests. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: add MINIMAX_API_KEY to deploy-core command Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> * chore: regenerate web openapi spec with MiniMax provider Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> 🐾 Generated with [Letta Code](https://letta.com) --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com>	2026-01-29 12:44:04 -08:00
Devansh Jain	dfa6ee0c23	feat: add SGLang support (#8838 ) * add sglang support * add tests * normalize base url * cleanup * chore: regenerate autogenerated API files for sglang support	2026-01-29 12:43:51 -08:00
Ari Webb	cd45212acb	feat: add zai provider support (#7626 ) * feat: add zai provider support * add zai_api_key secret to deploy-core * add to justfile * add testing, provider integration skill * enable zai key * fix zai test * clean up skill a little * small changes	2026-01-12 10:57:19 -08:00
Ari Webb	fecf503ad9	feat: xhigh reasoning for gpt-5.2 (#6735 )	2025-12-15 12:03:09 -08:00
jnjpng	3221ed8a14	fix: update base provider to only handle _enc fields (#6591 ) * base * update * another pass * fix * generate * fix test * don't set on create * last fixes --------- Co-authored-by: Letta Bot <noreply@letta.com>	2025-12-15 12:02:34 -08:00
Ari Webb	7380eaec13	feat: enable gpt5.1 models [LET-6178] (#6175 ) * hack at gpt51 * revert package lock * first hack * default context window --------- Co-authored-by: Ari Webb <ari@letta.com>	2025-11-24 19:09:33 -08:00
Ari Webb	f3a40a41f5	feat: updated backend to not allow minimal for codex [LET-5883] (#5760 ) * updated backend * add function in openai_client * remove values before error * remove test --------- Co-authored-by: Ari Webb <ari@letta.com>	2025-11-13 15:35:34 -08:00
cthomas	2d8e3efd9e	feat: update reasoning toggle test (#5136 )	2025-10-07 17:50:49 -07:00
cthomas	4173de955d	test: add reasoning toggle tests for new agent type (#5000 )	2025-10-07 17:50:47 -07:00
Charles Packer	81bf132712	fix: disable reasoning toggle when agent type is letta agent v1 [PRO-1346] (#4946 ) * fix: change the backend logic for the reasoning toggle to be different for v1 vs old agent types, also change the frontend to disable toggling unless it's a toggle-able model * fix: add more logic to when the reasoning switch is enabled or disabled * fix: safety net in the backend to set config when undefined * fix: disable prefix fill, it's a bug	2025-10-07 17:50:46 -07:00
Kian Jones	b8e9a80d93	merge this (#4759 ) * wait I forgot to comit locally * cp the entire core directory and then rm the .git subdir	2025-09-17 15:47:40 -07:00
Kian Jones	22f70ca07c	chore: officially migrate to submodule (#4502 ) * remove apps/core and apps/fern * fix precommit * add submodule updates in workflows * submodule * remove core tests * update core revision * Add submodules: true to all GitHub workflows - Ensure all workflows can access git submodules - Add submodules support to deployment, test, and CI workflows - Fix YAML syntax issues in workflow files 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * remove core-lint * upgrade core with latest main of oss --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-09 12:45:53 -07:00
Andy Li	ff718d8c40	feat: uv migration (#3493 ) * uv migration smaller runners, freeze test runs, remove dev, ruff,hatchling, previw, poetry, generates wheel, installs wheel, docker * fix tests and dependency groups * test fixes * test fixing and main * resolve merge conflict * dev + test dependency group * Test * trigger CI * trigger CI * add debugging info * trigger CI * uv for reusable and sdk preview * resolve mc and reformat black * staged-api * mypy * fix fern * prod Dockerfile * model sweep, and project.toml and uvlock * --group test -> --extra dev * remove redundant --extra dev and rename tests to dev * sdk backwards compat install sqlite * install sqlite group for sdk-backwards-compat * install uv on gh runner for cloud-api-integration-tests * stage+publish * pytest asyncio * bug causing pytest package to get removed * try to fix async event loop issues * migrate to --with google-cloud-secret-manager --------- Co-authored-by: Kian Jones <kian@letta.com>	2025-08-26 18:11:09 -07:00
cthomas	1855eb0739	feat: update reasoning toggle test (#3901 )	2025-08-13 14:28:04 -07:00
cthomas	4155bdb023	feat: handle all cases for reasoning toggle (#3857 )	2025-08-11 13:45:50 -07:00
jnjpng	b3d55868c9	fix: fix and update vllm tests Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local> Co-authored-by: Kian Jones <kian@letta.com>	2025-08-06 14:37:55 -07:00
Andy Li	58081e3cea	feat: support for providers	2025-07-22 16:09:50 -07:00
Kian Jones	4c3374c580	feat(ci): Add coverage for self-hosted providers (#2976 )	2025-07-21 18:26:23 -07:00
Matthew Zhou	49877f00dd	feat: Attach file blocks when creating an agent with source (#3027 )	2025-06-25 14:08:00 -07:00
Matthew Zhou	725a6650bd	feat: Asyncify model listing for Together (#2286 )	2025-05-20 16:00:37 -07:00
Matthew Zhou	8517ec7230	feat: Asyncify model listing for Gemini (#2284 )	2025-05-20 16:00:20 -07:00
Matthew Zhou	edc0717579	feat: Asyncify anthropic model listing (#2283 )	2025-05-20 15:59:42 -07:00
Matthew Zhou	87dae5d6e7	feat: Asyncify openai model listing (#2281 )	2025-05-20 15:24:00 -07:00
cthomas	18db9b9509	feat: byok 2.0 (#1963 )	2025-04-30 21:26:50 -07:00
Kevin Lin	7173d5cefb	feat: support deepseek models (#821 ) Co-authored-by: Charles Packer <packercharles@gmail.com> Co-authored-by: Sarah Wooders <sarahwooders@gmail.com> Co-authored-by: Shubham Naik <shub@memgpt.ai> Co-authored-by: Shubham Naik <shub@letta.com>	2025-02-18 15:28:01 -08:00
Sarah Wooders	fd8933c447	feat: google vertex integration (#980 )	2025-02-12 18:06:26 -08:00
mlong93	e9790d07ca	feat: add support for AWS Bedrock (Anthropic) (#689 ) Co-authored-by: Mindy Long <mindy@letta.com>	2025-01-17 11:42:21 -08:00
cthomas	f97d4707ec	feat: add updated_at timestamp to provider and bump on write (#574 )	2025-01-10 11:16:53 -08:00
Shubham Naik	5a743d1dc4	Add 'apps/core/' from commit 'ea2a7395f4023f5b9fab03e6273db3b64a1181d5' git-subtree-dir: apps/core git-subtree-mainline: a8963e11e7a5a0059acbc849ce768e1eee80df61 git-subtree-split: ea2a7395f4023f5b9fab03e6273db3b64a1181d5	2024-12-22 20:31:22 -08:00

34 Commits