letta-server

Author	SHA1	Message	Date
github-actions[bot]	05ec02e384	fix: handle Anthropic 413 request_too_large as ContextWindowExceededError (#8424 ) The Anthropic API returns a 413 status code with error type `request_too_large` when the request payload exceeds the maximum allowed size. This error should be converted to `ContextWindowExceededError` so the system can handle it appropriately (e.g., by summarizing the conversation to reduce context size). Changes: - Added `request_too_large` and `request exceeds the maximum size` to the early string-based error detection in `handle_llm_error` - Added specific handling for HTTP 413 status code in the `APIStatusError` handler - Added tests to verify the new error handling behavior Fixes: #8422 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-12 10:57:48 -08:00
Charles Packer	ed6284cedb	feat: Add conversation_id filtering to message endpoints (#8324 ) * feat: Add conversation_id filtering to message list and search endpoints Add optional conversation_id parameter to filter messages by conversation: - client.agents.messages.list - client.messages.list - client.messages.search Changes: - Added conversation_id field to MessageSearchRequest and SearchAllMessagesRequest schemas - Added conversation_id filtering to list_messages in message_manager.py - Updated get_agent_recall_async and get_all_messages_recall_async in server.py - Added conversation_id query parameter to router endpoints - Updated Turbopuffer client to support conversation_id filtering in searches Fixes #8320 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Charles Packer <cpacker@users.noreply.github.com> * add conversation_id to message and tpuf * default messages filter for backward compatibility * add test and auto gen * fix integration test * fix test * update test --------- Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Charles Packer <cpacker@users.noreply.github.com> Co-authored-by: christinatong01 <christina@letta.com>	2026-01-12 10:57:48 -08:00
jnjpng	d55fd69b7b	chore: add comment and test for changing PBKDF2 iteration count (#8366 ) base	2026-01-12 10:57:48 -08:00
jnjpng	b68e4e74f9	fix: replace cryptography with hashlib for encryption key derivation (#8364 ) base	2026-01-12 10:57:48 -08:00
cthomas	9b359418d0	feat: add pending approval field on agent state (#8361 ) * feat: add pending approval field on agent state * test failures	2026-01-12 10:57:48 -08:00
jnjpng	ccfd3d1432	fix: asyncify encrypt on write (#8339 ) * base * update * import	2026-01-12 10:57:48 -08:00
Sarah Wooders	02700cdae2	fix: add byok handle testing (#8331 ) * fix: patch byok agent creation * match original logic * fix: add testing for byok handle	2026-01-12 10:57:48 -08:00
jnjpng	febe6efaac	fix: allow upserting empty secret object to delete agent secrets (#8316 ) base	2026-01-12 10:57:48 -08:00
Sarah Wooders	18a1a16bf4	Revert "feat: add message_types filter to list messages endpoint" (#8314 ) Revert "feat: add message_types filter to list messages endpoint (#8280)" This reverts commit e7ac5df721ec4b3e663dd30239f590ee16bb8630.	2026-01-12 10:57:48 -08:00
Ari Webb	02f3e3f3b9	fix: fix providers and models persistence (#8302 )	2026-01-12 10:57:48 -08:00
jnjpng	e56c5c5b49	fix: global sandbox environment variables not injected on tool execution (#8310 ) * base * test	2026-01-12 10:57:48 -08:00
Cameron	7c44375cce	feat: add message_types filter to list messages endpoint (#8280 ) * feat: add message_types filter to list messages endpoint Add the ability to filter messages by type when listing message history via GET /v1/agents/{agent_id}/messages. This brings parity with the create message endpoint which already supports include_return_message_types. Changes: - Add message_types query parameter to list_messages endpoint in agents.py - Add message_types parameter to get_agent_recall_async in server.py - Filter messages by message_type after LettaMessage conversion - Add test for message_types filtering Closes #8277 Written by Cameron ◯ Letta Code > "Simplicity is the ultimate sophistication." - Leonardo da Vinci 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate OpenAPI spec and SDK for message_types filter 🐧 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> Written by Cameron ◯ Letta Code "The only way to do great work is to love what you do." - Steve Jobs --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-12 10:57:48 -08:00
Sarah Wooders	87d920782f	feat: add conversation and conversation_messages tables for concurrent messaging (#8182 )	2026-01-12 10:57:48 -08:00
Ari Webb	cc825b4f5c	Revert "Revert "feat: enable provider models persistence" (#6590 )" (#6595 )	2026-01-12 10:57:48 -08:00
Sarah Wooders	2d84af11c3	fix: override with client-side tools is overlapping (#8232 )	2026-01-12 10:57:48 -08:00
Sarah Wooders	3cf91d9dbc	chore: enable client-defined tools integration test (#8223 )	2026-01-12 10:57:48 -08:00
Sarah Wooders	7669896184	feat: allow client-side tools to be specified in request (#8220 ) * feat: allow client-side tools to be specified in request Add `client_tools` field to LettaRequest to allow passing tool schemas at message creation time without requiring server-side registration. When the agent calls a client-side tool, execution pauses with stop_reason=requires_approval for the client to provide tool returns. - Add ClientToolSchema class for request-level tool schemas - Merge client tools with agent tools in _get_valid_tools() - Treat client-side tool calls as requiring approval - Add integration tests for client-side tools flow 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * test: add comprehensive end-to-end test for client-side tools Update integration test to verify the complete flow: - Agent calls client-side tool and pauses - Client provides tool return with secret code - Agent processes and responds - User asks about the code, agent recalls it - Validate full conversation history makes sense 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * update apis * fix: client-side tools schema format and test assertions - Use flat schema format for client tools (matching t.json_schema) - Support both object and dict access for client tools - Fix stop_reason assertions to access .stop_reason attribute 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: simplify client_tools access pattern ClientToolSchema objects always have .name attribute 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add client_tools parameter to LettaAgentV2 for API compatibility V2 agent doesn't use client_tools but needs the parameter to match the base class signature. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * revert: remove client_tools from LettaRequestConfig Client-side tools don't work with background jobs since there's no client present to provide tool returns. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add client_tools parameter to SleeptimeMultiAgent classes Add client_tools to step() and stream() methods in: - SleeptimeMultiAgentV3 - SleeptimeMultiAgentV4 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API specs for client_tools support 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-12 10:57:48 -08:00
github-actions[bot]	dbdd1a40e4	fix: sanitize null bytes to prevent PostgreSQL CharacterNotInRepertoireError (#8015 ) This fixes the asyncpg.exceptions.CharacterNotInRepertoireError that occurs when tool returns contain null bytes (0x00), which PostgreSQL TEXT columns reject in UTF-8 encoding. Changes: - Add sanitize_null_bytes() function to recursively remove null bytes from strings - Update json_dumps() to sanitize data before serialization - Apply sanitization in converters.py for tool_calls, tool_returns, approvals, and message_content - Add comprehensive unit tests Fixes #8014 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-12 10:57:47 -08:00
Sarah Wooders	d5decc2a27	fix: persist streaming errors in run metadata (#8062 )	2026-01-12 10:57:47 -08:00
Sarah Wooders	f512d13bc9	feat: test token counting (#7943 )	2026-01-12 10:57:20 -08:00
Cameron	3e15159d83	fix: prevent human block overwrite when skills block missing (#7656 ) * fix: prevent human block overwrite when skills block missing Bug: When connecting to agents created before skills blocks were standard, the human block gets overwritten with skills directory content. Root cause: agent_manager.py:1893-1898 had `block = block` (no-op). When skills block doesn't exist, loop variable ends as last block in core_memory (often "human"), then updates that wrong block. Fix: Use `matched_block` variable to properly track found block. Now correctly raises NoResultFound when block label doesn't exist. Impact: Affects pre-December 2025 agents missing skills blocks. Written by Cameron ◯ Letta Code "The best error message is the one that never shows up." - Thomas Fuchs Co-Authored-By: Letta <noreply@letta.com> * fix: use correct method name in block update test Change get_block_by_label_async to get_block_with_label_async in test. Written by Cameron ◯ Letta Code Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-12 10:57:19 -08:00
Ari Webb	cd45212acb	feat: add zai provider support (#7626 ) * feat: add zai provider support * add zai_api_key secret to deploy-core * add to justfile * add testing, provider integration skill * enable zai key * fix zai test * clean up skill a little * small changes	2026-01-12 10:57:19 -08:00
cthomas	9a95a8f976	fix: duplicate session commit in step logging (#7512 ) * fix: duplicate session commit in step logging * update all callsites	2026-01-12 10:57:19 -08:00
Sarah Wooders	a7639a53eb	fix: fix summary message return for compaction (#7402 )	2026-01-12 10:57:19 -08:00
jnjpng	350f3a751c	fix: update more plaintext non async callsites (#7223 ) * bae * update * fix * clean up * last	2025-12-17 17:31:02 -08:00
Sarah Wooders	f1bd246e9b	feat: use token streaming for anthropic summarization (#7105 )	2025-12-17 17:31:02 -08:00
Kevin Lin	857139f907	feat: Set reasonable defaults for max output tokens [LET-6483] (#7084 )	2025-12-17 17:31:02 -08:00
jnjpng	00ba2d09f3	refactor: migrate mcp_servers and mcp_oauth to encrypted-only columns (#6751 ) * refactor: migrate mcp_servers and mcp_oauth to encrypted-only columns Complete migration to encrypted-only storage for sensitive fields: - Remove dual-write to plaintext columns (token, custom_headers, authorization_code, access_token, refresh_token, client_secret) - Read only from _enc columns, not from plaintext fallback - Remove helper methods (get_token_secret, set_token_secret, etc.) - Remove Secret.from_db() and Secret.to_dict() methods - Update tests to verify encrypted-only behavior After this change, plaintext columns can be set to NULL manually since they are no longer read from or written to. * fix test * rename * update * union * fix test	2025-12-17 17:31:02 -08:00
Sarah Wooders	bd9f3aca9b	fix: fix `prompt_acknowledgement` usage and update summarization prompts (#7012 )	2025-12-15 12:03:09 -08:00
Sarah Wooders	0c0ba5d03d	fix: remove letta-free embeddings from testing (#6870 )	2025-12-15 12:03:09 -08:00
Sarah Wooders	a731e01e88	fix: use `model` instead of `model_settings` (#6834 )	2025-12-15 12:03:09 -08:00
Sarah Wooders	a721a00899	feat: add `agent_id` to search results (#6867 )	2025-12-15 12:03:09 -08:00
Ari Webb	fecf503ad9	feat: xhigh reasoning for gpt-5.2 (#6735 )	2025-12-15 12:03:09 -08:00
jnjpng	4be813b956	fix: migrate sandbox and agent environment variables to encrypted only (#6623 ) * base * remove unnnecessary db migration * update * fix * update * update * comments * fix * revert * anotha --------- Co-authored-by: Letta Bot <noreply@letta.com>	2025-12-15 12:03:08 -08:00
Sarah Wooders	7ea297231a	feat: add `compaction_settings` to agents (#6625 ) * initial commit * Add database migration for compaction_settings field This migration adds the compaction_settings column to the agents table to support customized summarization configuration for each agent. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix * rename * update apis * fix tests * update web test --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <kian@letta.com>	2025-12-15 12:02:34 -08:00
Ari Webb	4d90f37f50	feat: add gpt-5.2 support (#6698 )	2025-12-15 12:02:34 -08:00
jnjpng	b658c70063	test: add coverage for provider encryption without LETTA_ENCRYPTION_KEY (#6629 ) Add tests to verify that providers work correctly when no encryption key is configured. The Secret class stores values as plaintext in _enc columns and retrieves them successfully, but this code path had no test coverage. Co-authored-by: Letta Bot <noreply@letta.com>	2025-12-15 12:02:34 -08:00
jnjpng	17a90538ca	fix: exclude common API key prefixes from encryption detection (#6624 ) * fix: exclude common API key prefixes from encryption detection Add a list of known API key prefixes (OpenAI, Anthropic, GitHub, AWS, Slack, etc.) to prevent is_encrypted() from incorrectly identifying plaintext credentials as encrypted values. * update * test	2025-12-15 12:02:34 -08:00
Kian Jones	fbd89c9360	fix: replace all 'PRODUCTION' references with 'prod' for consistency (#6627 ) * fix: replace all 'PRODUCTION' references with 'prod' for consistency Problem: Codebase had 11 references to 'PRODUCTION' (uppercase) that should use 'prod' (lowercase) for consistency with the deployment workflows and environment normalization. Changes across 8 files: 1. Source files (using settings.environment): - letta/functions/function_sets/multi_agent.py - letta/services/tool_manager.py - letta/services/tool_executor/multi_agent_tool_executor.py - letta/services/helpers/agent_manager_helper.py All checks changed from: settings.environment == "PRODUCTION" To: settings.environment == "prod" 2. OTEL resource configuration: - letta/otel/resource.py - Updated _normalize_environment_tag() to handle 'prod' directly - Removed 'PRODUCTION' -> 'prod' mapping (no longer needed) - Updated device.id check from _env != "PRODUCTION" to _env != "prod" 3. Test files: - tests/managers/conftest.py - Fixture parameter changed from "PRODUCTION" to "prod" - tests/managers/test_agent_manager.py (3 occurrences) - tests/managers/test_tool_manager.py (2 occurrences) All test checks changed to use "prod" Result: Complete consistency across the codebase: - All environment checks use "prod" instead of "PRODUCTION" - Normalization function simplified (no special case for PRODUCTION) - Tests use correct "prod" value - Matches deployment workflow configuration from PR #6626 This completes the environment naming standardization effort. * fix: update settings.py environment description to use 'prod' instead of 'PRODUCTION' The field description still referenced PRODUCTION as an example value. Updated to use lowercase 'prod' for consistency with actual usage. Before: "Application environment (PRODUCTION, DEV, CANARY, etc. - normalized to lowercase for OTEL tags)" After: "Application environment (prod, dev, canary, etc. - lowercase values used for OTEL tags)"	2025-12-15 12:02:34 -08:00
jnjpng	3221ed8a14	fix: update base provider to only handle _enc fields (#6591 ) * base * update * another pass * fix * generate * fix test * don't set on create * last fixes --------- Co-authored-by: Letta Bot <noreply@letta.com>	2025-12-15 12:02:34 -08:00
Sarah Wooders	70c57c5072	fix: various patches to summarizer (#6597 )	2025-12-15 12:02:34 -08:00
Sarah Wooders	8440e319e2	Revert "feat: enable provider models persistence" (#6590 ) Revert "feat: enable provider models persistence (#6193)" This reverts commit 9682aff32640a6ee8cf71a6f18c9fa7cda25c40e.	2025-12-15 12:02:34 -08:00
Sarah Wooders	bbd52e291c	feat: refactor summarization and message persistence code [LET-6464] (#6561 )	2025-12-15 12:02:34 -08:00
Sarah Wooders	fca5774795	feat: store run errors on streaming (#6573 )	2025-12-15 12:02:34 -08:00
Ari Webb	848a73125c	feat: enable provider models persistence (#6193 ) * Revert "fix test" This reverts commit 5126815f23cefb4edad3e3bf9e7083209dcc7bf1. * fix server and better test * test fix, get api key for base and byok? * set letta default endpoint * try to fix timeout for test * fix for letta api key * Delete apps/core/tests/sdk_v1/conftest.py * Update utils.py * clean up a few issues * fix filterning on list_llm_models * soft delete models with provider * add one more test * fix ci * add timeout * band aid for letta embedding provider * info instead of error logs when creating models	2025-12-15 12:02:34 -08:00
Devansh Jain	d1536df6f6	chore: Update deepseek client for v3.2 models (#6556 ) * support for v3.2 models * streaming + context window fix * fix for no assitant text from deepseek	2025-12-15 12:02:34 -08:00
Ari Webb	4092820f3a	feat: add project id scoping for tools backend changes (#6529 )	2025-12-15 12:02:34 -08:00
Kian Jones	09c027692f	tests: assistant msg validation error (#6536 ) * add regression test for dict content in AssistantMessage Tests the fix for pydantic validation error when send_message tool returns dict content like {'tofu': 1, 'mofu': 1, 'bofu': 1}. The test verifies that dict content is properly serialized to JSON string before creating AssistantMessage. * improve type annotation for validate_function_response Changed return type from Any to str \| dict[str, Any] to match actual behavior. This enables static type checkers (pyright, mypy) to catch type mismatches like the AssistantMessage bug. With proper type annotations, pyright would have caught: error: Argument of type "str \| dict[str, Any]" cannot be assigned to parameter "content" of type "str" This prevents future bugs where dict is passed to string-only fields. * add regression test for dict content in AssistantMessage Moved test into existing test_message_manager.py suite alongside other message conversion tests. Tests the fix for pydantic validation error when send_message tool returns dict content like {'tofu': 1, 'mofu': 1, 'bofu': 1}. The test verifies that dict content is properly serialized to JSON string before creating AssistantMessage.	2025-12-15 12:02:34 -08:00
Ari Webb	eb547bb96e	fix: clear message history no longer deletes messages (#6515 ) * fix: clear message history no longer deletes messages * toast and make it stay for 8 secs * fix test --------- Co-authored-by: Ari Webb <ari@letta.com>	2025-12-15 12:02:33 -08:00
Sarah Wooders	3569721fd4	fix: avoid infinite summarization loops (#6506 )	2025-12-15 12:02:33 -08:00

1 2 3 4 5 ...

2123 Commits