letta-server

Author	SHA1	Message	Date
Ani Tunturi	93337ce680	fix: pass images from tool returns to vision models OpenAI Chat Completions requires tool message content to be a string, so images in tool returns were silently replaced with [Image omitted]. Now: text stays in the tool return, images get injected as a user message right after. The model actually sees what the tool saw. to_openai_dict also cleaned up — image handling lives in to_openai_dicts_from_list where it can inject the extra message.	2026-03-21 12:41:37 -04:00
Kian Jones	f5c4ab50f4	chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 ) * auto fixes * auto fix pt2 and transitive deps and undefined var checking locals() * manual fixes (ignored or letta-code fixed) * fix circular import * remove all ignores, add FastAPI rules and Ruff rules * add ty and precommit * ruff stuff * ty check fixes * ty check fixes pt 2 * error on invalid	2026-02-24 10:55:11 -08:00
Kian Jones	25d54dd896	chore: enable F821, F401, W293 (#9503 ) * auto fixes * auto fix pt2 and transitive deps and undefined var checking locals() * manual fixes (ignored or letta-code fixed) * fix circular import	2026-02-24 10:55:08 -08:00
Sarah Wooders	26cbdb7b7b	fix(core): skip malformed send_message entries in message conversion (#9494 ) Avoid failing message-list endpoints when historical send_message tool calls are missing the expected message argument by logging and skipping malformed entries during conversion. 👾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-02-24 10:52:07 -08:00
Kian Jones	02f776b016	fix: handle system messages with mixed TextContent + ImageContent (#9418 ) * fix: handle system messages with mixed TextContent + ImageContent System messages injected by external tools (e.g. packify.ai MCP) can contain both TextContent and ImageContent. The assertions in to_openai_responses_dicts and to_anthropic_dict expected exactly one TextContent, causing AssertionError in production. Extract all text parts and join them, matching how to_openai_dict already handles this case. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: replace asserts with logger.warning + graceful skip Asserts are the wrong tool for production input validation — if a system message has only non-text content, we should warn and skip rather than crash the request. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-02-24 10:52:07 -08:00
Ari Webb	c08b67a26a	feat: add ToolReturnCreate to MessageCreateParams [LET-7366] (#9385 ) * fix: add ToolReturnCreate to sdk types * ci	2026-02-24 10:52:07 -08:00
Ari Webb	85ee7ed7b4	fix: anthropic tool sanitation (#9310 )	2026-02-24 10:52:06 -08:00
jnjpng	3f23a23227	feat: add compaction stats (#9219 ) * base * update * last * generate * fix test	2026-02-24 10:52:06 -08:00
jnjpng	d28ccc0be6	feat: add summary message and event on compaction (#9144 ) * base * update * update * revert formatting * routes * legacy * fix * review * update	2026-02-24 10:52:05 -08:00
Charles Packer	82c01368fc	feat: add conversation_id to message search results (#9056 ) * feat: add conversation_id to message search results Add conversation_id field to all MessageListResult classes (SystemMessageListResult, UserMessageListResult, ReasoningMessageListResult, AssistantMessageListResult) so that conversation IDs are returned from the /messages/search endpoint alongside agent IDs. Fixes #9055 Co-authored-by: Charles Packer <cpacker@users.noreply.github.com> chore: regenerate SDK and OpenAPI spec Regenerate autogenerated files after adding conversation_id to message search result schemas. Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> --------- Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Charles Packer <cpacker@users.noreply.github.com> Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com>	2026-01-29 12:44:04 -08:00
cthomas	208992170c	fix: gracefully skip assistant messages with empty content in LLM for… (#9050 ) fix: gracefully skip assistant messages with empty content in LLM format conversion Problem: Context window calculation crashed with AssertionError when converting messages to Google/Anthropic/OpenAI format: ``` AssertionError at line 2047: assert self.tool_calls is not None or text_content is not None or len(self.content) > 1 ``` This happened when loading agents with old/malformed messages that had `content=None` or `content=[]` in the database. Root Cause: The Message ORM model allows `content: Optional[List[...]] = None` (line 252), but format conversion methods assumed content would always have extractable text or tool calls. Scenarios that triggered crashes: 1. Assistant message with `content=None` (old migrations/edge cases) 2. Assistant message with `content=[]` (message creation bugs) 3. Assistant message with single non-text content that doesn't match extraction logic Fix: Replaced assertions with defensive checks in 3 conversion methods: 1. `to_google_dict()` (line 2054) - Return None to skip unconvertible messages 2. `to_openai_responses_api_dicts()` (line 1476) - Return early to skip 3. `to_anthropic_dict()` (line 1794) - Return None to skip Pattern: Check for empty content, return None/early to skip gracefully. Result: - Context window calculation no longer crashes on malformed/old messages - Messages with no convertible content are silently skipped - Consistent with existing Anthropic reasoning-only message handling (line 1308) 👾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
Charles Packer	2fc592e0b6	feat(core): add image support in tool returns [LET-7140] (#8985 ) * feat(core): add image support in tool returns [LET-7140] Enable tool_return to support both string and ImageContent content parts, matching the pattern used for user message inputs. This allows tools executed client-side to return images back to the agent. Changes: - Add LettaToolReturnContentUnion type for text/image content parts - Update ToolReturn schema to accept Union[str, List[content parts]] - Update converters for each provider: - OpenAI Chat Completions: placeholder text for images - OpenAI Responses API: full image support - Anthropic: full image support with base64 - Google: placeholder text for images - Add resolve_tool_return_images() for URL-to-base64 conversion - Make create_approval_response_message_from_input() async 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix(core): support images in Google tool returns as sibling parts Following the gemini-cli pattern: images in tool returns are sent as sibling inlineData parts alongside the functionResponse, rather than inside it. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * test(core): add integration tests for multi-modal tool returns [LET-7140] Tests verify that: - Models with image support (Anthropic, OpenAI Responses API) can see images in tool returns and identify the secret text - Models without image support (Chat Completions) get placeholder text and cannot see the actual image content - Tool returns with images persist correctly in the database Uses secret.png test image containing hidden text "FIREBRAWL" that models must identify to pass the test. Also fixes misleading comment about Anthropic only supporting base64 images - they support URLs too, we just pre-resolve for consistency. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: simplify tool return image support implementation Reduce code verbosity while maintaining all functionality: - Extract _resolve_url_to_base64() helper in message_helper.py (eliminates duplication) - Add _get_text_from_part() helper for text extraction - Add _get_base64_image_data() helper for image data extraction - Add _tool_return_to_google_parts() to simplify Google implementation - Add _image_dict_to_data_url() for OpenAI Responses format - Use walrus operator and list comprehensions where appropriate - Add integration_test_multi_modal_tool_returns.py to CI workflow Net change: -120 lines while preserving all features and test coverage. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix(tests): improve prompt for multi-modal tool return tests Make prompts more direct to reduce LLM flakiness: - Simplify tool description: "Retrieves a secret image with hidden text. Call this function to get the image." - Change user prompt from verbose request to direct command: "Call the get_secret_image function now." - Apply to both test methods This reduces ambiguity and makes tool calling more reliable across different LLM models. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix bugs * test(core): add google_ai/gemini-2.0-flash-exp to multi-modal tests Add Gemini model to test coverage for multi-modal tool returns. Google AI already supports images in tool returns via sibling inlineData parts. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix(ui): handle multi-modal tool_return type in frontend components Convert Union<string, LettaToolReturnContentUnion[]> to string for display: - ViewRunDetails: Convert array to '[Image here]' placeholder - ToolCallMessageComponent: Convert array to '[Image here]' placeholder Fixes TypeScript errors in web, desktop-ui, and docker-ui type-checks. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Caren Thomas <carenthomas@gmail.com>	2026-01-29 12:43:53 -08:00
Sarah Wooders	9d1ad00dd6	Revert "fix: filter orphaned approval_request messages to prevent Anthropic API errors" (#8721 ) Revert "fix: filter orphaned approval_request messages to prevent Anthropic A…" This reverts commit 2df946c0a821ab8346e8e9037e819be24004a51f.	2026-01-19 15:54:39 -08:00
Charles Packer	ca753a6d50	fix: filter orphaned approval_request messages to prevent Anthropic API errors (#8688 )	2026-01-19 15:54:39 -08:00
github-actions[bot]	e914075b04	fix: ensure thought_signature is included for Gemini 3 function calls (#8590 ) This fixes a 400 INVALID_ARGUMENT error from Google's Gemini API where function calls were missing required thought_signature in functionCall parts. Changes: - Allow signatures when self.model is None (backwards compatibility for older messages that may not have had their model field set) - Only add thought_signature to the FIRST function call for parallel tool calls, per Google's docs - Take the first non-None signature found (don't keep overwriting) Reference: https://ai.google.dev/gemini-api/docs/thought-signatures Closes #8589 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-19 15:54:38 -08:00
github-actions[bot]	b559bf8403	fix: handle missing tool_call_id in Anthropic message conversion (#8381 ) * fix: handle missing tool_call_id in Anthropic message conversion - Add null check for self.tool_returns before iterating - Fall back to message's tool_call_id when tool_return.tool_call_id is None - Improve error message to show actual tool name from message.name - Only raise error if no valid tool_call_id is available from either source This fixes the error "Anthropic API requires tool_use_id to be set" that occurs when a ToolReturn object in the database doesn't have tool_call_id set, by using the message-level tool_call_id as a fallback. Fixes #8379 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-Authored-By: Letta <noreply@letta.com> * fix: restrict tool_call_id fallback to single tool returns The message-level `self.tool_call_id` is set to the first tool return's ID for legacy compatibility. For parallel tool calls with multiple tool_returns, using this as a fallback would incorrectly assign the first tool return's ID to all subsequent returns missing their own ID. This change: - Only allows the fallback when there's exactly one tool return - For multiple tool returns, each must have its own ID or raise an error - Adds tool return index to error messages for better debugging Co-authored-by: Kian Jones <kianjones9@users.noreply.github.com> 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-12 10:57:49 -08:00
Charles Packer	ed6284cedb	feat: Add conversation_id filtering to message endpoints (#8324 ) * feat: Add conversation_id filtering to message list and search endpoints Add optional conversation_id parameter to filter messages by conversation: - client.agents.messages.list - client.messages.list - client.messages.search Changes: - Added conversation_id field to MessageSearchRequest and SearchAllMessagesRequest schemas - Added conversation_id filtering to list_messages in message_manager.py - Updated get_agent_recall_async and get_all_messages_recall_async in server.py - Added conversation_id query parameter to router endpoints - Updated Turbopuffer client to support conversation_id filtering in searches Fixes #8320 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Charles Packer <cpacker@users.noreply.github.com> * add conversation_id to message and tpuf * default messages filter for backward compatibility * add test and auto gen * fix integration test * fix test * update test --------- Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Charles Packer <cpacker@users.noreply.github.com> Co-authored-by: christinatong01 <christina@letta.com>	2026-01-12 10:57:48 -08:00
Charles Packer	64a1a8b14e	feat: expose agent_id to the messages search api endpoint (#8252 )	2026-01-12 10:57:48 -08:00
Kian Jones	82e5d70807	fix: prevent empty reasoning messages in streaming interfaces (#7207 ) * fix: prevent empty reasoning messages in streaming interfaces Prevents empty "Thinking..." indicators from appearing in clients by filtering out reasoning messages with no content at the source. Changes: - Gemini: Don't emit ReasoningMessage when only thought_signature exists - Gemini: Only emit reasoning content if text is non-empty - Anthropic: Don't emit ReasoningMessage for BetaSignatureDelta - Anthropic: Only emit reasoning content if thinking text is non-empty This fixes the issue where providers send signature metadata before actual thinking content, causing empty reasoning blocks to appear in the UI after responses complete. Affects: Gemini reasoning, Anthropic extended thinking 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: handle Anthropic thinking signature correctly - Only include 'signature' in Anthropic message payload if it is not None (fixes BadRequestError). - Capture and attach 'signature' to ReasoningMessage in streaming interface. * fix(anthropic): attach signature to last reasoning message in stream --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-12 10:57:19 -08:00
Christina Tong	f929d53cfe	add msg id to search endpoint response [LET-6582] (#7236 ) * add msg id to search endpoint response * rename	2025-12-17 17:31:02 -08:00
Sarah Wooders	a721a00899	feat: add `agent_id` to search results (#6867 )	2025-12-15 12:03:09 -08:00
Kian Jones	74e0172efe	fix: AssistantMessage validation error when content is dict (#6533 ) fix AssistantMessage validation error when content is dict validate_function_response can return either a string or dict, but AssistantMessage.content expects a string. When a tool returns a dict like {'tofu': 1, 'mofu': 1, 'bofu': 1}, it needs to be JSON-serialized before passing to AssistantMessage. Fixes: pydantic_core._pydantic_core.ValidationError: 2 validation errors for AssistantMessage	2025-12-15 12:02:33 -08:00
Kian Jones	d6292b6eb6	fix: bug which causes unrecoverable state if previous message was an image (#6486 ) * trying tout gpt-5.1-codex * add unit test for message content * try to support multimodal	2025-12-15 12:02:33 -08:00
Sarah Wooders	807c5c18d9	feat: add gemini token counting [LET-6371] (#6444 )	2025-12-15 12:02:19 -08:00
jnjpng	2c702785d7	fix: unrecognized content part log for gemini tool call content (#6400 ) base Co-authored-by: Letta Bot <noreply@letta.com>	2025-11-26 14:39:39 -08:00
Ari Webb	f9b405372d	feat: add search routes [LET-6236] (#6280 ) * claude code first pass * rename routes * search_messages and list_messages * revert agents messagesearch * generate api * fix backend for list all messages * request for message search * return list of letta message * add tests * error in archive endpoint * archive delete return type wrong * optional params for archive creation * add passage to tpuf on create * fix archive manager * support global passage search * search by agent * just do basic org wide search for now * change message test to be about fresh data, cleanup after --------- Co-authored-by: Ari Webb <ari@letta.com>	2025-11-24 19:10:27 -08:00
cthomas	45832a9d87	feat: add tracing to letta message translation (#6316 )	2025-11-24 19:10:27 -08:00
Christina Tong	04611b981c	feat: filter messages search endpoint by agent id [LET-6229] (#6246 ) * feat: filter messages search endpoint by agent id [LET-6229] * add autogenerated schema/types	2025-11-24 19:09:33 -08:00
Sarah Wooders	963e40e6db	fix: patch gemini-3 (#6237 )	2025-11-24 19:09:33 -08:00
Charles Packer	18029250d0	fix(core): sanitize messages to anthropic in the main path the same way (or similar) to how we do it in the token counter (#6044 ) * fix(core): sanitize messages to anthropic in the main path the same way (or similar) to how we do it in the token counter * fix: also patch poison error in backend by filtering lazily * fix: remap streaming errors (what the fuck) * fix: dedupe tool clals * fix: cleanup, removed try/catch	2025-11-13 15:36:55 -08:00
Kian Jones	db6d4bb124	chore: handle tool use id error in anthropic client (#6016 ) handle tool use id error in anthropic client	2025-11-13 15:36:55 -08:00
Charles Packer	4b371dd6fb	fix(core): patch bug w/ sleeptime agents and client-side tool executions [LET-6081] (#6001 ) * fix(core): patch bug w/ sleeptime agents and client-side tool executions * fix: add groupid to approvalcreate * chore: just stage-api && just publish-api	2025-11-13 15:36:55 -08:00
Charles Packer	aa7093c585	fix: patch hole in the fallback summarizer where we weren't actually truncating (#5919 ) * fix: patch hole in the fallback summarizer where we weren't actually truncating * fix: remove no-op * chore: comment * fix: simplify the new fallback * fix: properly handle images in summarizer payload	2025-11-13 15:36:50 -08:00
Charles Packer	a44c05040a	fix(core): big context overflow handling patch (#5901 )	2025-11-13 15:36:39 -08:00
Sarah Wooders	57bb051ea4	feat: add tool return truncation to summarization as a fallback [LET-5970] (#5859 )	2025-11-13 15:36:30 -08:00
Ari Webb	48cc73175b	feat: parallel tool calling for openai non streaming [LET-4593] (#5773 ) * first hack * clean up * first implementation working * revert package-lock * remove openai test * error throw * typo * Update integration_test_send_message_v2.py * Update integration_test_send_message_v2.py * refine test * Only make changes for openai non streaming * Add tests --------- Co-authored-by: Ari Webb <ari@letta.com> Co-authored-by: Matt Zhou <mattzh1314@gmail.com>	2025-11-13 15:36:14 -08:00
Kian Jones	f0de0b5812	feat: stainless sdk generation in CI [LET-5908] (#5768 ) * letta coded * migrate to stainless from fern * revert core workflows * fix if statement * fix typo * run on self-hosted ci runners * add empty check * change file * fix upstream renaming and special character escaping * fix letta-code with opus * remove random client type import * remove env localhost * remove fialing tests * ignore ts for now * fix caching maybe * tar.gz -> whl * retain name metadata * don't build on cache hit * add sdk_v1 tests	2025-11-13 15:35:41 -08:00
Sarah Wooders	ecfaa0b353	fix: fix requirement for tool returns to be in message.content for openai/anthropic [LET-5893] (#5756 ) fix: fix requirement for tool returns to be in message.content for openai and anthropic	2025-11-13 15:35:34 -08:00
cthomas	6b37ef2cb7	fix: special otid handling for approval request (#5726 )	2025-10-24 15:14:31 -07:00
Kian Jones	704d3b2d79	chore: refactor not to use warnings.warn (#5730 ) * refactor not to use warnings.warn * temp circular import fix maybe unecessary/bnad * fix Deprecation warning * fix deprecation warning and mcp thing? * revert changes to mcp server test * fix deprecation warning	2025-10-24 15:14:31 -07:00
Kian Jones	45065297a0	feat: runtime validation for ids for internal managers calls (#5544 ) * claude coded first pass * fix test cases to expect errors instead * fix this * let's see how letta-code did * claude * fix tests, remove dangling comments, retrofit all managers functions with decorator * revert to main for these since we are not erroring on invalid tool and block ids * reorder decorators * finish refactoring test cases * reorder agent_manager decorators and fix test tool manager * add decorator on missing managers * fix id sources * remove redundant check * uses enum now * move to enum	2025-10-24 15:13:47 -07:00
cthomas	73dcc0d4b7	feat: latest hitl + parallel tool call changes (#5565 )	2025-10-24 15:12:49 -07:00
cthomas	f8437d47e2	feat: add support for hitl parallel tool calling (#5549 ) * feat: add support for hitl parallel tool calling * rename to requested_tool_calls	2025-10-24 15:12:11 -07:00
Matthew Zhou	6cfdcc584c	fix: Add backfill for message listing without tool call id [LET-5479] (#5400 ) * Add warning comment * Remove extra commit * Remove extra helper unused	2025-10-24 15:11:31 -07:00
Ari Webb	8ab2cf258c	band aid, error instead of send to anthropic [LET-5478] (#5396 ) band aid, error instead of send to anthropic Co-authored-by: Ari Webb <ariwebb@Aris-MacBook-Pro.local>	2025-10-24 15:11:31 -07:00
Charles Packer	07b9fc26a0	fix(core): patch the issue with the GET failing (tested) (#5362 ) * fix: patch the issue with the GET failing (tested) * fix(web): patch for the FE UI * parse func_response --------- Co-authored-by: Caren Thomas <carenthomas@gmail.com>	2025-10-24 15:11:31 -07:00
Matthew Zhou	bb8a7889e0	feat: Add parallel tool call streaming for anthropic [LET-4601] (#5225 ) * wip * Fix parallel tool calling interface * wip * wip adapt using id field * Integrate new multi tool return schemas into parallel tool calling * Remove example script * Reset changes to llm stream adapter since old agent loop should not enable parallel tool calling * Clean up fallback logic for extracting tool calls * Remove redundant check * Simplify logic * Clean up logic in handle ai response * Fix tests * Write anthropic dict conversion to be back compatible * wip * Double write tool call id for legacy reasons * Fix override args failures * Patch for approvals * Revert comments * Remove extraneous prints	2025-10-24 15:11:31 -07:00
Matthew Zhou	1c285f5170	fix: Fix system init messages and anthropic parsing [LET-5385] (#5339 ) * Fix system init messages and anthropic parsing * Add backwards compat tool call id adding	2025-10-24 15:11:31 -07:00
cthomas	a7718ccdad	chore: remove inline imports for letta tool return (#5328 )	2025-10-24 15:11:31 -07:00
cthomas	3128b5e126	feat: add client side tool calling support (#5313 )	2025-10-24 15:11:31 -07:00

1 2 3 4

159 Commits