letta-server

Author	SHA1	Message	Date
Kevin Lin	23c94ec6d3	feat: add log probabilities from OpenAI-compatible servers and SGLang native endpoint (#9240 ) * Add log probabilities support for RL training This enables Letta server to request and return log probabilities from OpenAI-compatible providers (including SGLang) for use in RL training. Changes: - LLMConfig: Add return_logprobs and top_logprobs fields - OpenAIClient: Set logprobs in ChatCompletionRequest when enabled - LettaLLMAdapter: Add logprobs field and extract from response - LettaResponse: Add logprobs field to return log probs to client - LettaRequest: Add return_logprobs/top_logprobs for per-request override - LettaAgentV3: Store and pass logprobs through to response - agents.py: Handle request-level logprobs override Usage: response = client.agents.messages.create( agent_id=agent_id, messages=[...], return_logprobs=True, top_logprobs=5, ) print(response.logprobs) # Per-token log probabilities 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * Add multi-turn token tracking for RL training via SGLang native endpoint - Add TurnTokenData schema to track token IDs and logprobs per turn - Add return_token_ids flag to LettaRequest and LLMConfig - Create SGLangNativeClient for /generate endpoint (returns output_ids) - Create SGLangNativeAdapter that uses native endpoint - Modify LettaAgentV3 to accumulate turns across LLM calls - Include turns in LettaResponse when return_token_ids=True * Fix: Add SGLang native adapter to step() method, not just stream() * Fix: Handle Pydantic Message objects in SGLang native adapter * Fix: Remove api_key reference from LLMConfig (not present) * Fix: Add missing 'created' field to ChatCompletionResponse * Add full tool support to SGLang native adapter - Format tools into prompt in Qwen-style format - Parse tool calls from <tool_call> tags in response - Format tool results as <tool_response> in user messages - Set finish_reason to 'tool_calls' when tools are called * Use tokenizer.apply_chat_template for proper tool formatting - Add tokenizer caching in SGLang native adapter - Use apply_chat_template when tokenizer available - Fall back to manual formatting if not - Convert Letta messages to OpenAI format for tokenizer * Fix: Use func_response instead of tool_return for ToolReturn content * Fix: Get output_token_logprobs from meta_info in SGLang response * Fix: Allow None in output_token_logprobs (SGLang format includes null) * chore: remove unrelated files from logprobs branch 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add missing call_type param to adapter constructors in letta_agent_v3 The SGLang refactor dropped call_type=LLMCallType.agent_step when extracting adapter creation into conditional blocks. Restores it for all 3 spots (SGLang in step, SimpleLLM in step, SGLang in stream). 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * just stage-api && just publish-api * fix: update expected LLMConfig fields in schema test for logprobs support 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: remove rllm provider references 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * just stage-api && just publish-api 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Ubuntu <ubuntu@ip-172-31-65-206.ec2.internal> Co-authored-by: Letta <noreply@letta.com>	2026-02-24 10:52:07 -08:00
Ari Webb	c08b67a26a	feat: add ToolReturnCreate to MessageCreateParams [LET-7366] (#9385 ) * fix: add ToolReturnCreate to sdk types * ci	2026-02-24 10:52:07 -08:00
Sarah Wooders	526da4c49b	Revert "perf: optimize prefix caching by skipping system prompt rebuild on every step" (#9380 ) Revert "perf: optimize prefix caching by skipping system prompt rebuild on ev…" This reverts commit eafa4144c2577a45b7007a177b701863b98d1dfa.	2026-02-24 10:52:07 -08:00
Sarah Wooders	9dbe28e8f1	perf: optimize prefix caching by skipping system prompt rebuild on every step (#9080 )	2026-02-24 10:52:07 -08:00
Sarah Wooders	21e880907f	feat(core): structure memory directory and block labels [LET-7336] (#9309 )	2026-02-24 10:52:06 -08:00
jnjpng	0bdedb3c0f	feat: agent generate endpoint (#9304 ) * base * update * clean up * update	2026-02-24 10:52:06 -08:00
Kevin Lin	34159ffa21	feat: add Anthropic Opus 4.6 model support (#9123 )	2026-02-24 10:52:06 -08:00
jnjpng	ff69c6a32e	feat: add /agents/{agent_id}/generate endpoint for direct LLM requests (#9272 ) * feat: add /agents/{agent_id}/generate endpoint for direct LLM requests Add new endpoint that makes direct LLM provider requests without agent context, memory, tools, or state modification. This enables: - Quick LLM queries without agent overhead - Testing model configurations - Simple chat completions using agent's credentials - Comparing responses across different models Features: - Uses agent's LLM config by default - Supports model override with full provider config resolution - Non-streaming, stateless operation - Proper error handling and validation - Request/response schemas with Pydantic validation Implementation: - Add GenerateRequest and GenerateResponse schemas - Implement generate_completion endpoint handler - Add necessary imports (LLMError, LLMClient, HandleNotFoundError) - Include logging and comprehensive error handling * fix: improve error handling and fix Message construction - Fix critical bug: use content=[TextContent(text=...)] instead of text=... - Add explicit error handling for NoResultFound and HandleNotFoundError - Add error handling for convert_response_to_chat_completion - Add structured logging for debugging - Remove unnecessary .get() calls since Pydantic validates messages * refactor: extract generate logic to AgentCompletionService Move the generate endpoint business logic out of the endpoint handler into a dedicated AgentCompletionService class for better code organization and separation of concerns. Changes: - Create new AgentCompletionService in services/agent_completion_service.py - Service handles all business logic: agent validation, LLM config resolution, message conversion, LLM client creation, and request/response processing - Integrate service with SyncServer initialization - Refactor generate_completion endpoint to use the service - Endpoint now only handles HTTP concerns (auth, error mapping) Benefits: - Cleaner endpoint code (reduced from ~140 lines to ~25 lines) - Better separation of concerns (HTTP vs business logic) - Service logic can be reused or tested independently - Follows established patterns in the codebase (AgentManager, etc.) * feat: simplify generate API to accept just prompt text Simplify the client interface by accepting a simple prompt string instead of requiring clients to format messages. Changes: - Update GenerateRequest schema: - Replace 'messages' array with simple 'prompt' string - Add optional 'system_prompt' for context/instructions - Keep 'override_model' for model selection - Update AgentCompletionService to format messages automatically: - Accepts prompt and optional system_prompt - Constructs message array internally (system + user messages) - Simpler API surface for clients - Update endpoint documentation with new simplified examples - Regenerate OpenAPI spec and TypeScript SDK Benefits: - Much simpler client experience - just send text - No need to understand message formatting - Still supports system prompts for context - Cleaner API that matches common use cases Example (before): { "messages": [{"role": "user", "content": "What is 2+2?"}] } Example (after): { "prompt": "What is 2+2?" } * test: add comprehensive integration tests for generate endpoint Add 9 integration tests covering various scenarios: Happy path tests: - test_agent_generate_basic: Basic prompt -> response flow - test_agent_generate_with_system_prompt: System prompt + user prompt - test_agent_generate_with_model_override: Override model selection - test_agent_generate_long_prompt: Handle longer prompts - test_agent_generate_no_persistence: Verify no messages saved to agent Error handling tests: - test_agent_generate_empty_prompt_error: Empty prompt validation (422) - test_agent_generate_invalid_agent_id: Invalid agent ID (404) - test_agent_generate_invalid_model_override: Invalid model handle (404) All tests verify: - Response structure (content, model, usage) - Proper status codes for errors - Usage statistics (tokens, counts) - No side effects on agent state Tests follow existing test patterns in test_client.py and use the letta_client SDK (assuming generate_completion method is auto-generated from the OpenAPI spec). * openapi * refactor: rename AgentCompletionService to AgentGenerateCompletionManager Rename for better clarity and consistency with codebase naming conventions: - Rename file: agent_completion_service.py → agent_generate_completion_manager.py - Rename class: AgentCompletionService → AgentGenerateCompletionManager - Rename attribute: server.agent_completion_service → server.agent_generate_completion_manager - Update docstrings: 'Service' → 'Manager' Changes: - apps/core/letta/services/agent_generate_completion_manager.py (renamed + updated class) - apps/core/letta/server/server.py (import + initialization) - apps/core/letta/server/rest_api/routers/v1/agents.py (usage in endpoint) No functional changes, purely a naming refactor. * fix: remove invalid Message parameters in generate manager Remove agent_id=None and user_id=None from Message construction. The Message model doesn't accept these as None values - only pass required parameters (role, content). Fixes validation error: 'Extra inputs are not permitted [type=extra_forbidden, input_value=None]' This aligns with other Message construction patterns in the codebase (see tools.py, memory.py examples). * feat: improve generate endpoint validation and tests - Add field validator for whitespace-only prompts - Always include system message (required by Anthropic) - Use default "You are a helpful assistant." when no system_prompt provided - Update tests to use direct HTTP calls via httpx - Fix test issues: - Use valid agent ID format (agent-{uuid}) - Use available model (openai/gpt-4o-mini) - Add whitespace validation test - All 9 integration tests passing	2026-02-24 10:52:06 -08:00
cthomas	530d33c254	feat: add skills support to agentfile (#9287 )	2026-02-24 10:52:06 -08:00
jnjpng	c801866d89	feat: add context token estimates to llm usage (#9295 ) * base * generate * update	2026-02-24 10:52:06 -08:00
Sarah Wooders	e0a23f7039	feat: add usage columns to steps table (#9270 ) * feat: add usage columns to steps table Adds denormalized usage fields to the steps table for easier querying: - model_handle: The model handle (e.g., "openai/gpt-4o-mini") - cached_input_tokens: Tokens served from cache - cache_write_tokens: Tokens written to cache (Anthropic) - reasoning_tokens: Reasoning/thinking tokens These fields mirror LettaUsageStatistics and are extracted from the existing prompt_tokens_details and completion_tokens_details JSON columns. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate OpenAPI specs and SDK for usage columns 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com>	2026-02-24 10:52:06 -08:00
Kian Jones	a206f7f345	feat: add ID format validation to agent and user schemas (#9151 ) * feat: add ID format validation to agent and user schemas Reuse existing validator types (ToolId, SourceId, BlockId, MessageId, IdentityId, UserId) from letta.validators to enforce ID format validation at the schema level. This ensures malformed IDs are rejected with a 422 validation error instead of causing 500 database errors. Changes: - CreateAgent: validate tool_ids, source_ids, folder_ids, block_ids, identity_ids - UpdateAgent: validate tool_ids, source_ids, folder_ids, block_ids, message_ids, identity_ids - UserUpdate: validate id 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API spec and SDK * fix: override ID validation in AgentSchema for agent file portability AgentSchema extends CreateAgent but needs to allow arbitrary short IDs (e.g., tool-0, block-0) for portable agent files. Override the validated ID fields to use plain List[str] instead of the validated types. Also fix test_agent.af to use proper UUID-format IDs. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API spec and SDK 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: revert test_agent.af - short IDs are valid for agent files 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix openapi schema --------- Co-authored-by: Letta <noreply@letta.com>	2026-02-24 10:52:06 -08:00
Kian Jones	025eeaa363	fix: override validation for group for agentfile import (#9248 ) * override validation for group for agentfile import * fix the rest of groupcreate * add api changes	2026-02-24 10:52:06 -08:00
Sarah Wooders	3fdf2b6c79	chore: deprecate old agent messaging (#9120 )	2026-02-24 10:52:06 -08:00
Ari Webb	0bbb9c9bc0	feat: add reasoning zai openrouter (#9189 ) * feat: add reasoning zai openrouter * add openrouter reasoning * stage + publish api * openrouter reasoning always on * revert * fix * remove reference * do	2026-02-24 10:52:06 -08:00
jnjpng	3f23a23227	feat: add compaction stats (#9219 ) * base * update * last * generate * fix test	2026-02-24 10:52:06 -08:00
jnjpng	d28ccc0be6	feat: add summary message and event on compaction (#9144 ) * base * update * update * revert formatting * routes * legacy * fix * review * update	2026-02-24 10:52:05 -08:00
Ari Webb	7b0b1f2531	fix: warning (#9179 ) * fix: warning * just stage publish api * note * api	2026-02-24 10:52:05 -08:00
Caren Thomas	72871ff923	bump version	2026-01-29 12:45:45 -08:00
Ari Webb	a798cc90c4	fix: openrouter provider (#9166 ) * fix: openrouter provider * just stage publish api * web openapi	2026-01-29 12:44:04 -08:00
Ari Webb	9ce1249738	feat: openrouter byok (#9148 ) * feat: openrouter byok * new client is unnecessary * revert json diffs	2026-01-29 12:44:04 -08:00
cthomas	d992aa0df4	fix: non-streaming conversation messages endpoint (#9159 ) * fix: non-streaming conversation messages endpoint Problems: 1. `AssertionError: run_id is required when enforce_run_id_set is True` - Non-streaming path didn't create a run before calling `step()` 2. `ResponseValidationError: Unable to extract tag using discriminator 'message_type'` - `response_model=LettaStreamingResponse` but non-streaming returns `LettaResponse` Fixes: 1. Add run creation before calling `step()` (mirrors agents endpoint) 2. Set run_id in Redis for cancellation support 3. Pass `run_id` to `step()` 4. Change `response_model` from `LettaStreamingResponse` to `LettaResponse` (streaming returns `StreamingResponse` which bypasses response_model validation) Test: Added `test_conversation_non_streaming_raw_http` to verify the fix. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * api sync --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
Shubham Naik	bb2145c24c	connections (#9113 ) * chore: release code * chore: release code * chore: release code * chore: release code * chore: release code * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: change paths * chore: remote * chore: support multi project chat	2026-01-29 12:44:04 -08:00
Kian Jones	45c0a4cd0d	feat: add ID format validation to batch request schema (#9154 ) * feat: add ID format validation to batch request schema Add ID format validation to LettaBatchRequest using existing validator types from letta.validators. Changes: - LettaBatchRequest.agent_id: str → AgentId This ensures malformed agent IDs in batch requests are rejected with 422 validation errors instead of causing 500 database errors. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API spec and SDK --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
Kian Jones	eaaca141f7	feat: add ID format validation to identity schemas (#9153 ) * feat: add ID format validation to identity schemas Add ID format validation to IdentityCreate, IdentityUpsert, and IdentityUpdate schemas using existing validator types from letta.validators. Changes: - agent_ids: Optional[List[str]] → Optional[List[AgentId]] - block_ids: Optional[List[str]] → Optional[List[BlockId]] This ensures malformed IDs are rejected with 422 validation errors instead of causing 500 database errors. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API spec and SDK --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
Kian Jones	dd8be95142	feat: add ID format validation to group schemas (#9152 ) * feat: add ID format validation to group schemas Add ID format validation to GroupCreate, GroupUpdate, and manager config schemas using existing validator types from letta.validators. Changes: - GroupCreate/GroupUpdate: agent_ids → List[AgentId], shared_block_ids → List[BlockId] - SupervisorManager, DynamicManager, SleeptimeManager, VoiceSleeptimeManager: manager_agent_id → AgentId - Update variants: manager_agent_id → Optional[AgentId] This ensures malformed IDs are rejected with 422 validation errors instead of causing 500 database errors. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API spec and SDK --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
Kian Jones	34eed72150	feat: add user id validation (#9128 ) * add user id validation * relax conversation id check to allow default while I'm here * fix annotation validation * -api changes	2026-01-29 12:44:04 -08:00
github-actions[bot]	62a00cc672	fix: remove deprecation from agent passages endpoints (#9117 ) * fix: remove deprecation from agent passages endpoints The client.agent.passages endpoints (list, create, search, delete) were incorrectly marked as deprecated. This would break significant amounts of user code and negatively impact developer experience. Fixes #9116 Co-authored-by: Ari Webb <AriWebb@users.noreply.github.com> * stage publish api --------- Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Ari Webb <AriWebb@users.noreply.github.com> Co-authored-by: Ari Webb <ari@letta.com>	2026-01-29 12:44:04 -08:00
Shubham Naik	8f0ac630ab	chore: nw [LET-6982] (#9081 ) * chore: nw * chore: more * feat: redesign details view * feat: redesign details view * chore: poll every hour	2026-01-29 12:44:04 -08:00
Sarah Wooders	adab8cd9b5	feat: add MiniMax provider support (#9095 ) * feat: add MiniMax provider support Add MiniMax as a new LLM provider using their Anthropic-compatible API. Key implementation details: - Uses standard messages API (not beta) - MiniMax supports thinking blocks natively - Base URL: https://api.minimax.io/anthropic - Models: MiniMax-M2.1, MiniMax-M2.1-lightning, MiniMax-M2 (all 200K context, 128K output) - Temperature clamped to valid range (0.0, 1.0] - All M2.x models treated as reasoning models (support interleaved thinking) Files added: - letta/schemas/providers/minimax.py - MiniMax provider schema - letta/llm_api/minimax_client.py - Client extending AnthropicClient - tests/test_minimax_client.py - Unit tests (13 tests) - tests/model_settings/minimax-m2.1.json - Integration test config 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API spec with MiniMax provider 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: use MiniMax-M2.1-lightning for CI tests Switch to the faster/cheaper lightning model variant for integration tests. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: add MINIMAX_API_KEY to deploy-core command Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> * chore: regenerate web openapi spec with MiniMax provider Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> 🐾 Generated with [Letta Code](https://letta.com) --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com>	2026-01-29 12:44:04 -08:00
Shubham Naik	55a89398e1	chore: rebuild api requests (#9069 )	2026-01-29 12:44:04 -08:00
github-actions[bot]	194c743223	refactor: rename `stream` to `streaming` in ConversationMessageRequest (#9063 )	2026-01-29 12:44:04 -08:00
github-actions[bot]	1d1bb29a43	feat: add override_model support for agent file import (#9058 )	2026-01-29 12:44:04 -08:00
Charles Packer	82c01368fc	feat: add conversation_id to message search results (#9056 ) * feat: add conversation_id to message search results Add conversation_id field to all MessageListResult classes (SystemMessageListResult, UserMessageListResult, ReasoningMessageListResult, AssistantMessageListResult) so that conversation IDs are returned from the /messages/search endpoint alongside agent IDs. Fixes #9055 Co-authored-by: Charles Packer <cpacker@users.noreply.github.com> chore: regenerate SDK and OpenAPI spec Regenerate autogenerated files after adding conversation_id to message search result schemas. Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> --------- Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Charles Packer <cpacker@users.noreply.github.com> Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com>	2026-01-29 12:44:04 -08:00
Sarah Wooders	6c415b27f8	feat: add non-streaming option for conversation messages (#9044 ) * feat: add non-streaming option for conversation messages - Add ConversationMessageRequest with stream=True default (backwards compatible) - stream=true (default): SSE streaming via StreamingService - stream=false: JSON response via AgentLoop.load().step() 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API schema for ConversationMessageRequest --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
Sarah Wooders	25e9539a6e	feat: add batch passage create and optional search `query` (#8866 )	2026-01-29 12:44:04 -08:00
Shubham Naik	8ced2e0c82	Shub/let 7138 support custom feeds that recieve data via an endpoint [LET-7138] (#9027 ) * feat: support custom endpoint * feat: support custom endpoint * chore: add webhook * chore: add webhook * chore: fix types * chore: fix types * chore: docs	2026-01-29 12:44:04 -08:00
Kian Jones	e3fb00f970	feat(crouton): add orgId, userId, Compaction_Settings and LLM_Config (#9022 ) * LC one shot? * api changes * fix summarizer nameerror	2026-01-29 12:44:04 -08:00
Ari Webb	5c06918042	fix: don't need embedding model for self hosted [LET-7009] (#8935 ) * fix: don't need embedding model for self hosted * stage publish api * passes tests * add test * remove unnecessary upgrades * update revision order db migrations * add timeout for ci	2026-01-29 12:44:04 -08:00
Shubham Naik	16e3f10a56	Shub/let 7147 improved channel selector [LET-7147] (#9002 ) * feat; improve selector * chore: next * chore: next * wah * wah * wah * chore: next * chore: fix * chore: noverify * chroe; imporve selctor * chore: update api	2026-01-29 12:44:04 -08:00
Shubham Naik	6d453ea586	feat: fix template creation bogs [LET-7165] (#9015 ) feat: fix template creation bogs	2026-01-29 12:44:02 -08:00
Charles Packer	2fc592e0b6	feat(core): add image support in tool returns [LET-7140] (#8985 ) * feat(core): add image support in tool returns [LET-7140] Enable tool_return to support both string and ImageContent content parts, matching the pattern used for user message inputs. This allows tools executed client-side to return images back to the agent. Changes: - Add LettaToolReturnContentUnion type for text/image content parts - Update ToolReturn schema to accept Union[str, List[content parts]] - Update converters for each provider: - OpenAI Chat Completions: placeholder text for images - OpenAI Responses API: full image support - Anthropic: full image support with base64 - Google: placeholder text for images - Add resolve_tool_return_images() for URL-to-base64 conversion - Make create_approval_response_message_from_input() async 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix(core): support images in Google tool returns as sibling parts Following the gemini-cli pattern: images in tool returns are sent as sibling inlineData parts alongside the functionResponse, rather than inside it. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * test(core): add integration tests for multi-modal tool returns [LET-7140] Tests verify that: - Models with image support (Anthropic, OpenAI Responses API) can see images in tool returns and identify the secret text - Models without image support (Chat Completions) get placeholder text and cannot see the actual image content - Tool returns with images persist correctly in the database Uses secret.png test image containing hidden text "FIREBRAWL" that models must identify to pass the test. Also fixes misleading comment about Anthropic only supporting base64 images - they support URLs too, we just pre-resolve for consistency. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: simplify tool return image support implementation Reduce code verbosity while maintaining all functionality: - Extract _resolve_url_to_base64() helper in message_helper.py (eliminates duplication) - Add _get_text_from_part() helper for text extraction - Add _get_base64_image_data() helper for image data extraction - Add _tool_return_to_google_parts() to simplify Google implementation - Add _image_dict_to_data_url() for OpenAI Responses format - Use walrus operator and list comprehensions where appropriate - Add integration_test_multi_modal_tool_returns.py to CI workflow Net change: -120 lines while preserving all features and test coverage. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix(tests): improve prompt for multi-modal tool return tests Make prompts more direct to reduce LLM flakiness: - Simplify tool description: "Retrieves a secret image with hidden text. Call this function to get the image." - Change user prompt from verbose request to direct command: "Call the get_secret_image function now." - Apply to both test methods This reduces ambiguity and makes tool calling more reliable across different LLM models. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix bugs * test(core): add google_ai/gemini-2.0-flash-exp to multi-modal tests Add Gemini model to test coverage for multi-modal tool returns. Google AI already supports images in tool returns via sibling inlineData parts. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix(ui): handle multi-modal tool_return type in frontend components Convert Union<string, LettaToolReturnContentUnion[]> to string for display: - ViewRunDetails: Convert array to '[Image here]' placeholder - ToolCallMessageComponent: Convert array to '[Image here]' placeholder Fixes TypeScript errors in web, desktop-ui, and docker-ui type-checks. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Caren Thomas <carenthomas@gmail.com>	2026-01-29 12:43:53 -08:00
Ari Webb	4ec6649caf	feat: byok provider models in db also (#8317 ) * feat: byok provider models in db also * make tests and sync api * fix inconsistent state with recreating provider of same name * fix sync on byok creation * update revision * move stripe code for testing purposes * revert * add refresh byok models endpoint * just stage publish api * add tests * reorder revision * add test for name clashes	2026-01-29 12:43:53 -08:00
Devansh Jain	dfa6ee0c23	feat: add SGLang support (#8838 ) * add sglang support * add tests * normalize base url * cleanup * chore: regenerate autogenerated API files for sglang support	2026-01-29 12:43:51 -08:00
Cameron Pfiffer	399d04a3e1	feat: add shared memory block tutorial, update memory block guide (#5503 ) feat: update documentation and add new tutorials for memory blocks and agent collaboration - Updated navigation paths in docs.yml to reflect new tutorial locations. - Added comprehensive guides on shared memory blocks and attaching/detaching memory blocks. - Enhanced existing documentation for memory blocks with examples and best practices. - Corrected API key references in prebuilt tools documentation. These changes aim to improve user understanding and facilitate multi-agent collaboration through shared memory systems.	2026-01-19 15:52:23 -08:00
Sarah Wooders	b8a6496acb	feat: add `runs_metrics` table (#5169 )	2026-01-19 15:51:30 -08:00
Christina Tong	318498bde3	feat: filter internal runs endpoint by conversation id [LET-6886] (#8437 )	2026-01-12 10:57:49 -08:00
Shubham Naik	f3799fe4ee	Shub/let 6883 users can create a feed [LET-6883] (#8432 ) * chore: pdu * chore: pdu * chore: delete/disable * chore: delete/disable * chore: pdu * chore: pdu * chore: pdu * chore: pdu * chore: pdu * chore: pdu * chore: merge * chore: merge * cha * chore: hotfix for convo id	2026-01-12 10:57:49 -08:00
Ari Webb	754e750cc5	feat: add conversation_id filter to list runs [LET-6865] (#8404 ) feat: add conversation_id filter to list runs	2026-01-12 10:57:48 -08:00
Charles Packer	ed6284cedb	feat: Add conversation_id filtering to message endpoints (#8324 ) * feat: Add conversation_id filtering to message list and search endpoints Add optional conversation_id parameter to filter messages by conversation: - client.agents.messages.list - client.messages.list - client.messages.search Changes: - Added conversation_id field to MessageSearchRequest and SearchAllMessagesRequest schemas - Added conversation_id filtering to list_messages in message_manager.py - Updated get_agent_recall_async and get_all_messages_recall_async in server.py - Added conversation_id query parameter to router endpoints - Updated Turbopuffer client to support conversation_id filtering in searches Fixes #8320 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Charles Packer <cpacker@users.noreply.github.com> * add conversation_id to message and tpuf * default messages filter for backward compatibility * add test and auto gen * fix integration test * fix test * update test --------- Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Charles Packer <cpacker@users.noreply.github.com> Co-authored-by: christinatong01 <christina@letta.com>	2026-01-12 10:57:48 -08:00

1 2 3 4 5 ...

358 Commits