letta-server

Author	SHA1	Message	Date
Kian Jones	f5c4ab50f4	chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 ) * auto fixes * auto fix pt2 and transitive deps and undefined var checking locals() * manual fixes (ignored or letta-code fixed) * fix circular import * remove all ignores, add FastAPI rules and Ruff rules * add ty and precommit * ruff stuff * ty check fixes * ty check fixes pt 2 * error on invalid	2026-02-24 10:55:11 -08:00
Kian Jones	25d54dd896	chore: enable F821, F401, W293 (#9503 ) * auto fixes * auto fix pt2 and transitive deps and undefined var checking locals() * manual fixes (ignored or letta-code fixed) * fix circular import	2026-02-24 10:55:08 -08:00
Kevin Lin	23c94ec6d3	feat: add log probabilities from OpenAI-compatible servers and SGLang native endpoint (#9240 ) * Add log probabilities support for RL training This enables Letta server to request and return log probabilities from OpenAI-compatible providers (including SGLang) for use in RL training. Changes: - LLMConfig: Add return_logprobs and top_logprobs fields - OpenAIClient: Set logprobs in ChatCompletionRequest when enabled - LettaLLMAdapter: Add logprobs field and extract from response - LettaResponse: Add logprobs field to return log probs to client - LettaRequest: Add return_logprobs/top_logprobs for per-request override - LettaAgentV3: Store and pass logprobs through to response - agents.py: Handle request-level logprobs override Usage: response = client.agents.messages.create( agent_id=agent_id, messages=[...], return_logprobs=True, top_logprobs=5, ) print(response.logprobs) # Per-token log probabilities 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * Add multi-turn token tracking for RL training via SGLang native endpoint - Add TurnTokenData schema to track token IDs and logprobs per turn - Add return_token_ids flag to LettaRequest and LLMConfig - Create SGLangNativeClient for /generate endpoint (returns output_ids) - Create SGLangNativeAdapter that uses native endpoint - Modify LettaAgentV3 to accumulate turns across LLM calls - Include turns in LettaResponse when return_token_ids=True * Fix: Add SGLang native adapter to step() method, not just stream() * Fix: Handle Pydantic Message objects in SGLang native adapter * Fix: Remove api_key reference from LLMConfig (not present) * Fix: Add missing 'created' field to ChatCompletionResponse * Add full tool support to SGLang native adapter - Format tools into prompt in Qwen-style format - Parse tool calls from <tool_call> tags in response - Format tool results as <tool_response> in user messages - Set finish_reason to 'tool_calls' when tools are called * Use tokenizer.apply_chat_template for proper tool formatting - Add tokenizer caching in SGLang native adapter - Use apply_chat_template when tokenizer available - Fall back to manual formatting if not - Convert Letta messages to OpenAI format for tokenizer * Fix: Use func_response instead of tool_return for ToolReturn content * Fix: Get output_token_logprobs from meta_info in SGLang response * Fix: Allow None in output_token_logprobs (SGLang format includes null) * chore: remove unrelated files from logprobs branch 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add missing call_type param to adapter constructors in letta_agent_v3 The SGLang refactor dropped call_type=LLMCallType.agent_step when extracting adapter creation into conditional blocks. Restores it for all 3 spots (SGLang in step, SimpleLLM in step, SGLang in stream). 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * just stage-api && just publish-api * fix: update expected LLMConfig fields in schema test for logprobs support 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: remove rllm provider references 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * just stage-api && just publish-api 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Ubuntu <ubuntu@ip-172-31-65-206.ec2.internal> Co-authored-by: Letta <noreply@letta.com>	2026-02-24 10:52:07 -08:00
Sarah Wooders	21e880907f	feat(core): structure memory directory and block labels [LET-7336] (#9309 )	2026-02-24 10:52:06 -08:00
Ari Webb	5c6ca705f1	Revert "feat: bring back use message packing for timezone [LET-6846]" (#9302 ) Revert "feat: bring back use message packing for timezone [LET-6846] (#9256)" This reverts commit c5017cccdef95b84fc585b26a0ddc5b7e44eb7c9.	2026-02-24 10:52:06 -08:00
Ari Webb	426f6a8ca4	feat: bring back use message packing for timezone [LET-6846] (#9256 ) * feat: bring back use message packing for timezone * add tests	2026-02-24 10:52:06 -08:00
jnjpng	f48b60634f	refactor: extract compact logic to shared function for temporal (#9249 ) * refactor: extract compact logic to shared function Extract the compaction logic from LettaAgentV3.compact() into a standalone compact_messages() function that can be shared between the agent and temporal workflows. Changes: - Create apps/core/letta/services/summarizer/compact.py with: - compact_messages(): Core compaction logic - build_summarizer_llm_config(): LLM config builder for summarization - CompactResult: Dataclass for compaction results - Update LettaAgentV3.compact() to use compact_messages() - Update temporal summarize_conversation_history activity to use compact_messages() instead of the old Summarizer class - Add use_summary_role parameter to SummarizeParams This ensures consistent summarization behavior across different execution paths and prevents drift as we improve the implementation. * chore: clean up verbose comments * fix: correct CompactionSettings import path * fix: correct count_tokens import from summarizer_sliding_window * fix: update test patch path for count_tokens_with_tools After extracting compact logic to compact.py, the test was patching the old location. Update the patch path to the new module location. * fix: update test to use build_summarizer_llm_config from compact.py The function was moved from LettaAgentV3._build_summarizer_llm_config to compact.py as a standalone function. * fix: add early check for system prompt size in compact_messages Check if the system prompt alone exceeds the context window before attempting summarization. The system prompt cannot be compacted, so fail fast with SystemPromptTokenExceededError. * fix: properly propagate SystemPromptTokenExceededError from compact The exception handler in _step() was not setting the correct stop_reason for SystemPromptTokenExceededError, which caused the finally block to return early and swallow the exception. Add special handling to set stop_reason to context_window_overflow_in_system_prompt when SystemPromptTokenExceededError is caught. * revert: remove redundant SystemPromptTokenExceededError handling The special handling in the outer exception handler is redundant because stop_reason is already set in the inner handler at line 943. The actual fix for the test was the early check in compact_messages(), not this redundant handling. * fix: correctly re-raise SystemPromptTokenExceededError The inner exception handler was using 'raise e' which re-raised the outer ContextWindowExceededError instead of the current SystemPromptTokenExceededError. Changed to 'raise' to correctly re-raise the current exception. This bug was pre-existing but masked because _check_for_system_prompt_overflow was only called as a fallback. The new early check in compact_messages() exposed it. * revert: remove early check and restore raise e to match main behavior * fix: set should_continue=False and correctly re-raise exception - Add should_continue=False in SystemPromptTokenExceededError handler (matching main's _check_for_system_prompt_overflow behavior) - Fix raise e -> raise to correctly propagate SystemPromptTokenExceededError Note: test_large_system_prompt_summarization still fails locally but passes on main. Need to investigate why exception isn't propagating correctly on refactored branch. * fix: add SystemPromptTokenExceededError handler for post-step compaction The post-step compaction (line 1066) was missing a SystemPromptTokenExceededError exception handler. When compact_messages() raised this error, it would be caught by the outer exception handler which would: 1. Set stop_reason to "error" instead of "context_window_overflow_in_system_prompt" 2. Not set should_continue = False 3. Get swallowed by the finally block (line 1126) which returns early This caused test_large_system_prompt_summarization to fail because the exception never propagated to the test. The fix adds the same exception handler pattern used in the retry compaction flow (line 941-946), ensuring proper state is set before re-raising. This issue only affected the refactored code because on main, _check_for_system_prompt_overflow() was an instance method that set should_continue/stop_reason BEFORE raising. In the refactor, compact_messages() is a standalone function that cannot set instance state, so the caller must handle the exception and set the state.	2026-02-24 10:52:06 -08:00
Kian Jones	a206f7f345	feat: add ID format validation to agent and user schemas (#9151 ) * feat: add ID format validation to agent and user schemas Reuse existing validator types (ToolId, SourceId, BlockId, MessageId, IdentityId, UserId) from letta.validators to enforce ID format validation at the schema level. This ensures malformed IDs are rejected with a 422 validation error instead of causing 500 database errors. Changes: - CreateAgent: validate tool_ids, source_ids, folder_ids, block_ids, identity_ids - UpdateAgent: validate tool_ids, source_ids, folder_ids, block_ids, message_ids, identity_ids - UserUpdate: validate id 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API spec and SDK * fix: override ID validation in AgentSchema for agent file portability AgentSchema extends CreateAgent but needs to allow arbitrary short IDs (e.g., tool-0, block-0) for portable agent files. Override the validated ID fields to use plain List[str] instead of the validated types. Also fix test_agent.af to use proper UUID-format IDs. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API spec and SDK 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: revert test_agent.af - short IDs are valid for agent files 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix openapi schema --------- Co-authored-by: Letta <noreply@letta.com>	2026-02-24 10:52:06 -08:00
cthomas	3e49cf5d44	fix: load default provider config when summarizer uses different prov… (#9051 ) fix: load default provider config when summarizer uses different provider Problem: Summarization failed when agent used one provider (e.g., Google AI) but summarizer config specified a different provider (e.g., Anthropic): ```python # Agent LLM config model_endpoint_type='google_ai', handle='gemini-something/gemini-2.5-pro', context_window=100000 # Summarizer config model='anthropic/claude-haiku-4-5-20251001' # Bug: Resulting summarizer_llm_config mixed Google + Anthropic settings model='claude-haiku-4-5-20251001', model_endpoint_type='google_ai', # ❌ Wrong endpoint! context_window=100000 # ❌ Google's context window, not Anthropic's default! ``` This sent Claude requests to Google AI endpoints with incorrect parameters. Root Cause: `_build_summarizer_llm_config()` always copied the agent's LLM config as base, then patched model/provider fields. But this kept all provider-specific settings (endpoint, context_window, etc.) from the wrong provider. Fix: 1. Parse provider_name from summarizer handle 2. Check if it matches agent's model_endpoint_type (or provider_name for custom) 3. If YES → Use agent config as base, override model/handle (same provider) 4. If NO → Load default config via `provider_manager.get_llm_config_from_handle()` (new provider) Example Flow: ```python # Agent: google_ai/gemini-2.5-pro # Summarizer: anthropic/claude-haiku provider_name = "anthropic" # Parsed from handle provider_matches = ("anthropic" == "google_ai") # False ❌ # Different provider → load default Anthropic config base = await provider_manager.get_llm_config_from_handle( handle="anthropic/claude-haiku", actor=self.actor ) # Returns: model_endpoint_type='anthropic', endpoint='https://api.anthropic.com', etc. ✅ ``` Result: - Summarizer with different provider gets correct default config - No more mixing Google endpoints with Anthropic models - Same-provider summarizers still inherit agent settings efficiently 👾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
jnjpng	85c242077e	feat: strict tool calling setting (#8810 ) base	2026-01-19 15:54:42 -08:00
Sarah Wooders	97cdfb4225	Revert "feat: add strict tool calling setting [LET-6902]" (#8720 ) Revert "feat: add strict tool calling setting [LET-6902] (#8577)" This reverts commit 697c9d0dee6af73ec4d5d98780e2ca7632a69173.	2026-01-19 15:54:39 -08:00
Sarah Wooders	bdede5f90c	feat: add strict tool calling setting [LET-6902] (#8577 )	2026-01-19 15:54:38 -08:00
cthomas	ab4ccfca31	feat: add tags support to blocks (#8474 ) * feat: add tags support to blocks * fix: add timestamps and org scoping to blocks_tags Addresses PR feedback: 1. Migration: Added timestamps (created_at, updated_at), soft delete (is_deleted), audit fields (_created_by_id, _last_updated_by_id), and organization_id to blocks_tags table for filtering support. Follows SQLite baseline pattern (composite PK of block_id+tag, no separate id column) to avoid insert failures. 2. ORM: Relationship already correct with lazy="raise" to prevent implicit joins and passive_deletes=True for efficient CASCADE deletes. 3. Schema: Changed normalize_tags() from Any to dict for type safety. 4. SQLite: Added blocks_tags to SQLite baseline schema to prevent table-not-found errors. 5. Code: Updated all tag row inserts to include organization_id. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add ORM columns and update SQLite baseline for blocks_tags Fixes test failures (CompileError: Unconsumed column names: organization_id): 1. ORM: Added organization_id, timestamps, audit fields to BlocksTags ORM model to match database schema from migrations. 2. SQLite baseline: Added full column set to blocks_tags (organization_id, timestamps, audit fields) to match PostgreSQL schema. 3. Test: Added 'tags' to expected Block schema fields. This ensures SQLite and PostgreSQL have matching schemas and the ORM can consume all columns that the code inserts. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * revert change to existing alembic migration * fix: remove passive_deletes and SQLite support for blocks_tags 1. Removed passive_deletes=True from Block.tags relationship to match AgentsTags pattern (neither have ondelete CASCADE in DB schema). 2. Removed SQLite branch from _replace_block_pivot_rows_async since blocks_tags table is PostgreSQL-only (migration skips SQLite). 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * api sync --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:38 -08:00
cthomas	9b359418d0	feat: add pending approval field on agent state (#8361 ) * feat: add pending approval field on agent state * test failures	2026-01-12 10:57:48 -08:00
jnjpng	febe6efaac	fix: allow upserting empty secret object to delete agent secrets (#8316 ) base	2026-01-12 10:57:48 -08:00
Ari Webb	cc825b4f5c	Revert "Revert "feat: enable provider models persistence" (#6590 )" (#6595 )	2026-01-12 10:57:48 -08:00
Sarah Wooders	bd9f3aca9b	fix: fix `prompt_acknowledgement` usage and update summarization prompts (#7012 )	2025-12-15 12:03:09 -08:00
Sarah Wooders	0c0ba5d03d	fix: remove letta-free embeddings from testing (#6870 )	2025-12-15 12:03:09 -08:00
Sarah Wooders	a731e01e88	fix: use `model` instead of `model_settings` (#6834 )	2025-12-15 12:03:09 -08:00
jnjpng	4be813b956	fix: migrate sandbox and agent environment variables to encrypted only (#6623 ) * base * remove unnnecessary db migration * update * fix * update * update * comments * fix * revert * anotha --------- Co-authored-by: Letta Bot <noreply@letta.com>	2025-12-15 12:03:08 -08:00
Sarah Wooders	7ea297231a	feat: add `compaction_settings` to agents (#6625 ) * initial commit * Add database migration for compaction_settings field This migration adds the compaction_settings column to the agents table to support customized summarization configuration for each agent. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix * rename * update apis * fix tests * update web test --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <kian@letta.com>	2025-12-15 12:02:34 -08:00
Kian Jones	fbd89c9360	fix: replace all 'PRODUCTION' references with 'prod' for consistency (#6627 ) * fix: replace all 'PRODUCTION' references with 'prod' for consistency Problem: Codebase had 11 references to 'PRODUCTION' (uppercase) that should use 'prod' (lowercase) for consistency with the deployment workflows and environment normalization. Changes across 8 files: 1. Source files (using settings.environment): - letta/functions/function_sets/multi_agent.py - letta/services/tool_manager.py - letta/services/tool_executor/multi_agent_tool_executor.py - letta/services/helpers/agent_manager_helper.py All checks changed from: settings.environment == "PRODUCTION" To: settings.environment == "prod" 2. OTEL resource configuration: - letta/otel/resource.py - Updated _normalize_environment_tag() to handle 'prod' directly - Removed 'PRODUCTION' -> 'prod' mapping (no longer needed) - Updated device.id check from _env != "PRODUCTION" to _env != "prod" 3. Test files: - tests/managers/conftest.py - Fixture parameter changed from "PRODUCTION" to "prod" - tests/managers/test_agent_manager.py (3 occurrences) - tests/managers/test_tool_manager.py (2 occurrences) All test checks changed to use "prod" Result: Complete consistency across the codebase: - All environment checks use "prod" instead of "PRODUCTION" - Normalization function simplified (no special case for PRODUCTION) - Tests use correct "prod" value - Matches deployment workflow configuration from PR #6626 This completes the environment naming standardization effort. * fix: update settings.py environment description to use 'prod' instead of 'PRODUCTION' The field description still referenced PRODUCTION as an example value. Updated to use lowercase 'prod' for consistency with actual usage. Before: "Application environment (PRODUCTION, DEV, CANARY, etc. - normalized to lowercase for OTEL tags)" After: "Application environment (prod, dev, canary, etc. - lowercase values used for OTEL tags)"	2025-12-15 12:02:34 -08:00
Ari Webb	4092820f3a	feat: add project id scoping for tools backend changes (#6529 )	2025-12-15 12:02:34 -08:00
Ari Webb	89c7ab5f14	feat: structured outputs for openai [LET-6233] (#6363 ) * first hack with test * remove changes integration test * Delete apps/core/tests/sdk_v1/integration/integration_test_send_message_v2.py * add test * remove comment * stage and publish api * deprecate base level response_schema * add param to llm_config test --------- Co-authored-by: Ari Webb <ari@letta.com>	2025-11-26 14:39:39 -08:00
Ari Webb	ce2ca8660b	feat: add effort dropdown for claude 4.5 opus (backend) (#6351 ) * feat: add effort support (backend) * fix test_agent_state_schema_unchanged --------- Co-authored-by: Ari Webb <ari@letta.com>	2025-11-24 19:10:27 -08:00
Ari Webb	699820cecd	fix: managers test (#6232 ) fix managers test Co-authored-by: Ari Webb <ari@letta.com>	2025-11-24 19:09:33 -08:00
Shelley Pham	b73545cd60	fix: agents created from templates cannot read attached files [LET-6146] (#6137 ) * fix: Ensure agents created from templates can read attached files * test: Add test for template-based agent file attachment from sources	2025-11-13 15:36:56 -08:00
Sarah Wooders	6eeb3c90bb	feat: bring back model_settings and remove validation again (#6104 )	2025-11-13 15:36:56 -08:00
Sarah Wooders	ddc87418f4	feat: revert model_settings (#6089 )	2025-11-13 15:36:56 -08:00
Sarah Wooders	0b1fe096ec	feat: split up handle and `model_settings` (#6022 )	2025-11-13 15:36:56 -08:00
Christina Tong	c76bc9e216	feat: add filters to count_agents endpoint [LET-5380] [LET-5497] (#6008 ) * feat: add filters to count_agents endpoint [LET-5380] * comment * update	2025-11-13 15:36:55 -08:00
jnjpng	849d0dc64a	feat: provider-specific model configuration (#5873 ) (#5874 )	2025-11-13 15:36:55 -08:00
Christina Tong	881831501a	feat: filter list agents by stop reason [LET-5928] (#5779 ) * feat: add last_stop_reason to AgentState [LET-5911] * feat: filter list agents by stop reason [LET-5928] * undo agent loop changes, use update_run_by_id_async * add run manager test * add integration tests * remove comment * fix duplicate * fix docs	2025-11-13 15:36:55 -08:00
Christina Tong	ef3df907c5	feat: add last_stop_reason to AgentState [LET-5911] (#5772 ) * feat: add last_stop_reason to AgentState [LET-5911] * undo agent loop changes, use update_run_by_id_async * add run manager test * add integration tests * remove comment * remove duplicate test	2025-11-13 15:36:55 -08:00
Sarah Wooders	cfeed463a9	Revert "feat: provider-specific model configuration " (#5873 ) Revert "feat: provider-specific model configuration (#5774)" This reverts commit 34a334949a3ef72cd49ff0ca3da9e85d16daa57c.	2025-11-13 15:36:20 -08:00
Sarah Wooders	aaa12a393c	feat: provider-specific model configuration (#5774 ) * initial code updates * add models * cleanup * support overriding * add apis * cleanup reasoning interfaces to match models * update schemas * update apis * add new field * remove parallel * various fixes * modify schemas * fix * fix * make model optional * undo model schema change * update schemas * update schemas * format * fix tests * attempt to patch web * fic docs * change schemas * update error * fix tests * delete tests * clean up undefined matching conditional --------- Co-authored-by: jnjpng <jin@letta.com> Co-authored-by: Letta Bot <noreply@letta.com>	2025-11-13 15:36:14 -08:00
cthomas	afdf0f80e3	feat: add backend support for agent relationship loads (#5693 )	2025-10-24 15:14:09 -07:00
cthomas	c3c38f2713	feat: rename multi agent group to managed group [LET-5799] (#5672 ) feat: rename multi agent group to managed group	2025-10-24 15:13:47 -07:00
cthomas	14faa27869	feat: replace agent.identity_ids with agent.identities [LET-5803] (#5673 ) feat: replace agent.identity_ids with agent.identities	2025-10-24 15:13:47 -07:00
cthomas	0a083459c6	feat: add new blocks field to agent state schema test (#5668 )	2025-10-24 15:13:47 -07:00
jnjpng	8275bdd7e3	feat: add agent state schema change test (#5573 ) base Co-authored-by: Letta Bot <noreply@letta.com>	2025-10-24 15:13:15 -07:00
jnjpng	b3fef4b5a8	feat: double write to all encrypted columns and decrypt on read (#5265 ) * base * use secret field * fix * auth code * stage publish * decouple backfill * revert uncomment * providers and agent vars * mcp * mcp * stage and publish * fix oauth * double encrypt * sandbox --------- Co-authored-by: Letta Bot <noreply@letta.com>	2025-10-24 15:11:31 -07:00
Sarah Wooders	53786ee102	feat: asyncify test_manager.py [LET-4494] (#4890 ) * add test_agent_manager.py * created shared conftest * add test_tool_manager.py * add tag tests * add message manager tests * add blocks * add org * add passage tests * add archive manager * add user manager * add identity * add job manager tests * add sandbox manager * add file manager * add group managers * add mcp manager * fix batch tests * update workflows * fix test_managers.py * more tests * comment out old test and add file --------- Co-authored-by: Matthew Zhou <mattzh1314@gmail.com>	2025-10-07 17:50:46 -07:00

43 Commits