letta-server

Author	SHA1	Message	Date
github-actions[bot]	2460b36f97	fix: handle asyncpg QueryCanceledError for statement timeouts (#8241 ) The handle_db_timeout decorator only caught SQLAlchemy's TimeoutError (for pool/connection timeouts) but not asyncpg's QueryCanceledError which is thrown when PostgreSQL's statement_timeout kills a long-running query. This fix: - Import asyncpg.exceptions.QueryCanceledError - Update handle_db_timeout decorator to catch QueryCanceledError and wrap it in DatabaseTimeoutError - Update _handle_dbapi_error method to also handle wrapped QueryCanceledError Fixes #8108 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-19 15:54:38 -08:00
github-actions[bot]	bfb08e77f8	fix: prevent deadlock in bulk tool upsert by sorting tools by name (#8667 ) When multiple concurrent transactions try to upsert the same tools, they can deadlock if they acquire row locks in different orders. This fix sorts tools by name before the bulk INSERT to ensure all transactions acquire locks in a consistent order, preventing deadlocks. Fixes #8666 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-19 15:54:38 -08:00
github-actions[bot]	a5108c96b4	fix: handle ToolError exceptions in MCP clients to reduce production alerts (#8599 ) Add ToolError to exception handling alongside McpError in MCP client classes. ToolError is raised by fastmcp for input validation errors (e.g., missing required properties like 'filename'). Both error types are expected user-facing errors from external MCP servers and should be logged at warning/debug level to avoid triggering production alerts. Fixes issue with production error: "fastmcp.exceptions.ToolError: Input validation error: 'filename' is a required property" 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-19 15:54:38 -08:00
github-actions[bot]	f67af1b13d	fix: Handle ExceptionGroup errors in MCP client cleanup (#8561 ) The MCP library internally uses TaskGroup for async operations, which can raise ExceptionGroup when cleanup fails. This was causing unhandled errors to propagate in production. Changes: - Update cleanup() method in AsyncBaseMCPClient to catch ExceptionGroup using except* syntax and log errors at debug level (best-effort cleanup) - Remove redundant try/except blocks in mcp_manager.py and mcp_server_manager.py that incorrectly re-raised cleanup exceptions Fixes #8560 🐾 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-19 15:54:38 -08:00
Ari Webb	282df3a3fe	fix: increase limit for list mcp servers (#8674 )	2026-01-19 15:54:38 -08:00
Ari Webb	3682f87cbf	fix: fix pagination for blocks [LET-6359] (#8628 ) fix: fix pagination for blocks	2026-01-19 15:54:38 -08:00
Sarah Wooders	bdede5f90c	feat: add strict tool calling setting [LET-6902] (#8577 )	2026-01-19 15:54:38 -08:00
jnjpng	979062114c	chore: fix typo and improve MCP OAuth comments (#8629 ) - Fix typo "upate" -> "update" in TODO comments (mcp_manager.py, mcp_server_manager.py) - Improve comments in OAuth callback handler to explain why MCPOAuthSession is used directly (callback is unauthenticated, manager requires actor) - Clean up variable naming in callback handler 🐾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:38 -08:00
cthomas	ab4ccfca31	feat: add tags support to blocks (#8474 ) * feat: add tags support to blocks * fix: add timestamps and org scoping to blocks_tags Addresses PR feedback: 1. Migration: Added timestamps (created_at, updated_at), soft delete (is_deleted), audit fields (_created_by_id, _last_updated_by_id), and organization_id to blocks_tags table for filtering support. Follows SQLite baseline pattern (composite PK of block_id+tag, no separate id column) to avoid insert failures. 2. ORM: Relationship already correct with lazy="raise" to prevent implicit joins and passive_deletes=True for efficient CASCADE deletes. 3. Schema: Changed normalize_tags() from Any to dict for type safety. 4. SQLite: Added blocks_tags to SQLite baseline schema to prevent table-not-found errors. 5. Code: Updated all tag row inserts to include organization_id. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add ORM columns and update SQLite baseline for blocks_tags Fixes test failures (CompileError: Unconsumed column names: organization_id): 1. ORM: Added organization_id, timestamps, audit fields to BlocksTags ORM model to match database schema from migrations. 2. SQLite baseline: Added full column set to blocks_tags (organization_id, timestamps, audit fields) to match PostgreSQL schema. 3. Test: Added 'tags' to expected Block schema fields. This ensures SQLite and PostgreSQL have matching schemas and the ORM can consume all columns that the code inserts. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * revert change to existing alembic migration * fix: remove passive_deletes and SQLite support for blocks_tags 1. Removed passive_deletes=True from Block.tags relationship to match AgentsTags pattern (neither have ondelete CASCADE in DB schema). 2. Removed SQLite branch from _replace_block_pivot_rows_async since blocks_tags table is PostgreSQL-only (migration skips SQLite). 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * api sync --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:38 -08:00
jnjpng	c550457b60	feat: static redirect callback for mcp server oauth (#8611 ) * base * base * more * final * remove * pass	2026-01-19 15:54:38 -08:00
cthomas	870c5955d9	fix: wrap tpuf operations in thread pool (#8615 ) * fix: wrap turbopuffer vector writes in thread pool Turbopuffer library does CPU-intensive base64 encoding of vectors synchronously in async functions (_async_transform_recursive → b64encode_vector), blocking the event loop during file uploads. Solution: Created _run_turbopuffer_write_in_thread() helper that runs turbopuffer writes in an isolated event loop within a worker thread. Applied to all vector write operations: - insert_tools() - insert_archival_memories() - insert_messages() - insert_file_passages() This prevents pybase64.b64encode_as_string() from blocking the main event loop during vector encoding. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: wrap all turbopuffer operations in thread pool Extended the thread pool wrapping to ALL turbopuffer write operations, including delete operations, for complete isolation from the main event loop. All turbopuffer namespace.write() calls now run in isolated event loops within worker threads, preventing any potential CPU work from blocking. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:37 -08:00
cthomas	c05f3cec0b	fix: wrap markitdown PDF processing in asyncio.to_thread (#8614 ) MarkItDown.convert() does blocking file I/O and CPU-intensive PDF parsing. This was blocking the event loop during file uploads. Now wraps the entire markitdown pipeline (tempfile write, convert, cleanup) in asyncio.to_thread() to run in thread pool. 🐾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:37 -08:00
cthomas	9b5067bed9	fix: remove unused sync code (#8613 ) * chore: remove unused sync code * chore: remove deprecated sync Google AI functions Removes unused sync functions that used httpx.Client (blocking): - google_ai_get_model_details() - google_ai_get_model_context_window() - GoogleGeminiProvider.get_model_context_window() All code now uses async versions with httpx.AsyncClient. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:37 -08:00
cthomas	57cb2d7566	fix: async functions must call async methods (#8612 ) Critical fixes: - llm_client_base.send_llm_request() now calls await self.request_async() instead of self.request() - Remove unused sync get_openai_embedding() that used sync OpenAI client - Remove deprecated compile_in_thread_async() from Memory These were blocking the event loop during LLM requests and embeddings. 🐾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:37 -08:00
Ari Webb	851798d71a	fix: step_id is none (#8528 )	2026-01-19 15:54:37 -08:00
jnjpng	aca5c99a2b	feat: use fast mcp 2.0 oauth provider (#8608 ) * base * scopes	2026-01-19 15:54:37 -08:00
jnjpng	35ecf2279f	chore: remove dead sync model validator code (#8606 ) Remove commented-out sync_value_and_value_enc model validator and unused imports (traceback, model_validator, logger). This code was disabled and replaced with async decryption via from_orm_async methods.	2026-01-19 15:54:37 -08:00
Sarah Wooders	b8a6496acb	feat: add `runs_metrics` table (#5169 )	2026-01-19 15:51:30 -08:00
Caren Thomas	a626dec278	uv lock	2026-01-12 11:00:12 -08:00
Ari Webb	6d859174c2	feat: make conversations throw http busy to stop race condition [LET-6842] (#8411 ) * feat: make conversations throw http busy to stop race condition * use redis lock instead * move acquire lock into redis client, integration tests, move lock release into run manager * fix tests, bug * conditional import * remove else * better release * run ci * final reordering lock * update tests * wrong naming of lock holder token	2026-01-12 10:57:49 -08:00
jnjpng	59c2b19812	fix: remove sync model validator for env var (#8518 ) * base * import	2026-01-12 10:57:49 -08:00
cthomas	03a64993cf	fix: make file reads async (#8513 )	2026-01-12 10:57:49 -08:00
Ari Webb	cdca1a564f	fix: conversation id not found in tpuf (#8469 ) * fix: conversation id not found in tpuf * add tests	2026-01-12 10:57:49 -08:00
cthomas	e964307f6a	feat: add lazy=raise for passage-org relationship (#8482 )	2026-01-12 10:57:49 -08:00
Sarah Wooders	0cbdf452fa	fix: temporarily disable structured outputs for anthropic (#8491 )	2026-01-12 10:57:49 -08:00
jnjpng	87e939deda	feat: add fastmcp v2 client (#8457 ) * base * testing code * update * nit	2026-01-12 10:57:49 -08:00
Christina Tong	318498bde3	feat: filter internal runs endpoint by conversation id [LET-6886] (#8437 )	2026-01-12 10:57:49 -08:00
cthomas	938bb78afe	fix: handle anthropic incorrect tool id bug (#8447 )	2026-01-12 10:57:49 -08:00
jnjpng	dceed06f84	fix: set value on agent environment variable as pydantic obj (#8452 ) base	2026-01-12 10:57:49 -08:00
jnjpng	28839f5180	fix: import cryptography default backend at top level (#8444 ) * base * comment	2026-01-12 10:57:49 -08:00
Sarah Wooders	96cf24264c	fix: avoid 'NoneType' object has no attribute 'name' error (#8407 )	2026-01-12 10:57:49 -08:00
cthomas	b8e7c14f16	feat: enable optimized json response parsing (#8436 )	2026-01-12 10:57:49 -08:00
cthomas	8adb88e7ea	feat: set task name in safe_create_task (#8433 )	2026-01-12 10:57:49 -08:00
github-actions[bot]	f2171447a8	fix: handle httpx.ReadError, WriteError, and ConnectError in LLM streaming clients (#8243 ) Adds explicit handling for httpx network errors (ReadError, WriteError, ConnectError) in AnthropicClient, OpenAIClient, and GoogleVertexClient. These errors can occur during streaming when the connection is unexpectedly closed while reading/writing data. Maps these errors to LLMConnectionError for consistent error handling. Fixes #8221 (and duplicate #8156) 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-12 10:57:49 -08:00
github-actions[bot]	b559bf8403	fix: handle missing tool_call_id in Anthropic message conversion (#8381 ) * fix: handle missing tool_call_id in Anthropic message conversion - Add null check for self.tool_returns before iterating - Fall back to message's tool_call_id when tool_return.tool_call_id is None - Improve error message to show actual tool name from message.name - Only raise error if no valid tool_call_id is available from either source This fixes the error "Anthropic API requires tool_use_id to be set" that occurs when a ToolReturn object in the database doesn't have tool_call_id set, by using the message-level tool_call_id as a fallback. Fixes #8379 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-Authored-By: Letta <noreply@letta.com> * fix: restrict tool_call_id fallback to single tool returns The message-level `self.tool_call_id` is set to the first tool return's ID for legacy compatibility. For parallel tool calls with multiple tool_returns, using this as a fallback would incorrectly assign the first tool return's ID to all subsequent returns missing their own ID. This change: - Only allows the fallback when there's exactly one tool return - For multiple tool returns, each must have its own ID or raise an error - Adds tool return index to error messages for better debugging Co-authored-by: Kian Jones <kianjones9@users.noreply.github.com> 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-12 10:57:49 -08:00
Christina Tong	d2e19bbc05	chore: conversation id filter to list_messages [LET-6875] (#8406 ) * chore: add default conversation id filter to list_messages * fix filter * update comment	2026-01-12 10:57:48 -08:00
github-actions[bot]	05ec02e384	fix: handle Anthropic 413 request_too_large as ContextWindowExceededError (#8424 ) The Anthropic API returns a 413 status code with error type `request_too_large` when the request payload exceeds the maximum allowed size. This error should be converted to `ContextWindowExceededError` so the system can handle it appropriately (e.g., by summarizing the conversation to reduce context size). Changes: - Added `request_too_large` and `request exceeds the maximum size` to the early string-based error detection in `handle_llm_error` - Added specific handling for HTTP 413 status code in the `APIStatusError` handler - Added tests to verify the new error handling behavior Fixes: #8422 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-12 10:57:48 -08:00
github-actions[bot]	adbc47ddc9	fix: downgrade McpError logging from warning to debug level (#8371 ) MCP errors from external servers (e.g., "The specified key does not exist") are user-facing issues, not system errors. Downgrading the log level from warning to debug prevents these expected failures from triggering production alerts in Datadog/Sentry. Fixes #8370 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-12 10:57:48 -08:00
Ari Webb	754e750cc5	feat: add conversation_id filter to list runs [LET-6865] (#8404 ) feat: add conversation_id filter to list runs	2026-01-12 10:57:48 -08:00
Sarah Wooders	6fddcc0c57	fix: fix agent loop (#8401 )	2026-01-12 10:57:48 -08:00
Charles Packer	ed6284cedb	feat: Add conversation_id filtering to message endpoints (#8324 ) * feat: Add conversation_id filtering to message list and search endpoints Add optional conversation_id parameter to filter messages by conversation: - client.agents.messages.list - client.messages.list - client.messages.search Changes: - Added conversation_id field to MessageSearchRequest and SearchAllMessagesRequest schemas - Added conversation_id filtering to list_messages in message_manager.py - Updated get_agent_recall_async and get_all_messages_recall_async in server.py - Added conversation_id query parameter to router endpoints - Updated Turbopuffer client to support conversation_id filtering in searches Fixes #8320 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Charles Packer <cpacker@users.noreply.github.com> * add conversation_id to message and tpuf * default messages filter for backward compatibility * add test and auto gen * fix integration test * fix test * update test --------- Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Charles Packer <cpacker@users.noreply.github.com> Co-authored-by: christinatong01 <christina@letta.com>	2026-01-12 10:57:48 -08:00
jnjpng	737d6e2550	fix: remove noisy sync log and fix timeout type (#8392 ) base	2026-01-12 10:57:48 -08:00
cthomas	1d360cc239	feat: dump thread state on event loop hang (#8388 )	2026-01-12 10:57:48 -08:00
cthomas	a775bbfdf2	feat: offload llama index init logic (#8387 )	2026-01-12 10:57:48 -08:00
jnjpng	4cb037f28e	fix: async decrypt environment variable on create agent (#8367 ) * base * comment	2026-01-12 10:57:48 -08:00
jnjpng	d55fd69b7b	chore: add comment and test for changing PBKDF2 iteration count (#8366 ) base	2026-01-12 10:57:48 -08:00
jnjpng	b68e4e74f9	fix: replace cryptography with hashlib for encryption key derivation (#8364 ) base	2026-01-12 10:57:48 -08:00
cthomas	cc975b5d15	fix: reuse db session when loading pending approval (#8363 )	2026-01-12 10:57:48 -08:00
cthomas	9b359418d0	feat: add pending approval field on agent state (#8361 ) * feat: add pending approval field on agent state * test failures	2026-01-12 10:57:48 -08:00
Ari Webb	956e7783ad	feat: add ids to archival memory search [LET-6642] (#8355 ) * feat: add id to archival memory search tool * stage api	2026-01-12 10:57:48 -08:00

1 2 3 4 5 ...

4631 Commits