letta-server

Author	SHA1	Message	Date
Kian Jones	a92e868ee6	feat: centralize telemetry logging at LLM client level (#8815 ) * feat: centralize telemetry logging at LLM client level Moves telemetry logging from individual adapters to LLMClientBase: - Add TelemetryStreamWrapper for streaming telemetry on stream close - Add request_async_with_telemetry() for non-streaming requests - Add stream_async_with_telemetry() for streaming requests - Add set_telemetry_context() to configure agent_id, run_id, step_id Updates adapters and agents to use new pattern: - LettaLLMAdapter now accepts agent_id/run_id in constructor - Adapters call set_telemetry_context() before LLM requests - Removes duplicate telemetry logging from adapters - Enriches traces with agent_id, run_id, call_type metadata 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: accumulate streaming response content for telemetry TelemetryStreamWrapper now extracts actual response data from chunks: - Content text (concatenated from deltas) - Tool calls (id, name, arguments) - Model name, finish reason, usage stats 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: move streaming telemetry to caller (option 3) - Remove TelemetryStreamWrapper class - Add log_provider_trace_async() helper to LLMClientBase - stream_async_with_telemetry() now just returns raw stream - Callers log telemetry after processing with rich interface data Updated callers: - summarizer.py: logs content + usage after stream processing - letta_agent.py: logs tool_call, reasoning, model, usage 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: pass agent_id and run_id to parent adapter class LettaLLMStreamAdapter was not passing agent_id/run_id to parent, causing "unexpected keyword argument" errors. 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:43 -08:00
Kian Jones	9418ab9815	feat: add provider trace backend abstraction for multi-backend telemetry (#8814 ) * feat: add provider trace backend abstraction for multi-backend telemetry Introduces a pluggable backend system for provider traces: - Base class with async/sync create and read interfaces - PostgreSQL backend (existing behavior) - ClickHouse backend (via OTEL instrumentation) - Socket backend (writes to Unix socket for crouton sidecar) - Factory for instantiating backends from config Refactors TelemetryManager to use backends with support for: - Multi-backend writes (concurrent via asyncio.gather) - Primary backend for reads (first in config list) - Graceful error handling per backend Config: LETTA_TELEMETRY_PROVIDER_TRACE_BACKEND (comma-separated) Example: "postgres,socket" for dual-write to Postgres and crouton 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * feat: add protocol version to socket backend records Adds PROTOCOL_VERSION constant to socket backend: - Included in every telemetry record sent to crouton - Must match ProtocolVersion in apps/crouton/main.go - Enables crouton to detect and reject incompatible messages 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: remove organization_id from ProviderTraceCreate calls The organization_id is now handled via the actor parameter in the telemetry manager, not through ProviderTraceCreate schema. This fixes validation errors after changing ProviderTraceCreate to inherit from BaseProviderTrace which forbids extra fields. 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * consolidate provider trace * add clickhouse-connect to fix bug on main lmao * auto generated sdk changes, and deployment details, and clikchouse prefix bug and added fields to runs trace return api * auto generated sdk changes, and deployment details, and clikchouse prefix bug and added fields to runs trace return api * consolidate provider trace * consolidate provider trace bug fix --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:43 -08:00
jnjpng	8792e88e8b	feat: add letta code source mcp redirect (#8865 ) base	2026-01-19 15:54:43 -08:00
Sarah Wooders	7c1da1e9e2	feat: add TypeScript tool support for E2B sandbox execution (#8796 ) * feat: add TypeScript tool support for E2B sandbox execution This change implements TypeScript tool support using the same E2B path as Python tools: - Add TypeScript execution script generator (typescript_generator.py) - Modify E2B sandbox to detect TypeScript tools and use language='ts' - Add npm package installation for TypeScript tool dependencies - Add validation requiring json_schema for TypeScript tools - Add comprehensive integration tests for TypeScript tools TypeScript tools: - Require explicit json_schema (no docstring parsing) - Use JSON serialization instead of pickle for results - Support async functions with top-level await - Support npm package dependencies via npm_requirements field Closes #8793 Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> * fix: disable AgentState for TypeScript tools & add letta-client injection Based on Sarah's feedback: 1. AgentState is a legacy Python-only feature, disabled for TS tools 2. Added @letta-ai/letta-client npm package injection for TypeScript (similar to letta_client for Python) Changes: - base.py: Explicitly set inject_agent_state=False for TypeScript tools - typescript_generator.py: Inject LettaClient initialization code - e2b_sandbox.py: Auto-install @letta-ai/letta-client for TS tools - Added tests verifying both behaviors 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Sarah Wooders <sarahwooders@users.noreply.github.com> Co-Authored-By: Letta <noreply@letta.com> * Update core-integration-tests.yml * fix: convert TypeScript test fixtures to async The OrganizationManager and UserManager no longer have sync methods, only async variants. Updated all fixtures to use: - create_organization_async - create_actor_async - create_or_update_tool_async 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: skip Python AST parsing for TypeScript tools in sandbox base The _init_async method was calling parse_function_arguments (which uses Python's ast.parse) before checking if the tool was TypeScript, causing SyntaxError when running TypeScript tools. Moved the is_typescript_tool() check to happen first, skipping Python AST parsing entirely for TypeScript tools. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * letta_agent_id * skip ast parsing for s * add tool execution test --------- Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <kian@letta.com>	2026-01-19 15:54:43 -08:00
Sarah Wooders	5c7bed7743	feat: add conversation_id to export export and compact (#8792 )	2026-01-19 15:54:43 -08:00
jnjpng	e2a8a95371	fix: chatgpt oauth model endpoint (#8855 ) base	2026-01-19 15:54:43 -08:00
cthomas	6599aa3b44	feat: populate seq_id for ping messages (#8844 ) * feat: populate seq_id for ping messages * fix import	2026-01-19 15:54:43 -08:00
Christina Tong	0d7c2c6380	fix: reverse list_conversation_messages when converting to LettaMessages (#8852 )	2026-01-19 15:54:43 -08:00
Kevin Lin	e5ed8ca0e8	feat: set default temperature to 1.0 [LET-6920] (#8618 ) * temp 1 * stage * update core tests	2026-01-19 15:54:43 -08:00
Kevin Lin	8872c2b5d3	feat: extend memory_apply_patch for memory block create and delete (#8832 ) * update memory_apply_patch for memory block create and delete * update docstring * update docstring * fix label * working	2026-01-19 15:54:42 -08:00
cthomas	c2f13cf5fa	feat: add seq id to error chunks (#8843 )	2026-01-19 15:54:42 -08:00
jnjpng	a98bc31bf3	fix: refactor enable strict mode for structured output (#8840 ) * base * test	2026-01-19 15:54:42 -08:00
cthomas	b62ce02930	fix: run status failed getting overridden (#8834 ) * fix: run status failed getting overridden * fix method name * missing await	2026-01-19 15:54:42 -08:00
jnjpng	21c70323df	fix: respect strict mode for temporal with fallback on client (#8839 ) * base * update	2026-01-19 15:54:42 -08:00
jnjpng	e1dd62f0a8	fix: correct datetime.timezone import in chatgpt_oauth (#8837 ) The import was `from datetime import datetime` but the code used `datetime.timezone.utc` which is incorrect - timezone is a sibling class in the datetime module, not an attribute of the datetime class. This caused the error: "type object 'datetime.datetime' has no attribute 'timezone'"	2026-01-19 15:54:42 -08:00
jnjpng	85c242077e	feat: strict tool calling setting (#8810 ) base	2026-01-19 15:54:42 -08:00
Ari Webb	5136a8eaed	fix: update max context window on base provider sync (#8803 )	2026-01-19 15:54:42 -08:00
jnjpng	637e320b11	chore: fix up minor nits and typing for chatgpt client (#8807 ) base	2026-01-19 15:54:42 -08:00
jnjpng	5017cb1d12	feat: add chatgpt oauth client for codex routing (#8774 ) * base * refresh * use default model fallback * patch * streaming * generate	2026-01-19 15:54:42 -08:00
Kian Jones	2368efd027	fix: add missing use_clickhouse_for_provider_traces setting (#8799 ) PR #8682 added code that references settings.use_clickhouse_for_provider_traces but never added the field to Settings, causing AttributeError in prod. 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:42 -08:00
Ari Webb	193c0e4b74	feat: add override_model to message endpoints (#8763 ) * feat: add override_model to message endpoints * add tests back * remove from ci	2026-01-19 15:54:42 -08:00
Kian Jones	d2c3350a7e	feat(runs): add run ID filter to runs page (#8726 ) feat(core): add run_id filter to internal runs endpoint Adds the ability to filter runs by a specific run ID in the internal runs list endpoint. 🐛 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:42 -08:00
cthomas	6b6ca91183	fix: remove letta ping schema override (#8790 )	2026-01-19 15:54:41 -08:00
Sarah Wooders	ef0f8e20d9	fix: improve summarization prompts (#8764 )	2026-01-19 15:54:41 -08:00
cthomas	c3276dda17	fix: summarization drops reasoning content mid turn [LET-7039] (#8769 ) fix: summarization drops reasoning content mid turn	2026-01-19 15:54:41 -08:00
Charles Packer	97f7e95d1d	feat: add PATCH route for updating conversation summary (#8322 )	2026-01-19 15:54:41 -08:00
Kevin Lin	758b74b9c4	feat: rename memory tool arguments for consistency with edit tool [LET-6972] (#8631 ) * update names * stage * chore: retrigger linear workflow	2026-01-19 15:54:41 -08:00
Sarah Wooders	e2c6550363	fix: include assistant response in sliding window summarization slice (#8749 )	2026-01-19 15:54:41 -08:00
cthomas	ca43ee0b7b	fix: cancellation causing desync for parallel tool calling [LET-7028] (#8739 )	2026-01-19 15:54:41 -08:00
cthomas	487bb42231	fix: summarization causing desync for conversations [LET-7014] (#8734 )	2026-01-19 15:54:41 -08:00
Sarah Wooders	f91e77d971	fix: add cancel for conversations to SDK (#8742 )	2026-01-19 15:54:41 -08:00
jnjpng	58a5375c19	fix: test sdk client due to message batch route ordering (#8733 ) * base * generate	2026-01-19 15:54:40 -08:00
Sarah Wooders	aabd58628e	feat: add conversation cancellation endpoint (#8729 )	2026-01-19 15:54:40 -08:00
jnjpng	037c20ae1b	feat: query param parity for conversation messages (#8730 ) * base * add tests * generate	2026-01-19 15:54:40 -08:00
Sarah Wooders	9aac2abdfe	chore: deprecate identities/groups APIs and remove from SDK (#8580 ) * chore: deprecate identities/groups APIs and remove from SDK - Mark all /v1/identities/* endpoints as deprecated - Mark all /v1/groups/* endpoints as deprecated - Remove identities, groups, and batches resources from stainless.yml - Batch API remains active but hidden from SDK 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: update autogenerated SDK files * chore: regenerate SDK and OpenAPI spec Run `just stage-api` and `just publish-api` to sync generated files. 👾 Generated with [Letta Code](https://letta.com) Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> * chore: remove schedule API from stainless SDK Remove schedule subresource from stainless.yml to hide scheduled messages endpoints from the SDK generation. 👾 Generated with [Letta Code](https://letta.com) Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com> --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Sarah Wooders <sarahwooders@users.noreply.github.com>	2026-01-19 15:54:40 -08:00
jnjpng	e3e758a8c0	feat: add retrieve message endpoint and to client sdk (#8719 ) * base * generate openapi * try again * now	2026-01-19 15:54:40 -08:00
Kian Jones	3eae81cf62	feat: add /v1/runs/{run_id}/trace endpoint for OTEL traces (#8682 ) * feat: add /v1/runs/{run_id}/trace endpoint for OTEL traces - Add new endpoint to retrieve filtered OTEL spans for a run - Filter to only return UI-relevant spans (agent_step, tool executions, root span, TTFT) - Skip Postgres writes when ClickHouse is enabled for provider traces - Add USE_CLICKHOUSE_FOR_PROVIDER_TRACES env var to helm/justfile - Move typecheck CI to self-hosted runners 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add missing clickhouse_provider_traces.py The telemetry_manager.py imports ClickhouseProviderTraceReader from this module, but the file was not included when splitting the PR. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * autogen * fix: add trace.retrieve to stainless.yml for SDK generation Adds the runs.trace.retrieve method mapping so Stainless generates the useRunsServiceRetrieveTraceForRun hook. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:39 -08:00
Sarah Wooders	9d1ad00dd6	Revert "fix: filter orphaned approval_request messages to prevent Anthropic API errors" (#8721 ) Revert "fix: filter orphaned approval_request messages to prevent Anthropic A…" This reverts commit 2df946c0a821ab8346e8e9037e819be24004a51f.	2026-01-19 15:54:39 -08:00
Sarah Wooders	97cdfb4225	Revert "feat: add strict tool calling setting [LET-6902]" (#8720 ) Revert "feat: add strict tool calling setting [LET-6902] (#8577)" This reverts commit 697c9d0dee6af73ec4d5d98780e2ca7632a69173.	2026-01-19 15:54:39 -08:00
jnjpng	eb748b8f1a	fix: mcp oauth session user scoping (#8630 ) * base * update * revert a bit * revert package lock * clean up * update	2026-01-19 15:54:39 -08:00
Ari Webb	2233d141b1	feat: add codex 5.2 context window (#8704 )	2026-01-19 15:54:39 -08:00
Charles Packer	ca753a6d50	fix: filter orphaned approval_request messages to prevent Anthropic API errors (#8688 )	2026-01-19 15:54:39 -08:00
Ari Webb	20e4286382	fix: allow re-enable sleeptime after deleted [LET-6553] (#8680 ) fix: allow re-enable sleeptime after deleted	2026-01-19 15:54:39 -08:00
Sarah Wooders	b888c4c17a	feat: allow for conversation-level isolation of blocks (#8684 ) * feat: add conversation_id parameter to context endpoint [LET-6989] Add optional conversation_id query parameter to retrieve_agent_context_window. When provided, the endpoint uses messages from the specific conversation instead of the agent's default message_ids. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate SDK after context endpoint update [LET-6989] 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * feat: add isolated blocks support for conversations Allows conversations to have their own copies of specific memory blocks (e.g., todo_list) that override agent defaults, enabling conversation-specific state isolation. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * undo * update apis * test * cleanup * fix tests * simplify * move override logic * patch --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:39 -08:00
Sarah Wooders	9c4f191755	feat: add conversation_id parameter to context endpoint [LET-6989] (#8678 ) * feat: add conversation_id parameter to context endpoint [LET-6989] Add optional conversation_id query parameter to retrieve_agent_context_window. When provided, the endpoint uses messages from the specific conversation instead of the agent's default message_ids. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate SDK after context endpoint update [LET-6989] 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:39 -08:00
github-actions[bot]	5fbf8f93e2	fix: add explicit timeouts to httpx clients to prevent ReadTimeout errors (#8538 ) This commit addresses the httpx.ReadTimeout error detected in production by adding explicit timeout configurations to several httpx client usages: 1. MCP SSE client: Pass mcp_connect_to_server_timeout (30s) to sse_client() 2. MCP StreamableHTTP client: Pass mcp_connect_to_server_timeout (30s) to streamablehttp_client() 3. OpenAI model list API: Add 30s timeout with 10s connect timeout 4. Google AI model list/details API: Add 30s timeout with 10s connect timeout Previously, these httpx clients were created without explicit timeouts, which could cause ReadTimeout errors when remote servers are slow to respond. Fixes #8073 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-19 15:54:38 -08:00
github-actions[bot]	85c40c8154	fix: add streaming fallback for long-running Anthropic requests (#8564 ) When the Anthropic SDK detects a request may exceed 10 minutes, it raises a ValueError requiring streaming mode. This fix catches that specific error in request_async and automatically falls back to streaming mode, accumulating the response into the same format as non-streaming. This resolves the production error: "ValueError: Streaming is required for operations that may take longer than 10 minutes" Fixes #8516 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-19 15:54:38 -08:00
github-actions[bot]	ebc77d0950	fix: wrap MCP client connection errors in ConnectionError (#8569 ) The FastMCP clients were not properly wrapping exceptions from `connect_to_server()`, causing raw RuntimeErrors (like DNS resolution failures with "[Errno -2] Name or service not known") to propagate up unchanged. Changes: - Both `AsyncFastMCPSSEClient` and `AsyncFastMCPStreamableHTTPClient` now properly catch all exceptions and wrap them in `ConnectionError` - Added warning-level logging for failed connections - Provides user-friendly error messages with the server URL Fixes #8568 Related to #8499 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-19 15:54:38 -08:00
github-actions[bot]	e914075b04	fix: ensure thought_signature is included for Gemini 3 function calls (#8590 ) This fixes a 400 INVALID_ARGUMENT error from Google's Gemini API where function calls were missing required thought_signature in functionCall parts. Changes: - Allow signatures when self.model is None (backwards compatibility for older messages that may not have had their model field set) - Only add thought_signature to the FIRST function call for parallel tool calls, per Google's docs - Take the first non-None signature found (don't keep overwriting) Reference: https://ai.google.dev/gemini-api/docs/thought-signatures Closes #8589 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: datadog-official[bot] <datadog-official[bot]@users.noreply.github.com> Co-authored-by: Kian Jones <11655409+kianjones9@users.noreply.github.com>	2026-01-19 15:54:38 -08:00
Sarah Wooders	ea36633cd5	fix: make sure structured outputs turned on for openai (#8669 )	2026-01-19 15:54:38 -08:00

1 2 3 4 5 ...

4631 Commits