letta-server

Author	SHA1	Message	Date
Sarah Wooders	4096b30cd7	feat: log LLM traces to clickhouse (#9111 ) * feat: add non-streaming option for conversation messages - Add ConversationMessageRequest with stream=True default (backwards compatible) - stream=true (default): SSE streaming via StreamingService - stream=false: JSON response via AgentLoop.load().step() 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API schema for ConversationMessageRequest * feat: add direct ClickHouse storage for raw LLM traces Adds ability to store raw LLM request/response payloads directly in ClickHouse, bypassing OTEL span attribute size limits. This enables debugging and analytics on large LLM payloads (>10MB system prompts, large tool schemas, etc.). New files: - letta/schemas/llm_raw_trace.py: Pydantic schema with ClickHouse row helper - letta/services/llm_raw_trace_writer.py: Async batching writer (fire-and-forget) - letta/services/llm_raw_trace_reader.py: Reader with query methods - scripts/sql/clickhouse/llm_raw_traces.ddl: Production table DDL - scripts/sql/clickhouse/llm_raw_traces_local.ddl: Local dev DDL - apps/core/clickhouse-init.sql: Local dev initialization Modified: - letta/settings.py: Added 4 settings (store_llm_raw_traces, ttl, batch_size, flush_interval) - letta/llm_api/llm_client_base.py: Integration into request_async_with_telemetry - compose.yaml: Added ClickHouse service for local dev - justfile: Added clickhouse, clickhouse-cli, clickhouse-traces commands Feature disabled by default (LETTA_STORE_LLM_RAW_TRACES=false). Uses ZSTD(3) compression for 10-30x reduction on JSON payloads. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: address code review feedback for LLM raw traces Fixes based on code review feedback: 1. Fix ClickHouse endpoint parsing - default to secure=False for raw host:port inputs (was defaulting to HTTPS which breaks local dev) 2. Make raw trace writes truly fire-and-forget - use asyncio.create_task() instead of awaiting, so JSON serialization doesn't block request path 3. Add bounded queue (maxsize=10000) - prevents unbounded memory growth under load. Drops traces with warning if queue is full. 4. Fix deprecated asyncio usage - get_running_loop() instead of get_event_loop() 5. Add org_id fallback - use _telemetry_org_id if actor doesn't have it 6. Remove unused imports - json import in reader 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add missing asyncio import and simplify JSON serialization - Add missing 'import asyncio' that was causing 'name asyncio is not defined' error - Remove unnecessary clean_double_escapes() function - the JSON is stored correctly, the clickhouse-client CLI was just adding extra escaping when displaying - Update just clickhouse-trace to use Python client for correct JSON output 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * test: add clickhouse raw trace integration test * test: simplify clickhouse trace assertions * refactor: centralize usage parsing and stream error traces Use per-client usage helpers for raw trace extraction and ensure streaming errors log requests with error metadata. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * test: exercise provider usage parsing live Make live OpenAI/Anthropic/Gemini requests with credential gating and validate Anthropic cache usage mapping when present. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * test: fix usage parsing tests to pass - Use GoogleAIClient with GEMINI_API_KEY instead of GoogleVertexClient - Update model to gemini-2.0-flash (1.5-flash deprecated in v1beta) - Add tools=[] for Gemini/Anthropic build_request_data 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: extract_usage_statistics returns LettaUsageStatistics Standardize on LettaUsageStatistics as the canonical usage format returned by client helpers. Inline UsageStatistics construction for ChatCompletionResponse where needed. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * feat: add is_byok and llm_config_json columns to ClickHouse traces Extend llm_raw_traces table with: - is_byok (UInt8): Track BYOK vs base provider usage for billing analytics - llm_config_json (String, ZSTD): Store full LLM config for debugging and analysis This enables queries like: - BYOK usage breakdown by provider/model - Config parameter analysis (temperature, max_tokens, etc.) - Debugging specific request configurations * feat: add tests for error traces, llm_config_json, and cache tokens - Update llm_raw_trace_reader.py to query new columns (is_byok, cached_input_tokens, cache_write_tokens, reasoning_tokens, llm_config_json) - Add test_error_trace_stored_in_clickhouse to verify error fields - Add test_cache_tokens_stored_for_anthropic to verify cache token storage - Update existing tests to verify llm_config_json is stored correctly - Make llm_config required in log_provider_trace_async() - Simplify provider extraction to use provider_name directly 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * ci: add ClickHouse integration tests to CI pipeline - Add use-clickhouse option to reusable-test-workflow.yml - Add ClickHouse service container with otel database - Add schema initialization step using clickhouse-init.sql - Add ClickHouse env vars (CLICKHOUSE_ENDPOINT, etc.) - Add separate clickhouse-integration-tests job running integration_test_clickhouse_llm_raw_traces.py 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: simplify provider and org_id extraction in raw trace writer - Use model_endpoint_type.value for provider (not provider_name) - Simplify org_id to just self.actor.organization_id (actor is always pydantic) 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: simplify LLMRawTraceWriter with _enabled flag - Check ClickHouse env vars once at init, set _enabled flag - Early return in write_async/flush_async if not enabled - Remove ValueError raises (never used) - Simplify _get_client (no validation needed since already checked) 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add LLMRawTraceWriter shutdown to FastAPI lifespan Properly flush pending traces on graceful shutdown via lifespan instead of relying only on atexit handler. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * feat: add agent_tags column to ClickHouse traces Store agent tags as Array(String) for filtering/analytics by tag. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * cleanup * fix(ci): fix ClickHouse schema initialization in CI - Create database separately before loading SQL file - Remove CREATE DATABASE from SQL file (handled in CI step) - Add verification step to confirm table was created - Use -sf flag for curl to fail on HTTP errors 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: simplify LLM trace writer with ClickHouse async_insert - Use ClickHouse async_insert for server-side batching instead of manual queue/flush loop - Sync cloud DDL schema with clickhouse-init.sql (add missing columns) - Remove redundant llm_raw_traces_local.ddl - Remove unused batch_size/flush_interval settings - Update tests for simplified writer Key changes: - async_insert=1, wait_for_async_insert=1 for reliable server-side batching - Simple per-trace retry with exponential backoff (max 3 retries) - ~150 lines removed from writer 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: consolidate ClickHouse direct writes into TelemetryManager backend - Add clickhouse_direct backend to provider_trace_backends - Remove duplicate ClickHouse write logic from llm_client_base.py - Configure via LETTA_TELEMETRY_PROVIDER_TRACE_BACKEND=postgres,clickhouse_direct The clickhouse_direct backend: - Converts ProviderTrace to LLMRawTrace - Extracts usage stats from response JSON - Writes via LLMRawTraceWriter with async_insert 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: address PR review comments and fix llm_config bug Review comment fixes: - Rename clickhouse_direct -> clickhouse_analytics (clearer purpose) - Remove ClickHouse from OSS compose.yaml, create separate compose.clickhouse.yaml - Delete redundant scripts/test_llm_raw_traces.py (use pytest tests) - Remove unused llm_raw_traces_ttl_days setting (TTL handled in DDL) - Fix socket description leak in telemetry_manager docstring - Add cloud-only comment to clickhouse-init.sql - Update justfile to use separate compose file Bug fix: - Fix llm_config not being passed to ProviderTrace in telemetry - Now correctly populates provider, model, is_byok for all LLM calls - Affects both request_async_with_telemetry and log_provider_trace_async DDL optimizations: - Add secondary indexes (bloom_filter for agent_id, model, step_id) - Add minmax indexes for is_byok, is_error - Change model and error_type to LowCardinality for faster GROUP BY 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: rename llm_raw_traces -> llm_traces Address review feedback that "raw" is misleading since we denormalize fields. Renames: - Table: llm_raw_traces -> llm_traces - Schema: LLMRawTrace -> LLMTrace - Files: llm_raw_trace_{reader,writer}.py -> llm_trace_{reader,writer}.py - Setting: store_llm_raw_traces -> store_llm_traces 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: update workflow references to llm_traces Missed renaming table name in CI workflow files. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: update clickhouse_direct -> clickhouse_analytics in docstring 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: remove inaccurate OTEL size limit comments The 4MB limit is our own truncation logic, not an OTEL protocol limit. The real benefit is denormalized columns for analytics queries. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: remove local ClickHouse dev setup (cloud-only feature) - Delete clickhouse-init.sql and compose.clickhouse.yaml - Remove local clickhouse just commands - Update CI to use cloud DDL with MergeTree for testing clickhouse_analytics is a cloud-only feature. For local dev, use postgres backend. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: restore compose.yaml to match main 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: merge clickhouse_analytics into clickhouse backend Per review feedback - having two separate backends was confusing. Now the clickhouse backend: - Writes to llm_traces table (denormalized for cost analytics) - Reads from OTEL traces table (will cut over to llm_traces later) Config: LETTA_TELEMETRY_PROVIDER_TRACE_BACKEND=postgres,clickhouse 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: correct path to DDL file in CI workflow 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: add provider index to DDL for faster filtering 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: configure telemetry backend in clickhouse tests Tests need to set telemetry_settings.provider_trace_backends to include 'clickhouse', otherwise traces are routed to default postgres backend. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: set provider_trace_backend field, not property provider_trace_backends is a computed property, need to set the underlying provider_trace_backend string field instead. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: error trace test and error_type extraction - Add TelemetryManager to error trace test so traces get written - Fix error_type extraction to check top-level before nested error dict 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: use provider_trace.id for trace correlation across backends - Pass provider_trace.id to LLMTrace instead of auto-generating - Log warning if ID is missing (shouldn't happen, helps debug) - Fallback to new UUID only if not set 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: trace ID correlation and concurrency issues - Strip "provider_trace-" prefix from ID for UUID storage in ClickHouse - Add asyncio.Lock to serialize writes (clickhouse_connect not thread-safe) - Fix Anthropic prompt_tokens to include cached tokens for cost analytics - Log warning if provider_trace.id is missing 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Caren Thomas <carenthomas@gmail.com>	2026-02-24 10:52:06 -08:00
jnjpng	24ea7dbaed	feat: include tools as part of token estimate in compact (#9242 ) * base * fix	2026-02-24 10:52:06 -08:00
jnjpng	3f23a23227	feat: add compaction stats (#9219 ) * base * update * last * generate * fix test	2026-02-24 10:52:06 -08:00
github-actions[bot]	9471fab2cf	fix: Add agent_id property to BaseAgentV2 for backward compatibility (#8806 ) The LettaAgentV3 (and LettaAgentV2) agents inherit from BaseAgentV2, which unlike the original BaseAgent class, did not expose an agent_id attribute. This caused AttributeError: 'LettaAgentV3' object has no attribute 'agent_id' when code attempted to access self.agent_id. This fix adds an agent_id property to BaseAgentV2 that returns self.agent_state.id, maintaining backward compatibility with code that expects the self.agent_id interface from the original BaseAgent. Closes #8805 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: letta-code <248085862+letta-code@users.noreply.github.com> Co-authored-by: Letta <noreply@letta.com>	2026-02-24 10:52:06 -08:00
jnjpng	d28ccc0be6	feat: add summary message and event on compaction (#9144 ) * base * update * update * revert formatting * routes * legacy * fix * review * update	2026-02-24 10:52:05 -08:00
cthomas	372c8dcc85	fix: add conversation_id support to LettaAgentV3 constructor (#9156 ) Error: ``` TypeError: LettaAgentV2.__init__() got an unexpected keyword argument 'conversation_id' ``` Trace: https://letta.grafana.net/goto/afbk4da3fuxhcf?orgId=stacks-1189126 Problem: The `POST /v1/conversations/{conversation_id}/compact` endpoint was failing because `LettaAgentV3` inherits from `LettaAgentV2` without overriding `__init__`, so passing `conversation_id` to the constructor failed. Fix: 1. Add `__init__` to `LettaAgentV3` that accepts optional `conversation_id` 2. Remove redundant `conversation_id` param from `_checkpoint_messages` - use `self.conversation_id` consistently instead 3. Clean up internal callers that were passing `conversation_id=self.conversation_id` Backward compatible - existing code creating `LettaAgentV3(agent_state, actor)` still works since `conversation_id` defaults to `None`. 👾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
cthomas	6f8f227e64	fix: improve approval retry idempotency check for server-side tool calls (#9136 ) Problem: When retrying an approval response, the idempotency check only looked at the last message. If the approved tool triggered server-side tool calls (e.g., `memory`), those tool returns would be the last message, causing the idempotency check to fail with: "Cannot process approval response: No tool call is currently awaiting approval." Root Cause: The check at line 249 only validated `current_in_context_messages[-1]`, but server-side tool calls can add additional tool return messages after the original approved tool's return. Fix: Search the last 10 messages (instead of just the last one) for a tool return matching the approval's tool_call_ids. This handles the case where server-side tool calls happen after the approved tool executes, while keeping the search bounded and efficient. 👾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
cthomas	18274b5a42	fix: handle null step_id on approval request messages (#9074 ) Problem: Runs failed with error: ``` Argument step_id does not match type <class 'str'>; is None of type <class 'NoneType'> ``` This happened when processing approval responses where the original approval request message had `step_id=None`. Root Cause: Line 672 in `_step()` directly used `approval_request.step_id`: ```python step_id = approval_request.step_id # Can be None! step_metrics = await self.step_manager.get_step_metrics_async(step_id=step_id, ...) ``` `Message.step_id` is `Optional[str]` (default None), but `get_step_metrics_async` has `step_id: str` with `@enforce_types` validation. Old approval messages or edge cases could have `step_id=None`, causing the enforce_types decorator to reject the call. Fix: Check if `step_id is None` and generate a new step_id + initialize step checkpoint if needed, instead of assuming step_id always exists. Note: Similar issue exists in letta_agent_v2.py and temporal agents, but v2 is deprecated. 👾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
cthomas	3e49cf5d44	fix: load default provider config when summarizer uses different prov… (#9051 ) fix: load default provider config when summarizer uses different provider Problem: Summarization failed when agent used one provider (e.g., Google AI) but summarizer config specified a different provider (e.g., Anthropic): ```python # Agent LLM config model_endpoint_type='google_ai', handle='gemini-something/gemini-2.5-pro', context_window=100000 # Summarizer config model='anthropic/claude-haiku-4-5-20251001' # Bug: Resulting summarizer_llm_config mixed Google + Anthropic settings model='claude-haiku-4-5-20251001', model_endpoint_type='google_ai', # ❌ Wrong endpoint! context_window=100000 # ❌ Google's context window, not Anthropic's default! ``` This sent Claude requests to Google AI endpoints with incorrect parameters. Root Cause: `_build_summarizer_llm_config()` always copied the agent's LLM config as base, then patched model/provider fields. But this kept all provider-specific settings (endpoint, context_window, etc.) from the wrong provider. Fix: 1. Parse provider_name from summarizer handle 2. Check if it matches agent's model_endpoint_type (or provider_name for custom) 3. If YES → Use agent config as base, override model/handle (same provider) 4. If NO → Load default config via `provider_manager.get_llm_config_from_handle()` (new provider) Example Flow: ```python # Agent: google_ai/gemini-2.5-pro # Summarizer: anthropic/claude-haiku provider_name = "anthropic" # Parsed from handle provider_matches = ("anthropic" == "google_ai") # False ❌ # Different provider → load default Anthropic config base = await provider_manager.get_llm_config_from_handle( handle="anthropic/claude-haiku", actor=self.actor ) # Returns: model_endpoint_type='anthropic', endpoint='https://api.anthropic.com', etc. ✅ ``` Result: - Summarizer with different provider gets correct default config - No more mixing Google endpoints with Anthropic models - Same-provider summarizers still inherit agent settings efficiently 👾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
cthomas	4c2253dc76	fix: use repr() fallback for empty exception messages in error logging (#9047 ) Problem: Error logs showed empty detail fields when exceptions had no message: ``` Error during step processing: Run run-xxx stopped with unknown error: , error_data: {...'detail': ''} ``` This made debugging production issues difficult as the actual error type was hidden. Root Cause: Python exceptions created with no arguments (e.g., `Exception()` or caught and re-raised in certain ways) have `str(e) == ""`: ```python e = Exception() str(e) # Returns "" repr(e) # Returns "Exception()" ``` When exceptions with empty string representations were caught, all logging and error messages showed blank details. Fix: Use `str(e) or repr(e)` fallback pattern in 3 places: 1. `letta_agent_v3.py` stream() exception handler (line 406) 2. `letta_agent_v3.py` step() exception handler (line 928) 3. `streaming_service.py` generic exception handler (line 469) Result: - Error logs now show `Exception()` or similar instead of empty string - Helps identify exception types even when message is missing - Better production debugging without changing exception handling logic 👾 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:44:04 -08:00
cthomas	57ab117437	feat: dedupe approval response retries on server (#9038 )	2026-01-29 12:44:04 -08:00
Kian Jones	e3fb00f970	feat(crouton): add orgId, userId, Compaction_Settings and LLM_Config (#9022 ) * LC one shot? * api changes * fix summarizer nameerror	2026-01-29 12:44:04 -08:00
Kian Jones	273ca9ec44	feat(tests): add crouton telemetry tests (#9000 ) * test: add comprehensive provider trace telemetry tests Add two test files for provider trace telemetry: 1. test_provider_trace.py - Integration tests for: - Basic agent steps (streaming and non-streaming) - Tool calls - Telemetry context fields (agent_id, agent_tags, step_id, run_id) - Multi-step conversations - Request/response JSON content 2. test_provider_trace_summarization.py - Unit tests for: - simple_summary() telemetry context passing - summarize_all() telemetry pass-through - summarize_via_sliding_window() telemetry pass-through - Summarizer class runtime vs constructor telemetry - LLMClient.set_telemetry_context() method 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * test: add telemetry tests for tool generation, adapters, and agent versions Add comprehensive unit tests for provider trace telemetry: - TestToolGenerationTelemetry: Verify /generate-tool endpoint sets call_type="tool_generation" and has no agent context - TestLLMClientTelemetryContext: Verify LLMClient.set_telemetry_context accepts all telemetry fields - TestAdapterTelemetryAttributes: Verify base adapter and subclasses (LettaLLMRequestAdapter, LettaLLMStreamAdapter) support telemetry attrs - TestSummarizerTelemetry: Verify Summarizer stores and passes telemetry - TestAgentAdapterInstantiation: Verify LettaAgentV2 creates Summarizer with correct agent_id 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * ci: add provider trace telemetry tests to unit test workflow Add the new provider trace test files to the CI matrix: - test_provider_trace_backends.py - test_provider_trace_summarization.py - test_provider_trace_agents.py 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: update socket backend test to match new record structure The socket backend record structure changed - step_id/run_id are now at top level, and model/usage are nested in request/response objects. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add step_id to V1 agent telemetry context Pass step_id to set_telemetry_context in both streaming and non-streaming paths in LettaAgent (v1). The step_id is available via step_metrics.id in the non-streaming path and passed explicitly in the streaming path. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:43:53 -08:00
Kian Jones	4d256b3399	feat: add agent_id, run_id, step_id to summarization provider traces (#8996 ) * feat: add agent_id, run_id, step_id to summarization provider traces Summarization LLM calls were missing telemetry context (agent_id, agent_tags, run_id, step_id), making it impossible to attribute summarization costs to specific agents or trace them back to the step that triggered compaction. Changes: - Add step_id param to simple_summary() and set_telemetry_context() - Add agent_id, agent_tags, run_id, step_id to summarize_all() and summarize_via_sliding_window() - Update Summarizer class to accept and pass telemetry context - Update LettaAgentV3.compact() to pass full telemetry context - Update LettaAgentV2.summarize_conversation_history() with run_id/step_id - Update LettaAgent (v1) streaming methods with run_id param - Add run_id/step_id to SummarizeParams for Temporal activities 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: update test mock to accept new summarization params 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:43:53 -08:00
Kian Jones	7133083b81	fix: agent_tags for provider traces (#8989 ) * add include tags * include agent_tags and pass them into the adapter	2026-01-29 12:43:53 -08:00
Charles Packer	2fc592e0b6	feat(core): add image support in tool returns [LET-7140] (#8985 ) * feat(core): add image support in tool returns [LET-7140] Enable tool_return to support both string and ImageContent content parts, matching the pattern used for user message inputs. This allows tools executed client-side to return images back to the agent. Changes: - Add LettaToolReturnContentUnion type for text/image content parts - Update ToolReturn schema to accept Union[str, List[content parts]] - Update converters for each provider: - OpenAI Chat Completions: placeholder text for images - OpenAI Responses API: full image support - Anthropic: full image support with base64 - Google: placeholder text for images - Add resolve_tool_return_images() for URL-to-base64 conversion - Make create_approval_response_message_from_input() async 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix(core): support images in Google tool returns as sibling parts Following the gemini-cli pattern: images in tool returns are sent as sibling inlineData parts alongside the functionResponse, rather than inside it. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * test(core): add integration tests for multi-modal tool returns [LET-7140] Tests verify that: - Models with image support (Anthropic, OpenAI Responses API) can see images in tool returns and identify the secret text - Models without image support (Chat Completions) get placeholder text and cannot see the actual image content - Tool returns with images persist correctly in the database Uses secret.png test image containing hidden text "FIREBRAWL" that models must identify to pass the test. Also fixes misleading comment about Anthropic only supporting base64 images - they support URLs too, we just pre-resolve for consistency. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: simplify tool return image support implementation Reduce code verbosity while maintaining all functionality: - Extract _resolve_url_to_base64() helper in message_helper.py (eliminates duplication) - Add _get_text_from_part() helper for text extraction - Add _get_base64_image_data() helper for image data extraction - Add _tool_return_to_google_parts() to simplify Google implementation - Add _image_dict_to_data_url() for OpenAI Responses format - Use walrus operator and list comprehensions where appropriate - Add integration_test_multi_modal_tool_returns.py to CI workflow Net change: -120 lines while preserving all features and test coverage. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix(tests): improve prompt for multi-modal tool return tests Make prompts more direct to reduce LLM flakiness: - Simplify tool description: "Retrieves a secret image with hidden text. Call this function to get the image." - Change user prompt from verbose request to direct command: "Call the get_secret_image function now." - Apply to both test methods This reduces ambiguity and makes tool calling more reliable across different LLM models. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix bugs * test(core): add google_ai/gemini-2.0-flash-exp to multi-modal tests Add Gemini model to test coverage for multi-modal tool returns. Google AI already supports images in tool returns via sibling inlineData parts. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix(ui): handle multi-modal tool_return type in frontend components Convert Union<string, LettaToolReturnContentUnion[]> to string for display: - ViewRunDetails: Convert array to '[Image here]' placeholder - ToolCallMessageComponent: Convert array to '[Image here]' placeholder Fixes TypeScript errors in web, desktop-ui, and docker-ui type-checks. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Caren Thomas <carenthomas@gmail.com>	2026-01-29 12:43:53 -08:00
Christina Tong	fa92f711fe	add conversation_id to message obj before persisting (#8984 )	2026-01-29 12:43:53 -08:00
Kian Jones	81b5d71889	feat: add agents and log error properly (#8914 ) * add agents and log error properly * fix llm stream adapter	2026-01-19 15:54:43 -08:00
Kian Jones	a92e868ee6	feat: centralize telemetry logging at LLM client level (#8815 ) * feat: centralize telemetry logging at LLM client level Moves telemetry logging from individual adapters to LLMClientBase: - Add TelemetryStreamWrapper for streaming telemetry on stream close - Add request_async_with_telemetry() for non-streaming requests - Add stream_async_with_telemetry() for streaming requests - Add set_telemetry_context() to configure agent_id, run_id, step_id Updates adapters and agents to use new pattern: - LettaLLMAdapter now accepts agent_id/run_id in constructor - Adapters call set_telemetry_context() before LLM requests - Removes duplicate telemetry logging from adapters - Enriches traces with agent_id, run_id, call_type metadata 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: accumulate streaming response content for telemetry TelemetryStreamWrapper now extracts actual response data from chunks: - Content text (concatenated from deltas) - Tool calls (id, name, arguments) - Model name, finish reason, usage stats 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: move streaming telemetry to caller (option 3) - Remove TelemetryStreamWrapper class - Add log_provider_trace_async() helper to LLMClientBase - stream_async_with_telemetry() now just returns raw stream - Callers log telemetry after processing with rich interface data Updated callers: - summarizer.py: logs content + usage after stream processing - letta_agent.py: logs tool_call, reasoning, model, usage 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: pass agent_id and run_id to parent adapter class LettaLLMStreamAdapter was not passing agent_id/run_id to parent, causing "unexpected keyword argument" errors. 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:43 -08:00
Kian Jones	9418ab9815	feat: add provider trace backend abstraction for multi-backend telemetry (#8814 ) * feat: add provider trace backend abstraction for multi-backend telemetry Introduces a pluggable backend system for provider traces: - Base class with async/sync create and read interfaces - PostgreSQL backend (existing behavior) - ClickHouse backend (via OTEL instrumentation) - Socket backend (writes to Unix socket for crouton sidecar) - Factory for instantiating backends from config Refactors TelemetryManager to use backends with support for: - Multi-backend writes (concurrent via asyncio.gather) - Primary backend for reads (first in config list) - Graceful error handling per backend Config: LETTA_TELEMETRY_PROVIDER_TRACE_BACKEND (comma-separated) Example: "postgres,socket" for dual-write to Postgres and crouton 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * feat: add protocol version to socket backend records Adds PROTOCOL_VERSION constant to socket backend: - Included in every telemetry record sent to crouton - Must match ProtocolVersion in apps/crouton/main.go - Enables crouton to detect and reject incompatible messages 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: remove organization_id from ProviderTraceCreate calls The organization_id is now handled via the actor parameter in the telemetry manager, not through ProviderTraceCreate schema. This fixes validation errors after changing ProviderTraceCreate to inherit from BaseProviderTrace which forbids extra fields. 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * consolidate provider trace * add clickhouse-connect to fix bug on main lmao * auto generated sdk changes, and deployment details, and clikchouse prefix bug and added fields to runs trace return api * auto generated sdk changes, and deployment details, and clikchouse prefix bug and added fields to runs trace return api * consolidate provider trace * consolidate provider trace bug fix --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:43 -08:00
jnjpng	a98bc31bf3	fix: refactor enable strict mode for structured output (#8840 ) * base * test	2026-01-19 15:54:42 -08:00
jnjpng	85c242077e	feat: strict tool calling setting (#8810 ) base	2026-01-19 15:54:42 -08:00
cthomas	487bb42231	fix: summarization causing desync for conversations [LET-7014] (#8734 )	2026-01-19 15:54:41 -08:00
Sarah Wooders	97cdfb4225	Revert "feat: add strict tool calling setting [LET-6902]" (#8720 ) Revert "feat: add strict tool calling setting [LET-6902] (#8577)" This reverts commit 697c9d0dee6af73ec4d5d98780e2ca7632a69173.	2026-01-19 15:54:39 -08:00
Sarah Wooders	b888c4c17a	feat: allow for conversation-level isolation of blocks (#8684 ) * feat: add conversation_id parameter to context endpoint [LET-6989] Add optional conversation_id query parameter to retrieve_agent_context_window. When provided, the endpoint uses messages from the specific conversation instead of the agent's default message_ids. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate SDK after context endpoint update [LET-6989] 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * feat: add isolated blocks support for conversations Allows conversations to have their own copies of specific memory blocks (e.g., todo_list) that override agent defaults, enabling conversation-specific state isolation. 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * undo * update apis * test * cleanup * fix tests * simplify * move override logic * patch --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-19 15:54:39 -08:00
Sarah Wooders	bdede5f90c	feat: add strict tool calling setting [LET-6902] (#8577 )	2026-01-19 15:54:38 -08:00
Sarah Wooders	96cf24264c	fix: avoid 'NoneType' object has no attribute 'name' error (#8407 )	2026-01-12 10:57:49 -08:00
Sarah Wooders	87d920782f	feat: add conversation and conversation_messages tables for concurrent messaging (#8182 )	2026-01-12 10:57:48 -08:00
Sarah Wooders	2d84af11c3	fix: override with client-side tools is overlapping (#8232 )	2026-01-12 10:57:48 -08:00
Charles Packer	3cdee2e78f	fix: include client_tools in streaming requires_approval_tools (#8230 ) When streaming, the LLM adapter needs to know which tools require approval so it can emit ApprovalRequestMessage instead of ToolCallMessage. Client-side tools were being passed to the agent but not included in the requires_approval_tools list passed to the streaming interface. This caused the streaming interface to emit tool_call_message for client tools, but the stop_reason was still requires_approval, resulting in empty approvals arrays on the client side. 🤖 Generated with [Letta Code](https://letta.com) Co-authored-by: Letta <noreply@letta.com>	2026-01-12 10:57:48 -08:00
Sarah Wooders	7669896184	feat: allow client-side tools to be specified in request (#8220 ) * feat: allow client-side tools to be specified in request Add `client_tools` field to LettaRequest to allow passing tool schemas at message creation time without requiring server-side registration. When the agent calls a client-side tool, execution pauses with stop_reason=requires_approval for the client to provide tool returns. - Add ClientToolSchema class for request-level tool schemas - Merge client tools with agent tools in _get_valid_tools() - Treat client-side tool calls as requiring approval - Add integration tests for client-side tools flow 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * test: add comprehensive end-to-end test for client-side tools Update integration test to verify the complete flow: - Agent calls client-side tool and pauses - Client provides tool return with secret code - Agent processes and responds - User asks about the code, agent recalls it - Validate full conversation history makes sense 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * update apis * fix: client-side tools schema format and test assertions - Use flat schema format for client tools (matching t.json_schema) - Support both object and dict access for client tools - Fix stop_reason assertions to access .stop_reason attribute 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * refactor: simplify client_tools access pattern ClientToolSchema objects always have .name attribute 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add client_tools parameter to LettaAgentV2 for API compatibility V2 agent doesn't use client_tools but needs the parameter to match the base class signature. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * revert: remove client_tools from LettaRequestConfig Client-side tools don't work with background jobs since there's no client present to provide tool returns. 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: add client_tools parameter to SleeptimeMultiAgent classes Add client_tools to step() and stream() methods in: - SleeptimeMultiAgentV3 - SleeptimeMultiAgentV4 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * chore: regenerate API specs for client_tools support 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-12 10:57:48 -08:00
Sarah Wooders	0722877423	fix: validate parallel tool calls with tool rules at create/update time (#8060 ) * fix: validate parallel tool calls with tool rules at create/update time Move validation from runtime to agent create/update time for better UX. Add client-side enforcement to truncate parallel tool calls when disabled (handles providers like Gemini that ignore the setting). 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * update apis * undo --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-12 10:57:47 -08:00
Sarah Wooders	6bf5c50f42	fix: fix summarization for claude max plans (#8020 ) Co-authored-by: Letta Bot <jinjpeng@gmail.com>	2026-01-12 10:57:44 -08:00
Sarah Wooders	a7639a53eb	fix: fix summary message return for compaction (#7402 )	2026-01-12 10:57:19 -08:00
Sarah Wooders	f9f1b1e82d	feat: allow for configuration compaction and return message delta (#7378 )	2026-01-12 10:57:19 -08:00
Sarah Wooders	ae4490c5b3	fix: filter out stop reason from response streaming (#7332 )	2025-12-17 17:31:03 -08:00
jnjpng	25d75d6528	fix: update aysnc get plaintext callsites (#7069 ) * base * resolve * fix * fix	2025-12-17 17:31:02 -08:00
cthomas	e6a4b3e874	fix: nonetype not iterable (#7216 )	2025-12-17 17:31:02 -08:00
Sooty	6f48d4bd48	Correct provider name for openai-proxy in LLMConfig (#3097 )	2025-12-16 19:37:54 -08:00
Sarah Wooders	bd9f3aca9b	fix: fix `prompt_acknowledgement` usage and update summarization prompts (#7012 )	2025-12-15 12:03:09 -08:00
Sarah Wooders	a731e01e88	fix: use `model` instead of `model_settings` (#6834 )	2025-12-15 12:03:09 -08:00
Kevin Lin	4b9485a484	feat: Add max tokens exceeded to stop reasons [LET-6480] (#6576 )	2025-12-15 12:03:09 -08:00
Sarah Wooders	c9ad2fd7c4	chore: move things to debug logging (#6610 )	2025-12-15 12:03:09 -08:00
cthomas	bffb9064b8	fix: step logging error (#6755 )	2025-12-15 12:03:08 -08:00
jnjpng	4be813b956	fix: migrate sandbox and agent environment variables to encrypted only (#6623 ) * base * remove unnnecessary db migration * update * fix * update * update * comments * fix * revert * anotha --------- Co-authored-by: Letta Bot <noreply@letta.com>	2025-12-15 12:03:08 -08:00
Sarah Wooders	7ea297231a	feat: add `compaction_settings` to agents (#6625 ) * initial commit * Add database migration for compaction_settings field This migration adds the compaction_settings column to the agents table to support customized summarization configuration for each agent. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix * rename * update apis * fix tests * update web test --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <kian@letta.com>	2025-12-15 12:02:34 -08:00
Sarah Wooders	a2dfa5af17	fix: reorder summarization (#6606 )	2025-12-15 12:02:34 -08:00
Sarah Wooders	70c57c5072	fix: various patches to summarizer (#6597 )	2025-12-15 12:02:34 -08:00
Charles Packer	1c30ad6991	fix(core): patch anthropic context caching busting (#6516 ) Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>	2025-12-15 12:02:34 -08:00
Sarah Wooders	bbd52e291c	feat: refactor summarization and message persistence code [LET-6464] (#6561 )	2025-12-15 12:02:34 -08:00

1 2 3 4 5 ...

369 Commits