letta-server

Author	SHA1	Message	Date
Kian Jones	6f746c5225	fix(core): handle Anthropic overloaded errors and Unicode encoding issues (#9305 ) * fix: handle Anthropic overloaded_error in streaming interfaces * fix: handle Unicode surrogates in OpenAI requests Sanitize Unicode surrogate pairs before sending requests to OpenAI API. Surrogate pairs (U+D800-U+DFFF) are UTF-16 encoding artifacts that cause UnicodeEncodeError when encoding to UTF-8. Fixes Datadog error: 'utf-8' codec can't encode character '\ud83c' in position 326605: surrogates not allowed * fix: handle UnicodeEncodeError from lone Unicode surrogates in OpenAI requests Improved sanitize_unicode_surrogates() to explicitly filter out lone surrogate characters (U+D800 to U+DFFF) which are invalid in UTF-8. Previous implementation used errors='ignore' which could still fail in edge cases. New approach directly checks Unicode code points and removes any surrogates before data reaches httpx encoding. Also added sanitization to stream_async_responses() method which was missing it. Fixes: 'utf-8' codec can't encode character '\ud83c' in position X: surrogates not allowed	2026-02-24 10:52:06 -08:00
amysguan	16c96cc3c0	Fix sliding window cutoff logic (#9261 ) * fix sliding window cutoff calculations to use agent instead of summarizer config * allow approval messages with tool_calls as valid cutoffs, prevent approval pairs from being split * update tests with updated sliding window parameters --------- Co-authored-by: Amy Guan <amy@letta.com>	2026-02-24 10:52:06 -08:00
jnjpng	f48b60634f	refactor: extract compact logic to shared function for temporal (#9249 ) * refactor: extract compact logic to shared function Extract the compaction logic from LettaAgentV3.compact() into a standalone compact_messages() function that can be shared between the agent and temporal workflows. Changes: - Create apps/core/letta/services/summarizer/compact.py with: - compact_messages(): Core compaction logic - build_summarizer_llm_config(): LLM config builder for summarization - CompactResult: Dataclass for compaction results - Update LettaAgentV3.compact() to use compact_messages() - Update temporal summarize_conversation_history activity to use compact_messages() instead of the old Summarizer class - Add use_summary_role parameter to SummarizeParams This ensures consistent summarization behavior across different execution paths and prevents drift as we improve the implementation. * chore: clean up verbose comments * fix: correct CompactionSettings import path * fix: correct count_tokens import from summarizer_sliding_window * fix: update test patch path for count_tokens_with_tools After extracting compact logic to compact.py, the test was patching the old location. Update the patch path to the new module location. * fix: update test to use build_summarizer_llm_config from compact.py The function was moved from LettaAgentV3._build_summarizer_llm_config to compact.py as a standalone function. * fix: add early check for system prompt size in compact_messages Check if the system prompt alone exceeds the context window before attempting summarization. The system prompt cannot be compacted, so fail fast with SystemPromptTokenExceededError. * fix: properly propagate SystemPromptTokenExceededError from compact The exception handler in _step() was not setting the correct stop_reason for SystemPromptTokenExceededError, which caused the finally block to return early and swallow the exception. Add special handling to set stop_reason to context_window_overflow_in_system_prompt when SystemPromptTokenExceededError is caught. * revert: remove redundant SystemPromptTokenExceededError handling The special handling in the outer exception handler is redundant because stop_reason is already set in the inner handler at line 943. The actual fix for the test was the early check in compact_messages(), not this redundant handling. * fix: correctly re-raise SystemPromptTokenExceededError The inner exception handler was using 'raise e' which re-raised the outer ContextWindowExceededError instead of the current SystemPromptTokenExceededError. Changed to 'raise' to correctly re-raise the current exception. This bug was pre-existing but masked because _check_for_system_prompt_overflow was only called as a fallback. The new early check in compact_messages() exposed it. * revert: remove early check and restore raise e to match main behavior * fix: set should_continue=False and correctly re-raise exception - Add should_continue=False in SystemPromptTokenExceededError handler (matching main's _check_for_system_prompt_overflow behavior) - Fix raise e -> raise to correctly propagate SystemPromptTokenExceededError Note: test_large_system_prompt_summarization still fails locally but passes on main. Need to investigate why exception isn't propagating correctly on refactored branch. * fix: add SystemPromptTokenExceededError handler for post-step compaction The post-step compaction (line 1066) was missing a SystemPromptTokenExceededError exception handler. When compact_messages() raised this error, it would be caught by the outer exception handler which would: 1. Set stop_reason to "error" instead of "context_window_overflow_in_system_prompt" 2. Not set should_continue = False 3. Get swallowed by the finally block (line 1126) which returns early This caused test_large_system_prompt_summarization to fail because the exception never propagated to the test. The fix adds the same exception handler pattern used in the retry compaction flow (line 941-946), ensuring proper state is set before re-raising. This issue only affected the refactored code because on main, _check_for_system_prompt_overflow() was an instance method that set should_continue/stop_reason BEFORE raising. In the refactor, compact_messages() is a standalone function that cannot set instance state, so the caller must handle the exception and set the state.	2026-02-24 10:52:06 -08:00
jnjpng	24ea7dbaed	feat: include tools as part of token estimate in compact (#9242 ) * base * fix	2026-02-24 10:52:06 -08:00
jnjpng	3f23a23227	feat: add compaction stats (#9219 ) * base * update * last * generate * fix test	2026-02-24 10:52:06 -08:00
jnjpng	e25a0c9cdf	feat: update compact endpoint to store summary message (#9215 ) * base * add tests	2026-02-24 10:52:06 -08:00
jnjpng	d28ccc0be6	feat: add summary message and event on compaction (#9144 ) * base * update * update * revert formatting * routes * legacy * fix * review * update	2026-02-24 10:52:05 -08:00
Kian Jones	4d256b3399	feat: add agent_id, run_id, step_id to summarization provider traces (#8996 ) * feat: add agent_id, run_id, step_id to summarization provider traces Summarization LLM calls were missing telemetry context (agent_id, agent_tags, run_id, step_id), making it impossible to attribute summarization costs to specific agents or trace them back to the step that triggered compaction. Changes: - Add step_id param to simple_summary() and set_telemetry_context() - Add agent_id, agent_tags, run_id, step_id to summarize_all() and summarize_via_sliding_window() - Update Summarizer class to accept and pass telemetry context - Update LettaAgentV3.compact() to pass full telemetry context - Update LettaAgentV2.summarize_conversation_history() with run_id/step_id - Update LettaAgent (v1) streaming methods with run_id param - Add run_id/step_id to SummarizeParams for Temporal activities 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: update test mock to accept new summarization params 🤖 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-29 12:43:53 -08:00
Sarah Wooders	a7639a53eb	fix: fix summary message return for compaction (#7402 )	2026-01-12 10:57:19 -08:00
Sarah Wooders	f1bd246e9b	feat: use token streaming for anthropic summarization (#7105 )	2025-12-17 17:31:02 -08:00
Sarah Wooders	bd9f3aca9b	fix: fix `prompt_acknowledgement` usage and update summarization prompts (#7012 )	2025-12-15 12:03:09 -08:00
Sarah Wooders	a731e01e88	fix: use `model` instead of `model_settings` (#6834 )	2025-12-15 12:03:09 -08:00
Sarah Wooders	7ea297231a	feat: add `compaction_settings` to agents (#6625 ) * initial commit * Add database migration for compaction_settings field This migration adds the compaction_settings column to the agents table to support customized summarization configuration for each agent. 🐾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix * rename * update apis * fix tests * update web test --------- Co-authored-by: Letta <noreply@letta.com> Co-authored-by: Kian Jones <kian@letta.com>	2025-12-15 12:02:34 -08:00
Sarah Wooders	70c57c5072	fix: various patches to summarizer (#6597 )	2025-12-15 12:02:34 -08:00
Sarah Wooders	bbd52e291c	feat: refactor summarization and message persistence code [LET-6464] (#6561 )	2025-12-15 12:02:34 -08:00
Sarah Wooders	3569721fd4	fix: avoid infinite summarization loops (#6506 )	2025-12-15 12:02:33 -08:00
Sarah Wooders	bd97b23025	feat: fallback to `all` mode for summarizer if error (#6465 )	2025-12-15 12:02:19 -08:00
Sarah Wooders	91e3dd8b3e	feat: fix new summarizer code and add more tests (#6461 )	2025-12-15 12:02:19 -08:00
Sarah Wooders	c0b422c4c6	fix: patch summarizer and add tests (#6457 )	2025-12-15 12:02:19 -08:00
Sarah Wooders	1939a9d185	feat: patch summarizer without changes to `AgentState` (#6450 )	2025-12-15 12:02:19 -08:00
Sarah Wooders	57bb051ea4	feat: add tool return truncation to summarization as a fallback [LET-5970] (#5859 )	2025-11-13 15:36:30 -08:00
Sarah Wooders	307c85ca9a	fix: patch summarizer tests (#5196 )	2025-10-07 17:50:50 -07:00
Sarah Wooders	eb95c1330e	fix: patch summarizer for gpt-5 [LET-4562] (#5040 )	2025-10-07 17:50:48 -07:00
Matthew Zhou	96deccc45d	fix: Remove integration test summarizer (#4925 ) Remove integration test summarizer	2025-10-07 17:50:46 -07:00
Sarah Wooders	4df0a27eb0	chore: remove sync db (#4873 )	2025-10-07 17:50:45 -07:00
Kian Jones	b8e9a80d93	merge this (#4759 ) * wait I forgot to comit locally * cp the entire core directory and then rm the .git subdir	2025-09-17 15:47:40 -07:00
Kian Jones	22f70ca07c	chore: officially migrate to submodule (#4502 ) * remove apps/core and apps/fern * fix precommit * add submodule updates in workflows * submodule * remove core tests * update core revision * Add submodules: true to all GitHub workflows - Ensure all workflows can access git submodules - Add submodules support to deployment, test, and CI workflows - Fix YAML syntax issues in workflow files 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * remove core-lint * upgrade core with latest main of oss --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-09 12:45:53 -07:00
Andy Li	04e9f43220	chore: strings lint cleanup (#3374 )	2025-07-18 09:20:45 -07:00
Matthew Zhou	87f4bcad9a	feat: Add summarization for more scenarios (#2499 )	2025-05-29 11:10:13 -07:00
Matthew Zhou	917821a735	refactor: Deprecate local client (#2344 )	2025-05-22 18:57:14 -07:00
Andy Li	a45739444f	fix: summarization includes tool call message before truncation (#2084 ) Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>	2025-05-09 15:01:12 -07:00
Andy Li	acda68c0a8	fix: summarization trims tool call without trimming tool response (#2010 ) Co-authored-by: cthomas <caren@letta.com> Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>	2025-05-05 21:02:23 -07:00
Charles Packer	2ab267aa84	fix: patch .utcnow warning (#1702 )	2025-04-14 12:55:34 -07:00
Matthew Zhou	bb0ea1844f	fix: Fix cascade deletion (#1641 )	2025-04-09 10:55:32 -07:00
Sarah Wooders	b4e19f9a70	fix: patch summarizer for google and use new client (#1639 )	2025-04-08 21:10:48 -07:00
Matthew Zhou	227b76fe0e	feat: Add testing for SDK `send_message` variants (#1520 )	2025-04-01 16:54:09 -07:00
cthomas	e29f333cbe	chore: message schema api improvements (#1267 )	2025-03-13 12:04:03 -07:00
cthomas	eddd167f43	chore: remove message.text property (#1253 )	2025-03-12 10:58:31 -07:00
cthomas	c6293f2ac9	feat: extend message model to support more content types (#756 )	2025-01-23 17:24:52 -08:00
Matthew Zhou	50de3cb4b7	feat: Rework summarizer (#654 )	2025-01-22 11:19:26 -08:00
Caren Thomas	fd8961c39e	run black, add isort config to pyproject.toml	2024-12-26 19:43:11 -08:00
Shubham Naik	0b8017853a	fix: add tests to cypress	2024-12-23 14:44:08 -08:00
Shubham Naik	5a743d1dc4	Add 'apps/core/' from commit 'ea2a7395f4023f5b9fab03e6273db3b64a1181d5' git-subtree-dir: apps/core git-subtree-mainline: a8963e11e7a5a0059acbc849ce768e1eee80df61 git-subtree-split: ea2a7395f4023f5b9fab03e6273db3b64a1181d5	2024-12-22 20:31:22 -08:00

43 Commits