Go to file

jnjpng f48b60634f refactor: extract compact logic to shared function for temporal (#9249 )

* refactor: extract compact logic to shared function

Extract the compaction logic from LettaAgentV3.compact() into a
standalone compact_messages() function that can be shared between
the agent and temporal workflows.

Changes:
- Create apps/core/letta/services/summarizer/compact.py with:
  - compact_messages(): Core compaction logic
  - build_summarizer_llm_config(): LLM config builder for summarization
  - CompactResult: Dataclass for compaction results
- Update LettaAgentV3.compact() to use compact_messages()
- Update temporal summarize_conversation_history activity to use
  compact_messages() instead of the old Summarizer class
- Add use_summary_role parameter to SummarizeParams

This ensures consistent summarization behavior across different
execution paths and prevents drift as we improve the implementation.

* chore: clean up verbose comments

* fix: correct CompactionSettings import path

* fix: correct count_tokens import from summarizer_sliding_window

* fix: update test patch path for count_tokens_with_tools

After extracting compact logic to compact.py, the test was patching
the old location. Update the patch path to the new module location.

* fix: update test to use build_summarizer_llm_config from compact.py

The function was moved from LettaAgentV3._build_summarizer_llm_config
to compact.py as a standalone function.

* fix: add early check for system prompt size in compact_messages

Check if the system prompt alone exceeds the context window before
attempting summarization. The system prompt cannot be compacted,
so fail fast with SystemPromptTokenExceededError.

* fix: properly propagate SystemPromptTokenExceededError from compact

The exception handler in _step() was not setting the correct stop_reason
for SystemPromptTokenExceededError, which caused the finally block to
return early and swallow the exception.

Add special handling to set stop_reason to context_window_overflow_in_system_prompt
when SystemPromptTokenExceededError is caught.

* revert: remove redundant SystemPromptTokenExceededError handling

The special handling in the outer exception handler is redundant because
stop_reason is already set in the inner handler at line 943. The actual
fix for the test was the early check in compact_messages(), not this
redundant handling.

* fix: correctly re-raise SystemPromptTokenExceededError

The inner exception handler was using 'raise e' which re-raised the outer
ContextWindowExceededError instead of the current SystemPromptTokenExceededError.

Changed to 'raise' to correctly re-raise the current exception. This bug
was pre-existing but masked because _check_for_system_prompt_overflow was
only called as a fallback. The new early check in compact_messages() exposed it.

* revert: remove early check and restore raise e to match main behavior

* fix: set should_continue=False and correctly re-raise exception

- Add should_continue=False in SystemPromptTokenExceededError handler (matching main's _check_for_system_prompt_overflow behavior)
- Fix raise e -> raise to correctly propagate SystemPromptTokenExceededError

Note: test_large_system_prompt_summarization still fails locally but passes on main.
Need to investigate why exception isn't propagating correctly on refactored branch.

* fix: add SystemPromptTokenExceededError handler for post-step compaction

The post-step compaction (line 1066) was missing a SystemPromptTokenExceededError
exception handler. When compact_messages() raised this error, it would be caught
by the outer exception handler which would:
1. Set stop_reason to "error" instead of "context_window_overflow_in_system_prompt"
2. Not set should_continue = False
3. Get swallowed by the finally block (line 1126) which returns early

This caused test_large_system_prompt_summarization to fail because the exception
never propagated to the test.

The fix adds the same exception handler pattern used in the retry compaction flow
(line 941-946), ensuring proper state is set before re-raising.

This issue only affected the refactored code because on main, _check_for_system_prompt_overflow()
was an instance method that set should_continue/stop_reason BEFORE raising. In the refactor,
compact_messages() is a standalone function that cannot set instance state, so the caller
must handle the exception and set the state.

2026-02-24 10:52:06 -08:00

.github

fix: update gh templates (#3155 )

2026-01-18 13:50:17 -08:00

.skills

refactor: add extract_usage_statistics returning LettaUsageStatistics (#9065 )

2026-01-29 12:44:04 -08:00

alembic

feat: add metadata-only provider trace storage option (#9155 )

2026-01-29 12:44:04 -08:00

assets

chore: Update README.md (#2215 )

2024-12-10 19:20:27 -08:00

certs

feat: support local https mode (#2217 )

2024-12-10 13:36:20 -08:00

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

examples/notebooks/data

chore: remove old examples (#6255 )

2025-11-24 19:09:33 -08:00

fern

feat: add ID format validation to agent and user schemas (#9151 )

2026-02-24 10:52:06 -08:00

letta

refactor: extract compact logic to shared function for temporal (#9249 )

2026-02-24 10:52:06 -08:00

otel

feat: Ship traces to datadog and add trace correlation (#6311 )

2025-11-24 19:10:26 -08:00

sandbox

fix: safer type coersion for tools (#8990 )

2026-01-29 12:43:53 -08:00

scripts

cleanup

2025-04-21 08:43:29 -07:00

tests

refactor: extract compact logic to shared function for temporal (#9249 )

2026-02-24 10:52:06 -08:00

.dockerignore

fix: patch Dockerfile for purpose of docker run (#2177 )

2024-12-09 15:03:11 -08:00

.env.example

fix example

2024-12-27 11:28:00 +04:00

.gitattributes

chore: .gitattributes (#1511 )

2024-07-04 14:45:35 -07:00

.gitignore

feat: Write tests for search messages [LET-4212] (#4447 )

2025-09-05 17:52:13 -07:00

.pre-commit-config.yaml

chore: migrate to ruff (#4305 )

2025-08-29 11:11:19 -07:00

.python-version

feat: add custom version of ddtrace which supports anthropic (#8419 )

2026-01-12 10:57:48 -08:00

alembic.ini

chore: support alembic (#1867 )

2024-10-11 15:51:14 -07:00

CITATION.cff

fix: Update CITATION.cff (#2009 )

2024-11-06 23:00:17 -08:00

compose.yaml

fix: correct external db (#2163 )

2025-05-13 15:32:09 -07:00

conf.yaml

feat: add support for YAML config file (#8999 )

2026-02-24 10:52:06 -08:00

CONTRIBUTING.md

Update contributing.md with corrected local setup steps (#3123 )

2025-12-31 12:24:14 -08:00

dev-compose.yaml

Shelley/let 7218 editor should be compatible with typescript [LET-7218] (#9087 )

2026-01-29 12:44:04 -08:00

development.compose.yml

fix: fix core memory heartbeat issue (#1929 )

2024-10-23 12:22:37 -07:00

docker-compose-vllm.yaml

feat: rename docker to letta/letta (#2010 )

2024-11-06 23:15:25 -08:00

Dockerfile

chore: add redis to oss docker (#7347 )

2025-12-17 17:32:25 -08:00

init.sql

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

LICENSE

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

nginx.conf

fix: Fix Docker compose startup issues (letta-ai#2056) (#2057 )

2024-11-17 19:28:53 -08:00

package-lock.json

feat: add sonnet 3.7 support (#1302 )

2025-03-24 16:36:16 -10:00

PRIVACY.md

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

project.json

fix: try and patch the PATCH/update issue with MCP server URL [LET-3933]

2025-09-03 09:42:57 -07:00

pyproject.toml

bump version

2026-01-29 12:45:45 -08:00

README.md

fix: update gh templates (#3155 )

2026-01-18 13:50:17 -08:00

TERMS.md

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

test_watchdog_hang.py

Add lightweight event loop watchdog monitoring (#6209 )

2025-11-24 19:09:33 -08:00

uv.lock

bump version

2026-01-29 12:45:45 -08:00

WEBHOOK_SETUP.md

feat: support webhooks for step completions (#5904 )

2025-11-13 15:36:50 -08:00

README.md

Letta (formerly MemGPT)

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

Letta Code: run agents locally in your terminal
Letta API: build agents into your applications

Get started in the CLI

Requires Node.js 18+

Install the Letta Code CLI tool: npm install -g @letta-ai/letta-code
Run letta in your terminal to launch an agent with memory running on your local computer

When running the CLI tool, your agent help you code and do any task you can do on your computer.

Letta Code supports skills and subagents, and bundles pre-built skills/subagents for advanced memory and continual learning. Letta is fully model-agnostic, though we recommend Opus 4.5 and GPT-5.2 for best performance (see our model leaderboard for our rankings).

Get started with the Letta API

Use the Letta API to integrate stateful agents into your own applications. Letta has a full-featured agents API, and a Python and Typescript SDK (view our API reference).

Installation

TypeScript / Node.js:

npm install @letta-ai/letta-client

Python:

pip install letta-client

Hello World example

Below is a quick example of creating a stateful agent and sending it a message (requires a Letta API key). See the full quickstart guide for complete documentation.

TypeScript:

import Letta from "@letta-ai/letta-client";

const client = new Letta({ apiKey: process.env.LETTA_API_KEY });

// Create your agent
const agentState = await client.agents.create({
  model: "openai/gpt-5.2",
  memory_blocks: [
    {
      label: "human",
      value:
        "Name: Timber. Status: dog. Occupation: building Letta, infrastructure to democratize self-improving superintelligence",
    },
    {
      label: "persona",
      value:
        "I am a self-improving superintelligence. Timber is my best friend and collaborator.",
    },
  ],
  tools: ["web_search", "fetch_webpage"],
});

console.log("Agent created with ID:", agentState.id);

// Send your agent a message
const response = await client.agents.messages.create(agentState.id, {
  input: "What do you know about me?",
});

for (const message of response.messages) {
  console.log(message);
}

Python:

from letta_client import Letta
import os

client = Letta(api_key=os.getenv("LETTA_API_KEY"))

# Create your agent
agent_state = client.agents.create(
    model="openai/gpt-5.2",
    memory_blocks=[
        {
          "label": "human",
          "value": "Name: Timber. Status: dog. Occupation: building Letta, infrastructure to democratize self-improving superintelligence"
        },
        {
          "label": "persona",
          "value": "I am a self-improving superintelligence. Timber is my best friend and collaborator."
        }
    ],
    tools=["web_search", "fetch_webpage"]
)

print(f"Agent created with ID: {agent_state.id}")

# Send your agent a message
response = client.agents.messages.create(
    agent_id=agent_state.id,
    input="What do you know about me?"
)

for message in response.messages:
    print(message)

Contributing

Letta is an open source project built by over a hundred contributors from around the world. There are many ways to get involved in the Letta OSS project!

Join the Discord: Chat with the Letta devs and other AI developers.
Chat on our forum: If you're not into Discord, check out our developer forum.
Follow our socials: Twitter/X, LinkedIn, YouTube

Legal notices: By using Letta and related Letta services (such as the Letta endpoint or hosted service), you are agreeing to our privacy policy and terms of service.