Go to file

Kian Jones b9c4ed3b15 fix: catch contextwindowexceeded error on gemini (#9450 )

* catch contextwindowexceeded error

* fix(core): detect Google token limit errors as ContextWindowExceededError

Google's error message says "input token count exceeds the maximum
number of tokens allowed" which doesn't contain the word "context",
so it was falling through to generic LLMBadRequestError instead of
ContextWindowExceededError. This means compaction won't auto-trigger.

Expands the detection to also match "token count" and "tokens allowed"
in addition to the existing "context" keyword.

🐾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

* fix(core): add missing message arg to LLMBadRequestError in OpenAI client

The generic 400 path in handle_llm_error was constructing
LLMBadRequestError without the required message positional arg,
causing TypeError in prod during summarization.

🐾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

* ci: add adapters/ test suite to core unit test matrix

🐾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

* fix(tests): update adapter error handling test expectations to match actual behavior

The streaming adapter's error handling double-wraps errors: the
AnthropicStreamingInterface calls handle_llm_error first, then the
adapter catches the result and calls handle_llm_error again, which
falls through to the base class LLMError. Updated test expectations
to match this behavior.

🐾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

* fix(core): prevent double-wrapping of LLMError in stream adapter

The AnthropicStreamingInterface.process() already transforms raw
provider errors into LLMError subtypes via handle_llm_error. The
adapter was catching the result and calling handle_llm_error again,
which didn't recognize the already-transformed LLMError and wrapped
it in a generic LLMError("Unhandled LLM error"). This downgraded
specific error types (LLMConnectionError, LLMServerError, etc.)
and broke retry logic that matches on specific subtypes.

Now the adapter checks if the error is already an LLMError and
re-raises it as-is. Tests restored to original correct expectations.

🐾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

---------

Co-authored-by: Letta <noreply@letta.com>

2026-02-24 10:52:07 -08:00

.github

test(core): git-backed memory repo integration (real object store) (#9298 )

2026-02-24 10:52:06 -08:00

.skills

refactor: add extract_usage_statistics returning LettaUsageStatistics (#9065 )

2026-01-29 12:44:04 -08:00

alembic

feat: add usage columns to steps table (#9270 )

2026-02-24 10:52:06 -08:00

assets

chore: Update README.md (#2215 )

2024-12-10 19:20:27 -08:00

certs

feat: support local https mode (#2217 )

2024-12-10 13:36:20 -08:00

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

examples/notebooks/data

chore: remove old examples (#6255 )

2025-11-24 19:09:33 -08:00

fern

fix: update ContextWindowCalculator to parse new system message sections (#9398 )

2026-02-24 10:52:07 -08:00

letta

fix: catch contextwindowexceeded error on gemini (#9450 )

2026-02-24 10:52:07 -08:00

otel

feat: Ship traces to datadog and add trace correlation (#6311 )

2025-11-24 19:10:26 -08:00

sandbox

fix: safer type coersion for tools (#8990 )

2026-01-29 12:43:53 -08:00

scripts

cleanup

2025-04-21 08:43:29 -07:00

tests

fix: catch contextwindowexceeded error on gemini (#9450 )

2026-02-24 10:52:07 -08:00

.dockerignore

fix: patch Dockerfile for purpose of docker run (#2177 )

2024-12-09 15:03:11 -08:00

.env.example

fix example

2024-12-27 11:28:00 +04:00

.gitattributes

chore: .gitattributes (#1511 )

2024-07-04 14:45:35 -07:00

.gitignore

feat: Write tests for search messages [LET-4212] (#4447 )

2025-09-05 17:52:13 -07:00

.pre-commit-config.yaml

chore: migrate to ruff (#4305 )

2025-08-29 11:11:19 -07:00

.python-version

feat: add custom version of ddtrace which supports anthropic (#8419 )

2026-01-12 10:57:48 -08:00

alembic.ini

chore: support alembic (#1867 )

2024-10-11 15:51:14 -07:00

CITATION.cff

fix: Update CITATION.cff (#2009 )

2024-11-06 23:00:17 -08:00

compose.yaml

fix: correct external db (#2163 )

2025-05-13 15:32:09 -07:00

conf.yaml

feat: add support for YAML config file (#8999 )

2026-02-24 10:52:06 -08:00

CONTRIBUTING.md

Update contributing.md with corrected local setup steps (#3123 )

2025-12-31 12:24:14 -08:00

dev-compose.yaml

chore: update pgvector Docker image to official pgvector/pgvector (#9336 )

2026-02-24 10:52:06 -08:00

development.compose.yml

fix: fix core memory heartbeat issue (#1929 )

2024-10-23 12:22:37 -07:00

docker-compose-vllm.yaml

feat: rename docker to letta/letta (#2010 )

2024-11-06 23:15:25 -08:00

Dockerfile

chore: update pgvector Docker image to official pgvector/pgvector (#9336 )

2026-02-24 10:52:06 -08:00

init.sql

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

LICENSE

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

nginx.conf

fix: Fix Docker compose startup issues (letta-ai#2056) (#2057 )

2024-11-17 19:28:53 -08:00

package-lock.json

feat: add sonnet 3.7 support (#1302 )

2025-03-24 16:36:16 -10:00

PRIVACY.md

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

project.json

fix: try and patch the PATCH/update issue with MCP server URL [LET-3933]

2025-09-03 09:42:57 -07:00

pyproject.toml

feat: git smart HTTP for agent memory repos (#9257 )

2026-02-24 10:52:06 -08:00

README.md

fix: update gh templates (#3155 )

2026-01-18 13:50:17 -08:00

TERMS.md

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

test_watchdog_hang.py

Add lightweight event loop watchdog monitoring (#6209 )

2025-11-24 19:09:33 -08:00

uv.lock

feat: git smart HTTP for agent memory repos (#9257 )

2026-02-24 10:52:06 -08:00

WEBHOOK_SETUP.md

feat: support webhooks for step completions (#5904 )

2025-11-13 15:36:50 -08:00

README.md

Letta (formerly MemGPT)

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

Letta Code: run agents locally in your terminal
Letta API: build agents into your applications

Get started in the CLI

Requires Node.js 18+

Install the Letta Code CLI tool: npm install -g @letta-ai/letta-code
Run letta in your terminal to launch an agent with memory running on your local computer

When running the CLI tool, your agent help you code and do any task you can do on your computer.

Letta Code supports skills and subagents, and bundles pre-built skills/subagents for advanced memory and continual learning. Letta is fully model-agnostic, though we recommend Opus 4.5 and GPT-5.2 for best performance (see our model leaderboard for our rankings).

Get started with the Letta API

Use the Letta API to integrate stateful agents into your own applications. Letta has a full-featured agents API, and a Python and Typescript SDK (view our API reference).

Installation

TypeScript / Node.js:

npm install @letta-ai/letta-client

Python:

pip install letta-client

Hello World example

Below is a quick example of creating a stateful agent and sending it a message (requires a Letta API key). See the full quickstart guide for complete documentation.

TypeScript:

import Letta from "@letta-ai/letta-client";

const client = new Letta({ apiKey: process.env.LETTA_API_KEY });

// Create your agent
const agentState = await client.agents.create({
  model: "openai/gpt-5.2",
  memory_blocks: [
    {
      label: "human",
      value:
        "Name: Timber. Status: dog. Occupation: building Letta, infrastructure to democratize self-improving superintelligence",
    },
    {
      label: "persona",
      value:
        "I am a self-improving superintelligence. Timber is my best friend and collaborator.",
    },
  ],
  tools: ["web_search", "fetch_webpage"],
});

console.log("Agent created with ID:", agentState.id);

// Send your agent a message
const response = await client.agents.messages.create(agentState.id, {
  input: "What do you know about me?",
});

for (const message of response.messages) {
  console.log(message);
}

Python:

from letta_client import Letta
import os

client = Letta(api_key=os.getenv("LETTA_API_KEY"))

# Create your agent
agent_state = client.agents.create(
    model="openai/gpt-5.2",
    memory_blocks=[
        {
          "label": "human",
          "value": "Name: Timber. Status: dog. Occupation: building Letta, infrastructure to democratize self-improving superintelligence"
        },
        {
          "label": "persona",
          "value": "I am a self-improving superintelligence. Timber is my best friend and collaborator."
        }
    ],
    tools=["web_search", "fetch_webpage"]
)

print(f"Agent created with ID: {agent_state.id}")

# Send your agent a message
response = client.agents.messages.create(
    agent_id=agent_state.id,
    input="What do you know about me?"
)

for message in response.messages:
    print(message)

Contributing

Letta is an open source project built by over a hundred contributors from around the world. There are many ways to get involved in the Letta OSS project!

Join the Discord: Chat with the Letta devs and other AI developers.
Chat on our forum: If you're not into Discord, check out our developer forum.
Follow our socials: Twitter/X, LinkedIn, YouTube

Legal notices: By using Letta and related Letta services (such as the Letta endpoint or hosted service), you are agreeing to our privacy policy and terms of service.