Go to file

Charles Packer 2fc592e0b6 feat(core): add image support in tool returns [LET-7140] (#8985 )

* feat(core): add image support in tool returns [LET-7140]

Enable tool_return to support both string and ImageContent content parts,
matching the pattern used for user message inputs. This allows tools
executed client-side to return images back to the agent.

Changes:
- Add LettaToolReturnContentUnion type for text/image content parts
- Update ToolReturn schema to accept Union[str, List[content parts]]
- Update converters for each provider:
  - OpenAI Chat Completions: placeholder text for images
  - OpenAI Responses API: full image support
  - Anthropic: full image support with base64
  - Google: placeholder text for images
- Add resolve_tool_return_images() for URL-to-base64 conversion
- Make create_approval_response_message_from_input() async

🐾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

* fix(core): support images in Google tool returns as sibling parts

Following the gemini-cli pattern: images in tool returns are sent as
sibling inlineData parts alongside the functionResponse, rather than
inside it.

🐾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

* test(core): add integration tests for multi-modal tool returns [LET-7140]

Tests verify that:
- Models with image support (Anthropic, OpenAI Responses API) can see
  images in tool returns and identify the secret text
- Models without image support (Chat Completions) get placeholder text
  and cannot see the actual image content
- Tool returns with images persist correctly in the database

Uses secret.png test image containing hidden text "FIREBRAWL" that
models must identify to pass the test.

Also fixes misleading comment about Anthropic only supporting base64
images - they support URLs too, we just pre-resolve for consistency.

🐾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

* refactor: simplify tool return image support implementation

Reduce code verbosity while maintaining all functionality:
- Extract _resolve_url_to_base64() helper in message_helper.py (eliminates duplication)
- Add _get_text_from_part() helper for text extraction
- Add _get_base64_image_data() helper for image data extraction
- Add _tool_return_to_google_parts() to simplify Google implementation
- Add _image_dict_to_data_url() for OpenAI Responses format
- Use walrus operator and list comprehensions where appropriate
- Add integration_test_multi_modal_tool_returns.py to CI workflow

Net change: -120 lines while preserving all features and test coverage.

👾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

* fix(tests): improve prompt for multi-modal tool return tests

Make prompts more direct to reduce LLM flakiness:
- Simplify tool description: "Retrieves a secret image with hidden text. Call this function to get the image."
- Change user prompt from verbose request to direct command: "Call the get_secret_image function now."
- Apply to both test methods

This reduces ambiguity and makes tool calling more reliable across different LLM models.

👾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

* fix bugs

* test(core): add google_ai/gemini-2.0-flash-exp to multi-modal tests

Add Gemini model to test coverage for multi-modal tool returns. Google AI already supports images in tool returns via sibling inlineData parts.

👾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

* fix(ui): handle multi-modal tool_return type in frontend components

Convert Union<string, LettaToolReturnContentUnion[]> to string for display:
- ViewRunDetails: Convert array to '[Image here]' placeholder
- ToolCallMessageComponent: Convert array to '[Image here]' placeholder

Fixes TypeScript errors in web, desktop-ui, and docker-ui type-checks.

👾 Generated with [Letta Code](https://letta.com)

Co-Authored-By: Letta <noreply@letta.com>

---------

Co-authored-by: Letta <noreply@letta.com>
Co-authored-by: Caren Thomas <carenthomas@gmail.com>

2026-01-29 12:43:53 -08:00

.github

fix: update gh templates (#3155 )

2026-01-18 13:50:17 -08:00

.skills/db-migrations-schema-changes

feat: add conversation and conversation_messages tables for concurrent messaging (#8182 )

2026-01-12 10:57:48 -08:00

alembic

feat: byok provider models in db also (#8317 )

2026-01-29 12:43:53 -08:00

assets

chore: Update README.md (#2215 )

2024-12-10 19:20:27 -08:00

certs

feat: support local https mode (#2217 )

2024-12-10 13:36:20 -08:00

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

examples/notebooks/data

chore: remove old examples (#6255 )

2025-11-24 19:09:33 -08:00

fern

feat(core): add image support in tool returns [LET-7140] (#8985 )

2026-01-29 12:43:53 -08:00

letta

feat(core): add image support in tool returns [LET-7140] (#8985 )

2026-01-29 12:43:53 -08:00

otel

feat: Ship traces to datadog and add trace correlation (#6311 )

2025-11-24 19:10:26 -08:00

sandbox

feat: tool function arguments passed in at runtime

2025-08-15 16:24:56 -07:00

scripts

cleanup

2025-04-21 08:43:29 -07:00

tests

feat(core): add image support in tool returns [LET-7140] (#8985 )

2026-01-29 12:43:53 -08:00

.dockerignore

fix: patch Dockerfile for purpose of docker run (#2177 )

2024-12-09 15:03:11 -08:00

.env.example

fix example

2024-12-27 11:28:00 +04:00

.gitattributes

chore: .gitattributes (#1511 )

2024-07-04 14:45:35 -07:00

.gitignore

feat: Write tests for search messages [LET-4212] (#4447 )

2025-09-05 17:52:13 -07:00

.pre-commit-config.yaml

chore: migrate to ruff (#4305 )

2025-08-29 11:11:19 -07:00

.python-version

feat: add custom version of ddtrace which supports anthropic (#8419 )

2026-01-12 10:57:48 -08:00

alembic.ini

chore: support alembic (#1867 )

2024-10-11 15:51:14 -07:00

CITATION.cff

fix: Update CITATION.cff (#2009 )

2024-11-06 23:00:17 -08:00

compose.yaml

fix: correct external db (#2163 )

2025-05-13 15:32:09 -07:00

CONTRIBUTING.md

Update contributing.md with corrected local setup steps (#3123 )

2025-12-31 12:24:14 -08:00

dev-compose.yaml

fix: refactor into common uri parsing logic, fix test, and fix compose file (#5261 )

2025-10-24 15:10:35 -07:00

development.compose.yml

fix: fix core memory heartbeat issue (#1929 )

2024-10-23 12:22:37 -07:00

docker-compose-vllm.yaml

feat: rename docker to letta/letta (#2010 )

2024-11-06 23:15:25 -08:00

Dockerfile

chore: add redis to oss docker (#7347 )

2025-12-17 17:32:25 -08:00

init.sql

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

LICENSE

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

nginx.conf

fix: Fix Docker compose startup issues (letta-ai#2056) (#2057 )

2024-11-17 19:28:53 -08:00

package-lock.json

feat: add sonnet 3.7 support (#1302 )

2025-03-24 16:36:16 -10:00

PRIVACY.md

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

project.json

fix: try and patch the PATCH/update issue with MCP server URL [LET-3933]

2025-09-03 09:42:57 -07:00

pyproject.toml

fix: max tokens and context window size [LET-6481] (#8298 )

2026-01-29 12:43:53 -08:00

README.md

fix: update gh templates (#3155 )

2026-01-18 13:50:17 -08:00

TERMS.md

chore: migrate package name to letta (#1775 )

2024-09-23 09:15:18 -07:00

test_watchdog_hang.py

Add lightweight event loop watchdog monitoring (#6209 )

2025-11-24 19:09:33 -08:00

uv.lock

fix: max tokens and context window size [LET-6481] (#8298 )

2026-01-29 12:43:53 -08:00

WEBHOOK_SETUP.md

feat: support webhooks for step completions (#5904 )

2025-11-13 15:36:50 -08:00

README.md

Letta (formerly MemGPT)

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

Letta Code: run agents locally in your terminal
Letta API: build agents into your applications

Get started in the CLI

Requires Node.js 18+

Install the Letta Code CLI tool: npm install -g @letta-ai/letta-code
Run letta in your terminal to launch an agent with memory running on your local computer

When running the CLI tool, your agent help you code and do any task you can do on your computer.

Letta Code supports skills and subagents, and bundles pre-built skills/subagents for advanced memory and continual learning. Letta is fully model-agnostic, though we recommend Opus 4.5 and GPT-5.2 for best performance (see our model leaderboard for our rankings).

Get started with the Letta API

Use the Letta API to integrate stateful agents into your own applications. Letta has a full-featured agents API, and a Python and Typescript SDK (view our API reference).

Installation

TypeScript / Node.js:

npm install @letta-ai/letta-client

Python:

pip install letta-client

Hello World example

Below is a quick example of creating a stateful agent and sending it a message (requires a Letta API key). See the full quickstart guide for complete documentation.

TypeScript:

import Letta from "@letta-ai/letta-client";

const client = new Letta({ apiKey: process.env.LETTA_API_KEY });

// Create your agent
const agentState = await client.agents.create({
  model: "openai/gpt-5.2",
  memory_blocks: [
    {
      label: "human",
      value:
        "Name: Timber. Status: dog. Occupation: building Letta, infrastructure to democratize self-improving superintelligence",
    },
    {
      label: "persona",
      value:
        "I am a self-improving superintelligence. Timber is my best friend and collaborator.",
    },
  ],
  tools: ["web_search", "fetch_webpage"],
});

console.log("Agent created with ID:", agentState.id);

// Send your agent a message
const response = await client.agents.messages.create(agentState.id, {
  input: "What do you know about me?",
});

for (const message of response.messages) {
  console.log(message);
}

Python:

from letta_client import Letta
import os

client = Letta(api_key=os.getenv("LETTA_API_KEY"))

# Create your agent
agent_state = client.agents.create(
    model="openai/gpt-5.2",
    memory_blocks=[
        {
          "label": "human",
          "value": "Name: Timber. Status: dog. Occupation: building Letta, infrastructure to democratize self-improving superintelligence"
        },
        {
          "label": "persona",
          "value": "I am a self-improving superintelligence. Timber is my best friend and collaborator."
        }
    ],
    tools=["web_search", "fetch_webpage"]
)

print(f"Agent created with ID: {agent_state.id}")

# Send your agent a message
response = client.agents.messages.create(
    agent_id=agent_state.id,
    input="What do you know about me?"
)

for message in response.messages:
    print(message)

Contributing

Letta is an open source project built by over a hundred contributors from around the world. There are many ways to get involved in the Letta OSS project!

Join the Discord: Chat with the Letta devs and other AI developers.
Chat on our forum: If you're not into Discord, check out our developer forum.
Follow our socials: Twitter/X, LinkedIn, YouTube

Legal notices: By using Letta and related Letta services (such as the Letta endpoint or hosted service), you are agreeing to our privacy policy and terms of service.