letta-server

Author	SHA1	Message	Date
Ari Webb	851798d71a	fix: step_id is none (#8528 )	2026-01-19 15:54:37 -08:00
Kian Jones	82e5d70807	fix: prevent empty reasoning messages in streaming interfaces (#7207 ) * fix: prevent empty reasoning messages in streaming interfaces Prevents empty "Thinking..." indicators from appearing in clients by filtering out reasoning messages with no content at the source. Changes: - Gemini: Don't emit ReasoningMessage when only thought_signature exists - Gemini: Only emit reasoning content if text is non-empty - Anthropic: Don't emit ReasoningMessage for BetaSignatureDelta - Anthropic: Only emit reasoning content if thinking text is non-empty This fixes the issue where providers send signature metadata before actual thinking content, causing empty reasoning blocks to appear in the UI after responses complete. Affects: Gemini reasoning, Anthropic extended thinking 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: handle Anthropic thinking signature correctly - Only include 'signature' in Anthropic message payload if it is not None (fixes BadRequestError). - Capture and attach 'signature' to ReasoningMessage in streaming interface. * fix(anthropic): attach signature to last reasoning message in stream --------- Co-authored-by: Letta <noreply@letta.com>	2026-01-12 10:57:19 -08:00
Sarah Wooders	8729a037b9	fix: handle new openai overflow error format (#7110 )	2025-12-17 17:31:02 -08:00
Devansh Jain	d1536df6f6	chore: Update deepseek client for v3.2 models (#6556 ) * support for v3.2 models * streaming + context window fix * fix for no assitant text from deepseek	2025-12-15 12:02:34 -08:00
Kian Jones	647e271c2a	fix: add more logging for stream error (#6490 ) * trying tout gpt-5.1-codex * add unit test for message content * try to support multimodal * remove ValueError and add logging on stream error * prevent stream termination from api spec implementation errors * fix: remove final_response references from non-Responses API interfaces * fix: add diagnostic attributes to SimpleOpenAIResponsesStreamingInterface * fix: remove final_response from SimpleOpenAIStreamingInterface (Chat Completions API)	2025-12-15 12:02:33 -08:00
Sarah Wooders	91e3dd8b3e	feat: fix new summarizer code and add more tests (#6461 )	2025-12-15 12:02:19 -08:00
Charles Packer	1f7165afc4	fix: patch counting of tokens for anthropic (#6458 ) * fix: patch counting of tokens for anthropic * fix: patch ui to be simpler * fix: patch undercounting bug in anthropic when caching is on	2025-12-15 12:02:19 -08:00
Charles Packer	4af6465226	feat(core+web): store raw usage data on streams (and visualize properly in ADE) (#6452 ) * feat(core): store raw usage data on streams * fix(web): various fixes to deal w/ hardcoding against openai	2025-12-15 12:02:19 -08:00
Charles Packer	88a3743cc8	fix(core): distinguish between null and 0 for prompt caching (#6451 ) * fix(core): distinguish between null and 0 for prompt caching * fix: runtime errors * fix: just publish just sgate	2025-12-15 12:02:19 -08:00
Charles Packer	131891e05f	feat: add tracking of advanced usage data (eg caching) [LET-6372] (#6449 ) * feat: init refactor * feat: add helper code * fix: missing file + test * fix: just state/publish api	2025-12-15 12:02:19 -08:00
Charles Packer	e142d440d5	fix: patch gemini token counting (#6445 ) fix: use usage_metadata.candidates_token_count for counting total tokens	2025-12-15 12:02:18 -08:00
Charles Packer	081a1f6920	fix(core): patch responses api parallel tool calling not returning tool call IDs (#6079 ) * fix(core): patch responses api parallel tool calling not returning tool call ids * fix(core): patch chatcompletions as well * fix: patch problem with gpt-4.1	2025-11-13 15:36:56 -08:00
Matthew Zhou	72e80395cc	fix: Fix gemini streaming interface string growth [LET-6067] (#5975 ) * Fix gemini streaming interface * Add comments	2025-11-13 15:36:55 -08:00
Matthew Zhou	6f57ae829a	fix: Reduce string growth for anthropic (#5974 ) Reduce string growth for anthropic	2025-11-13 15:36:55 -08:00
Matthew Zhou	a699aca626	fix: Eliminate O(n^2) string growth for OpenAI [LET-6065] (#5973 ) Finish	2025-11-13 15:36:55 -08:00
Matthew Zhou	7b3cb0224a	feat: Add gemini parallel tool call streaming for gemini [LET-6027] (#5913 ) * Make changes to gemini streaming interface to support parallel tool calling * Finish send message integration test * Add comments	2025-11-13 15:36:39 -08:00
Matthew Zhou	d3ab51b822	feat: Support parallel tool calling streaming for responses OpenAI [LET-5977] (#5867 )	2025-11-13 15:36:20 -08:00
Matthew Zhou	ff81f4153b	feat: Support parallel tool calling streaming for OpenAI chat completions [LET-4594] (#5865 ) * Finish chat completions parallel tool calling * Undo comments * Add comments * Remove test file	2025-11-13 15:36:14 -08:00
Ari Webb	9d5fdc6de7	feat: migrate integration test mcp serverspy to use 1.0 client [LET-5945] (#5814 ) * new test first hack, should still break --------- Co-authored-by: Ari Webb <ari@letta.com>	2025-11-13 15:36:14 -08:00
cthomas	1848df2daa	feat: add special approval request otid for openai streaming (#5744 ) * feat: add special approval request otid for openai streaming * fix import	2025-10-24 15:14:39 -07:00
cthomas	c67bdd9c64	fix: special approval message otid for gemini streaming (#5742 )	2025-10-24 15:14:39 -07:00
cthomas	6b37ef2cb7	fix: special otid handling for approval request (#5726 )	2025-10-24 15:14:31 -07:00
cthomas	4823416af9	feat: default unpack assistant message content [LET-5404] (#5707 ) feat: default unpack assistant message content	2025-10-24 15:14:20 -07:00
cthomas	73dcc0d4b7	feat: latest hitl + parallel tool call changes (#5565 )	2025-10-24 15:12:49 -07:00
Matthew Zhou	643ec8fe2f	fix: Double write tool call deltas [LET-5545] (#5461 ) * Double write tool call deltas * Fix	2025-10-24 15:12:11 -07:00
Kevin Lin	08da1a64bb	feat: parse `reasoning_content` from OAI proxies (eg. vLLM / OpenRouter) (#5372 ) * reasonig_content support * fix * comment * fix * rm comment --------- Co-authored-by: Charles Packer <packercharles@gmail.com>	2025-10-24 15:11:31 -07:00
Kian Jones	c2e474e03a	feat: refactor logs to parse as a single log line each and filter out 404s from sentry (#5242 ) * add multiline log auto detect * implement logger.exception() * filter out 404 * remove potentially problematic changes	2025-10-24 15:11:31 -07:00
Matthew Zhou	7511b0f4fe	feat: Write anthropic streaming interface that supports parallel tool calling [LET-5355] (#5295 ) Write anthropic streaming interface that supports parallel tool calling	2025-10-09 15:25:21 -07:00
Matthew Zhou	5593f1450b	feat: Double write to `ToolCallMessage`'s new list `tool_calls` field (#5268 ) * Add new tool_calls field to ToolCallMessage * fern autogen * Double write to new tool_calls field * Update straggling instances	2025-10-09 13:20:52 -07:00
cthomas	cc913df27c	feat: add signature to content parts (#5134 ) * feat: add signature to content parts * always base64 encode thought signature * propagate thought signature back to request	2025-10-07 17:50:49 -07:00
cthomas	93d9ff01c6	feat: add gemini native thinking (#5124 ) * feat: add gemini native thinking * update test * revert comments	2025-10-07 17:50:49 -07:00
cthomas	3e17b4289a	feat: gracefully handle gemini empty content parts (#5116 )	2025-10-07 17:50:48 -07:00
cthomas	f7755d837a	feat: add gemini streaming to new agent loop (#5109 ) * feat: add gemini streaming to new agent loop * add google as required dependency * support storing all content parts * remove extra google references	2025-10-07 17:50:48 -07:00
Sarah Wooders	ef07e03ee3	feat: add `run_id` to input messages and `step_id` to messages (#5099 )	2025-10-07 17:50:48 -07:00
cthomas	a3545110cf	feat: add full responses api support in new agent loop (#5051 ) * feat: add full responses api support in new agent loop * update matrix in workflow * relax check for reasoning messages for high effort gpt 5 * fix indent * one more relax	2025-10-07 17:50:48 -07:00
cthomas	67f8e46619	feat: add run id to streamed messages (#5037 )	2025-10-07 17:50:47 -07:00
cthomas	f235dfb356	feat: add tool call test for new agent loop (#5034 )	2025-10-07 17:50:47 -07:00
Charles Packer	a4041879a4	feat: add new agent loop (squash rebase of OSS PR) (#4815 ) * feat: squash rebase of OSS PR * fix: revert changes that weren't on manual rebase * fix: caught another one * fix: disable force * chore: drop print * fix: just stage-api && just publish-api * fix: make agent_type consistently an arg in the client * fix: patch multi-modal support * chore: put in todo stub * fix: disable hardcoding for tests * fix: patch validate agent sync (#4882) patch validate agent sync * fix: strip bad merge diff * fix: revert unrelated diff * fix: react_v2 naming -> letta_v1 naming * fix: strip bad merge --------- Co-authored-by: Kevin Lin <klin5061@gmail.com>	2025-10-07 17:50:45 -07:00
Kian Jones	b8e9a80d93	merge this (#4759 ) * wait I forgot to comit locally * cp the entire core directory and then rm the .git subdir	2025-09-17 15:47:40 -07:00
Kian Jones	22f70ca07c	chore: officially migrate to submodule (#4502 ) * remove apps/core and apps/fern * fix precommit * add submodule updates in workflows * submodule * remove core tests * update core revision * Add submodules: true to all GitHub workflows - Ensure all workflows can access git submodules - Add submodules support to deployment, test, and CI workflows - Fix YAML syntax issues in workflow files 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * remove core-lint * upgrade core with latest main of oss --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-09 12:45:53 -07:00
cthomas	cb7296c81d	fix: approval request for streaming (#4445 ) * fix: approval request for streaming * fix: claude code attempt, unit test passing (add on to #4445) (#4448) * fix: claude code attempt, unit test passing * chore: update locks to 0.1.314 from 0.1.312 * chore: just stage-api && just publish-api * chore: drop dead poetry lock --------- Co-authored-by: Charles Packer <packercharles@gmail.com>	2025-09-05 17:43:21 -07:00
Charles Packer	264171f327	fix: patch streaming hidden reasoning event [LET-4167] (#4367 ) * fix: patch streaming hidden reasoning event * fix: patch reasoning_effort not getting passed to openai	2025-09-02 16:21:18 -07:00
Charles Packer	9d49eff204	fix: patch the streaming issue in the openai client for when inner_thoughts_in_kwargs is off [LET-4146] (#4350 ) fix: patch the streaming issue in the openai client for when inner_thoughts_in_kwargs is off	2025-09-02 12:44:25 -07:00
Charles Packer	e741f84add	fix: patch bug w/ extended thinking mode involving `text` leaking into `reasoning` (#4341 ) * fix: patch for bad native reasoning behavior w/ sonnet * fix: cleanup * fix: cleanup * fix: another prompt tune for less flaking	2025-09-01 20:26:24 -07:00
cthomas	1edcc13778	feat: support filtering out messages when converting to openai dict (#4337 ) * feat: support filtering out messages when converting to openai dict * fix imports	2025-09-01 12:48:45 -07:00
Kian Jones	fecf6decfb	chore: migrate to ruff (#4305 ) * base requirements * autofix * Configure ruff for Python linting and formatting - Set up minimal ruff configuration with basic checks (E, W, F, I) - Add temporary ignores for common issues during migration - Configure pre-commit hooks to use ruff with pass_filenames - This enables gradual migration from black to ruff * Delete sdj * autofixed only * migrate lint action * more autofixed * more fixes * change precommit * try changing the hook * try this stuff	2025-08-29 11:11:19 -07:00
cthomas	c8b370466e	fix: duplicate message stream error (#3834 )	2025-08-11 14:27:35 -07:00
cthomas	db41f01ac2	feat: continue stream processing on client cancel (#3796 )	2025-08-07 13:17:36 -07:00
Andy Li	ca6f474c4e	feat: track metrics for runs in db	2025-08-06 15:46:50 -07:00
cthomas	7d33254f5f	feat: log stream cancellation to sentry (#3759 )	2025-08-05 16:07:30 -07:00

1 2

86 Commits