letta-server/letta at db9e0f42afe5c97d13f5f5836147193606d9411e - letta-server - WIUF Gitea: Waiting is - Until Fullness

Fimeg/letta-server

Files

History

jnjpng db9e0f42af fix(core): prevent ModelSettings default max_output_tokens from overriding agent config (#9739 )

* fix(core): prevent ModelSettings default max_output_tokens from overriding agent config

When a conversation's model_settings were saved, the Pydantic default
of max_output_tokens=4096 was always persisted to the DB even when the
client never specified it. On subsequent messages, this default would
overwrite the agent's max_tokens (typically None) with 4096, silently
capping output.

Two changes:
1. Use model_dump(exclude_unset=True) when persisting model_settings
   to the DB so Pydantic defaults are not saved.
2. Add model_fields_set guards at all callsites that apply
   _to_legacy_config_params() to skip max_tokens when it was not
   explicitly provided by the caller.

Also conditionally set max_output_tokens in the OpenAI Responses API
request builder so None is not sent as null (which some models treat
as a hard 4096 cap).

* nit

* Fix model_settings serialization to preserve provider_type discriminator

Replace blanket exclude_unset=True with targeted removal of only
max_output_tokens when not explicitly set. The previous approach
stripped the provider_type field (a Literal with a default), which
broke discriminated union deserialization when reading back from DB.

2026-03-03 18:34:02 -08:00

..

fix(core): handle ResponseIncompleteEvent in OpenAI Responses API streaming (#9535 )

2026-02-24 10:55:11 -08:00

fix: set otid for summary message (#9654 )

2026-03-03 18:34:01 -08:00

chore: enable F821, F401, W293 (#9503 )

2026-02-24 10:55:08 -08:00

chore: remove sync db (#4873 )

2025-10-07 17:50:45 -07:00

chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 )

2026-02-24 10:55:11 -08:00

feat: global exception middleware (#6017 )

2025-11-13 15:36:55 -08:00

fix(memory): standardize tool parameter names (#9552 )

2026-02-24 10:55:24 -08:00

chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 )

2026-02-24 10:55:11 -08:00

chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 )

2026-02-24 10:55:11 -08:00

merge this (#4759 )

2025-09-17 15:47:40 -07:00

fix(core): raise LLMEmptyResponseError for empty Anthropic responses (#9624 )

2026-03-03 18:34:01 -08:00

chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 )

2026-02-24 10:55:11 -08:00

fix(core): prevent ModelSettings default max_output_tokens from overriding agent config (#9739 )

2026-03-03 18:34:02 -08:00

chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 )

2026-02-24 10:55:11 -08:00

feat(core): add gpt-5.3-codex model support (#9628 )

2026-03-03 18:34:01 -08:00

feat: dump thread state on event loop hang (#8388 )

2026-01-12 10:57:48 -08:00

openai_backcompat

merge this (#4759 )

2025-09-17 15:47:40 -07:00

fix: lazy load conversations [LET-7682] (#9629 )

2026-03-03 18:34:01 -08:00

chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 )

2026-02-24 10:55:11 -08:00

merge this (#4759 )

2025-09-17 15:47:40 -07:00

chore: enable F821, F401, W293 (#9503 )

2026-02-24 10:55:08 -08:00

Add modes self and self_sliding_window for prompt caching (#9372 )

2026-02-24 10:55:26 -08:00

Fix: Change Z.ai context window to account for max_token subtraction (#9710 )

2026-03-03 18:34:02 -08:00

serialize_schemas

chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 )

2026-02-24 10:55:11 -08:00

fix(core): prevent ModelSettings default max_output_tokens from overriding agent config (#9739 )

2026-03-03 18:34:02 -08:00

fix(core): prevent ModelSettings default max_output_tokens from overriding agent config (#9739 )

2026-03-03 18:34:02 -08:00

merge this (#4759 )

2025-09-17 15:47:40 -07:00

chore: enable F821, F401, W293 (#9503 )

2026-02-24 10:55:08 -08:00

__init__.py

bump version

2026-02-24 10:58:16 -08:00

agent.py

chore: remove sync db (#4873 )

2025-10-07 17:50:45 -07:00

config_file.py

feat: change default context window from 32000 to 128000 (#9673 )

2026-03-03 18:34:01 -08:00

config.py

merge this (#4759 )

2025-09-17 15:47:40 -07:00

constants.py

Fix: Change Z.ai context window to account for max_token subtraction (#9710 )

2026-03-03 18:34:02 -08:00

database_utils.py

chore: sync 0.12.0 version (#3023 )

2025-10-08 16:10:51 -07:00

errors.py

fix(core): raise LLMEmptyResponseError for empty Anthropic responses (#9624 )

2026-03-03 18:34:01 -08:00

interface.py

chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 )

2026-02-24 10:55:11 -08:00

log_context.py

feat(logs): Enrich logs with context-aware primtive types (#5949 )

2025-11-13 15:36:55 -08:00

log.py

feat: Ship traces to datadog and add trace correlation (#6311 )

2025-11-24 19:10:26 -08:00

main.py

chore: enable F821, F401, W293 (#9503 )

2026-02-24 10:55:08 -08:00

pytest.ini

merge this (#4759 )

2025-09-17 15:47:40 -07:00

settings.py

feat(core): increase Gemini timeout to 10 minutes (#9714 )

2026-03-03 18:34:02 -08:00

streaming_interface.py

chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 )

2026-02-24 10:55:11 -08:00

streaming_utils.py

merge this (#4759 )

2025-09-17 15:47:40 -07:00

system.py

Add modes self and self_sliding_window for prompt caching (#9372 )

2026-02-24 10:55:26 -08:00

test_gemini.py

feat(gemini): add 3.1 pro preview support (#9553 )

2026-02-24 10:55:11 -08:00

utils.py

chore: add ty + pre-commit hook and repeal even more ruff rules (#9504 )

2026-02-24 10:55:11 -08:00

validators.py

feat: add default convo support to conversations endpoint (#9706 )

2026-03-03 18:34:02 -08:00