letta-server

Author	SHA1	Message	Date
jnjpng	37d32be58b	fix: retry on 500 and 503 for gemini [LET-4185] * handle 500 and 503 * timeout --------- Co-authored-by: Letta Bot <noreply@letta.com>	2025-09-03 14:02:58 -07:00
jnjpng	4df4c5ea2e	fix: retry on MALFORMED_FUNCTION_CALL for gemini [LET-4089] --------- Co-authored-by: Letta Bot <noreply@letta.com>	2025-09-01 07:26:13 -07:00
Charles Packer	aecb6e32d7	fix(core): update default value [LET-4117] (#4321 ) * fix(core): update default value * fix: just stage just publish * fix(core): temporary hardcoding of the anthropic max tokens * fix(core): patch the gemini	2025-08-29 16:42:19 -07:00
Kian Jones	ca6cfa5914	chore: migrate to ruff (#4305 ) * base requirements * autofix * Configure ruff for Python linting and formatting - Set up minimal ruff configuration with basic checks (E, W, F, I) - Add temporary ignores for common issues during migration - Configure pre-commit hooks to use ruff with pass_filenames - This enables gradual migration from black to ruff * Delete sdj * autofixed only * migrate lint action * more autofixed * more fixes * change precommit * try changing the hook * try this stuff	2025-08-29 11:11:19 -07:00
cthomas	8b617c9e0d	fix: gemini flash integration test [LET-4060] (#4242 ) * fix: gemini flash integration test * also update google flash * catch error in test * revert test changes * do try catch again * remove try catch from streaming tests * add try catch for summarize test also	2025-08-27 11:59:15 -07:00
cthomas	02de0effa6	feat: improve error message for vertex response parsing (#4043 )	2025-08-20 09:52:20 -07:00
jnjpng	6562868251	fix: include `google_ai` model endpoint type when setting reasoning tokens for google reasoning models Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>	2025-08-19 14:58:35 -07:00
jnjpng	ccdcbb3b01	fix: require function declarations to be present for setting gemini tool config Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>	2025-08-19 12:56:31 -07:00
Matthew Zhou	5772365285	fix: Fix error messages with malformed gemini call (#3865 )	2025-08-11 16:59:19 -07:00
cthomas	639641137d	feat: consolidate reasoning model checks (#3862 )	2025-08-11 16:55:45 -07:00
jnjpng	b62955f0bf	fix: gemini 2.5 thinking models fail to call functions if thinking is fully disabled Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>	2025-08-08 16:34:32 -07:00
Andy Li	e2d8b35201	chore: remove excessive warning logging	2025-07-30 14:10:41 -07:00
Andy Li	343dbb5359	feat: timeout configuration for LLM clients + vertex (#2972 )	2025-06-23 16:55:23 -07:00
Sarah Wooders	041421e75e	fix: remove from mcp so that it works with gemini (#2961 )	2025-06-21 21:32:18 -07:00
cthomas	6a23bc743d	feat: support multi content part input (#2717 )	2025-06-10 13:36:17 -07:00
cthomas	2823e4447a	feat: add multi-modal input support (#2590 )	2025-06-08 18:28:01 -07:00
Andy Li	80f6e97ca9	feat: otel metrics and expanded collecting (#2647 ) (passed tests in last run)	2025-06-05 17:20:14 -07:00
cthomas	7d6c79cf7a	feat: add property ordering for vertex structured outputs (#2487 )	2025-05-28 09:27:26 -07:00
cthomas	ced3cac558	fix: google clients thinking config (#2414 ) Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>	2025-05-24 09:42:34 -07:00
cthomas	c8c8de0ad1	fix: set thinking budget for vertex tokens (#2367 )	2025-05-23 09:07:32 -07:00
cthomas	ed9f3e6abf	feat: add tracing to llm clients (#2340 )	2025-05-22 13:55:32 -07:00
cthomas	32e2d78957	fix: google vertex client errors (#2307 )	2025-05-21 12:03:50 -07:00
Sarah Wooders	ed2ddad6a5	chore: add gemini + vertex to new agent loop (#2230 )	2025-05-17 21:47:42 -07:00
cthomas	856d98d33d	feat: add informative error for max tokens exceeded (#2223 )	2025-05-16 14:02:11 -07:00
cthomas	5dbaa0e5e5	fix: missing json import (#2073 )	2025-05-08 13:33:39 -07:00
cthomas	93d702141d	fix: inner thoughts constant (#2070 )	2025-05-07 21:33:34 -07:00
cthomas	3a2a7e4001	fix: vertex imports (#2063 )	2025-05-07 15:44:44 -07:00
Andy Li	abd3fb3204	chore: enable vertex experimental (#2053 ) Co-authored-by: Caren Thomas <carenthomas@gmail.com>	2025-05-07 14:52:26 -07:00
cthomas	f5b224602e	feat: add vertex to new agent loop (#2054 )	2025-05-07 14:03:07 -07:00
Sarah Wooders	80db705db7	chore: enable flash and fix vertex integration (#2003 )	2025-05-05 13:47:28 -07:00
Andy Li	93cc510839	fix: handle malformed function calls in vertex (#1987 )	2025-05-02 11:45:41 -07:00
Matthew Zhou	09cbb00d80	feat: Add thinking budget for vertex gemini (#1960 )	2025-04-30 18:37:42 -07:00
cthomas	7605675f23	feat: add llm config per request (#1866 )	2025-04-23 16:37:05 -07:00
cthomas	167c6a1d2e	fix: pydantic serializer warning datetime -> int (#1701 )	2025-04-23 14:53:58 -07:00
Charles Packer	ee8e095b69	fix: patch gemini 2.5 pro (#1643 ) Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>	2025-04-09 18:35:53 -07:00
Matthew Zhou	6a2312a02d	chore: Inject LLM config directly to batch api request func (#1652 )	2025-04-09 15:56:54 -07:00
Matthew Zhou	59ca61da58	feat: Simplify arguments for LLM clients (#1536 )	2025-04-02 14:26:27 -07:00
cthomas	6a20f85189	feat: add new llm client framework and migrate google apis (#1209 )	2025-03-07 16:34:06 -08:00

38 Commits