Commit Graph

43 Commits

Author SHA1 Message Date
jnjpng
2899171e2c fix: retry on gemini 504 (#2861)
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>
2025-09-12 15:58:56 -07:00
cthomas
33bee28d08 feat: use LLMServerError to distinguish provider errors (#2859) 2025-09-12 14:43:14 -07:00
Caren Thomas
0a58326daf feat: handle google errors in our stack correctly 2025-09-12 10:58:06 -07:00
cthomas
32b9f228dd fix: missing await in agent loop (#2835) 2025-09-11 11:54:43 -07:00
cthomas
694be7b136 feat: support filtering out approval messages for gemini (#2833) 2025-09-11 00:26:24 -07:00
jnjpng
37d32be58b fix: retry on 500 and 503 for gemini [LET-4185]
* handle 500 and 503

* timeout

---------

Co-authored-by: Letta Bot <noreply@letta.com>
2025-09-03 14:02:58 -07:00
jnjpng
4df4c5ea2e fix: retry on MALFORMED_FUNCTION_CALL for gemini [LET-4089]
---------

Co-authored-by: Letta Bot <noreply@letta.com>
2025-09-01 07:26:13 -07:00
Charles Packer
aecb6e32d7 fix(core): update default value [LET-4117] (#4321)
* fix(core): update default value

* fix: just stage just publish

* fix(core): temporary hardcoding of the anthropic max tokens

* fix(core): patch the gemini
2025-08-29 16:42:19 -07:00
Kian Jones
ca6cfa5914 chore: migrate to ruff (#4305)
* base requirements

* autofix

* Configure ruff for Python linting and formatting

- Set up minimal ruff configuration with basic checks (E, W, F, I)
- Add temporary ignores for common issues during migration
- Configure pre-commit hooks to use ruff with pass_filenames
- This enables gradual migration from black to ruff

* Delete sdj

* autofixed only

* migrate lint action

* more autofixed

* more fixes

* change precommit

* try changing the hook

* try this stuff
2025-08-29 11:11:19 -07:00
cthomas
8b617c9e0d fix: gemini flash integration test [LET-4060] (#4242)
* fix: gemini flash integration test

* also update google flash

* catch error in test

* revert test changes

* do try catch again

* remove try catch from streaming tests

* add try catch for summarize test also
2025-08-27 11:59:15 -07:00
cthomas
02de0effa6 feat: improve error message for vertex response parsing (#4043) 2025-08-20 09:52:20 -07:00
jnjpng
6562868251 fix: include google_ai model endpoint type when setting reasoning tokens for google reasoning models
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>
2025-08-19 14:58:35 -07:00
jnjpng
ccdcbb3b01 fix: require function declarations to be present for setting gemini tool config
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>
2025-08-19 12:56:31 -07:00
Matthew Zhou
5772365285 fix: Fix error messages with malformed gemini call (#3865) 2025-08-11 16:59:19 -07:00
cthomas
639641137d feat: consolidate reasoning model checks (#3862) 2025-08-11 16:55:45 -07:00
jnjpng
b62955f0bf fix: gemini 2.5 thinking models fail to call functions if thinking is fully disabled
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>
2025-08-08 16:34:32 -07:00
Andy Li
e2d8b35201 chore: remove excessive warning logging 2025-07-30 14:10:41 -07:00
Andy Li
343dbb5359 feat: timeout configuration for LLM clients + vertex (#2972) 2025-06-23 16:55:23 -07:00
Sarah Wooders
041421e75e fix: remove from mcp so that it works with gemini (#2961) 2025-06-21 21:32:18 -07:00
cthomas
6a23bc743d feat: support multi content part input (#2717) 2025-06-10 13:36:17 -07:00
cthomas
2823e4447a feat: add multi-modal input support (#2590) 2025-06-08 18:28:01 -07:00
Andy Li
80f6e97ca9 feat: otel metrics and expanded collecting (#2647)
(passed tests in last run)
2025-06-05 17:20:14 -07:00
cthomas
7d6c79cf7a feat: add property ordering for vertex structured outputs (#2487) 2025-05-28 09:27:26 -07:00
cthomas
ced3cac558 fix: google clients thinking config (#2414)
Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>
2025-05-24 09:42:34 -07:00
cthomas
c8c8de0ad1 fix: set thinking budget for vertex tokens (#2367) 2025-05-23 09:07:32 -07:00
cthomas
ed9f3e6abf feat: add tracing to llm clients (#2340) 2025-05-22 13:55:32 -07:00
cthomas
32e2d78957 fix: google vertex client errors (#2307) 2025-05-21 12:03:50 -07:00
Sarah Wooders
ed2ddad6a5 chore: add gemini + vertex to new agent loop (#2230) 2025-05-17 21:47:42 -07:00
cthomas
856d98d33d feat: add informative error for max tokens exceeded (#2223) 2025-05-16 14:02:11 -07:00
cthomas
5dbaa0e5e5 fix: missing json import (#2073) 2025-05-08 13:33:39 -07:00
cthomas
93d702141d fix: inner thoughts constant (#2070) 2025-05-07 21:33:34 -07:00
cthomas
3a2a7e4001 fix: vertex imports (#2063) 2025-05-07 15:44:44 -07:00
Andy Li
abd3fb3204 chore: enable vertex experimental (#2053)
Co-authored-by: Caren Thomas <carenthomas@gmail.com>
2025-05-07 14:52:26 -07:00
cthomas
f5b224602e feat: add vertex to new agent loop (#2054) 2025-05-07 14:03:07 -07:00
Sarah Wooders
80db705db7 chore: enable flash and fix vertex integration (#2003) 2025-05-05 13:47:28 -07:00
Andy Li
93cc510839 fix: handle malformed function calls in vertex (#1987) 2025-05-02 11:45:41 -07:00
Matthew Zhou
09cbb00d80 feat: Add thinking budget for vertex gemini (#1960) 2025-04-30 18:37:42 -07:00
cthomas
7605675f23 feat: add llm config per request (#1866) 2025-04-23 16:37:05 -07:00
cthomas
167c6a1d2e fix: pydantic serializer warning datetime -> int (#1701) 2025-04-23 14:53:58 -07:00
Charles Packer
ee8e095b69 fix: patch gemini 2.5 pro (#1643)
Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>
2025-04-09 18:35:53 -07:00
Matthew Zhou
6a2312a02d chore: Inject LLM config directly to batch api request func (#1652) 2025-04-09 15:56:54 -07:00
Matthew Zhou
59ca61da58 feat: Simplify arguments for LLM clients (#1536) 2025-04-02 14:26:27 -07:00
cthomas
6a20f85189 feat: add new llm client framework and migrate google apis (#1209) 2025-03-07 16:34:06 -08:00