jnjpng
37d32be58b
fix: retry on 500 and 503 for gemini [LET-4185]
...
* handle 500 and 503
* timeout
---------
Co-authored-by: Letta Bot <noreply@letta.com >
2025-09-03 14:02:58 -07:00
jnjpng
4df4c5ea2e
fix: retry on MALFORMED_FUNCTION_CALL for gemini [LET-4089]
...
---------
Co-authored-by: Letta Bot <noreply@letta.com >
2025-09-01 07:26:13 -07:00
Charles Packer
aecb6e32d7
fix(core): update default value [LET-4117] ( #4321 )
...
* fix(core): update default value
* fix: just stage just publish
* fix(core): temporary hardcoding of the anthropic max tokens
* fix(core): patch the gemini
2025-08-29 16:42:19 -07:00
Kian Jones
ca6cfa5914
chore: migrate to ruff ( #4305 )
...
* base requirements
* autofix
* Configure ruff for Python linting and formatting
- Set up minimal ruff configuration with basic checks (E, W, F, I)
- Add temporary ignores for common issues during migration
- Configure pre-commit hooks to use ruff with pass_filenames
- This enables gradual migration from black to ruff
* Delete sdj
* autofixed only
* migrate lint action
* more autofixed
* more fixes
* change precommit
* try changing the hook
* try this stuff
2025-08-29 11:11:19 -07:00
cthomas
8b617c9e0d
fix: gemini flash integration test [LET-4060] ( #4242 )
...
* fix: gemini flash integration test
* also update google flash
* catch error in test
* revert test changes
* do try catch again
* remove try catch from streaming tests
* add try catch for summarize test also
2025-08-27 11:59:15 -07:00
cthomas
02de0effa6
feat: improve error message for vertex response parsing ( #4043 )
2025-08-20 09:52:20 -07:00
jnjpng
6562868251
fix: include google_ai model endpoint type when setting reasoning tokens for google reasoning models
...
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local >
2025-08-19 14:58:35 -07:00
jnjpng
ccdcbb3b01
fix: require function declarations to be present for setting gemini tool config
...
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local >
2025-08-19 12:56:31 -07:00
Matthew Zhou
5772365285
fix: Fix error messages with malformed gemini call ( #3865 )
2025-08-11 16:59:19 -07:00
cthomas
639641137d
feat: consolidate reasoning model checks ( #3862 )
2025-08-11 16:55:45 -07:00
jnjpng
b62955f0bf
fix: gemini 2.5 thinking models fail to call functions if thinking is fully disabled
...
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local >
2025-08-08 16:34:32 -07:00
Andy Li
e2d8b35201
chore: remove excessive warning logging
2025-07-30 14:10:41 -07:00
Andy Li
343dbb5359
feat: timeout configuration for LLM clients + vertex ( #2972 )
2025-06-23 16:55:23 -07:00
Sarah Wooders
041421e75e
fix: remove from mcp so that it works with gemini ( #2961 )
2025-06-21 21:32:18 -07:00
cthomas
6a23bc743d
feat: support multi content part input ( #2717 )
2025-06-10 13:36:17 -07:00
cthomas
2823e4447a
feat: add multi-modal input support ( #2590 )
2025-06-08 18:28:01 -07:00
Andy Li
80f6e97ca9
feat: otel metrics and expanded collecting ( #2647 )
...
(passed tests in last run)
2025-06-05 17:20:14 -07:00
cthomas
7d6c79cf7a
feat: add property ordering for vertex structured outputs ( #2487 )
2025-05-28 09:27:26 -07:00
cthomas
ced3cac558
fix: google clients thinking config ( #2414 )
...
Co-authored-by: Sarah Wooders <sarahwooders@gmail.com >
2025-05-24 09:42:34 -07:00
cthomas
c8c8de0ad1
fix: set thinking budget for vertex tokens ( #2367 )
2025-05-23 09:07:32 -07:00
cthomas
ed9f3e6abf
feat: add tracing to llm clients ( #2340 )
2025-05-22 13:55:32 -07:00
cthomas
32e2d78957
fix: google vertex client errors ( #2307 )
2025-05-21 12:03:50 -07:00
Sarah Wooders
ed2ddad6a5
chore: add gemini + vertex to new agent loop ( #2230 )
2025-05-17 21:47:42 -07:00
cthomas
856d98d33d
feat: add informative error for max tokens exceeded ( #2223 )
2025-05-16 14:02:11 -07:00
cthomas
5dbaa0e5e5
fix: missing json import ( #2073 )
2025-05-08 13:33:39 -07:00
cthomas
93d702141d
fix: inner thoughts constant ( #2070 )
2025-05-07 21:33:34 -07:00
cthomas
3a2a7e4001
fix: vertex imports ( #2063 )
2025-05-07 15:44:44 -07:00
Andy Li
abd3fb3204
chore: enable vertex experimental ( #2053 )
...
Co-authored-by: Caren Thomas <carenthomas@gmail.com >
2025-05-07 14:52:26 -07:00
cthomas
f5b224602e
feat: add vertex to new agent loop ( #2054 )
2025-05-07 14:03:07 -07:00
Sarah Wooders
80db705db7
chore: enable flash and fix vertex integration ( #2003 )
2025-05-05 13:47:28 -07:00
Andy Li
93cc510839
fix: handle malformed function calls in vertex ( #1987 )
2025-05-02 11:45:41 -07:00
Matthew Zhou
09cbb00d80
feat: Add thinking budget for vertex gemini ( #1960 )
2025-04-30 18:37:42 -07:00
cthomas
7605675f23
feat: add llm config per request ( #1866 )
2025-04-23 16:37:05 -07:00
cthomas
167c6a1d2e
fix: pydantic serializer warning datetime -> int ( #1701 )
2025-04-23 14:53:58 -07:00
Charles Packer
ee8e095b69
fix: patch gemini 2.5 pro ( #1643 )
...
Co-authored-by: Sarah Wooders <sarahwooders@gmail.com >
2025-04-09 18:35:53 -07:00
Matthew Zhou
6a2312a02d
chore: Inject LLM config directly to batch api request func ( #1652 )
2025-04-09 15:56:54 -07:00
Matthew Zhou
59ca61da58
feat: Simplify arguments for LLM clients ( #1536 )
2025-04-02 14:26:27 -07:00
cthomas
6a20f85189
feat: add new llm client framework and migrate google apis ( #1209 )
2025-03-07 16:34:06 -08:00