Commit Graph

287 Commits

Author SHA1 Message Date
cthomas
1eb692f62a feat: add azure llm client (#3882) 2025-08-12 14:43:03 -07:00
cthomas
6289ca55a2 feat: refactor byok logic in llm clients (#3880) 2025-08-12 14:19:02 -07:00
cthomas
956a4ad011 feat: add together byok support (#3877) 2025-08-12 13:44:01 -07:00
cthomas
227e472282 feat: add new together llm client (#3875) 2025-08-12 13:37:20 -07:00
Matthew Zhou
5772365285 fix: Fix error messages with malformed gemini call (#3865) 2025-08-11 16:59:19 -07:00
cthomas
639641137d feat: consolidate reasoning model checks (#3862) 2025-08-11 16:55:45 -07:00
jnjpng
b62955f0bf fix: gemini 2.5 thinking models fail to call functions if thinking is fully disabled
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>
2025-08-08 16:34:32 -07:00
Sarah Wooders
e5497b1d0d chore: merge oss (#3712) 2025-08-07 22:20:26 -07:00
Sarah Wooders
ce44386011 feat: support opus 4.1 and gpt-5 (#3806) 2025-08-07 15:11:57 -07:00
Matthew Zhou
4b788d11de feat: Support streaming and move endpoint for letta-free (#3780) 2025-08-06 15:33:35 -07:00
Sarah Wooders
f5187583d2 feat: move ollama to new agent loop (#3615) 2025-07-31 13:40:26 -07:00
Andy Li
e2d8b35201 chore: remove excessive warning logging 2025-07-30 14:10:41 -07:00
jnjpng
de3c99d7f5 fix: lmstudio support for qwen and llama
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>
Co-authored-by: Charles Packer <packercharles@gmail.com>
2025-07-29 15:57:20 -07:00
jnjpng
cf8c59aab9 feat: allow mcp authentication overrides per agent (#3318)
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>
2025-07-28 18:20:58 -07:00
Matthew Zhou
84ea52172a feat: Add ability to disable reasoning (#3594) 2025-07-28 15:30:10 -07:00
Andy Li
9ca1664ed8 feat: support for providers 2025-07-22 16:09:50 -07:00
Andy Li
904d9ba5a2 chore: strings lint cleanup (#3374) 2025-07-18 09:20:45 -07:00
Eric Ly
5f3e23fd2a feat: create 'test connection' bedrock api + fix endpoints for test connection (ant, openai, gemini) (#3227)
Co-authored-by: Eric Ly <lyyeric@letta.com>
2025-07-17 11:39:46 -07:00
Charles Packer
72b472896d fix: add frequency penalty for gpt-4o-mini (#3166) 2025-07-06 11:05:31 -07:00
Matthew Zhou
804ec12ee2 feat: Only add suffix on duplication (#3120) 2025-07-01 13:48:38 -07:00
Matthew Zhou
241f653eb6 fix: Fix constraints and also implement bulk attach (#3107) 2025-06-30 14:27:57 -07:00
Charles Packer
d41cfe4d0f fix: patch annoying user warning caused by not having sonnet/opus 4 listed (#3017) 2025-06-24 19:07:25 -07:00
Andy Li
343dbb5359 feat: timeout configuration for LLM clients + vertex (#2972) 2025-06-23 16:55:23 -07:00
Sarah Wooders
041421e75e fix: remove from mcp so that it works with gemini (#2961) 2025-06-21 21:32:18 -07:00
cthomas
fa362c404f feat: add bedrock client (#2913) 2025-06-19 12:07:00 -07:00
cthomas
5cc180ee47 feat: rename aws env vars for bedrock (#2907)
Co-authored-by: Andy Li <55300002+cliandy@users.noreply.github.com>
2025-06-19 10:36:47 -07:00
cthomas
cbd7fe3926 feat: add bedrock to byok (#2891) 2025-06-18 16:03:28 -07:00
Kevin Lin
6ea0a7e74e fix: add exceptions to accept_developer_role (#2848) 2025-06-16 15:14:40 -07:00
Matthew Zhou
440305ad74 fix: Harden string matching for context window exceeded error (#2847) 2025-06-16 14:34:41 -07:00
Kevin Lin
a8895ad017 feat: add reasoning models to integration_test_send_message (#2710) 2025-06-13 14:54:37 -07:00
cthomas
654f6b5053 feat: make tool calls required for model proxy (#2756) 2025-06-11 11:36:45 -07:00
Matthew Zhou
ba3d59bba5 feat: Add prompting to guide tool rule usage (#2742) 2025-06-10 16:21:27 -07:00
cthomas
484a6f1d37 fix: parallel tool calling OpenAI (#2738) 2025-06-10 14:27:01 -07:00
cthomas
6a23bc743d feat: support multi content part input (#2717) 2025-06-10 13:36:17 -07:00
Matthew Zhou
530f9488b6 fix: Turn parallel tool calling off for OpenAI (#2737) 2025-06-10 13:27:00 -07:00
Matthew Zhou
96b8f155d8 fix: Turn off parallel tool calling for Claude (#2736) 2025-06-10 13:04:20 -07:00
Matthew Zhou
a5ee49e579 feat: Remove debug artifacts (#2734) 2025-06-10 12:26:45 -07:00
cthomas
2823e4447a feat: add multi-modal input support (#2590) 2025-06-08 18:28:01 -07:00
cthomas
91e859d461 fix: incorrect anthropic tool format hack (#2685) 2025-06-06 15:48:18 -07:00
cthomas
e72f0f68ef fix: gracefully handle too long responses from llm provider (#2677) 2025-06-06 13:13:32 -07:00
Andy Li
80f6e97ca9 feat: otel metrics and expanded collecting (#2647)
(passed tests in last run)
2025-06-05 17:20:14 -07:00
cthomas
64ac7a7deb fix: send message tests (#2656) 2025-06-05 13:57:43 -07:00
Matthew Zhou
bf74e53915 feat: Add tools for opening and closing files (#2638) 2025-06-04 17:33:18 -07:00
Matthew Zhou
3ccd79efa5 fix: Make OpenAI context window exceeded error more specific (#2624) 2025-06-04 12:57:51 -07:00
Matthew Zhou
fb54072de6 fix: Add additional testing for anthropic token counting (#2619) 2025-06-03 20:56:39 -07:00
Matthew Zhou
285acd0324 feat: Add summarization for more scenarios (#2499) 2025-05-29 11:10:13 -07:00
Sarah Wooders
729414e7a9 feat: concurrently make embedding request and use async client for OpenAI (#2482)
Co-authored-by: Matthew Zhou <mattzh1314@gmail.com>
2025-05-28 11:35:22 -07:00
cthomas
7d6c79cf7a feat: add property ordering for vertex structured outputs (#2487) 2025-05-28 09:27:26 -07:00
Sarah Wooders
cab3733280 feat: add more trace methods (#2471) 2025-05-27 16:56:30 -07:00
cthomas
20470844a7 feat(asyncify): byok in async loop (#2421) 2025-05-25 19:47:20 -07:00