Matthew Zhou
|
285acd0324
|
feat: Add summarization for more scenarios (#2499)
|
2025-05-29 11:10:13 -07:00 |
|
Sarah Wooders
|
729414e7a9
|
feat: concurrently make embedding request and use async client for OpenAI (#2482)
Co-authored-by: Matthew Zhou <mattzh1314@gmail.com>
|
2025-05-28 11:35:22 -07:00 |
|
cthomas
|
7d6c79cf7a
|
feat: add property ordering for vertex structured outputs (#2487)
|
2025-05-28 09:27:26 -07:00 |
|
Sarah Wooders
|
cab3733280
|
feat: add more trace methods (#2471)
|
2025-05-27 16:56:30 -07:00 |
|
cthomas
|
20470844a7
|
feat(asyncify): byok in async loop (#2421)
|
2025-05-25 19:47:20 -07:00 |
|
Shangyin Tan
|
19efa1a89a
|
fix: do not pass temperature to request if model is oai reasoning model (#2189)
Co-authored-by: Charles Packer <packercharles@gmail.com>
|
2025-05-24 21:34:18 -07:00 |
|
cthomas
|
ced3cac558
|
fix: google clients thinking config (#2414)
Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>
|
2025-05-24 09:42:34 -07:00 |
|
cthomas
|
c8c8de0ad1
|
fix: set thinking budget for vertex tokens (#2367)
|
2025-05-23 09:07:32 -07:00 |
|
cthomas
|
ed9f3e6abf
|
feat: add tracing to llm clients (#2340)
|
2025-05-22 13:55:32 -07:00 |
|
cthomas
|
32e2d78957
|
fix: google vertex client errors (#2307)
|
2025-05-21 12:03:50 -07:00 |
|
cthomas
|
012e1cab4d
|
ci: use experimental for send message tests (#2290)
Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>
|
2025-05-20 18:39:27 -07:00 |
|
cthomas
|
00d893e7c5
|
feat: add anthropic token counter for cloud (#2289)
|
2025-05-20 17:10:50 -07:00 |
|
Matthew Zhou
|
0103ea6fcf
|
feat: Asyncify model listing for Gemini (#2284)
|
2025-05-20 16:00:20 -07:00 |
|
Matthew Zhou
|
b0f38cd2b1
|
feat: Asyncify anthropic model listing (#2283)
|
2025-05-20 15:59:42 -07:00 |
|
Matthew Zhou
|
e4fda7d6c7
|
feat: Asyncify openai model listing (#2281)
|
2025-05-20 15:24:00 -07:00 |
|
Matthew Zhou
|
068f27d83d
|
feat: Add tavily search builtin tool (#2257)
|
2025-05-19 16:38:11 -07:00 |
|
cthomas
|
2dde1f34ef
|
feat: configure retries on anthropic client (#2254)
|
2025-05-19 16:19:27 -07:00 |
|
cthomas
|
5df7bd5af6
|
feat: protect against anthropic nested tool args (#2250)
|
2025-05-19 16:01:59 -07:00 |
|
Andy Li
|
0a54b998a8
|
feat: track llm provider traces and tracking steps in async agent loop (#2219)
|
2025-05-19 15:50:56 -07:00 |
|
Sarah Wooders
|
ed2ddad6a5
|
chore: add gemini + vertex to new agent loop (#2230)
|
2025-05-17 21:47:42 -07:00 |
|
Sarah Wooders
|
f076964bd1
|
feat: support together in new agent loop and add tests (#2231)
|
2025-05-17 19:17:08 -07:00 |
|
cthomas
|
856d98d33d
|
feat: add informative error for max tokens exceeded (#2223)
|
2025-05-16 14:02:11 -07:00 |
|
Kevin Lin
|
d133ca248f
|
fix: patch lmstudio qwen (#2155)
|
2025-05-13 14:42:00 -07:00 |
|
cthomas
|
d67c425532
|
test: add additional new agent messaging tests (#2120)
|
2025-05-12 15:58:52 -07:00 |
|
Kevin Lin
|
6a33859224
|
fix: use auto function calling for together models (#2097)
|
2025-05-09 17:46:35 -07:00 |
|
Andy Li
|
bf1874dbc9
|
fix: summarization includes tool call message before truncation (#2084)
Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>
|
2025-05-09 15:01:12 -07:00 |
|
Charles Packer
|
8bb194541e
|
fix: make togetherai nebius xai etc usable via the openaiprovider (#1981)
Co-authored-by: Kevin Lin <klin5061@gmail.com>
Co-authored-by: Kevin Lin <kl2806@columbia.edu>
|
2025-05-09 10:50:55 -07:00 |
|
cthomas
|
5dbaa0e5e5
|
fix: missing json import (#2073)
|
2025-05-08 13:33:39 -07:00 |
|
cthomas
|
93d702141d
|
fix: inner thoughts constant (#2070)
|
2025-05-07 21:33:34 -07:00 |
|
jnjpng
|
05e1623389
|
feat: add endpoint to test connection to llm provider (#2032)
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>
|
2025-05-07 16:26:55 -07:00 |
|
cthomas
|
3a2a7e4001
|
fix: vertex imports (#2063)
|
2025-05-07 15:44:44 -07:00 |
|
Andy Li
|
abd3fb3204
|
chore: enable vertex experimental (#2053)
Co-authored-by: Caren Thomas <carenthomas@gmail.com>
|
2025-05-07 14:52:26 -07:00 |
|
cthomas
|
f5b224602e
|
feat: add vertex to new agent loop (#2054)
|
2025-05-07 14:03:07 -07:00 |
|
cthomas
|
d8151d76e2
|
feat: add provider_category field to distinguish byok (#2038)
|
2025-05-06 17:31:36 -07:00 |
|
jnjpng
|
7956da93a8
|
fix: skip warning logs for valid event types for anthropic (#2014)
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>
|
2025-05-05 17:07:38 -07:00 |
|
Sarah Wooders
|
80db705db7
|
chore: enable flash and fix vertex integration (#2003)
|
2025-05-05 13:47:28 -07:00 |
|
Charles Packer
|
77f9e2d127
|
fix: patch o1 support (#1978)
|
2025-05-02 14:54:25 -07:00 |
|
cthomas
|
65280d6f3c
|
feat: add gemini to byok (#1993)
|
2025-05-02 14:01:23 -07:00 |
|
Andy Li
|
93cc510839
|
fix: handle malformed function calls in vertex (#1987)
|
2025-05-02 11:45:41 -07:00 |
|
cthomas
|
9280d85ba4
|
feat: always add user id to openai requests (#1969)
|
2025-04-30 23:23:01 -07:00 |
|
cthomas
|
25c188689a
|
feat: byok 2.0 (#1963)
|
2025-04-30 21:26:50 -07:00 |
|
Matthew Zhou
|
09cbb00d80
|
feat: Add thinking budget for vertex gemini (#1960)
|
2025-04-30 18:37:42 -07:00 |
|
cthomas
|
5187e79a43
|
feat: use new model-proxy in production (#1908)
|
2025-04-30 15:20:54 -07:00 |
|
Sarah Wooders
|
c6650de5fc
|
fix: trigger gemini retry with ValueError (#1918)
|
2025-04-28 18:47:29 -07:00 |
|
cthomas
|
dc1ecdc902
|
fix: put inner thoughts in kwargs for anthropic (#1911)
|
2025-04-28 16:37:29 -07:00 |
|
cthomas
|
83157c1949
|
feat: add letta-free endpoint constant (#1907)
|
2025-04-27 12:57:06 -07:00 |
|
Charles Packer
|
1568f6b5b4
|
fix: patch weird OSS / cloud drift by copying in OSS into cloud (#1904)
|
2025-04-26 23:21:00 -07:00 |
|
cthomas
|
7605675f23
|
feat: add llm config per request (#1866)
|
2025-04-23 16:37:05 -07:00 |
|
cthomas
|
167c6a1d2e
|
fix: pydantic serializer warning datetime -> int (#1701)
|
2025-04-23 14:53:58 -07:00 |
|
Charles Packer
|
e1fa7f0b40
|
fix: patch o-series (#1699)
|
2025-04-23 13:41:34 -07:00 |
|