Commit Graph

3072 Commits

Author SHA1 Message Date
Sarah Wooders
08ef32ace2 feature/anthropic-system-ephemeral-caching (#2684) 2025-06-18 10:10:59 -07:00
Astrotalk
1531829cc3 - Add cache control to system messages
- Cache static system content to reduce token costs on repeated requests
                              - Maintain existing conversation flow while optimizing API costs
                              - Support both string and list format system messages

                              This change leverages Anthropic's ephemeral caching to reduce costs for
                              frequently used system prompts and personas without affecting functionality.
2025-06-17 18:25:07 +05:30
Sarah Wooders
12146579c4 chore: bugfixes (#2682) 2025-06-15 14:59:23 -07:00
Sarah Wooders
32911892f2 fix: fix poetry.lock 2025-06-15 11:20:50 -07:00
Sarah Wooders
b2a4f7e173 Merge branch 'main' into 0.8.4-release-2 2025-06-15 11:19:51 -07:00
jnjpng
729acaedb2 feat: add token column to mcp_servers and pipe through to sse server config (#2775)
Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>
2025-06-14 18:48:10 -07:00
Matthew Zhou
c71a6d1917 feat: Fall back to Optimistic JSON parser if Pydantic one fails (#2820) 2025-06-14 14:29:59 -07:00
Matthew Zhou
093f65e891 feat: Change reset messages to always preserve the original system message (#2813) 2025-06-13 18:58:27 -07:00
Sarah Wooders
78e81391ba fix: avoid calling model_dump on stop reason messages twice (#2811) 2025-06-13 18:25:35 -07:00
Matthew Zhou
96a6e21dd6 fix: Relax constraints on pip requirements (#2809) 2025-06-13 17:21:04 -07:00
Matthew Zhou
17fc3420ed fix: Make voice beta endpoint VAPI compatible (#2808) 2025-06-13 17:20:25 -07:00
Kevin Lin
8148dab8a8 feat: tune file tools prompting (#2652) 2025-06-13 16:24:59 -07:00
cthomas
97986b0f03 feat: send stop reason in letta APIs (#2789) 2025-06-13 16:04:48 -07:00
Kevin Lin
a8895ad017 feat: add reasoning models to integration_test_send_message (#2710) 2025-06-13 14:54:37 -07:00
Andy Li
562a404e27 fix: metrics imports (#2801) 2025-06-13 14:39:24 -07:00
Andy Li
bf7bd3b905 fix: metric tracking (#2785) 2025-06-13 13:53:10 -07:00
Matthew Zhou
b6affd1a16 feat: Add optional pip requirements to tool object (#2793) 2025-06-13 13:20:36 -07:00
Andy Li
0e45a90609 chore: workflow consolidatation and fern caching for tests on pr (#2787) 2025-06-13 12:33:53 -07:00
Kian Jones
aeadd1aebe fix(ci): autosummarize test fails (pydantic version dependant?) (#2792) 2025-06-13 12:16:48 -07:00
Andy Li
336896dc5c feat: redis user caching (#2774) 2025-06-12 17:32:07 -07:00
Shangyin Tan
22b640a5dd fix: adjust memory summary's recall message count (#2778) 2025-06-12 17:11:36 -07:00
cthomas
8f13c078ef feat: add stop reason object (#2783) 2025-06-12 16:54:29 -07:00
cthomas
ccb203b264 feat: consolidate logic for finish tokens (#2779) 2025-06-12 15:24:06 -07:00
Kevin Lin
916fe6c1da fix: use max_completion_tokens in chat completion request (#2764) 2025-06-12 10:58:23 -07:00
Sarah Wooders
6be55bd982 chore: 0.8.4 release (#2680) 2025-06-12 09:33:25 -07:00
Sarah Wooders
af70e7c9e3 chore: bump version 0.8.4 2025-06-11 23:20:42 -07:00
Sarah Wooders
b1949c7d17 Merge branch 'main' into 0.8.4-release 2025-06-11 23:18:44 -07:00
Charles Packer
5498061e58 docs: add vibecoding prompts to docs site (#2579)
Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>
2025-06-11 22:10:17 -07:00
Matthew Zhou
28fdaaff74 fix: Fix claude sonnet 3_7 tests (#2767) 2025-06-11 17:53:03 -07:00
Kevin Lin
78dad0db7a fix: patch reasoning models (#2703)
Co-authored-by: Charles Packer <packercharles@gmail.com>
2025-06-11 17:20:04 -07:00
Matthew Zhou
d0c2ef89ea feat: Add errors when tool call violates tool rules (#2766) 2025-06-11 17:12:39 -07:00
cthomas
03f4867cbe fix: composio tools with array args (#2762) 2025-06-11 16:24:37 -07:00
cthomas
cf357b931d feat: bump letta client version (#2759) 2025-06-11 13:39:02 -07:00
cthomas
aedc95056a test: add new agent fixture to send message test (#2758) 2025-06-11 11:48:34 -07:00
cthomas
654f6b5053 feat: make tool calls required for model proxy (#2756) 2025-06-11 11:36:45 -07:00
cthomas
f3bee4658c fix: not enough values to unpack error (#2753) 2025-06-11 11:05:46 -07:00
Matthew Zhou
d616a99916 fix: Get rid of lru cache (#2748) 2025-06-11 10:44:36 -07:00
cthomas
1bf54e1d6c fix: test managers failure (#2754) 2025-06-11 10:41:03 -07:00
Matthew Zhou
ba3d59bba5 feat: Add prompting to guide tool rule usage (#2742) 2025-06-10 16:21:27 -07:00
Matthew Zhou
8ced9e57ba feat: Fix test managers and patch default org id (#2746) 2025-06-10 16:20:36 -07:00
cthomas
883050e761 feat: set request heartbeat for max steps (#2739) 2025-06-10 15:26:07 -07:00
cthomas
484a6f1d37 fix: parallel tool calling OpenAI (#2738) 2025-06-10 14:27:01 -07:00
cthomas
6a23bc743d feat: support multi content part input (#2717) 2025-06-10 13:36:17 -07:00
Matthew Zhou
530f9488b6 fix: Turn parallel tool calling off for OpenAI (#2737) 2025-06-10 13:27:00 -07:00
Matthew Zhou
96b8f155d8 fix: Turn off parallel tool calling for Claude (#2736) 2025-06-10 13:04:20 -07:00
cthomas
142a4f0ed8 fix: async loop eligibility check (#2727) 2025-06-10 12:56:01 -07:00
Matthew Zhou
a5ee49e579 feat: Remove debug artifacts (#2734) 2025-06-10 12:26:45 -07:00
Matthew Zhou
c4d7abb0ff feat: Cache actor calls (#2732) 2025-06-10 12:15:13 -07:00
Andy Li
9b91c94d48 feat: otel endpoint tracking for messages (#2715) 2025-06-10 11:02:36 -07:00
Andy Li
617d208bcc fix: list ollama embeddings models (#2711) 2025-06-10 10:44:26 -07:00