letta-server

Author	SHA1	Message	Date
Sarah Wooders	08ef32ace2	feature/anthropic-system-ephemeral-caching (#2684 )	2025-06-18 10:10:59 -07:00
Astrotalk	1531829cc3	- Add cache control to system messages - Cache static system content to reduce token costs on repeated requests - Maintain existing conversation flow while optimizing API costs - Support both string and list format system messages This change leverages Anthropic's ephemeral caching to reduce costs for frequently used system prompts and personas without affecting functionality.	2025-06-17 18:25:07 +05:30
Sarah Wooders	12146579c4	chore: bugfixes (#2682 )	2025-06-15 14:59:23 -07:00
Sarah Wooders	32911892f2	fix: fix poetry.lock	2025-06-15 11:20:50 -07:00
Sarah Wooders	b2a4f7e173	Merge branch 'main' into 0.8.4-release-2	2025-06-15 11:19:51 -07:00
jnjpng	729acaedb2	feat: add token column to mcp_servers and pipe through to sse server config (#2775 ) Co-authored-by: Jin Peng <jinjpeng@Jins-MacBook-Pro.local>	2025-06-14 18:48:10 -07:00
Matthew Zhou	c71a6d1917	feat: Fall back to Optimistic JSON parser if Pydantic one fails (#2820 )	2025-06-14 14:29:59 -07:00
Matthew Zhou	093f65e891	feat: Change reset messages to always preserve the original system message (#2813 )	2025-06-13 18:58:27 -07:00
Sarah Wooders	78e81391ba	fix: avoid calling `model_dump` on stop reason messages twice (#2811 )	2025-06-13 18:25:35 -07:00
Matthew Zhou	96a6e21dd6	fix: Relax constraints on pip requirements (#2809 )	2025-06-13 17:21:04 -07:00
Matthew Zhou	17fc3420ed	fix: Make voice beta endpoint VAPI compatible (#2808 )	2025-06-13 17:20:25 -07:00
Kevin Lin	8148dab8a8	feat: tune file tools prompting (#2652 )	2025-06-13 16:24:59 -07:00
cthomas	97986b0f03	feat: send stop reason in letta APIs (#2789 )	2025-06-13 16:04:48 -07:00
Kevin Lin	a8895ad017	feat: add reasoning models to `integration_test_send_message` (#2710 )	2025-06-13 14:54:37 -07:00
Andy Li	562a404e27	fix: metrics imports (#2801 )	2025-06-13 14:39:24 -07:00
Andy Li	bf7bd3b905	fix: metric tracking (#2785 )	2025-06-13 13:53:10 -07:00
Matthew Zhou	b6affd1a16	feat: Add optional pip requirements to tool object (#2793 )	2025-06-13 13:20:36 -07:00
Andy Li	0e45a90609	chore: workflow consolidatation and fern caching for tests on pr (#2787 )	2025-06-13 12:33:53 -07:00
Kian Jones	aeadd1aebe	fix(ci): autosummarize test fails (pydantic version dependant?) (#2792 )	2025-06-13 12:16:48 -07:00
Andy Li	336896dc5c	feat: redis user caching (#2774 )	2025-06-12 17:32:07 -07:00
Shangyin Tan	22b640a5dd	fix: adjust memory summary's recall message count (#2778 )	2025-06-12 17:11:36 -07:00
cthomas	8f13c078ef	feat: add stop reason object (#2783 )	2025-06-12 16:54:29 -07:00
cthomas	ccb203b264	feat: consolidate logic for finish tokens (#2779 )	2025-06-12 15:24:06 -07:00
Kevin Lin	916fe6c1da	fix: use `max_completion_tokens` in chat completion request (#2764 )	2025-06-12 10:58:23 -07:00
Sarah Wooders	6be55bd982	chore: 0.8.4 release (#2680 )	2025-06-12 09:33:25 -07:00
Sarah Wooders	af70e7c9e3	chore: bump version 0.8.4	2025-06-11 23:20:42 -07:00
Sarah Wooders	b1949c7d17	Merge branch 'main' into 0.8.4-release	2025-06-11 23:18:44 -07:00
Charles Packer	5498061e58	docs: add vibecoding prompts to docs site (#2579 ) Co-authored-by: Sarah Wooders <sarahwooders@gmail.com>	2025-06-11 22:10:17 -07:00
Matthew Zhou	28fdaaff74	fix: Fix claude sonnet 3_7 tests (#2767 )	2025-06-11 17:53:03 -07:00
Kevin Lin	78dad0db7a	fix: patch reasoning models (#2703 ) Co-authored-by: Charles Packer <packercharles@gmail.com>	2025-06-11 17:20:04 -07:00
Matthew Zhou	d0c2ef89ea	feat: Add errors when tool call violates tool rules (#2766 )	2025-06-11 17:12:39 -07:00
cthomas	03f4867cbe	fix: composio tools with array args (#2762 )	2025-06-11 16:24:37 -07:00
cthomas	cf357b931d	feat: bump letta client version (#2759 )	2025-06-11 13:39:02 -07:00
cthomas	aedc95056a	test: add new agent fixture to send message test (#2758 )	2025-06-11 11:48:34 -07:00
cthomas	654f6b5053	feat: make tool calls required for model proxy (#2756 )	2025-06-11 11:36:45 -07:00
cthomas	f3bee4658c	fix: not enough values to unpack error (#2753 )	2025-06-11 11:05:46 -07:00
Matthew Zhou	d616a99916	fix: Get rid of lru cache (#2748 )	2025-06-11 10:44:36 -07:00
cthomas	1bf54e1d6c	fix: test managers failure (#2754 )	2025-06-11 10:41:03 -07:00
Matthew Zhou	ba3d59bba5	feat: Add prompting to guide tool rule usage (#2742 )	2025-06-10 16:21:27 -07:00
Matthew Zhou	8ced9e57ba	feat: Fix test managers and patch default org id (#2746 )	2025-06-10 16:20:36 -07:00
cthomas	883050e761	feat: set request heartbeat for max steps (#2739 )	2025-06-10 15:26:07 -07:00
cthomas	484a6f1d37	fix: parallel tool calling OpenAI (#2738 )	2025-06-10 14:27:01 -07:00
cthomas	6a23bc743d	feat: support multi content part input (#2717 )	2025-06-10 13:36:17 -07:00
Matthew Zhou	530f9488b6	fix: Turn parallel tool calling off for OpenAI (#2737 )	2025-06-10 13:27:00 -07:00
Matthew Zhou	96b8f155d8	fix: Turn off parallel tool calling for Claude (#2736 )	2025-06-10 13:04:20 -07:00
cthomas	142a4f0ed8	fix: async loop eligibility check (#2727 )	2025-06-10 12:56:01 -07:00
Matthew Zhou	a5ee49e579	feat: Remove debug artifacts (#2734 )	2025-06-10 12:26:45 -07:00
Matthew Zhou	c4d7abb0ff	feat: Cache actor calls (#2732 )	2025-06-10 12:15:13 -07:00
Andy Li	9b91c94d48	feat: otel endpoint tracking for messages (#2715 )	2025-06-10 11:02:36 -07:00
Andy Li	617d208bcc	fix: list ollama embeddings models (#2711 )	2025-06-10 10:44:26 -07:00

1 2 3 4 5 ...

3072 Commits