* feat: add provider trace backend abstraction for multi-backend telemetry Introduces a pluggable backend system for provider traces: - Base class with async/sync create and read interfaces - PostgreSQL backend (existing behavior) - ClickHouse backend (via OTEL instrumentation) - Socket backend (writes to Unix socket for crouton sidecar) - Factory for instantiating backends from config Refactors TelemetryManager to use backends with support for: - Multi-backend writes (concurrent via asyncio.gather) - Primary backend for reads (first in config list) - Graceful error handling per backend Config: LETTA_TELEMETRY_PROVIDER_TRACE_BACKEND (comma-separated) Example: "postgres,socket" for dual-write to Postgres and crouton 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * feat: add protocol version to socket backend records Adds PROTOCOL_VERSION constant to socket backend: - Included in every telemetry record sent to crouton - Must match ProtocolVersion in apps/crouton/main.go - Enables crouton to detect and reject incompatible messages 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * fix: remove organization_id from ProviderTraceCreate calls The organization_id is now handled via the actor parameter in the telemetry manager, not through ProviderTraceCreate schema. This fixes validation errors after changing ProviderTraceCreate to inherit from BaseProviderTrace which forbids extra fields. 🐙 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta <noreply@letta.com> * consolidate provider trace * add clickhouse-connect to fix bug on main lmao * auto generated sdk changes, and deployment details, and clikchouse prefix bug and added fields to runs trace return api * auto generated sdk changes, and deployment details, and clikchouse prefix bug and added fields to runs trace return api * consolidate provider trace * consolidate provider trace bug fix --------- Co-authored-by: Letta <noreply@letta.com>
53 lines
1.8 KiB
Python
53 lines
1.8 KiB
Python
"""Factory for creating provider trace backends."""
|
|
|
|
from functools import lru_cache
|
|
|
|
from letta.services.provider_trace_backends.base import ProviderTraceBackend, ProviderTraceBackendClient
|
|
|
|
|
|
def _create_backend(backend: ProviderTraceBackend | str) -> ProviderTraceBackendClient:
|
|
"""Create a single backend instance."""
|
|
from letta.settings import telemetry_settings
|
|
|
|
backend_str = backend.value if isinstance(backend, ProviderTraceBackend) else backend
|
|
|
|
match backend_str:
|
|
case "clickhouse":
|
|
from letta.services.provider_trace_backends.clickhouse import ClickhouseProviderTraceBackend
|
|
|
|
return ClickhouseProviderTraceBackend()
|
|
|
|
case "socket":
|
|
from letta.services.provider_trace_backends.socket import SocketProviderTraceBackend
|
|
|
|
return SocketProviderTraceBackend(socket_path=telemetry_settings.socket_path)
|
|
|
|
case "postgres" | _:
|
|
from letta.services.provider_trace_backends.postgres import PostgresProviderTraceBackend
|
|
|
|
return PostgresProviderTraceBackend()
|
|
|
|
|
|
@lru_cache(maxsize=1)
|
|
def get_provider_trace_backends() -> list[ProviderTraceBackendClient]:
|
|
"""
|
|
Get all configured provider trace backends.
|
|
|
|
Returns cached singleton instances for each configured backend.
|
|
Supports multiple backends for dual-write scenarios (e.g., migration).
|
|
"""
|
|
from letta.settings import telemetry_settings
|
|
|
|
backends = telemetry_settings.provider_trace_backends
|
|
return [_create_backend(b) for b in backends]
|
|
|
|
|
|
def get_provider_trace_backend() -> ProviderTraceBackendClient:
|
|
"""
|
|
Get the primary (first) configured provider trace backend.
|
|
|
|
For backwards compatibility and read operations.
|
|
"""
|
|
backends = get_provider_trace_backends()
|
|
return backends[0] if backends else _create_backend("postgres")
|