-
Notifications
You must be signed in to change notification settings - Fork 187
TruLens Eval
Piotr Mardziel edited this page Jun 28, 2024
·
4 revisions
- Awaitable and generator method inputs/outputs not properly recorded.
- Conversation/session tracking, display, feedbacks.
-
Correlations between feedback results.
-
Feedback clustering (f1, f2 behave similarly, f3, f4 behave similarly).
-
Derived metrics/feedbacks: e.g. "Does high context relevance imply low abstain?"
-
Statistical validity computation and presentation.
- Safety with LlamaGuard
-
Goal: Export records to otel.
-
Goal: Record via otel interface.
-
Goal: Record and view other tool's instrumented spans alongside/within ours.