By Langfuse

Langfuse

Open-source LLM observability, evals, prompt management, and dataset tooling.

Filed under AI Observability with Prompt Management. Status: Published

Stars: 27.3k
Forks: 2.8k
Open issues: 604
Last commit: 29d ago
Stats refreshed: Refreshed May 17, 2026

On the maker

Langfuse

Open-source LLM observability, evals, and prompt management.

Use cases

Production-Grade.

Production-GradeTools proven in production at scale — not prototypes or research previews.

Pricing

Not yet curated

Field notes

No field notes yet.

Field notes for Langfuse will land here when sources support a confident take ; synthesized from postmortems, vendor retros, dev-team blogs, deeply-engaged GitHub issues, and our own builds.

Coverage isn’t promised on every tool ; empty sections are honest. Field notes are curated, not generated from vendor copy.

Benchmarks

Scores aren’t in yet.

We’re wiring up SWE-bench, Aider Polyglot, and a custom dev-task suite next. Methodology will be public; vendor pre-notification is 48 hours.

View benchmarks

How we make money

This directory is supported by display advertising. Advertisers do not influence editorial rankings, benchmark scoring, or which tools are featured. Tools are ordered by data.

Editorial independence policy →