Principal Software Engineer, DataRobot
LLM judges can mislead you. We built a human-labeled dataset and tested alternatives to uncover what works best. Read the blog to see the results.
Discover how to find “silver bullet” agentic AI flows that boost accuracy and cut latency — plus the full list of 23 top-performing setups.
Explore syftr, an open source framework for discovering Pareto-optimal generative AI workflows. Learn how to optimize for accuracy, cost, and latency in real-world use cases.