~/aboneto.dev/category/Continuous Calibration
#Continuous Calibration
posts:1 last_update:2026-05-13 avg_read:~9min
// all posts — ordered by date desc
LLM-as-a-Judge: how to build scalable AI evaluation pipelines
Prompts in systems with Large Language Models (LLMs) don't behave like deterministic code. In traditional software development, if you modify a function and all tests pass, you can...