Tag

operational-judgment

AI Quality Systems Series #1: AI Quality Is an Operating System

A concrete operator guide to ai quality is an operating system: what changes, who owns it, how to verify it, and where it breaks.

AI Quality Systems Series #3: Gold Sets and Regression Tests

A concrete operator guide to gold sets and regression tests: what changes, who owns it, how to verify it, and where it breaks.

AI Quality Systems Series #2: Evals Are the Quality Bar

A concrete operator guide to evals are the quality bar: what changes, who owns it, how to verify it, and where it breaks.

AI Quality Systems Series #5: Confidence Thresholds and Escalation

A concrete operator guide to confidence thresholds and escalation: what changes, who owns it, how to verify it, and where it breaks.

AI Quality Systems Series #4: Judgment Calibration

A concrete operator guide to judgment calibration: what changes, who owns it, how to verify it, and where it breaks.

AI Quality Systems Series #8: Quality Ownership

A concrete operator guide to quality ownership: what changes, who owns it, how to verify it, and where it breaks.

AI Quality Systems Series #7: Failure Analysis and Drift

A concrete operator guide to failure analysis and drift: what changes, who owns it, how to verify it, and where it breaks.

AI Quality Systems Series #6: Review Loops That Improve the System

A concrete operator guide to review loops that improve the system: what changes, who owns it, how to verify it, and where it breaks.
You've successfully subscribed to Antoine Buteau
Great! Next, complete checkout to get full access to all premium content.
Welcome back! You've successfully signed in.
Unable to sign you in. Please try again.
Success! Your account is fully activated, you now have access to all content.
Error! Stripe checkout failed.
Success! Your billing info is updated.
Error! Billing info update failed.