Case finder

Find the right file without naming the trap.

Filter by broad issue, setting, evidence format, difficulty, and local progress. Precise concepts stay out of view until the case replay.

Matched files

25 cases

View pathways
Case 01unopened

The Dashboard Spike

A launch-week chart jumps, a deck is due, and several teams have reasons to claim the movement.

introProduct analytics8 minchartaudio
Case 02unopened

The Checkout Readout

A checkout readout lands just before planning closes, and different artifacts point toward different launch stories.

standardExperimentation10 minchartaudio
Case 03unopened

The Churn Model Pitch

A polished retention model arrives with a renewal deadline, a crowded outreach queue, and a promise that the save team can act sooner.

standardML evaluation12 minmodel-outputtable
Case 04unopened

The Inspection Queue

A city inspection team has a new routing screen, a long backlog, and one week to decide how much authority the score should have.

standardPublic policy analytics12 minmappolicy
Case 05unopened

The Spring Tutoring Brief

A district impact brief is headed to a funding vote after students who used a tutoring platform show stronger spring gains.

standardEducation analytics11 minpress-releasechart
Case 06unopened

The Winter Shelter Forecast

A city housing office must set winter overflow capacity from a forecast that fits ordinary nights better than pressure weeks.

standardPublic service forecasting12 mincharttimeline
Case 07unopened

The Benefits Queue Score

A state benefits agency wants to use a verification score to cut backlog, but the burden may land unevenly on applicants with messier administrative records.

advancedGovernment benefits analytics12 minmodel-outputpolicy
Case 08unopened

The Claimant Chatbot

A benefits agency chatbot handles routine questions well, but evaluation logs show confident wrong answers on high-stakes claim situations.

standardPublic sector AI evaluation12 mintranscriptrubric
Case 09unopened

The Payment Hold Dial

The same benefits agency must choose a payment-hold threshold that catches fraud without turning suspicion into broad payment delay.

standardGovernment risk operations12 minchartsimulator
Case 10unopened

The Clearance Rate Metric

The benefits modernization program changes its executive metric, and the new dashboard may reward faster closure while hiding reopened cases and payment delay.

standardPublic administration analytics11 mintablememo
Case 11unopened

The Survey Sample Mirage

A customer research survey appears decisive until response patterns reveal who never had a real chance to answer.

standardSurvey analytics12 mintablechart
Case 12unopened

The Bed-Ready Field

A familiar hospital operations field powers a clean improvement story while source systems leave conflicting traces.

standardHealthcare operations12 mintablememo
Case 13unopened

The Missingness Report

A clinical risk report looks stable after dropping incomplete records, but missingness follows staffing, language access, and acuity.

standardClinical analytics12 minheatmaptable
Case 14unopened

The Privacy-Safe Export

A de-identified public health export clears a checklist, but linkage, consent scope, and lifecycle controls make the release less simple.

advancedData governance12 minpolicymemo
Case 15unopened

The Board Slide

A board packet turns an early operational shift into a dramatic story, and the chart frame is doing more work than it first appears.

introExecutive reporting9 minchartmemo
Case 16unopened

The Geo Test Winner

A regional media test appears to win, but market matching, spillover, seasonality, and operational changes keep the counterfactual unsettled.

standardRetail media12 mincharttable
Case 17unopened

The Parallel Trends Slide

A policy brief claims a workforce pilot raised employment, but the comparison group was already drifting away before launch.

standardLabor policy12 mincharttable
Case 18unopened

The Cutoff Policy Claim

An eligibility cutoff seems to prove a rental-assistance navigator prevented evictions, until sorting around the threshold weakens the design.

advancedBenefits eligibility13 mincharttable
Case 19unopened

The QuickStart Readout

A product experiment gets a fast no-go recommendation, but the exposure record and interval width leave more than one interpretation alive.

standardProduct experimentation12 mincharttable
Case 20unopened

The Short-Term Lift

A subscription checkout test lifts paid starts, but refunds, retention, and support burden make the growth claim less settled.

standardSubscription growth12 mincharttable
Case 21unopened

The Discharge Score

A hospital readmission score looks unusually strong in validation, and the launch team wants it in the discharge workflow next month.

advancedHospital readmission13 mincharttable
Case 22unopened

The Labeling Vendor Benchmark

A moderation model beats the old rules engine on a vendor benchmark, but the benchmark labels may be measuring vendor behavior more than policy truth.

advancedTrust and safety13 mincharttable
Case 23unopened

The Drift Alarm Nobody Owned

An ETA model drift alarm is real, but the deeper failure is that monitoring is not connected to owned operational response.

advancedLogistics ETA13 mincharttable
Case 24unopened

The Holiday Override

A holiday replenishment model looks accurate enough to override planners, but store operations leave clues that demand may not be fully visible.

advancedRetail supply chain13 mincharttable
Case 25unopened

The DealDesk Pilot

An enterprise assistant performs well on clean sales workflows, and revenue operations wants tool-enabled expansion before renewal season.

advancedEnterprise AI13 mincharttable