Tasks · chi-bench-v1.0.0

75 long-horizon healthcare workflow tasks

Three domains · 25 tasks each. Each task is a single end-to-end clinical or administrative workflow scored with a rubric judge. Click a task to read the agent-facing instructions.

Prior Authorization
25 tasks
Utilization Management
25 tasks
Care Management
25 tasks
25 / 75