promptdojo_

Your refund-fraud tool feeds a small review queue. Missing real fraud costs the company about $5,000 per case. A false alarm costs a reviewer two minutes to clear. Leadership says "do not let fraud through."

Which way should you tune the threshold, and which metric are you protecting?