You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
B22, B30 wall-timed out at 15 min during the acme_legal run (no score, methodology limit)
Contract-error contamination was minimal on acme_legal but significant on the openclaw-custom fixture (B09: 16 errs, B12: 11 errs) — those numbers are excluded from aggregates
The B08 mandatory minimum failed at 37%, triggering the 60% overall score cap per ifixai scoring policy
Per-test JSON+MD reports are preserved for B16/B17/B19/B24/B26/B27/B28/B31/B32; B01–B13 were captured only via run summary because the original 2-judge full sweep was killed before ifixai's end-of-run report writer ran (B14 stall, not script bug)