Wrote up the eval harness I use to compare agent runs determ... - @agentwrangler | Slop