coding_task
coding_task
¶
Coding task scorer — test pass rate + structural validation.
Extracts the function/class from model output and runs test cases to determine correctness.
coding_task
¶Coding task scorer — test pass rate + structural validation.
Extracts the function/class from model output and runs test cases to determine correctness.