coding_assistant

coding_assistant ¶

coding_assistant scorer — test-based evaluation of bug fixes.

Extracts fixed code from model output, runs the test suite, and computes: - fix_rate: fraction of originally-failing tests now passing - regressions: count of originally-passing tests that broke

Classes¶

CodingAssistantScorer ¶

CodingAssistantScorer(judge_backend=None, judge_model: str = '')

Bases: Scorer

Score coding assistant bug fixes by test execution.

Source code in src/openjarvis/evals/scorers/coding_assistant.py

def __init__(self, judge_backend=None, judge_model: str = "") -> None:
    pass