sft_trainer
sft_trainer
¶
General-purpose SFT trainer -- fine-tune any local LM on trace-derived pairs.
Delegates to :class:LoRATrainer from training/lora.py when use_lora=True.
Supports train(trace_store) for end-to-end pipeline and train_on_pairs()
for pre-extracted data.
Classes¶
SFTTrainer
¶
SFTTrainer(config: SFTConfig)
General-purpose supervised fine-tuning trainer.
| PARAMETER | DESCRIPTION |
|---|---|
config
|
SFTConfig controlling model, LoRA params, and training hyperparams.
TYPE:
|
Source code in src/openjarvis/learning/intelligence/sft_trainer.py
Attributes¶
target_module_list
property
¶
Parse comma-separated target_modules string into a list.
Functions¶
train
¶
End-to-end: mine SFT pairs from traces, then train.
| PARAMETER | DESCRIPTION |
|---|---|
trace_store
|
Object with
TYPE:
|
| RETURNS | DESCRIPTION |
|---|---|
dict with at least ``status`` key.
|
|
Source code in src/openjarvis/learning/intelligence/sft_trainer.py
train_on_pairs
¶
Train on pre-extracted SFT pairs.
| PARAMETER | DESCRIPTION |
|---|---|
pairs
|
List of dicts with at least
TYPE:
|
| RETURNS | DESCRIPTION |
|---|---|
dict with ``status``, ``training_samples``, and training metrics.
|
|