multi
multi
¶
Multi-engine wrapper — routes requests to the right backend by model name.
Classes¶
MultiEngine
¶
MultiEngine(engines: list[tuple[str, InferenceEngine]])
Bases: InferenceEngine
Wraps multiple engines and routes by model name.
Models from each engine are discovered via list_models().
When generate() or stream() is called, the model name
is looked up to find which engine owns it.
Source code in src/openjarvis/engine/multi.py
Functions¶
stream_full
async
¶
stream_full(messages: Sequence[Message], *, model: str, **kwargs: Any) -> AsyncIterator['StreamChunk']
Delegate stream_full() to the engine that owns the model.