Model Family Catalog
Overview
This page is the human-readable rendering of
contracts/model_family_catalog.json.
Use it to answer three distinct questions without weakening the public meaning of the support matrix:
- What is supported as a public lane?
- What families are implemented in code but not publicly supported?
- What families or capabilities should be added next?
Support Tier vs Coverage State
| Term | Meaning | Source of truth |
|---|---|---|
support tier | Public support/assurance posture for a declared lane. Values stay aligned with support_matrix.json. | contracts/support_matrix.json |
coverage state | Repo implementation maturity outside the public support matrix, such as profile_first_class, profile_shared_alias, auto_or_loader_only, loader_only, or backlog states. | contracts/model_family_catalog.json |
The support matrix remains strict. The model family catalog is broader by design and records code-level visibility, usage-only checkpoints, and recommended additions. Access-gated vendor checkpoints are intentionally kept out of declared support lanes and included preset inventory.
Declared Support
| Family | State | Representative models | Notes |
|---|---|---|---|
| GPT-2 causal LM | published_basis | openai-community/gpt2 | Public lane derived from gpt2-causal-hf. |
| BERT / RoBERTa MLM | published_basis | bert-base-uncased, roberta-base | Public lane derived from bert-mlm-hf. |
| Mistral 7B causal LM | supported_experimental | mistralai/Mistral-7B-v0.1 | Pilot preset and calibration config are included. |
| Ministral 3 causal LM (text-only eval) | supported_experimental | mistralai/Ministral-3-8B-Instruct-2512-BF16, mistralai/Ministral-3-14B-Instruct-2512-BF16 | Text-only pilot presets and calibration configs are included for both 8B and 14B checkpoints. |
| Qwen2 7B causal LM | supported_experimental | Qwen/Qwen2-7B | Pilot preset and calibration config are included. |
| Qwen2.5 7B causal LM | supported_experimental | Qwen/Qwen2.5-7B | Pilot preset and calibration config are included. |
| Qwen2.5 14B causal LM | supported_experimental | Qwen/Qwen2.5-14B | Pilot preset and calibration config are included. |
| Qwen3 causal LM | supported_experimental | Qwen/Qwen3-8B | Pilot preset and calibration config are included. |
| DeepSeek-R1-Distill-Qwen causal LM | supported_experimental | deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | Pilot preset and calibration config are included. |
| Phi-4 causal LM (text-only eval) | supported_experimental | microsoft/Phi-4-reasoning-plus | Text-only pilot preset and calibration config are included. Current HF runtime validation closes cleanly when the lane opts into trust_remote_code. |
| Gemma 4 E2B causal LM (text-only eval) | supported_experimental | google/gemma-4-E2B-it | Text-only pilot preset and calibration config are included. Image-text evaluation uses the explicit hf_multimodal + vision_text path. |
| TinyLlama 1.1B causal LM | supported_experimental | TinyLlama/TinyLlama-1.1B-Chat-v1.0 | Ungated Llama-family pilot lane with included preset and calibration config. |
| OLMo 2 causal LM | supported_experimental | allenai/OLMo-2-1124-7B, allenai/OLMo-2-1124-13B-Instruct | Pilot presets and calibration configs are included for both 7B and 13B scale points. |
| Qwen3.5 causal LM | supported_experimental | Qwen/Qwen3.5-9B | Pilot preset and calibration config are included. |
| Seq2Seq / local pairs | community_experimental | t5-small, facebook/bart-base | Generic seq2seq lane without a published-basis claim. |
Implemented Coverage
| Family | Coverage state | Representative models | Notes |
|---|---|---|---|
| Mixtral | profile_first_class | mistralai/Mixtral-8x7B-v0.1 | Profile and loader code recognize the family directly. |
| Llama | profile_first_class | openlm-research/open_llama_7b, TinyLlama/TinyLlama-1.1B-Chat-v1.0 | Generic Llama-family profile handling is first-class. TinyLlama provides the ungated declared support lane, while access-gated vendor checkpoints remain omitted. |
| Qwen family aliases (Qwen1.5/Qwen2.5/Qwen3 naming) | profile_first_class | Qwen/Qwen2.5-14B, Qwen/Qwen3.5-9B | Shared qwen-family heuristics cover aliases beyond the declared Qwen2, Qwen2.5 14B, Qwen3, and Qwen3.5 lanes, including usage-only Qwen2.5 checkpoints. |
| Yi | profile_first_class | 01-ai/Yi-34B | Treated as a RoPE decoder family in profile logic. |
| Phi family | profile_first_class | microsoft/Phi-3-mini-4k-instruct, microsoft/Phi-4-reasoning-plus | Dedicated phi-family selectors exist. Phi-4 has a declared text-only lane, while multimodal Phi-4 remains backlog-only. |
| Gemma family | profile_first_class | google/gemma-3-4b-it, google/gemma-4-E2B-it | Gemma 3/4 selectors and loaders are first-class. Gemma 4 E2B has a declared text-only lane, image-text evaluation uses hf_multimodal + vision_text, and audio remains deferred. |
| OPT / GPT-NeoX / GPT-J | profile_shared_alias | facebook/opt-1.3b, EleutherAI/gpt-neox-20b | Available through shared GPT-style paths. |
| GPT-OSS | profile_first_class | openai/gpt-oss-20b | Dedicated profile selectors and HF causal decoder spec now cover the open-weight checkpoint directly. |
| Falcon | auto_or_loader_only | tiiuae/falcon-7b | Visible through adapter-auto heuristics only. |
| GLM | auto_or_loader_only | THUDM/glm-4-9b-chat | Visible through adapter-auto heuristics only. |
| DeepSeek | profile_first_class | deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | DeepSeek distill checkpoints share the qwen-family route. Oversized FP8 checkpoint-specific repo hooks and included configs are omitted because they do not fit the supported hardware/runtime path. |
| Broader BERT-like MLMs (DistilBERT/ALBERT/DeBERTa/ELECTRA) | auto_or_loader_only | distilbert-base-uncased, microsoft/deberta-v3-base | Loader/auto support exceeds the public BERT / RoBERTa lane. |
| Broader seq2seq families (mBART/PEGASUS/Marian) | auto_or_loader_only | facebook/mbart-large-50, Helsinki-NLP/opus-mt-en-de | Loader support is broader than the generic seq2seq public lane. |
Usage Only
| Family | State | Representative models | Notes |
|---|---|---|---|
| Qwen2.5 32B | usage_only | Qwen/Qwen2.5-32B | Used in evidence-pack suites and validation defaults outside the declared Qwen2.5 14B support lane. |
| Qwen1.5 72B | usage_only | Qwen/Qwen1.5-72B | Used concretely in evidence-pack suites. |
| Yi 34B | usage_only | 01-ai/Yi-34B | Used in workshop and full evidence-pack suites. |
| Mixtral 8x7B | usage_only | mistralai/Mixtral-8x7B-v0.1 | Used in evidence-pack flows without a public support lane. |
<=14B Text Candidate Inventory
This section summarizes the contract-tracked <=14B text and MLM candidates
that sit outside, adjacent to, or have recently graduated into declared
support.
It is a catalog view, not a run ledger. Exact criterion-by-criterion status and
decision codes live under promotion_candidates_text_le_14b in
contracts/model_family_catalog.json.
| Family | Representative model | Promotion status | Catalog location | Notes |
|---|---|---|---|---|
| Qwen2.5 7B causal LM | Qwen/Qwen2.5-7B | promote_now | supported_experimental | This candidate already graduated into declared support and remains listed here because the contract tracks the reviewed candidate set. |
| OpenLLaMA 7B causal LM | openlm-research/open_llama_7b | blocked_missing_artifacts | implemented_coverage | Recognition and targeted tests exist, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing. |
| Phi-3 Mini 4K Instruct causal LM | microsoft/Phi-3-mini-4k-instruct | explicitly_out_of_scope | implemented_coverage | The current declared Phi support surface remains the shipped Phi-4 text-only lane. |
| Gemma 3 4B IT | google/gemma-3-4b-it | explicitly_out_of_scope | implemented_coverage | The broader Gemma family remains multimodal/audio-capable, so this text-only candidate is not part of the tracked promotion set. |
| OPT 1.3B causal LM | facebook/opt-1.3b | blocked_missing_artifacts | implemented_coverage | Shared GPT-style recognition exists, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing. |
| Falcon 7B causal LM | tiiuae/falcon-7b | blocked_missing_artifacts | implemented_coverage | Recognition and targeted variant-path tests exist, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing. |
| GLM 4 9B Chat | THUDM/glm-4-9b-chat | blocked_missing_artifacts | implemented_coverage | Recognition and targeted variant-path tests exist, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing. |
| Broader BERT-like MLMs (DistilBERT/ALBERT/DeBERTa/ELECTRA) | distilbert-base-uncased | blocked_missing_artifacts | implemented_coverage | Loader and adapter tests exist for DistilBERT and DeBERTa, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing. |
| mBART large 50 seq2seq | facebook/mbart-large-50 | explicitly_out_of_scope | implemented_coverage | Generic seq2seq and community lanes remain outside this tracked candidate set. |
The machine-readable criterion-by-criterion ledger for this candidate set lives
under promotion_candidates_text_le_14b in
contracts/model_family_catalog.json.
Recommended Additions
| Priority | Family | Planned support mode | Representative models | Notes |
|---|---|---|---|---|
P2 | Audio-text evaluation pipeline | phase2_audio_eval | google/gemma-4-E2B-it, google/gemma-4-E4B-it | Image-text evaluation is included. Audio-capable evaluation for the smaller Gemma 4 checkpoints remains deferred. |
Promotion Criteria
A family only moves into support_matrix.json after all of the following are
present:
- explicit adapter/profile recognition
- an included preset
- an included calibration config
- targeted tests
- CLI smoke evidence
- approved calibration/evaluation evidence