Model Family Catalog

Overview

This page is the human-readable rendering of contracts/model_family_catalog.json.

Use it to answer three distinct questions without weakening the public meaning of the support matrix:

  • What is supported as a public lane?
  • What families are implemented in code but not publicly supported?
  • What families or capabilities should be added next?

Support Tier vs Coverage State

TermMeaningSource of truth
support tierPublic support/assurance posture for a declared lane. Values stay aligned with support_matrix.json.contracts/support_matrix.json
coverage stateRepo implementation maturity outside the public support matrix, such as profile_first_class, profile_shared_alias, auto_or_loader_only, loader_only, or backlog states.contracts/model_family_catalog.json
lifecycle classificationWhether a lane, catalog family, or candidate is published, backlog, blocked, smoke-only, usage-only, or out of scope.contracts/model_classification.json

The support matrix remains strict. The model family catalog is broader by design and records code-level visibility, usage-only checkpoints, and recommended additions. The model classification contract records the promotion decision and blocker state for those entries, including the blocked named checkpoint list used by repo checks. The support matrix is ordered by evidence readiness: baseline fixtures, published decoder evidence, repo-maintained experimental lanes, community candidate backlog, and concrete task-family bases. Within those sections, related families and scale points stay adjacent without adding another visible column. Access-gated vendor checkpoints are intentionally kept out of included preset inventory.

Declared Support

FamilyStateRepresentative modelsNotes
GPT-2 causal LMpublished_basisopenai-community/gpt2Public lane derived from gpt2-causal-hf.
BERT / RoBERTa MLMpublished_basisbert-base-uncased, roberta-basePublic lane derived from bert-mlm-hf.
Mistral 7B causal LMpublished_basismistralai/Mistral-7B-v0.1Public container-backed evidence fixture is included, plus a real guard-value scenario package with PM-pass, baseline-relative spectral/RMT/variance evidence.
Ministral 3 8B causal LM (text-only eval)published_basismistralai/Ministral-3-8B-Instruct-2512-BF16Public container-backed evidence fixture is included.
Ministral 3 14B causal LM (text-only eval)published_basismistralai/Ministral-3-14B-Instruct-2512-BF16Public container-backed evidence fixture is included.
TinyLlama 1.1B causal LMpublished_basisTinyLlama/TinyLlama-1.1B-Chat-v1.0Public container-backed evidence fixture is included.
OLMo 2 7B causal LMpublished_basisallenai/OLMo-2-1124-7BPublic container-backed evidence fixture is included.
OLMo 2 13B causal LMpublished_basisallenai/OLMo-2-1124-13B-InstructPublic container-backed evidence fixture is included.
OLMoE 1B-active/7B-total causal LMpublished_basisallenai/OLMoE-1B-7B-0924Public release-profile container-backed evidence fixture is included for the smaller MoE validation basis.
OpenLLaMA 7B causal LMpublished_basisopenlm-research/open_llama_7bPublic release-profile container-backed evidence fixture is included.
Falcon 7B causal LMpublished_basistiiuae/falcon-7bPublic release-profile container-backed evidence fixture is included.
Qwen2 7B causal LMpublished_basisQwen/Qwen2-7BPublic container-backed evidence fixture is included.
Qwen2.5 7B causal LMpublished_basisQwen/Qwen2.5-7BPublic container-backed evidence fixture is included.
Qwen2.5 14B causal LMpublished_basisQwen/Qwen2.5-14BPublic container-backed evidence fixture is included.
Qwen3 causal LMpublished_basisQwen/Qwen3-8BPublic container-backed evidence fixture is included.
DeepSeek-R1-Distill-Qwen causal LMpublished_basisdeepseek-ai/DeepSeek-R1-Distill-Qwen-7BPublic container-backed evidence fixture is included.
DeepSeek-R1-0528-Qwen3 8B causal LMpublished_basisdeepseek-ai/DeepSeek-R1-0528-Qwen3-8BContainer-backed public report, runtime manifest, and signed evidence pack are included.
DeepSeek-R1-Distill-Qwen 14B causal LMpublished_basisdeepseek-ai/DeepSeek-R1-Distill-Qwen-14BPublic release-profile container-backed evidence fixture is included.
Phi-4 causal LM (text-only eval)published_basismicrosoft/Phi-4-reasoning-plusPublic text-only container-backed evidence fixture is included; guard-overhead measurement is skipped by preset policy.
Gemma 4 E2B causal LM (text-only eval)published_basisgoogle/gemma-4-E2B-itPublic release-profile container-backed evidence fixture is included. Image-text evaluation uses the explicit hf_multimodal + vision_text path.
Qwen3.5 causal LMpublished_basisQwen/Qwen3.5-9BPublic container-backed evidence fixture is included.
Ministral 3 3B causal LM (text-only eval)published_basismistralai/Ministral-3-3B-Instruct-2512-BF16Public release-profile container-backed evidence fixture is included.
Granite 4.1 3B causal LMpublished_basisibm-granite/granite-4.1-3bPublic release-profile container-backed evidence fixture is included.
Granite 4.1 8B causal LMpublished_basisibm-granite/granite-4.1-8bPublic release-profile container-backed evidence fixture is included.
Gemma 4 12B any-to-any LMpublished_basisgoogle/gemma-4-12B-itPublic release-profile image-text evidence fixture is included on pinned public VQAv2 materialization. The no-op report passes strict policy with 0.565 final accuracy over 400 examples and no guard warnings; audio and broader any-to-any behavior remain out of scope.
Gemma 4 E4B image-text LMpublished_basisgoogle/gemma-4-E4B-itPublic release-profile image-text evidence fixture is included on pinned public VQAv2 materialization. The no-op report passes strict policy with 0.500 final accuracy over 400 examples and no guard warnings.
Gemma 4 E2B image-text LMpublished_basisgoogle/gemma-4-E2B-itPublic release-profile image-text evidence fixture is included on pinned public VQAv2 materialization. The no-op report passes strict policy with 0.388 final accuracy over 400 examples and no guard warnings.
Qwen3.5 4B image-text LMpublished_basisQwen/Qwen3.5-4BPublic release-profile image-text evidence fixture is included on pinned public VQAv2 materialization. The no-op report passes strict policy with 0.855 final accuracy over 400 examples and no guard warnings.
Qwen3.5 2B image-text LMpublished_basisQwen/Qwen3.5-2BPublic release-profile container-backed image-text evidence fixture is included on pinned public VQAv2 materialization; it is no-op preservation/null-behavior evidence, not guard-value proof.
SmolLM3 3B causal LMpublished_basisHuggingFaceTB/SmolLM3-3BPublic release-profile container-backed evidence fixture is included; guard-overhead measurement is skipped by preset policy.
Phi-4 mini causal LMpublished_basismicrosoft/Phi-4-mini-instructPublic release-profile container-backed evidence fixture is included.
FLAN-T5 base seq2seq LMpublished_basisgoogle/flan-t5-basePublic release-profile container-backed evidence fixture is included on pinned CNN/DailyMail validation data through hf_seq2seq.

Implemented Coverage

FamilyCoverage stateRepresentative modelsNotes
Qwen3 30B-A3B MoE causal LMpublished_basisQwen/Qwen3-30B-A3B-Instruct-2507Public release-profile evidence fixture is included on public WikiText-103 with all-8 80GB-GPU sharding and scoped attention/router/shared-expert guard scans; it is no-op preservation evidence, not benchmark-quality, exhaustive expert-bank, or MoE routing-quality assurance.
Gemma 4 26B-A4B MoE image-text LMpublished_basisgoogle/gemma-4-26B-A4B-itPublic release-profile image-text evidence fixture is included on pinned public VQAv2 materialization. The no-op report passes strict policy with 0.555 final accuracy over 400 examples and no guard warnings; it is not audio, exhaustive expert-bank, or MoE routing-quality evidence.
Mixtral 8x7B MoE causal LMpublished_basismistralai/Mixtral-8x7B-v0.1Public release-profile evidence fixture is included as a no-op preservation basis with full guard scans; it is not a benchmark-quality or MoE routing-quality claim, and guard-overhead measurement is skipped by preset policy.
Llamaprofile_first_classTinyLlama/TinyLlama-1.1B-Chat-v1.0Generic Llama-family profile handling is first-class. OpenLLaMA and TinyLlama provide ungated declared support lanes, while access-gated vendor checkpoints remain omitted.
Qwen family aliases (Qwen1.5/Qwen2.5/Qwen3 naming)profile_first_classQwen/Qwen2.5-14B, Qwen/Qwen3.5-9B, Qwen/Qwen3.5-4BShared qwen-family heuristics cover aliases beyond the declared text-only Qwen2, Qwen2.5 14B, Qwen3, and Qwen3.5 9B public lanes. Qwen3.5 2B and Qwen3.5 4B have published image-text evidence through hf_multimodal and pinned public VQAv2 materialization.
Yiprofile_first_class01-ai/Yi-34BTreated as a RoPE decoder family in profile logic.
Phi familyprofile_first_classmicrosoft/Phi-3-mini-4k-instruct, microsoft/Phi-4-reasoning-plusDedicated phi-family selectors exist. Phi-4 has a declared published text-only lane, while multimodal Phi-4 remains backlog-only.
Gemma familyprofile_first_classgoogle/gemma-4-E2B-it, google/gemma-4-E4B-it, google/gemma-4-12B-it, google/gemma-4-26B-A4B-itGemma-family selectors and loaders remain first-class for compatible local or user-supplied checkpoints. Repo-declared published Gemma support includes Gemma 4 E2B text-only plus Gemma 4 E2B, E4B, 12B, and 26B-A4B image-text published-basis evidence.
OPT / GPT-NeoX / GPT-Jprofile_shared_aliasEleutherAI/gpt-neox-20bAvailable through shared GPT-style paths. The common OPT-1.3B hosted checkpoint is intentionally not named in repo support inventory because its license is not Apache-2.0 or MIT.
GPT-OSSprofile_first_classopenai/gpt-oss-20bDedicated profile selectors and HF causal decoder spec cover the open-weight checkpoint directly.
Falconprofile_shared_aliastiiuae/falcon-7bFalcon 7B has a declared support lane; remaining Falcon-family coverage is available through adapter-auto heuristics and variant-path tests.
GLMauto_or_loader_onlylocal-glm-compatible-checkpointVisible through adapter-auto heuristics only. The public catalog intentionally avoids naming hosted GLM chat checkpoints that fall outside the repo's Apache-2.0/MIT named-checkpoint policy.
DeepSeekprofile_first_classdeepseek-ai/DeepSeek-R1-Distill-Qwen-7BDeepSeek distill checkpoints share the qwen-family route. DeepSeek-R1-Distill-Qwen 7B has a declared published lane; oversized FP8 checkpoint-specific repo hooks and included configs are omitted because they do not fit the supported hardware/runtime path.
Broader BERT-like MLMs (DistilBERT/ALBERT/DeBERTa/ELECTRA)auto_or_loader_onlydistilbert-base-uncased, microsoft/deberta-v3-baseLoader/auto support exceeds the public BERT / RoBERTa lane.
Broader seq2seq families (mBART/PEGASUS/Marian)auto_or_loader_onlyfacebook/mbart-large-50Loader support is broader than the FLAN-T5 public seq2seq basis. CC-BY-4.0-only hosted checkpoints are intentionally not named because they are outside the repo's strict Apache-2.0/MIT named-checkpoint policy.

Usage Only

FamilyStateRepresentative modelsNotes
Qwen2.5 32Busage_onlyQwen/Qwen2.5-32BUsed in evidence-pack suites and validation defaults outside the declared Qwen2.5 14B support lane.
Yi 34Busage_only01-ai/Yi-34BUsed in workshop and full evidence-pack suites.

<=14B Text Candidate Inventory

This section summarizes the contract-tracked <=14B text and MLM candidates that sit outside, adjacent to, or have recently graduated into declared support.

It is a catalog view, not a run ledger. Exact criterion-by-criterion status and decision codes live under promotion_candidates_text_le_14b in contracts/model_family_catalog.json.

FamilyRepresentative modelPromotion statusCatalog locationNotes
Qwen2.5 7B causal LMQwen/Qwen2.5-7Bpromoted_published_basispublished_basisPromoted with container-backed public report, runtime manifest, and signed evidence pack.
Qwen2.5 14B causal LMQwen/Qwen2.5-14Bpromoted_published_basispublished_basisPromoted with container-backed public report, runtime manifest, and signed evidence pack.
Qwen3 8B causal LMQwen/Qwen3-8Bpromoted_published_basispublished_basisPromoted with container-backed public report, runtime manifest, and signed evidence pack.
DeepSeek-R1-Distill-Qwen causal LMdeepseek-ai/DeepSeek-R1-Distill-Qwen-7Bpromoted_published_basispublished_basisPromoted with container-backed public report, runtime manifest, and signed evidence pack.
Phi-4 reasoning-plus causal LMmicrosoft/Phi-4-reasoning-pluspromoted_published_basispublished_basisPromoted with container-backed public report, runtime manifest, and signed evidence pack; this fixture is text-only and skips guard-overhead measurement by preset policy.
OpenLLaMA 7B causal LMopenlm-research/open_llama_7bpromoted_published_basispublished_basisPromoted with release-profile container-backed public report, runtime manifest, and signed evidence pack.
Phi-3 Mini 4K Instruct causal LMmicrosoft/Phi-3-mini-4k-instructexplicitly_out_of_scopeimplemented_coverageThe current declared Phi support surface remains the shipped Phi-4 text-only lane.
Falcon 7B causal LMtiiuae/falcon-7bpromoted_published_basispublished_basisPromoted with release-profile container-backed public report, runtime manifest, and signed evidence pack.
Broader BERT-like MLMs (DistilBERT/ALBERT/DeBERTa/ELECTRA)distilbert-base-uncasedblocked_missing_artifactsimplemented_coverageLoader and adapter tests exist for DistilBERT and DeBERTa, and the repo ships a lane preset plus calibration config with dry-run sweep coverage, but approved calibration/evaluation evidence is still missing.
mBART large 50 seq2seqfacebook/mbart-large-50explicitly_out_of_scopeimplemented_coverageFLAN-T5 base supplies the concrete public seq2seq basis; mBART still needs its own evidence.

The machine-readable criterion-by-criterion ledger for this candidate set lives under promotion_candidates_text_le_14b in contracts/model_family_catalog.json.

PriorityFamilyPlanned support modeRepresentative modelsNotes
P2Audio-text evaluation pipelinephase2_audio_evalgoogle/gemma-4-E2B-it, google/gemma-4-E4B-itImage-text evaluation is included. Audio-capable evaluation for the smaller Gemma 4 checkpoints remains deferred.

Promotion Criteria

A family only moves into support_matrix.json after all of the following are present:

  1. explicit adapter/profile recognition
  2. an included preset
  3. an included calibration config
  4. targeted tests
  5. CLI smoke evidence
  6. approved calibration/evaluation evidence