Models · The PredictLM family

Two checkpoints. One interface. Open weights.

PredictLM is a family of open-weight tabular foundation models. Same architecture family, same calibrated-uncertainty head, two size tiers. Both Apache-2.0.

Ship recipe · Duo + TTT

0.751 classification accuracy. 0.609 regression R². Locked 25-dataset OpenML benchmark.

The recommended way to use PredictLM is the Duo + TTT recipe: load both models, run ~15 inner Adam steps of self-supervised fine-tuning on the user's in-context examples (test-time training), then average their softmax predictions at w = 0.4. +7.8 pp of classification accuracy and +7.3 pp of regression R² over zero-tuning. Implemented in one Python call.

Classification

0.751

mean accuracy (n=10 datasets)

vs 0.673 Mini-v1 alone (+7.8 pp) · 0.685 Base alone (+6.6 pp)

Regression

0.609

mean R² (n=10 datasets)

vs 0.536 Mini-v1 alone (+7.3 pp) · 0.589 Base alone (+2.0 pp)

19 of 20 evaluated datasets improved with TTT versus zero-tuning; no dataset regressed by more than 0.006. Single-model alternatives: Mini-v1 + TTT alone gets 0.742 / 0.595 (one model loaded); Base + TTT alone gets 0.748 / 0.608.

Recommended · Edge / CPU

PredictLM‑Mini

13M parameters · 54 MB · Apache-2.0

The smallest open-weight tabular FM with calibrated uncertainty. Distilled from PredictLM-Base. Statistically tied with Base on classification accuracy and within ~4 percentage points R² on regression. Runs on any laptop.

What it does

Runs on a laptop — CPU only, no GPU required
Returns full prediction confidence intervals, not just guesses
~95% of Base's accuracy at half the size

Evaluation

Classification: mean accuracy = 0.673 (25-dataset OpenML)
Regression: mean R² = 0.536
Statistically tied with Base on classification
Trained in 3.3 hours for ~$1.30 of cloud compute

View on Hugging Face

PredictLM‑Base

26M parameters · 105 MB · Apache-2.0

The best-accuracy model in the family. Teacher for Mini and the architecture of record for our published evaluations. Trends ahead of XGBoost on regression; CI within sampling noise.

What it does

Highest-accuracy model in the family
Returns full prediction confidence intervals, not just guesses
Reference architecture for the PredictLM family

Evaluation

Classification: mean accuracy = 0.685 (25-dataset OpenML)
Regression: mean R² = 0.589
With Mini + test-time training: 0.751 / 0.609

View on Hugging Face

Side-by-side

Mini vs Base

Spec	Mini	Base
Parameters	13M	26M
Checkpoint size	54 MB	105 MB
License	Apache-2.0	Apache-2.0
Best for	CPU / edge	Highest accuracy
Classification (alone)	0.673	0.685
Regression R² (alone)	0.536	0.589
With test-time training	0.742 / 0.595	0.748 / 0.608
Combined (Duo + TTT recipe)	0.751 cls / 0.609 reg

Single-model numbers are point estimates on the same locked 25-dataset OpenML benchmark. The Duo + TTT recipe uses both checkpoints together.

How to use

Same model. Three ways to call it.

Python · one call, ship recipe by default

pip install predictlm

from predictlm import PredictLM

model = PredictLM.from_pretrained("zerooneresearch/predictlm-mini-13m")

# Just .fit().predict() — the package silently downloads the partner
# checkpoint and runs the published Duo + TTT ensemble under the hood.
# Returns the 0.751 cls / 0.609 reg result on the locked OpenML eval.
preds = model.fit(X_train, y_train).predict(X_test)

# Single-model fast path (no Duo, no TTT) — pass auto_duo=False:
# model = PredictLM.from_pretrained(..., auto_duo=False)

Hosted API · HTTPS, free preview

# no install — free API key at /predictlm/register
curl -s https://predictlm-api.zerooneresearch.ai/v1/predict \
  -H 'X-API-Key: ml_sk_...' -H 'content-type: application/json' \
  -d '{"X_train":[[5.1,3.5,1.4]],"y_train":[0],
       "X_query":[[5.9,3.0,5.1]]}'

MCP tool (Claude / Cursor / Continue)

pip install predictlm-mcp

# in claude_desktop_config.json:
{"mcpServers": {"predictlm": {
  "command": "predictlm-mcp",
  "args": ["--model", "predictlm-mini-13m"]
}}}

Get a free API key API docs