TimesFM 2.5 200M

online

google/timesfm-2.5-200m-pytorch

200M (+ optional 30M quantile head) params | 16K context | $0.00025 per forecast

TimesFM 2.5 200M is Google's smaller and more deployment-friendly open TimesFM checkpoint. The official repo describes it as an upstream 16K-context model with an optional continuous quantile head, no frequency indicator requirement, and speed-oriented structural updates such as QKV fusion. TSFM.ai currently serves it with a 1,024-point hosted context budget, reflecting the compiled runtime configuration rather than the upstream theoretical maximum.

Model Classification

Family

TimesFM

Type

time series foundation model

Pretrained time-series model exposed on TSFM.ai for zero-shot or few-shot forecasting workloads.

Resources

HuggingFace Paper

Training Data

GiftEvalPretrain, Wikimedia Pageviews through November 2023, Google Trends through end-2022, plus synthetic and augmented data, as listed in the official model card.

Recommended For

• Long-context zero-shot forecasting with strong open-model baselines
• Workloads where point forecasting quality matters more than broad task coverage

Strengths

• Large open checkpoints with long context windows
• Efficient patched-transformer design with strong zero-shot behavior

Limitations

• Primarily a forecasting family rather than a general multi-task TSFM
• Quantile support is not the main identity of the family

Capabilities

forecastingpoint-forecastingquantile-forecastingcovariateszero-shotlong-context

Specifications

Parameters: 200M (+ optional 30M quantile head)
Architecture: decoder-only patched transformer with optional continuous quantile head
Context length: 16,384
Max context: 16,384
Minimum history: n/a
Recommended history: n/a
Input step: n/a
Required target series: 1
Temperature: Ignored
Top P: Ignored
Max output: 1,024
Avg latency: n/a
Uptime: n/a
Plan limits: 1,000 rpm free · 1,000,000 rpm with billing
Accelerator: T4
Regions: Virginia, US
License: n/a

Pricing

Per forecast: $0.00025

Performance

Average latency: n/a
Availability: n/a
Plan limits: 1,000 rpm free · 1,000,000 rpm with billing