Public Docs
OpenAPI Source of Truth
MCP Streamable HTTP
CLI for Consumers

TSFM.ai developer documentation.

Multiple pages, one contract. API, MCP, and CLI are aligned on the same schema so teams can move from manual calls to production automation with zero drift.

Learn

Choosing the right model

TSFM.ai offers 17 models across different architectures, sizes, and specializations. This guide helps you pick the right one based on your latency, cost, accuracy, and task requirements.

If you are not sure where to start

Start with Chronos-Bolt Base for most workloads. It offers a strong balance of accuracy, speed (130ms), and cost ($0.07/1M tokens) with probabilistic output support. From there, move to TimesFM 2.5 if you need longer context, Moirai 2.0 for multivariate, or Granite TTM if you need to minimize cost.

Decision factors

Key dimensions to consider when selecting a model.

FactorLowMidHigh
Latency requirement< 150ms150-400ms> 400ms
Budget per 1M tokens< $0.10$0.10-0.25> $0.25
Context needed< 5K tokens5K-12K> 12K
Task complexityPoint forecastProbabilisticMulti-task

Recommendations by scenario

Lowest latency

You need sub-150ms responses for real-time dashboards, alerting, or streaming applications.

Recommended

Chronos-Bolt Small (88ms)Granite TTM 1M (95ms)PatchTST Large (115ms)

These models use direct prediction or lightweight architectures that minimize inference time.

Lowest cost

You are processing millions of series in batch and need to minimize per-request cost.

Recommended

Granite TTM 1M ($0.03)Chronos-Bolt Small ($0.04)PatchTST Large ($0.06)

Smaller parameter counts mean lower GPU utilization per request. Combined with high rate limits, these are ideal for batch workloads.

Best forecast quality

Accuracy is the primary concern and you can tolerate higher latency and cost.

Recommended

Moirai 2.0 LargeTimesFM 2.5TiRex Large

Larger models with longer context windows capture more complex patterns. Moirai Large excels at multivariate, TimesFM at long context, TiRex at covariate-heavy data.

Multivariate series

You have multiple correlated variables that should be modeled jointly.

Recommended

Moirai 2.0 (any size)Time-MoE 200MPatchTST Large

Moirai's Any-Variate Attention captures cross-variate dependencies natively. Time-MoE's expert routing handles diverse multivariate domains.

Anomaly detection

You need to detect unusual patterns in monitoring, sensor, or transactional data.

Recommended

MOMENT LargeToto (for observability)

MOMENT is purpose-built for multi-task including anomaly detection. Toto is specialized for infrastructure metrics from Datadog's telemetry.

Limited history

You have fewer than 50 historical observations (new products, new sensors).

Recommended

Chronos-2Chronos-Bolt BaseMoirai 2.0 Small

These models have strong zero-shot transfer from pre-training and produce reasonable forecasts even with minimal context.

Covariates and external signals

Your forecasts depend on external factors like promotions, holidays, or weather.

Recommended

TiRex LargeChronos-2

TiRex has native covariate encoding. Chronos-2 added covariate support in v2. Both accept past and future covariates in the request payload.

Full model comparison

All 17 models sorted by latency. Click any model name to see full details.

ModelParamsLatencyInput costContextBest for
Chronos-Bolt Small48M88ms$0.046KReal-time and batch applications at lowest cost
Granite TTM 1M~1M95ms$0.034KUltra-low-cost batch forecasting and edge deployment
PatchTST Large~40M115ms$0.064KStable baseline for benchmarking and comparison
Chronos-Bolt Base205M130ms$0.078KFast inference with strong accuracy
Sundial128M140ms$0.088KCalibrated uncertainty estimates (diffusion-based)
Moirai 2.0 Small14M210ms$0.098KLow-cost multivariate forecasting
Chronos-2120M240ms$0.168KGeneral-purpose probabilistic forecasting with covariate support
Time-MoE 200M200M/2.4B240ms$0.1112KCross-domain transfer with expert specialization
Toto151M250ms$0.138KObservability and infrastructure telemetry signals
TiRex Large~300M260ms$0.1516KCovariate-heavy workloads (holidays, promotions)
Lag-Llama~10M280ms$0.148KFull predictive distributions with uncertainty
TimesFM 2.5200M290ms$0.1816KLong-context forecasting with large historical windows
Moirai 2.0 Base91M330ms$0.1712KBalanced multivariate quality and cost
MOMENT Large385M410ms$0.2012KMulti-task: forecast + anomaly + classify + impute
Moirai 2.0 Large311M520ms$0.2916KMaximum multivariate forecast quality
Time-LLM 7B7B650ms$0.328KResearch and LLM-based reasoning over series

Browse models

See all models with live status, pricing, and detailed specifications.

Model catalog

Quickstart

Make your first API call and see forecasts in under 5 minutes.

Get started

Playground

Test models interactively with your own data before committing to an integration.

Try it now