Routeframe - Run time series foundation models locally

Just 3 steps needed

curl -fsSL https://www.routeframe.com/install.sh | bash

15 MB binary. No dependencies.

routeframe pull toto

Downloads a foundation model trained on 2 trillion time series data points

routeframe run toto --input "45,48,52,49,55" --horizon 8

Predict the next 8 values. Runs on GPU automatically.

Forecasting use cases

Routeframe puts foundation model forecasting directly in your workflow - reduce infrastructure costs with proactive scaling, catch anomalies before they become incidents, predict demand before it spikes, and embed forecasting into any service through a local API.

Capacity planning

Scale before it hurts, not after

Predict CPU, memory, and traffic weeks ahead. Provision capacity proactively and cut cloud costs by eliminating emergency over-provisioning. Teach the model about deploy days with --exogenous is_deploy to account for known spikes.

Anomaly detection

Catch what's off before it pages

Routeframe forecasts what your metrics should look like. When reality diverges from the prediction, that's your signal. Pipe live metrics through routeframe monitor and catch anomalies in real time before they become incidents.

Demand forecasting

Know what's coming before it arrives

Forecast revenue, orders, API traffic, or inventory from a CSV with one command. Fine-tune on your own data to capture your business's seasonal patterns. Flag known events like holidays or launches with --exogenous is_holiday.

Embedded forecasting

Add forecasting to any service

Run routeframe serve and call POST /api/forecast from Go, Java, Python, or any language. 4ms latency. The model stays warm between requests. No ML dependencies in your service, no round-trip to a cloud API.

Why run it locally

Cloud ML APIs charge per prediction, add latency, and require sending your data off-network. Routeframe is a CLI that runs on your hardware - and because it exposes a local REST API, it can be used as a tool by any AI agent of your choice to fine-tune models like Toto on your data and reason over your time series forecasts.

Data stays on your machine

Your metrics never leave your network. No API keys, no data processing agreements, no compliance reviews.

No per-prediction costs

Run a million forecasts a day. It's your CPU. Cloud APIs charge $0.01-0.10 per call -- that adds up.

4ms, not 400ms

Local GPU inference is 100x faster than a round-trip to a cloud endpoint. Fast enough for real-time dashboards.

Works offline

Air-gapped environments, edge deployments, planes. The model runs with no internet connection.

What this replaces

	Before	With Routeframe
Get a forecast	Ask the data team, wait 2 weeks	Run one command, get it now
Add to your service	Deploy a Python ML service	POST to localhost:11435
Train on your data	Set up PyTorch, write training loop	routeframe finetune --data csv
Handle known events	Manual adjustments, tribal knowledge	--exogenous holidays,deploys
Dependencies	Python, PyTorch, CUDA, 4+ GB	15 MB binary, nothing else

Run time series foundation models locally