Skip to content

Time Series Foundation Models as Strong Baselines in Transportation Forecasting: A Large-Scale Benchmark Analysis

Home / Papers / Time Series Foundation Models as Strong Baselines in Transportation Forecasting: A Large-Scale Benchmark Analysis

Time Series Foundation Models as Strong Baselines in Transportation Forecasting: A Large-Scale Benchmark Analysis

Authors: Javier Pulido, Filipe Rodrigues Date: 2026-02-27 Paper ID: openalex:2602.24238

Summary

This paper investigates the zero-shot effectiveness of the Chronos-2 time-series foundation model as a general-purpose forecaster for transportation dynamics across ten real-world datasets, including traffic volume, speed, and demand. The evaluation protocol rigorously benchmarks Chronos-2 against traditional statistical methods and specialized deep learning architectures without any task-specific training. Results indicate that the foundation model frequently achieves state-of-the-art or competitive accuracy, especially for longer forecasting horizons, confirming its utility as a strong baseline. Furthermore, the model’s inherent probabilistic outputs are shown to offer reliable uncertainty quantification through prediction interval coverage and sharpness metrics. This work strongly suggests that large, pre-trained time-series models can significantly simplify the baseline establishment process in specialized forecasting domains like transportation.

Key Contributions

  • Benchmarked the zero-shot performance of the Chronos-2 time-series foundation model across ten diverse transportation forecasting datasets.
  • Demonstrated that Chronos-2 achieves state-of-the-art or competitive accuracy compared to specialized deep learning models and statistical baselines without any fine-tuning.
  • Showed that Chronos-2’s native probabilistic outputs provide useful uncertainty quantification (coverage and sharpness) for transportation forecasting tasks.
  • Advocated for the adoption of general-purpose time-series foundation models as strong, standardized baselines in transportation forecasting research.

Limitations

The evaluation is strictly zero-shot, leaving open the potential gains from lightweight fine-tuning or prompt engineering on these specialized domains.

Open Questions & Future Work

Key Concepts

  • Chronos-2: A large-scale, general-purpose time-series foundation model evaluated for transportation forecasting tasks.

Datasets

Limitations

The evaluation is strictly zero-shot, leaving open the potential gains from lightweight fine-tuning or prompt engineering on these specialized domains.

Metadata & Links

url
https://arxiv.org/abs/2602.24238
paper_id
2602.24238
paper_source
openalex
domain
time-series
tags
language-modeltime-seriesforecastingzero-shot-learningevaluationbenchmarkfoundation-model
architectures
datasets
highway traffic volume and flow datasetsurban traffic speed datasetsbike-sharing demand datasetselectric vehicle charging station data datasets
skill
TimeSeriesSkill
created_at
2026-03-27T14:09:50Z