Skip to content

PRISM: Dynamic Primitive-Based Forecasting for Large-Scale GPU Cluster Workloads

Home / Papers / PRISM: Dynamic Primitive-Based Forecasting for Large-Scale GPU Cluster Workloads

PRISM: Dynamic Primitive-Based Forecasting for Large-Scale GPU Cluster Workloads

Authors: Xin Wu, Fei Teng, Xingwang Li, Bin Zheng, Qiang Duan Date: 2026-03-26 Paper ID: openalex:2603.25378

Summary

The paper introduces PRISM, a dynamic primitive-based compositional forecasting framework designed to accurately predict highly volatile and multi-periodic GPU cluster workloads essential for efficient AI infrastructure management. PRISM utilizes a dual representation method that merges dictionary-driven temporal decomposition with adaptive spectral refinement to extract stable and interpretable workload signatures. Evaluated on large-scale production traces, PRISM demonstrates state-of-the-art accuracy, showing significant improvement in reducing burst-phase forecasting errors. This robust approach provides a crucial foundation for enabling advanced, architecture-aware dynamic resource allocation in modern GPU-powered platforms.

Key Contributions

  • Proposal of PRISM, a novel primitive-based compositional forecasting framework for highly volatile and multi-periodic GPU cluster workloads.
  • Introduction of a dual representation combining dictionary-driven temporal decomposition with adaptive spectral refinement to extract stable workload signatures.
  • Achieving state-of-the-art forecasting performance on large-scale production GPU traces, particularly demonstrating significant error reduction in burst phases.
  • Establishing an architecture-aware foundation for dynamic resource management in GPU-powered AI infrastructures.

Limitations

The abstract does not explicitly detail known limitations, but the reliance on ‘production traces’ suggests potential challenges in generalization to entirely unseen cluster configurations or job types.

Open Questions & Future Work

Key Concepts

Archivist Review

Two concepts related to the core mechanism of PRISM were approved: the overarching compositional approach and the specific spectral refinement technique used within it. One open question was approved as it rigorously probes the synergistic interaction between the two main components, which is a valuable architectural insight for future compositional models. Other proposed candidates were rejected as they were either application-specific results, application domains, or subcomponents of the approved core mechanisms.

Approved Concepts

  • Primitive-Based Compositional Forecasting: This compositional framework directly addresses the complexity of modern GPU workloads by decomposing them into stable, interpretable components.
  • Adaptive Spectral Refinement: This adaptive technique refines the temporal decomposition using spectral methods, necessary for capturing subtle, rapidly changing dynamics in volatile GPU workloads.

Approved Open Questions

  • Characterize Primitive-Spectral Synergy: Understanding the synergistic interaction between component-based decomposition (primitives) and frequency-domain refinement (spectral) is key to designing future compositional forecasting models for complex systems.

Rejected Candidates

  • [concept] GPU Workload Forecasting (gpu-workload-forecasting) - paper_local: This is the application domain rather than a novel reusable method, concept, or architecture.
  • [concept] Dictionary-Driven Temporal Decomposition (dictionary-driven-temporal-decomposition) - subcomponent_of_broader_mechanism: This is a component sub-function, and the broader, reusable concept is ‘Primitive-Based Compositional Forecasting’.
  • [concept] Burst-Phase Error Reduction (burst-phase-error-reduction) - low_impact: This is a result/benefit of the model, not a reusable technique or representation.

Metadata & Links

url
https://arxiv.org/abs/2603.25378
paper_id
2603.25378
paper_source
openalex
domain
time-series
tags
time-seriesforecastinganomaly-detectionscaling-law
architectures
datasets
concept_slugs
primitive-based-compositional-forecastingadaptive-spectral-refinement-workload
dataset_slugs
skill
TimeSeriesSkill
created_at
2026-03-29T06:07:19Z