In December 2023, we embarked upon an ambitious initiative to develop a comprehensive digital twin of the Frontier supercomputer. This twin includes: 3D asset modeling with virtual and augmented reality capabilities, telemetry data assimilation, AI/ML integration, simulations, and reinforcement learning for optimization. The goal was initially to develop four main modules:

  1. A transient simulation of the thermo-fluid cooling system from cooling tower to cold plate.
  2. A resource allocator and power simulator - which models workloads and resulting dynamic power, along with energy conversion losses.
  3. A visual analytics module consisting of both an augmented reality model based on Unreal Engine 5, and a web-based dashboard for launching experiments.
  4. A network digital twin to study dynamic network power and congestion.

Once we were able to model Frontier, we set out to generalize these modules as a generalized framework called ExaDigiT for modeling a variety of supercomputer architectures. This digital twin framework offers insights into operational strategies, “what-if” scenarios, as well as elucidates complex, cross-disciplinary transient behaviors. It also serves as a design tool for future system prototyping. Built on an open software stack (Modelica, SST Macro, Unreal Engine) with an aim to foster community-driven development, we have formed a partnership with supercomputer centers around the world to develop an open framework for modeling supercomputers. The source code is available here:

ExaDigiT Source Repositories

For more information, contact Wes Brewer at brewerwh@ornl.gov.

Meetings / Events

Ongoing

  • Monthly Large Group Meeting: fourth Monday of each month, 9am ET.
    (Check exadigit.slack.com for schedule and invite.)

2024

  • High Performance Data-centre Digital Twins - Birds of a Feather - CUG-2024, Perth, WA, Australia (May 6th, 2024)

2023

  • Initial Invitational Meeting: SC'23, Denver, CO, USA (Nov 15th, 2023)

Working Groups

Publications

2024

  • J. Athavale, C. Bash, W. Brewer, M. Maiterth, D. Milojicic, H. Petty, and S. Sarkar, “Data center digital twins.” Computer, IEEE, in press.
  • W. Brewer, M., Maiterth, V. Kumar, R. Wojda, S. Bouknight, J. Hines, W. Shin, S. Greenwood, D. Grant, W. Williams, F. Wang, “A digital twin framework for liquid-cooled supercomputers as demonstrated at exascale.” In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'24), Atlanta, GA, USA, November, 2024.
  • J. Holmen, N. Newaz, S. Yoginath, M. Maiterth, A. Shehata, N. Hagerty, C. Zimmer, W. Brewer, “Towards the development of an exascale network digital twin.” In Cray User Group (CUG) 2024 Proceedings, Perth, WA, Australia, May 2024.
  • V. Kumar, S. Greenwood, W. Brewer, D. Grant, N. Parkison, W. Williams, “Thermo-fluid modeling framework for supercomputer digital twins: Part 1, demonstration at exascale.” In Proceedings of the 2024 American Modelica Conference, Mansfield, CT, USA, October 2024.
  • S. Greenwood, V. Kumar, W. Brewer, “Thermo-fluid modeling framework for supercomputer digital twins: Part 2, automated cooling models.” In Proceedings of the 2024 American Modelica Conference, Mansfield, CT, USA, October 2024.
  • M. Maiterth, W. Brewer, D. De Wet, S. Greenwood, V. Kumar, J. Hines, S. Bouknight, Z. Wang, T. Dykes, F. Wang, “Visualizing an exascale data center digital twin: Considerations, challenges and opportunities.” In 2024 IEEE Visualization and Visual Analytics (VIS), St. Pete Beach, FL, USA, October 2024.
  • F. Suter, W. Brewer, M. Maiterth, R. Ferreira da Silva, H. Casanova, “Comprehensive digital twins of leadership computing facilities to gain full insight on energy-efficiency optimization.” DOE/ASCR Energy-Efficient Computing for Science Workshop, Bethesda, MD, USA, September, 2024.
  • R. Wojda, M. Maiterth, S. Bouknight, W. Brewer, “Dynamic modeling of power conversion stages for an exascale supercomputer.” In 2024 IEEE Energy Conversion Congress & Expo (ECCE), Phoenix, AZ, USA, October, 2024.

Participating Organizations

Industry Partners