First EuroHPC19 Workshop to Seed and Foster Collaborations Across Europe

19-20 September 2022 at Campus Puerta de Toledo, UC3M, in Madrid Find Workshop materials here Program Monday 19th September Time Topic 12:30 – 13:00 pm Welcome by Jesus, Hans-Christian and Peter 13:00 – 13:30 pm Collaboration on IO – traces suggested by Maike Gilliot, Philippe Deniel and André Brinkmann – including IO-SEA, Admire, MAELSTROM 13:30 … Read more

HIPEAC 2023. EuroHPC JU Projects Shaping Europe’s HPC Landscape.

Ten research & innovation projects were started by the EuroHPC joint undertaking to address the challenge at the hardware, system architecture, system software and software development tool levels. The results achieved will have a lasting impact on the European HPC ecosystem. This workshop presents the progress made by them in the last 20 months, putting … Read more

SC23 BoF: Enabling I/O and Computation Malleability in High-Performance Computing

The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC22) Nov 13–18, 2022 • Dallas, Texas. BoF Session 127. Schedule: November 16th, Wednesday 5:15pm-6:45pm See here BOF presentations Traditional interest in increasing parallelism for individual jobs in HPC systems is being conditioned by the variety and dynamicity of resource demands of jobs at … Read more

1+1 is not 2 in I/O: Interference Mitigation

I/O Scheduling is not an easy task. Some time ago, in the realm of HDD technology, our primary concern was sending (or receiving) data to the HDD in a well-organized manner, in large blocks to minimize disk movements. Every disk movement was a wasted time slot. Numerous techniques and schedulers were developed through simulations. Accurate … Read more

Modeling I/O Performance With Extra-P

In high-performance computing (HPC), large scientific applications are usually executed on huge clusters with a vast number of resources. Twice a year, the Top 500 list presents the top performers in this field. As these systems increasingly become more complex and powerful, so do the applications across various domains (e.g., fluid dynamics, molecular dynamics, and … Read more

Deep Learning and Dynamic Ressources

Applications  In the ADMIRE project, the dynamic allocation of resources to jobs is crucial. This is particularly true for applications that involve training Deep Learning (DL) models on large datasets. DL has unleashed advances in applications from various disciplines, such as physics or medicine,  reaching unprecedented performances compared to traditional Machine Learning. Remote sensing One … Read more

Quality of Service at Scale

the ADMIRE approach In large systems HPC, the common assumption that resource sharing can result from self-organization in the name of a common good tends to not match observation. Fig. 2: Resource sharing as hoped, and as observed on a large HPC systems The inherent complexity of HPC systems, the difficulties for end-users to get … Read more

Towards I/O monitoring at scale

Designing a self-tuning I/O environment in HPC Download in PDF I/O Challenges in HPC In High-Performance Computing (HPC) data movements are one of the biggest challenges. Indeed, large computation is necessarily leading to large datasets. Current HPC workflows favor a feed-forward way of launching programs, loading their dataset, and then storing the result in persistent … Read more