Execution‐Cache‐Memory modeling and performance tuning of sparse matrix‐vector multiplication and Lattice quantum chromodynamics on A64FX

Alappat, Christie ; Meyer, Nils ; Laukemann, Jan ; Gruber, Thomas ; Hager, Georg ; Wellein, Gerhard ; Wettig, Tilo

Alternative Links zum Volltext:DOI Verlag

Details

Indiziert in

Bibliographische Daten exportieren

Zusammenfassung

The A64FX CPU is arguably the most powerful Arm-based processor design to date. Although it is a traditional cache-based multicore processor, its peak performance and memory bandwidth rival accelerator devices. A good understanding of its performance features is of paramount importance for developers who wish to leverage its full potential. We present an architectural analysis of the A64FX used ...

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

Altmetric

Alternative Statistik (altmetrics)

Weitere Literatur (mittels CORE)

Details

Indiziert in

Bibliographische Daten exportieren

Zusammenfassung

Zusammenfassung

Alternative Statistik (altmetrics)

Weitere Literatur (mittels CORE)

Universitätsbibliothek