Staggered Dslash Performance on Intel Xeon Phi Architecture

Preprint English OPEN
Li, Ruizi; Gottlieb, Steven;
  • Subject: High Energy Physics - Lattice | Physics - Computational Physics

The conjugate gradient (CG) algorithm is among the most essential and time consuming parts of lattice calculations with staggered quarks. We test the performance of CG and dslash, the key step in the CG algorithm, on the Intel Xeon Phi, also known as the Many Integrated... View more
  • References (15)
    15 references, page 1 of 2

    [1] ExaScale Computing Study: Technology Challenges in Achieving Exascale Systems, P. Kogge et al.,

    [2] K. Barros, R. Babich, R. Brower, M. A. Clark and C. Rebbi, PoS LATTICE2008, 045 (2008) [arXiv:0810.5365 [hep-lat]].

    [3] R. Babich, M. A. Clark and B. Joo, Proc. Intl. Conf. for High Performance Computing, Networking, Storage and Analysis (Supercomputing 2010), New Orleans, LA, Nov. 2010.

    [4] M. A. Clark, R. Babich, K. Barros, R. C. Brower and C. Rebbi, Comput. Phys. Commun. 181, 1517 (2010) [arXiv:0911.3191 [hep-lat]].

    [5] G. Shi, S. Gottlieb, A. Torok, V. Kindratenko, Proc. Symposium on Application Accelerators in HPC (SAAHPC'10), Knoxville, TN, July 2010.

    [6] S. Gottlieb, G. Shi, A. Torok and V. Kindratenko, PoS LATTICE 2010, 026 (2010).

    [7] R. Babich, R. Brower, M. Clark, S. Gottlieb, B. Joo and G. Shi, PoS LATTICE 2011, 033 (2011).

    [8] R. Babich, M. A. Clark, B. Joo, G. Shi, R. C. Brower and S. Gottlieb, arXiv:1109.2935 [hep-lat].



  • Metrics
Share - Bookmark