Asynchronous Execution of the Fast Multipole Method Using Charm++

Preprint English OPEN
AbdulJabbar, Mustafa; Yokota, Rio; Keyes, David;
  • Subject: Computer Science - Distributed, Parallel, and Cluster Computing | 70F10 | D.1.3 | D.1.2 | G.1.0 | G.1.2

Fast multipole methods (FMM) on distributed mem- ory have traditionally used a bulk-synchronous model of com- municating the local essential tree (LET) and overlapping it with computation of the local data. This could be perceived as an extreme case of data aggregation,... View more
  • References (21)
    21 references, page 1 of 3

    [1] M. S. Warren and J. K. Salmon, “A parallel hashed oct-tree N-body algorithm,” in Proceedings of the 1993 ACM/IEEE conference on Supercomputing, 1993, pp. 12-21.

    Different distributions of particles. N = 108 on 1; 025 (not 1; 024) cores 1

    [2] P. Jetley, F. Gioachin, C. Mendes, and L. V. Kale´, “Massively parallel cosmological simulations with ChaNGa,” in Proceedings of the 2008 IEEE International Parallel and Distributed Processing Symposium, 2008, pp. 1-12.

    [3] P. Jetley, L. Wesolowski, F. Gioachin, L. V. Kale´, and T. R. Quinn, “Scaling hierarchical N-body simulations on GPU clusters,” in Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, 2010.

    [4] B. Lange and P. Fortin, “Parallel dual tree traversal on multi-core and many-core architectures for astrophysical N-body simulations,” Sorbonne Universite´s UPMC, Tech. Rep. hal-00947130.

    [5] C. Dekate, M. Anderson, M. Brodowicz, H. Kaiser, B. AdelsteinLelbach, and T. Sterling, “Improving the scalability of parallel N-body applications with an event-driven constraint-based execution model,” International Journal of High Performance Computing Applications, vol. 26, no. 3, pp. 319-332, 2012.

    [6] J. Barnes and P. Hut, “O(NlogN) force-calculation algorithm,” Nature, vol. 324, pp. 446-449, 1986.

    [7] E. Agullo, B. Bramas, O. Coulaud, E. Darve, M. Messner, and T. Takahashi, “Pipelining the fast multipole method over a runtime system,” INRIA, Research Report 7981, 2012.

    [8] C. Bordage, “Parallelization on heterogeneous multicore and multi-GPU systems of the fast multipole method for the Helmholtz equation using a runtime system,” in The Sixth International Conference on Advanced Engineering Computing and Applications in Sciences, 2012.

    [9] M. Perica`s, A. Amer, K. Fukuda, N. Maruyama, R. Yokota, and S. Matsuoka, “Towards a dataflow FMM using the OmpSs programming model,” IPSJ SIG, Tech. Rep., 2012.

  • Related Research Results (1)
  • Related Organizations (1)
  • Metrics
Share - Bookmark