Fast multipole methods (FMM) on distributed mem- ory have traditionally used a bulk-synchronous model of com- municating the local essential tree (LET) and overlapping it with computation of the local data. This could be perceived as an extreme case of data aggregation,... View more
 M. S. Warren and J. K. Salmon, “A parallel hashed oct-tree N-body algorithm,” in Proceedings of the 1993 ACM/IEEE conference on Supercomputing, 1993, pp. 12-21.
Different distributions of particles. N = 108 on 1; 025 (not 1; 024) cores 1
 P. Jetley, F. Gioachin, C. Mendes, and L. V. Kale´, “Massively parallel cosmological simulations with ChaNGa,” in Proceedings of the 2008 IEEE International Parallel and Distributed Processing Symposium, 2008, pp. 1-12.
 P. Jetley, L. Wesolowski, F. Gioachin, L. V. Kale´, and T. R. Quinn, “Scaling hierarchical N-body simulations on GPU clusters,” in Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, 2010.
 B. Lange and P. Fortin, “Parallel dual tree traversal on multi-core and many-core architectures for astrophysical N-body simulations,” Sorbonne Universite´s UPMC, Tech. Rep. hal-00947130.
 C. Dekate, M. Anderson, M. Brodowicz, H. Kaiser, B. AdelsteinLelbach, and T. Sterling, “Improving the scalability of parallel N-body applications with an event-driven constraint-based execution model,” International Journal of High Performance Computing Applications, vol. 26, no. 3, pp. 319-332, 2012.
 J. Barnes and P. Hut, “O(NlogN) force-calculation algorithm,” Nature, vol. 324, pp. 446-449, 1986.
 E. Agullo, B. Bramas, O. Coulaud, E. Darve, M. Messner, and T. Takahashi, “Pipelining the fast multipole method over a runtime system,” INRIA, Research Report 7981, 2012.
 C. Bordage, “Parallelization on heterogeneous multicore and multi-GPU systems of the fast multipole method for the Helmholtz equation using a runtime system,” in The Sixth International Conference on Advanced Engineering Computing and Applications in Sciences, 2012.
 M. Perica`s, A. Amer, K. Fukuda, N. Maruyama, R. Yokota, and S. Matsuoka, “Towards a dataflow FMM using the OmpSs programming model,” IPSJ SIG, Tech. Rep., 2012.