Downloads provided by UsageCounts
handle: 2117/86316
The Trotter-Suzuki approximation leads to an efficient algorithm for solving the timedependent Schrödinger equation. Using existing highly optimized CPU and GPU kernels, we developed a distributed version of the algorithm that runs efficiently on a cluster. Our implementation also improves single node performance, and is able to use multiple GPUs within a node. The scaling is close to linear using the CPU kernels, whereas the efficiency of GPU kernels improve with larger matrices. We also introduce a hybrid kernel that simultaneously uses multicore CPUs and GPUs in a distributed system. This kernel is shown to be efficient when the matrix size would not fit in the GPU memory. Larger quantum systems scale especially well with a high number nodes. The code is available under an open source license. This work was carried out while P. W. was visiting the Department of Computer Applications in Science & Engineering at the Barcelona Supercomputing Center, funded by the \Access to BSC Facilities" project of the HPC-Europe2 programme (contract no. 228398). Peer Reviewed
Computer and Information Sciences, Beräkningsmatematik, Quantum Evolution, Hybrid Kernel, GPU Accleration, Computació distribuïda, Data- och informationsvetenskap, Cluster analysis--Computer programs, Algorithmic language, Hamiltonian, Trotter-Suzuki Algorithm, Computational Mathematics, :Enginyeria mecànica [Àrees temàtiques de la UPC], Àrees temàtiques de la UPC::Enginyeria mecànica, GPU Computing, Algorismes computacionals, Trotter-Suzuki, MPI, Distributed Computing
Computer and Information Sciences, Beräkningsmatematik, Quantum Evolution, Hybrid Kernel, GPU Accleration, Computació distribuïda, Data- och informationsvetenskap, Cluster analysis--Computer programs, Algorithmic language, Hamiltonian, Trotter-Suzuki Algorithm, Computational Mathematics, :Enginyeria mecànica [Àrees temàtiques de la UPC], Àrees temàtiques de la UPC::Enginyeria mecànica, GPU Computing, Algorismes computacionals, Trotter-Suzuki, MPI, Distributed Computing
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 10 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 63 | |
| downloads | 189 |

Views provided by UsageCounts
Downloads provided by UsageCounts