The Research Center for Many-core High-Performance Computing fosters the uptake of current and next generation Intel many- and multicore technology in high performance computing and big data analytics. The activities of the research center are focused on enhancing selected workloads with impact on the HPC community to improve their performance and scalability on many-core processor technologies and platform architectures. The selected applications cover a wide range of scientific disciplines including materials science and nanotechnology, atmosphere and ocean flow dynamics, astrophysics, drug design, particle physics and big data analytics. Additionally, novel programming models and algorithms are being evaluated for efficient execution of HPC workloads on many-core processors. The workload optimization for many-core processors is supported by research activities associated with many-core architectures at ZIB. Furthermore, the parallelization work is complemented by dissemination and education activities within the North-German Supercomputing Alliance (HLRN) to diminish the barriers involved with the introduction of upcoming highly parallel processor and platform technologies.

Publications

2020
Leveraging a Heterogeneous Memory System for a Legacy Fortran Code: The Interplay of Storage Class Memory, DRAM and OS 2020 IEEE/ACM Workshop on Memory Centric High Performance Computing (MCHPC), pp. 17-24, 2020, ISBN: 978-0-7381-1067-7 Steffen Christgau, Thomas Steinke BibTeX
DOI
IPCC
Porting a Legacy CUDA Stencil Code to oneAPI 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2020, New Orleans, LA, USA, May 18-22, 2020, pp. 359-367, 2020, ISBN: 978-1-7281-7445-7 Steffen Christgau, Thomas Steinke BibTeX
DOI
IPCC
2019
Advancing the Heterogeneous Active Messages Approach Master's thesis, Humboldt-Universität zu Berlin, Alexander Reinefeld, Björn Scheuermann (Advisors), 2019 Daniel Deppisch BibTeX
IPCC
Heterogeneous Active Messages (HAM) — Implementing Lightweight Remote Procedure Calls in C++ Proceedings of the 5th International Workshop on OpenCL, The Distributed & Heterogeneous Programming in C/C++ (DHPCC++ 2019) Conference, 2019 Matthias Noack BibTeX
DOI
IPCC
Heterogeneous Active Messages for Offloading on the NEC SX-Aurora TSUBASA 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Heterogeneity in Computing Workshop (HCW 2019), 2019 Matthias Noack, Erich Focht, Thomas Steinke BibTeX
IPCC
2018
DM-HEOM: A Portable and Scalable Solver-Framework for the Hierarchical Equations of Motion 2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 19th IEEE Int. Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2018), pp. 947-956, 2018, ISBN: 978-1-5386-5555-9 Matthias Noack, Alexander Reinefeld, Tobias Kramer, Thomas Steinke BibTeX
DOI
IPCC
OpenMP in VASP: Threading and SIMD International Journal of Quantum Chemistry, p. e25851, 2018 (in press) Florian Wende, Martijn Marsman, Jeongnim Kim, Fedor Vasilev, Zhengji Zhao, Thomas Steinke BibTeX
DOI
IPCC
2017
KART - A Runtime Compilation Library for Improving HPC Application Performance High Performance Computing: ISC High Performance 2017 International Workshops, DRBSD, ExaComm, HCPM, HPC-IODC, IWOPH, IXPUG, P^3MA, VHPC, Visualization at Scale, WOPSSS, Frankfurt, Germany, June 18-22, 2017, Revised Selected Papers, Springer International Publishing, pp. 389-403, 2017 (preprint available as ZIB-Report 16-48) Matthias Noack, Florian Wende, Georg Zitzlsberger, Michael Klemm, Thomas Steinke PDF (ZIB-Report)
BibTeX
DOI
IPCC
OpenCL in Scientific High Performance Computing: The Good, the Bad, and the Ugly Proceedings of the 5th International Workshop on OpenCL, pp. 12:1-12:3, IWOCL 2017, 2017 Matthias Noack BibTeX
DOI
IPCC
Performance of Hybrid MPI/OpenMP VASP on Cray XC40 Based on Intel Knights Landing Many Integrated Core Architecture CUG Conference Proceedings, 2017 Zhengji Zhao, Martijn Marsman, Florian Wende, Jeongnim Kim PDF
BibTeX
IPCC
Porting VASP from MPI to MPI+OpenMP [SIMD] Scaling OpenMP for Exascale Performance and Portability - 13th International Workshop on OpenMP, IWOMP 2017, Stony Brook, NY, USA, September 20-22, 2017, pp. 107-122, Vol.8766, LNCS, 2017 Florian Wende, Martijn Marsman, Zhengji Zhao, Jeongnim Kim BibTeX
DOI
IPCC
Porting the MPI-parallelized LES model PALM to multi-GPU systems and many integrated core processors: an experience report International Journal of Computational Science and Engineering. Special Issue on: Novel Strategies for Programming Accelerators, 2017 (accepted for publication) Helge Knoop, Tobias Gronemeier, Matthias Sühring, Peter Steinbach, Matthias Noack, Florian Wende, Thomas Steinke, Christoph Knigge, Siegfried Raasch, Klaus Ketelsen BibTeX
IPCC
2016
Dynamic SIMD Vector Lane Scheduling High Performance Computing, ISC High Performance 2016 International Workshops, ExaComm, E-MuCoCoS, HPC-IODC, IXPUG, IWOPH, P^3MA, VHPC, WOPSSS, pp. 354-365, Vol.9945, LNCS, 2016 Olaf Krzikalla, Florian Wende, Markus Höhnerbach BibTeX
DOI
IPCC
Evaluierung autovektorisierender Compiler anhand von Schleifenklassen Bachelor's thesis, Freie Universität Berlin, Thomas Steinke, Katinka Wolter, Christian Maurer (Advisors), 2016 Alexander Kammeyer BibTeX
IPCC
On Enhancing 3D-FFT Performance in VASP CUG Proceedings, 2016 Florian Wende, Martijn Marsman, Thomas Steinke PDF
BibTeX
IPCC
Portable SIMD Performance with OpenMP* 4.x Compiler Directives Pierre-Francois Dutot, Denis Trystram (Eds.), Vol.Euro-Par 2016: Parallel Processing: 22nd International Conference on Parallel and Distributed Computing, LNCS, 2016, ISBN: 978-3-319-43659-3 Florian Wende, Matthias Noack, Thomas Steinke, Michael Klemm, Georg Zitzlsberger, Chris J. Newburn BibTeX
DOI
IPCC
2015
Application Performance on a Cray XC30 Evaluation System with Xeon Phi Coprocessors at HLRN-III Cray User Group, 2015 Florian Wende, Matthias Noack, Thorsten Schütt, Stephen Sachs, Thomas Steinke BibTeX
IPCC
DataWarp: First Experiences Cray User Group, 2015 Stefan Andersson, Stephen Sachs, Christian Tuma, Thorsten Schütt BibTeX
IPCC
OpenCL: There and Back Again High Performance Parallelism Pearls, James Reinders, Jim Jeffers (Eds.), Morgan Kaufman, Elsevier, pp. 355-378, 2015, ISBN: 978-0-12-803819-2 Matthias Noack, Florian Wende, Klaus-Dieter Oertel BibTeX
IPCC
SIMD Enabled Functions on Intel Xeon CPU and Intel Xeon Phi Coprocessor ZIB-Report 15-17 Florian Wende PDF
BibTeX
URN
IPCC
2014
A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters SC '14: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis. SC14, November 16-21, 2014, New Orleans, Louisiana, USA, 2014 Matthias Noack, Florian Wende, Thomas Steinke, Frank Cordes BibTeX
DOI
IPCC
Concurrent Kernel Execution on Xeon Phi within Parallel Heterogeneous Workloads Euro-Par 2014: Parallel Processing. 20th International Conference, Porto, Portugal, August 25-29, 2014, Proceedings, pp. 788-799, Vol.8632, Lecture Notes in Computer Science, 2014 Florian Wende, Frank Cordes, Thomas Steinke BibTeX
DOI
IPCC
Concurrent Kernel Offloading High Performance Parallelism Pearls, James Reinders, Jim Jeffers (Eds.), Morgan Kaufman, Elsevier, 2014, ISBN: 978-0128021187 (in press) Florian Wende, Thomas Steinke, Michael Klemm, Alexander Reinefeld BibTeX
IPCC
HAM - Heterogenous Active Messages for Efficient Offloading on the Intel Xeon Phi ZIB-Report 14-23 Matthias Noack PDF
BibTeX
URN
IPCC
Hybrid Transactions for Intel’s Transactional Synchronization Extensions Master's thesis, Humboldt-Universität zu Berlin, Thomas Steinke, Alexander Reinefeld (Advisors), 2014 Gunnar Ruthenberg BibTeX
IPCC
Integration of Intel Xeon Phi Servers into the HLRN-III Complex: Experiences, Performance and Lessons Learned CUG2014 Proceedings, 2014 (preprint available as ZIB-Report 14-15) Florian Wende, Guido Laubender, Thomas Steinke PDF
PDF (ZIB-Report)
BibTeX
IPCC
Multi-threaded Kernel Offloading to GPGPU Using Hyper-Q on Kepler Architecture ZIB-Report 14-19 Florian Wende, Thomas Steinke, Frank Cordes PDF
BibTeX
URN
IPCC
2013
An Automated Approach for Estimating the Memory Footprint of Non-Linear Data Objects ZIB-Report 13-46 (Appeared in: Euro-Par 2013: Parallel Processing Workshops. Lecture Notes in Computer Science 8374. 2014, pp. 249-258) Sebastian Dreßler, Thomas Steinke PDF
BibTeX
URN
DOI
IPCC