IPCC - Research Center for Many-core High-Performance Computing
The Research Center for Many-core High-Performance Computing fosters the uptake of current and next generation Intel many- and multicore technology in high performance computing and big data analytics. The activities of the research center are focused on enhancing selected workloads with impact on the HPC community to improve their performance and scalability on many-core processor technologies and platform architectures. The selected applications cover a wide range of scientific disciplines including materials science and nanotechnology, atmosphere and ocean flow dynamics, astrophysics, drug design, particle physics and big data analytics. Additionally, novel programming models and algorithms are being evaluated for efficient execution of HPC workloads on many-core processors. The workload optimization for many-core processors is supported by research activities associated with many-core architectures at ZIB. Furthermore, the parallelization work is complemented by dissemination and education activities within the North-German Supercomputing Alliance (HLRN) to diminish the barriers involved with the introduction of upcoming highly parallel processor and platform technologies.
Publications
2020 |
|||
Steffen Christgau, Thomas Steinke | Leveraging a Heterogeneous Memory System for a Legacy Fortran Code: The Interplay of Storage Class Memory, DRAM and OS | 2020 IEEE/ACM Workshop on Memory Centric High Performance Computing (MCHPC), pp. 17-24, 2020, ISBN: 978-0-7381-1067-7 |
BibTeX
DOI |
Steffen Christgau, Thomas Steinke | Porting a Legacy CUDA Stencil Code to oneAPI | 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2020, New Orleans, LA, USA, May 18-22, 2020, pp. 359-367, 2020, ISBN: 978-1-7281-7445-7 |
BibTeX
DOI |
2019 |
|||
Daniel Deppisch | Advancing the Heterogeneous Active Messages Approach | Master's thesis, Humboldt-Universität zu Berlin, Alexander Reinefeld, Björn Scheuermann (Advisors), 2019 |
BibTeX
|
Matthias Noack | Heterogeneous Active Messages (HAM) — Implementing Lightweight Remote Procedure Calls in C++ | Proceedings of the 5th International Workshop on OpenCL, The Distributed & Heterogeneous Programming in C/C++ (DHPCC++ 2019) Conference, 2019 |
BibTeX
DOI |
Matthias Noack, Erich Focht, Thomas Steinke | Heterogeneous Active Messages for Offloading on the NEC SX-Aurora TSUBASA | 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Heterogeneity in Computing Workshop (HCW 2019), 2019 |
BibTeX
|
2018 |
|||
Matthias Noack, Alexander Reinefeld, Tobias Kramer, Thomas Steinke | DM-HEOM: A Portable and Scalable Solver-Framework for the Hierarchical Equations of Motion | 2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 19th IEEE Int. Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2018), pp. 947-956, 2018, ISBN: 978-1-5386-5555-9 |
BibTeX
DOI |
Florian Wende, Martijn Marsman, Jeongnim Kim, Fedor Vasilev, Zhengji Zhao, Thomas Steinke | OpenMP in VASP: Threading and SIMD | International Journal of Quantum Chemistry, p. e25851, 2018 (in press) |
BibTeX
DOI |
2017 |
|||
Matthias Noack, Florian Wende, Georg Zitzlsberger, Michael Klemm, Thomas Steinke | KART - A Runtime Compilation Library for Improving HPC Application Performance | High Performance Computing: ISC High Performance 2017 International Workshops, DRBSD, ExaComm, HCPM, HPC-IODC, IWOPH, IXPUG, P^3MA, VHPC, Visualization at Scale, WOPSSS, Frankfurt, Germany, June 18-22, 2017, Revised Selected Papers, Springer International Publishing, pp. 389-403, 2017 (preprint available as ZIB-Report 16-48) |
PDF (ZIB-Report)
BibTeX DOI |
Matthias Noack | OpenCL in Scientific High Performance Computing: The Good, the Bad, and the Ugly | Proceedings of the 5th International Workshop on OpenCL, pp. 12:1-12:3, IWOCL 2017, 2017 |
BibTeX
DOI |
Zhengji Zhao, Martijn Marsman, Florian Wende, Jeongnim Kim | Performance of Hybrid MPI/OpenMP VASP on Cray XC40 Based on Intel Knights Landing Many Integrated Core Architecture | CUG Conference Proceedings, 2017 |
PDF
BibTeX |
Florian Wende, Martijn Marsman, Zhengji Zhao, Jeongnim Kim | Porting VASP from MPI to MPI+OpenMP [SIMD] | Scaling OpenMP for Exascale Performance and Portability - 13th International Workshop on OpenMP, IWOMP 2017, Stony Brook, NY, USA, September 20-22, 2017, pp. 107-122, Vol.8766, LNCS, 2017 |
BibTeX
DOI |
Helge Knoop, Tobias Gronemeier, Matthias Sühring, Peter Steinbach, Matthias Noack, Florian Wende, Thomas Steinke, Christoph Knigge, Siegfried Raasch, Klaus Ketelsen | Porting the MPI-parallelized LES model PALM to multi-GPU systems and many integrated core processors: an experience report | International Journal of Computational Science and Engineering. Special Issue on: Novel Strategies for Programming Accelerators, 2017 (accepted for publication on 2017-04-29) |
BibTeX
|
2016 |
|||
Olaf Krzikalla, Florian Wende, Markus Höhnerbach | Dynamic SIMD Vector Lane Scheduling | High Performance Computing, ISC High Performance 2016 International Workshops, ExaComm, E-MuCoCoS, HPC-IODC, IXPUG, IWOPH, P^3MA, VHPC, WOPSSS, pp. 354-365, Vol.9945, LNCS, 2016 |
BibTeX
DOI |
Alexander Kammeyer | Evaluierung autovektorisierender Compiler anhand von Schleifenklassen | Bachelor's thesis, Freie Universität Berlin, Thomas Steinke, Katinka Wolter, Christian Maurer (Advisors), 2016 |
BibTeX
|
Florian Wende, Martijn Marsman, Thomas Steinke | On Enhancing 3D-FFT Performance in VASP | CUG Proceedings, 2016 |
PDF
BibTeX |
Florian Wende, Matthias Noack, Thomas Steinke, Michael Klemm, Georg Zitzlsberger, Chris J. Newburn | Portable SIMD Performance with OpenMP* 4.x Compiler Directives | Pierre-Francois Dutot, Denis Trystram (Eds.), Vol.Euro-Par 2016: Parallel Processing: 22nd International Conference on Parallel and Distributed Computing, LNCS, 2016, ISBN: 978-3-319-43659-3 |
BibTeX
DOI |
2015 |
|||
Florian Wende, Matthias Noack, Thorsten Schütt, Stephen Sachs, Thomas Steinke | Application Performance on a Cray XC30 Evaluation System with Xeon Phi Coprocessors at HLRN-III | Cray User Group, 2015 |
BibTeX
|
Stefan Andersson, Stephen Sachs, Christian Tuma, Thorsten Schütt | DataWarp: First Experiences | Cray User Group, 2015 |
BibTeX
|
Matthias Noack, Florian Wende, Klaus-Dieter Oertel | OpenCL: There and Back Again | High Performance Parallelism Pearls, James Reinders, Jim Jeffers (Eds.), Morgan Kaufman, Elsevier, pp. 355-378, 2015, ISBN: 978-0-12-803819-2 |
BibTeX
|
Florian Wende | SIMD Enabled Functions on Intel Xeon CPU and Intel Xeon Phi Coprocessor | ZIB-Report 15-17 |
PDF
BibTeX URN |
2014 |
|||
Matthias Noack, Florian Wende, Thomas Steinke, Frank Cordes | A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters | SC '14: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis. SC14, November 16-21, 2014, New Orleans, Louisiana, USA, 2014 |
BibTeX
DOI |
Florian Wende, Frank Cordes, Thomas Steinke | Concurrent Kernel Execution on Xeon Phi within Parallel Heterogeneous Workloads | Euro-Par 2014: Parallel Processing. 20th International Conference, Porto, Portugal, August 25-29, 2014, Proceedings, pp. 788-799, Vol.8632, Lecture Notes in Computer Science, 2014 |
BibTeX
DOI |
Florian Wende, Thomas Steinke, Michael Klemm, Alexander Reinefeld | Concurrent Kernel Offloading | High Performance Parallelism Pearls, James Reinders, Jim Jeffers (Eds.), Morgan Kaufman, Elsevier, 2014, ISBN: 978-0128021187 (in press) |
BibTeX
|
Matthias Noack | HAM - Heterogenous Active Messages for Efficient Offloading on the Intel Xeon Phi | ZIB-Report 14-23 |
PDF
BibTeX URN |
Gunnar Ruthenberg | Hybrid Transactions for Intel’s Transactional Synchronization Extensions | Master's thesis, Humboldt-Universität zu Berlin, Thomas Steinke, Alexander Reinefeld (Advisors), 2014 |
BibTeX
|
Florian Wende, Guido Laubender, Thomas Steinke | Integration of Intel Xeon Phi Servers into the HLRN-III Complex: Experiences, Performance and Lessons Learned | CUG2014 Proceedings, 2014 (preprint available as ZIB-Report 14-15) |
PDF
PDF (ZIB-Report) BibTeX |
Florian Wende, Thomas Steinke, Frank Cordes | Multi-threaded Kernel Offloading to GPGPU Using Hyper-Q on Kepler Architecture | ZIB-Report 14-19 |
PDF
BibTeX URN |
2013 |
|||
Sebastian Dreßler, Thomas Steinke | An Automated Approach for Estimating the Memory Footprint of Non-Linear Data Objects | ZIB-Report 13-46 (Appeared in: Euro-Par 2013: Parallel Processing Workshops. Lecture Notes in Computer Science 8374. 2014, pp. 249-258) |
PDF
BibTeX URN DOI |