Stay up-to-date with the OpenACC and Open Hackathons Monthly Highlights. June’s edition covers the 2022 OpenACC and Hackathons Summit, NSF’s Traineeship Program, NVIDIA’s Academic Hardware Grant program, upcoming Open Hackathons and Bootcamps, recent research, new resources, and more!
2. 2
WHAT IS OPENACC?
main()
{
<serial code>
#pragma acc kernels
{
<parallel code>
}
}
Add Simple Compiler Directive
POWERFUL & PORTABLE
Directives-based
programming model for
parallel
computing
Designed for
performance and
portability on
CPUs and GPUs
SIMPLE
Open Specification Developed by OpenACC Organization
3. 3
silica IFPEN, RMM-DIIS on P100
OPENACC SPECIFICATION MOMENTUM
Wide Adoption Across Key HPC Codes
ANSYS Fluent
Gaussian
VASP
LSDalton
MPAS
GAMERA
GTC
XGC
ACME
FLASH
COSMO
Numeca
400+ APPS* USING OPENACC
Prof. Georg Kresse
Computational Materials Physics
University of Vienna
For VASP, OpenACC is the way forward for GPU
acceleration. Performance is similar to CUDA, and
OpenACC dramatically decreases GPU
development and maintenance efforts. We’re
excited to collaborate with NVIDIA and PGI as an
early adopter of Unified Memory.
“ “
VASP
Top Quantum Chemistry and Material Science Code
* Applications in production and development
4. 4
REGISTER NOW
Our annual OpenACC and Hackathons Summit showcases
leading research advanced through our Open Hackathons or
accelerated by the OpenACC directives-based programming
model.
This year’s event features two keynotes, HPC and AI
tutorials, and invited speakers from premier institutions
worldwide, including:
OPENACC AND HACKATHONS SUMMIT 2022
REGISTRATION OPEN!
• Arizona State University
• Lancaster University
• NASA Langley
• Oak Ridge National Laboratory
• Princeton Plasma Physics
Laboratory
• RIKEN R-CCS
• University of Delaware
• University of Florida
• University of Notre Dame
• University of York
5. 5
UPCOMING OPEN HACKATHONS & BOOTCAMPS
COMPLETE LIST OF EVENTS
Event Call Closes Event Date
CIAIRI AI for Science Bootcamp June 30, 2022 July 11-12, 2022
KISTI GPU Hackathon 2022 July 4, 2022 July 18-22, 2022
UM6P AI for Science Bootcamp July 6, 2022 July 25-26, 2022
NCC Portugal AI for Science Bootcamp September 5, 2022 September 12-13, 2022
NASA GPU Hackathon 2022 July 20, 2022 September 19, 26-28, 2022
CSCS GPU Hackathon 2022 July 11, 2022 September 19, 27-29, 2022
OLCF GPU Hackathon 2022 August 10, 2022 October 17, 25-27, 2022
Digital in 2022: Our virtual events continue to offer the same high-touch training and mentorship without the
hassle of travel!
6. 6
VIEW AWARDEES
The U.S. National Science Foundation announced 22 new
awards through its NSF Research Traineeship (NRT)
program. The $58 million investment, including $6.9 million of
support from the American Rescue Plan Act of 2021,
expands the program to comprise 43 states, the District of
Columbia and the U.S. Virgin Islands. This NSF program
supports graduate students, educates the STEM leaders of
tomorrow and strengthens the national research
infrastructure.
The awardees, including two from the newly added state of
Missouri, will focus on research and development in artificial
intelligence and quantum information science and
engineering — both national priorities of utmost importance.
NSF RESEARCH TRAINEESHIP PROGRAM
EXPANDS TO INCLUDE 43 STATES
7. 7
APPLY NOW
NVIDIA’s Academic Hardware Grant program is not
accepting applications through July 1, 2022. This program
equips researchers and educators with computing resources
and hands-on platforms for teaching AI, deep learning, and
data science across various disciplines.
You can apply for hardware such as:
• NVIDIA RTX™ workstation GPUs powered by NVIDIA
Ampere Architecture
• NVIDIA® BlueField® Data Processing Units (DPUs)
• NVIDIA Jetson Nano Developer Kits
• Remote GPU instances in the cloud with pre-built container
images
NVIDIA ACADEMIC HARDWARE GRANT PROGRAM
NOW ACCEPTING APPLICATIONS
8. 8
LEARN MORE
openSUSE Tumbleweed 20220510 snapshot switch to
GCC 12 compiler brings new features, including
improved C/C++/Ada language support, features on
new/upcoming CPUs like AVX-512 FP16, OpenMP and
OpenACC enhancements, JIT enhancements, useful
debug improvements, and much more. There are also
various performance optimizations from small changes
up to more notable items like vectorization now being
enabled at the "-O2" optimization level with the very-
cheap cost model.
openSUSE TUMBLEWEED’S GCC 12 UPGRADE
HELPING PERFORMANCE
9. 9
RESOURCES
Paper: Optimizing UniFrac with OpenACC Yields
Greater Than One Thousand Times Speed Increase
Igor Sfiligoi, George Armstrong, Antonio Gonzalez,
Daniel McDonald, and Rob Knight
UniFrac is an important tool in microbiome research that is
used for phylogenetically comparing microbiome profiles to one
another (beta diversity). Striped UniFrac recently added the
ability to split the problem into many independent subproblems,
exhibiting nearly linear scaling but suffering from memory
contention. Here, we adapt UniFrac to graphics processing
units using OpenACC, enabling greater than 1,000X
computational improvement, and apply it to 307,237 samples,
the largest 16S rRNA V4 uniformly preprocessed microbiome
data set analyzed to date).
READ PAPER
10. 10
RESOURCES
Paper: Energy Efficiency of Reverse Time Migration on HPC
Architectures
M. Serpa, P. Silva, C. Künas, A. Carissimi, J. Panetta,
and P. Navaux
Reverse Time Migration (RTM) is a preferred solution for modeling and imaging seismic
data in areas of complex wave phenomena. It computes numerical solutions to the
complete wave equation. Historically, it was impractical due to high computational costs.
Nevertheless, developers have been porting their simulations to the new High-
Performance Computing architectures, providing faster and more accurate results with
each new generation. That makes RTM coupled with more precise velocity model-
building workflows, now making RTM a viable option in the imaging portfolio. However,
several challenges arrive when achieving high performance on HPC architectures. The
first one is related to choosing the architecture that best fits your simulation. Moreover,
the API used to implement the simulation affects the performance and portability of the
code. This abstract presents two versions of an RTM simulation, reducing floating-point
operations by changing the equation derivatives and improving the energy efficiency of
the simulation. The paper studies optimizations in different multicore and GPU
architectures, investigating the impact of other APIs on the energy efficiency and
portability of the code. The energy efficiency results showed that the CUDA
implementation running on the Volta architecture has the best energy efficiency for RTM
on GPUs, while for multicore, the OpenACC version was the best.
READ PAPER Figure 1: Data collection in marine seismic survey
11. 11
RESOURCES
GCC Contributors Guide 0.1 documentation: GCC for New Contributors
David Malcolm
VIEW GUIDE
This is an “unofficial” guide to GCC’s internals, aimed at new
developers and at plugin authors.
Contents include:
• Diving into GCC internals
• “Gotchas” and FAQS
• Getting Started
• Debugging GCC
• Working with the testsuite
• How to improve the location of a gcc diagnostic
• Memory Management in GCC
• TODO
12. 12
RESOURCES
Website: OpenHackathons.org
Technical Resources
VISIT SITE
Explore a wealth of resources for parallelization and
accelerated computing across HPC, AI and Big Data.
Review a collection of videos, presentations, GitHub
repos, tutorials, libraries, and more to help you advance
your skills and expand your knowledge.
13. 13
STAY IN THE KNOW: JOIN THE COMMUNITY
OPENACC AND HACKATHON UPDATES
JOIN TODAY
The OpenACC Organization is dedicated to
helping the research and developer community
advance science by expanding their accelerated
and parallel computing skills.
Take an active role in influencing the future of both
the OpenACC specification and the organization
itself by becoming a member of the community.
Keep abreast of the new tools, latest resources,
recent research, and upcoming events.