Published June 24, 2024 | Version v1
Journal article Open

APACE: AlphaFold2 and advanced computing as a service for accelerated discovery in biophysics

  • 1. Argonne National Laboratory
  • 2. University of Illinois at Urbana-Champaign
  • 3. University of Chicago

Description

The prediction of protein 3D structure from amino acid sequence is a computational grand challenge in biophysics and plays a key role in robust protein structure prediction algorithms, from drug discovery to genome interpretation. The advent of AI models, such as AlphaFold, is revolutionizing applications that depend on robust protein structure prediction algorithms. To maximize the impact, and ease the usability, of these AI tools we introduce APACE, AlphaFold2 and advanced computing as a service, a computational framework that effectively handles this AI model and its TB-size database to conduct accelerated protein structure prediction analyses in modern supercomputing environments. We deployed APACE in the Delta and Polaris supercomputers and quantified its performance for accurate protein structure predictions using four exemplar proteins: 6AWO, 6OAN, 7MEZ, and 6D6U. Using up to 300 ensembles, distributed across 200 NVIDIA A100 GPUs, we found that APACE is up to two orders of magnitude faster than off-the-self AlphaFold2 implementations, reducing time-to-solution from weeks to minutes. This computational approach may be readily linked with robotics laboratories to automate and accelerate scientific discovery.

Data availability

The data and scientific software needed to reproduce this work are available at https://github.com/hyunp2/alphafold/tree/main (41).

Files

park-et-al-2024-apace-alphafold2-and-advanced-computing-as-a-service-for-accelerated-discovery-in-biophysics.pdf

Additional details

Identifiers

DOI
10.1073/pnas.2311888121
Other
oai:uchicago.tind.io:12890

Funding

Department of Energy
DE-AC02-06CH11357
National Science Foundation
OAC-2209892
National Science Foundation
OAC 2005572
State of Illinois
National Institute of General Medical Sciences
P41-GM104601
National Institute of General Medical Sciences
R24-GM145965
National Institute of General Medical Sciences
R01-GM123455

UChicago Information

Division(s)
Physical Sciences Division
Department(s)
Computer Science