Published May 30, 2023 | Version v1
Journal article Open

An end-to-end deep learning method for protein side-chain packing and inverse folding

  • 1. University of Chicago
  • 2. Toyota Technical Institute of Chicago

Description

Protein side-chain packing (PSCP), the task of determining amino acid side-chain conformations given only backbone atom positions, has important applications to protein structure prediction, refinement, and design. Many methods have been proposed to tackle this problem, but their speed or accuracy is still unsatisfactory. To address this, we present AttnPacker, a deep learning (DL) method for directly predicting protein side-chain coordinates. Unlike existing methods, AttnPacker directly incorporates backbone 3D geometry to simultaneously compute all side-chain coordinates without delegating to a discrete rotamer library or performing expensive conformational search and sampling steps. This enables a significant increase in computational efficiency, decreasing inference time by over 100× compared to the DL-based method DLPacker and physics-based RosettaPacker. Tested on the CASP13 and CASP14 native and nonnative protein backbones, AttnPacker computes physically realistic side-chain conformations, reducing steric clashes and improving both rmsd and dihedral accuracy compared to state-of-the-art methods SCWRL4, FASPR, RosettaPacker, and DLPacker. Different from traditional PSCP approaches, AttnPacker can also codesign sequences and side chains, producing designs with subnative Rosetta energy and high in silico consistency.

Data availability

Pretrained model, source code, and inference scripts are available at https://github.com/MattMcPartlon/AttnPacker. All study data are included in the article and/or supporting information.

Files

End-to-end-deep-learning-method-for-protein-side-chain-packing-and-inverse-folding.pdf

Files (25.6 MB)

Name Size Download all
Article
md5:4d823901de0a2ac978756816dbdf6aee
15.5 MB Preview Download
Supporting information
md5:03d86e20e865cc6fd9a1c49f62e784fc
10.2 MB Preview Download

Additional details

Identifiers

DOI
10.1073/pnas.2216438120
Other
oai:uchicago.tind.io:6221

Funding

National Institutes of Health
R01 GM089753

UChicago Information

Division(s)
Physical Sciences Division
Department(s)
Computer Science