LOGO

AlphaFold2 Alternative: Faster, Free Protein Folding Model Released

July 15, 2021
AlphaFold2 Alternative: Faster, Free Protein Folding Model Released

A New Leap in Protein Structure Prediction: RoseTTAFold

Late last year, DeepMind’s AlphaFold2 AI model revolutionized biology by accurately predicting protein structures – a long-standing challenge. Now, researchers have developed RoseTTAFold, a system achieving comparable results with significantly reduced computational demands. Notably, RoseTTAFold is freely accessible for use.

AlphaFold2 and the CASP14 Competition

Since November, AlphaFold2 has been a focal point in the scientific community. Its performance at CASP14, a virtual competition assessing protein structure prediction algorithms, was groundbreaking. The model’s accuracy was so substantial that some researchers playfully suggested shifting their focus to new areas of study.

Initial Concerns Regarding DeepMind’s Approach

A primary concern revolved around DeepMind’s plans for the system. The lack of exhaustive public documentation raised worries that the company, owned by Alphabet/Google, might restrict access to the core technology. This would have conflicted with the collaborative spirit prevalent in scientific research.

Update: DeepMind has since published more detailed methodologies in the journal Nature, and the code is available on GitHub. This addresses some initial concerns, but the advancements detailed below remain highly significant.

Introducing RoseTTAFold

Researchers at the University of Washington, led by David Baker and Minkyung Baek, have introduced a new model that addresses these concerns. Published in the journal Science, RoseTTAFold achieves similar accuracy levels to AlphaFold2, but at a fraction of the computational cost.

Inspired by AlphaFold2

Baker acknowledged that RoseTTAFold’s development was directly inspired by the concepts presented by the AlphaFold2 team at CASP14. He praised Minkyung Baek’s rapid progress, stating, “She is amazing!”

Examples of predicted protein structures and their ground truths. A score above 90 is considered extremely good. Image Credits: UW/Baek et al

The “Three-Track” Neural Network

The team developed a “three-track” neural network that simultaneously analyzes the amino acid sequence, distances between residues, and spatial coordinates. This complex implementation results in accuracy levels previously considered unattainable less than a year ago.

Computational Efficiency

RoseTTAFold’s key advantage lies in its efficiency. It achieves comparable accuracy far more quickly, requiring less computing power. As the research paper highlights:

This represents a significant relief for researchers who previously faced challenges securing supercomputer time for protein structure prediction.

Public Accessibility

The reduced computational demands enable public hosting and distribution of RoseTTAFold, a possibility that may not have been feasible for AlphaFold2. A public server is available for anyone to submit protein sequences for structure prediction.

“There have been over 4,500 submissions since we put the server up a few weeks ago,” Baker reported. “We have also made the source code freely available.”

Historical Significance

Protein folding has historically been a major challenge in biology, demanding substantial high-performance computing resources. Projects like Folding@Home relied on distributed computing to tackle this problem. Tasks that once required thousands of computers days or weeks to complete can now be accomplished in minutes on a single desktop.

The Importance of Protein Structure

Understanding protein structure is crucial because proteins perform the majority of functions within our bodies. Modifying, suppressing, or enhancing proteins is essential for therapeutic purposes, but this requires a reliable understanding of their structure. This understanding was previously unattainable computationally until recently.

Limitations and Future Directions

While a significant breakthrough, RoseTTAFold and AlphaFold2 do not represent a complete “solution” to protein folding. Most proteins can now have their structure predicted in neutral conditions, but proteins rarely exist in such a state. Their dynamic interactions with other molecules present a far greater challenge.

“There are many exciting chapters ahead… the story is just beginning,” Baker stated.

DeepMind’s Response

DeepMind acknowledged the advancements made by the Baker Lab, noting that the accuracy difference is not insignificant and the performance gap has narrowed. This highlights the rapid pace of research in this field.

For a more in-depth understanding of the methods and potential future developments, consulting detailed technical accounts and expert analyses is recommended.

#AlphaFold2#protein folding#AI#machine learning#DeepMind#research