Can we predict protein structure directly from sequence w/o MSAs and w/o intermediate steps like distograms? In a new preprint we show that we can, led by @mrprotein24 @NazimBouatta @SurgeBiswas along w/@charochereau @geochurch & Peter Sorger: (1/4) Tweet added by Mohammed AlQuraishi @MoAlQuraishi

Mohammed AlQuraishi

3 years

Can we predict protein structure directly from sequence w/o MSAs and w/o intermediate steps like distograms? In a new preprint we show that we can, led by @mrprotein24 @NazimBouatta @SurgeBiswas along w/ @charochereau @geochurch & Peter Sorger: (1/4)

Single-sequence protein structure prediction using language models from deep learning

AlphaFold2 and related systems use deep learning to predict protein structure from co-evolutionary relationships encoded in multiple sequence alignments (MSAs). Despite dramatic, recent increases in...

www.biorxiv.org

7

95

374

Mohammed AlQuraishi

@MoAlQuraishi

3 years

We combine a new protein language model (AminoBERT) with an improved version of our end-to-end differentiable machinery (RGN2) to directly generate 3D coordinates. On orphan proteins, RGN2 outperforms all major methods, including #AlphaFold , RoseTTAFold, and trRosetta. (2/4)

2

11

64

Mohammed AlQuraishi

@MoAlQuraishi

3 years

On designed proteins RGN2 is close but not yet best accuracy-wise. However, it is orders of magnitude faster; a useful property for exploring new protein sequences. (3/4)

1

2

28

Mohammed AlQuraishi

@MoAlQuraishi

3 years

Comments on manuscript are most welcome of course. For future work, we look to combine ideas from AF2 with language models, without sacrificing speed. (4/4)

3

1

17