AlphaProteo 生成用于生物学和健康研究的新型蛋白质 AlphaProteo generates novel proteins for biology and health research

原始链接: https://deepmind.google/discover/blog/alphaproteo-generates-novel-proteins-for-biology-and-health-research/

科学家开发了一种名为 AlphaProteo 的新型人工智能系统,用于设计新型蛋白质结合剂,可以有效地附着到特定目标分子上,从而可能彻底改变药物设计、疾病调查、诊断和农业应用等各个领域。 通过研究蛋白质数据库 (PDB) 中的大量蛋白质数据和 AlphaFold 中的 1 亿多个预测结构,AlphaProteo 深入了解了复杂的分子键合机制。 它根据输入的目标分子信息生成针对目标相互作用点定制的候选蛋白质结合剂。 通过对包括 VEGF-A 在内的几种关键靶蛋白的测试,证明了其能够产生相对于现有技术更优异的结果,与现有方法相比,其结合率高达 95%,平均强度提高了 10 倍。 此外,AlphaProteo 还实现了前所未有的壮举,为 VEGF-A 创建了蛋白质结合剂,这是之前的人工智能工具无法实现的。 该系统的性能有望大幅减少各种蛋白质结合剂应用的实验时间要求。 尽管取得了一些成就,但研究人员承认 AlphaProteo 的局限性,并计划持续改进,以解决诸如为 TNFα 等靶点设计结合物等挑战,众所周知,由于其独特的形状和功能,该结合物特别困难。 与外部专家、科学界和专业实验室的合作将确保该技术的安全处理和道德使用。 寻求访问或合作的感兴趣者可以联系 [[email protected]](mailto:[email protected])。

A new AI system named AlphaProteo has been developed by scientists to design novel protein binders that can effectively attach to specific target molecules, potentially revolutionizing various areas including drug design, disease investigation, diagnostics, and agricultural applications. By studying vast quantities of protein data from the Protein Data Bank (PDB) and over 100 million predicted structures from AlphaFold, AlphaProteo has gained insight into complex molecular bonding mechanisms. It produces candidate protein binders tailored for targeted interaction points based on inputted target molecule information. Its ability to produce superior results relative to current techniques was demonstrated via tests on several crucial target proteins, including VEGF-A, yielding binding rates up to 95% and average strength improvements of 10x compared to established methods. Furthermore, AlphaProteo achieved the unprecedented feat of creating a protein binder for VEGF-A, something no previous AI tool has accomplished. The system's performance promises substantial reduction in experimental time requirements for a wide array of protein binder applications. Despite its accomplishments, researchers acknowledge AlphaProteo's limitations and plan ongoing improvement to address challenges like designing binders for targets such as TNFα, known to be particularly difficult due to its unique shape and function. Collaborative work with external experts, the scientific community, and specialized labs will ensure safe handling and ethical use of the technology moving forward. Interested parties seeking access or collaboration may contact [[email protected]](mailto:[email protected]).


Research

Published
Authors

Protein Design and Wet Lab teams

The target protein shown here in yellow is the spike protein from SARS-CoV-2 virus, which is involved in COVID-19 infection.

New AI system designs proteins that successfully bind to target molecules, with potential for advancing drug design, disease understanding and more.

Every biological process in the body, from cell growth to immune responses, depends on interactions between molecules called proteins. Like a key to a lock, one protein can bind to another, helping regulate critical cellular processes. Protein structure prediction tools like AlphaFold have already given us tremendous insight into how proteins interact with each other to perform their functions, but these tools cannot create new proteins to directly manipulate those interactions.

Scientists, however, can create novel proteins that successfully bind to target molecules. These binders can help researchers accelerate progress across a broad spectrum of research, including drug development, cell and tissue imaging, disease understanding and diagnosis – even crop resistance to pests. While recent machine learning approaches to protein design have made great strides, the process is still laborious and requires extensive experimental testing.

Today, we introduce AlphaProteo, our first AI system for designing novel, high-strength protein binders to serve as building blocks for biological and health research. This technology has the potential to accelerate our understanding of biological processes, and aid the discovery of new drugs, the development of biosensors and more.

AlphaProteo can generate new protein binders for diverse target proteins, including VEGF-A, which is associated with cancer and complications from diabetes. This is the first time an AI tool has been able to design a successful protein binder for VEGF-A.

AlphaProteo also achieves higher experimental success rates and 3 to 300 times better binding affinities than the best existing methods on seven target proteins we tested.

Learning the intricate ways proteins bind to each other

Protein binders that can bind tightly to a target protein are hard to design. Traditional methods are time intensive, requiring multiple rounds of extensive lab work. After the binders are created, they undergo additional experimental rounds to optimize binding affinity, so they bind tightly enough to be useful.

Trained on large amounts of protein data from the Protein Data Bank (PDB) and more than 100 million predicted structures from AlphaFold, AlphaProteo has learned the myriad ways molecules bind to each other. Given the structure of a target molecule and a set of preferred binding locations on that molecule, AlphaProteo generates a candidate protein that binds to the target at those locations.

Illustration of a predicted protein binder structure interacting with a target protein. Shown in blue is a predicted protein binder structure generated by AlphaProteo, designed for binding to a target protein. Shown in yellow is the target protein, specifically the SARS-CoV-2 spike receptor-binding domain

Demonstrating success on important protein binding targets

To test AlphaProteo, we designed binders for diverse target proteins, including two viral proteins involved in infection, BHRF1 and SARS-CoV-2 spike protein receptor-binding domain, SC2RBD, and five proteins involved in cancer, inflammation and autoimmune diseases, IL-7Rɑ, PD-L1, TrkA, IL-17A and VEGF-A.

Our system has highly-competitive binding success rates and best-in-class binding strengths. For seven targets, AlphaProteo generated candidate proteins in-silico that bound strongly to their intended proteins when tested experimentally.

A grid of illustrations of predicted structures of seven target proteins for which AlphaProteo generated successful binders. Shown in blue are examples of binders tested in the wet lab, shown in yellow are protein targets, and highlighted in dark yellow are intended binding regions.

For one particular target, the viral protein BHRF1, 88% of our candidate molecules bound successfully when tested in the Google DeepMind Wet Lab. Based on the targets tested, AlphaProteo binders also bind 10 times more strongly, on average, than the best existing design methods.

For another target, TrkA, our binders are even stronger than the best prior designed binders to this target that have been through multiple rounds of experimental optimization.

Bar graph showing experimental in vitro success rates of AlphaProteo’s output for each of the seven target proteins, compared to other design methods. Higher success rates mean fewer designs must be tested to find successful binders.

Bar graph showing the best affinity for AlphaProteo’s designs without experimental optimization for each of the seven target proteins, compared to other design methods. Lower affinity means the binder protein binds more tightly to the target protein. Please note the logarithmic scale of the vertical axis.

Validating our results

Beyond in silico validation and testing AlphaProteo in our wet lab, we engaged Peter Cherepanov’s, Katie Bentley’s and David LV Bauer’s research groups from the Francis Crick Institute to validate our protein binders. Across different experiments, they dived deeper into some of our stronger SC2RBD and VEGF-A binders. The research groups confirmed that the binding interactions of these binders were indeed similar to what AlphaProteo had predicted. Additionally, the groups confirmed that the binders have useful biological function. For example, some of our SC2RBD binders were shown to prevent SARS-CoV-2 and some of its variants from infecting cells.

AlphaProteo’s performance indicates that it could drastically reduce the time needed for initial experiments involving protein binders for a broad range of applications. However, we know that our AI system has limitations, as it was unable to design successful binders against an 8th target, TNFɑ, a protein associated with autoimmune diseases like rheumatoid arthritis. We selected TNFɑ to robustly challenge AlphaProteo, as computational analysis showed that it would be extremely difficult to design binders against. We will continue to improve and expand AlphaProteo's capabilities with the goal of eventually addressing such challenging targets.

Achieving strong binding is usually only the first step in designing proteins that might be useful for practical applications, and there are many more bioengineering obstacles to overcome in the research and development process.

Towards responsible development of protein design

Protein design is a fast-evolving technology that holds lots of potential for advancing science in everything from understanding the factors that cause disease, to accelerating diagnostic test development for virus outbreaks, supporting more sustainable manufacturing processes, and even cleaning contaminants from the environment.

To account for potential risks in biosecurity, building on our long-standing approach to responsibility and safety, we’re working with leading external experts to inform our phased approach to sharing this work, and feeding into community efforts to develop best practices, including the NTI’s (Nuclear Threat Initiative) new AI Bio Forum.

Going forward, we’ll be working with the scientific community to leverage AlphaProteo on impactful biology problems and understand its limitations. We've also been exploring its drug design applications at Isomorphic Labs, and are excited for what the future holds.

At the same time, we’re continuing to improve the success rate and affinity of AlphaProteo’s algorithms, expanding the range of design problems it can tackle, and working with researchers in machine learning, structural biology, biochemistry and other disciplines to develop a responsible and more comprehensive protein design offering for the community.

相关文章
联系我们 contact @ memedata.com