Project information
Pushing the limits in automated NMR structure determination using a single 4D NOESY spectrum and machine learning methods
- Project Identification
- MUNI/G/0739/2017
- Project Period
- 3/2018 - 12/2020
- Investor / Pogramme / Project type
-
Masaryk University
- Grant Agency of Masaryk University
- INTERDISCIPLINARY - Interdisciplinary research projects
- MU Faculty or unit
- Central European Institute of Technology
- Other MU Faculty/Unit
- Faculty of Informatics
- Other MU Faculty/Unit
- Institute of Computer Science
The ultrahigh-field NMR spectrometers with magnetic strengths of 1.2 GHz that cost more than 12.5 million Euros are within reach. The more powerful the magnet, the larger the protein structures to be revealed in general. The increases in sensitivity and resolution that arise from higher field strengths benefit the simplest type of experiments (NOESY) that report on interatomic distances and are used for NMR structure determination. The financial and technical investments required for these instruments call for methodological breakthroughs to exploit the better NOE build-up and sensitivity in larger proteins. Here, we propose to combine novel NOESY concepts with artificial intelligence to a streamlined strategy designed to meet the key objectives of NMR structure determination; minimal data collection, least human intervention, and applicability to large proteins. Artificial neural networks are to be trained using the massive amounts of chemical shifts deposited in Biological Magnetic Resonance Bank by human experts (8 million entries) and perform the NMR assignment task using only the 4D NOESY spectrum. Since Rosetta software can deliver accurate structures from a sparse network of distance constraints extracted from the NOESY spectrum, our approach promises to crack the atomic structures of large proteins in a single step. Today, image recognition by machines trained via machine learning in some scenarios is better than humans. The NMR assignment problem, if anything else, is a pattern recognition problem. As a major impulse for biomolecular NMR of large proteins, the present grant integrates the unique experience and knowledge in the analysis of NOESY data with several years of expertise in neural network applications to supersede approaches for protein structure determination in vogue that are highly time-consuming.
Publications
Total number of publications: 6
2021
-
4D-GRAPHS
Year: 2021
-
Distinct EH domains of the endocytic TPLATE complex confer lipid and protein binding
Nature Communications, year: 2021, volume: 12, edition: 1, DOI
-
Phosphorylation-induced changes in the PDZ domain of Dishevelled 3
Scientific Reports, year: 2021, volume: 11, edition: 1, DOI
2020
-
Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes
The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, year: 2020
2019
-
Dishevelled-3 conformation dynamics analyzed by FRET-based biosensors reveals a key role of casein kinase 1
Nature Communications, year: 2019, volume: 10, edition: APR 18 2019, DOI
2018
-
De novo design of a non-local beta-sheet protein with high stability and accuracy
NATURE STRUCTURAL & MOLECULAR BIOLOGY, year: 2018, volume: 25, edition: 11, DOI