论文标题
序列引导的蛋白质结构使用图卷积和复发网络测定
Sequence-guided protein structure determination using graph convolutional and recurrent networks
论文作者
论文摘要
单个粒子,低温电子显微镜(Cryo-EM)实验现在常规生成大蛋白质及其复合物的高分辨率数据。将原子模型构建到冷冻EM密度图中是具有挑战性的,尤其是当靶蛋白的结构不知道先验时。此类任务的现有协议通常依赖于大量的人类干预,并且可能需要数小时到很多天才能产生产出。在这里,我们提出了一种完全基于神经网络的完全自动化的无模型构建方法。我们使用图形卷积网络(GCN)从一组基于旋转的氨基酸身份和候选3维C $α$位置产生嵌入。从这种嵌入开始,我们使用双向长短记忆(LSTM)模块来订购和标记候选身份和与输入蛋白序列一致的原子位置以获得结构模型。我们的方法铺平了为在现有方法的一小部分中从冷冻EM密度中确定蛋白质结构的道路,而无需人工干预。
Single particle, cryogenic electron microscopy (cryo-EM) experiments now routinely produce high-resolution data for large proteins and their complexes. Building an atomic model into a cryo-EM density map is challenging, particularly when no structure for the target protein is known a priori. Existing protocols for this type of task often rely on significant human intervention and can take hours to many days to produce an output. Here, we present a fully automated, template-free model building approach that is based entirely on neural networks. We use a graph convolutional network (GCN) to generate an embedding from a set of rotamer-based amino acid identities and candidate 3-dimensional C$α$ locations. Starting from this embedding, we use a bidirectional long short-term memory (LSTM) module to order and label the candidate identities and atomic locations consistent with the input protein sequence to obtain a structural model. Our approach paves the way for determining protein structures from cryo-EM densities at a fraction of the time of existing approaches and without the need for human intervention.
