首页> 美国卫生研究院文献>BMC Bioinformatics >ProteinNet: a standardized data set for machine learning of protein structure

【2h】

ProteinNet: a standardized data set for machine learning of protein structure

机译：ProteinNet：用于蛋白质结构机器学习的标准化数据集

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

BackgroundRapid progress in deep learning has spurred its application to bioinformatics problems including protein structure prediction and design. In classic machine learning problems like computer vision, progress has been driven by standardized data sets that facilitate fair assessment of new methods and lower the barrier to entry for non-domain experts. While data sets of protein sequence and structure exist, they lack certain components critical for machine learning, including high-quality multiple sequence alignments and insulated training/validation splits that account for deep but only weakly detectable homology across protein space.

机译：背景技术深度学习的飞速发展推动了其在生物信息学问题（包括蛋白质结构预测和设计）中的应用。在诸如计算机视觉之类的经典机器学习问题中，标准化数据集推动了进步，标准化数据集促进了对新方法的公正评估，并降低了非领域专家的进入门槛。尽管存在蛋白质序列和结构的数据集，但它们缺少某些对于机器学习至关重要的组件，包括高质量的多序列比对和绝缘的训练/验证拆分，这些拆分解释了整个蛋白质空间中的深度但仅是弱可检测的同源性。

著录项

期刊名称 BMC Bioinformatics
作者
Mohammed AlQuraishi;
展开▼
作者单位

展开▼
年(卷),期 2019(20),-1
年度 2019
页码 311
总页数 10
原文格式 PDF
正文语种
中图分类应用微生物学;生化遗传学;生化药理学;
关键词
Proteins Protein structure Machine learning CASP Protein sequence Co-evolution PSSM Protein structure prediction Database Deep learning;

机译：蛋白质;蛋白质结构;机器学习;CASP;蛋白质序列;协同进化;PSSM;蛋白质结构预测;数据库;深度学习;

相似文献

外文文献
中文文献
专利

1. ProteinNet: a standardized data set for machine learning of protein structure [J] . Mohammed AlQuraishi BMC Bioinformatics . 2019,第1期

机译：QuoteNet：用于蛋白质结构的机器学习的标准化数据集
2. kScore: a novel machine learning approach that is not dependent on the data structure of the training set [J] . Scott Oloff, Ingo Muegge Journal of Computer-Aided Molecular Design . 2007,第1a3期

机译：kScore：一种新颖的机器学习方法，不依赖于训练集的数据结构
3. A Novel Approach to Standardizing Data & Detecting Duplicates Across Adverse Event Data Sources Using Machine Learning [J] . Desai S., Chan K., Bannout K., Drug safety: An international journal of medical toxicology and drug experience . 2018,第11期

机译：使用机器学习标准化数据标准化数据和检测重复性的新方法
4. Generating Generic Data Sets for Machine Learning Applications in Building Services Using Standardized Time Series Data [C] . F. Stinner, Y. Yang, T. Schreiber, International Symposium on Automation and Robotics in Construction and Mining . 2019

机译：使用标准化时间序列数据生成用于构建服务中的机器学习应用程序的通用数据集
5. Genome Data Analysis, Protein Function and Structure Prediction by Machine Learning Techniques [D] . Cao, Renzhi 2016

机译：通过机器学习技术进行基因组数据分析，蛋白质功能和结构预测
6. Comparative Characterization of Crofelemer Samples Using Data Mining and Machine Learning Approaches With Analytical Stability Data Sets [O] . Maulik K. Nariya, Jae Hyun Kim, Jian Xiong, -1

机译：使用数据挖掘和机器学习方法与分析稳定性数据集比较Crofelemer样品的表征
7. ProteinNet: a standardized data set for machine learning of protein structure [O] . Mohammed AlQuraishi 2019

机译：QuoteNet：用于蛋白质结构的机器学习的标准化数据集

ProteinNet: a standardized data set for machine learning of protein structure

摘要

著录项

相似文献

相关主题

期刊订阅