...
首页> 外文期刊>Journal of chemical information and modeling >New Workflow for QSAR Model Development from Small Data Sets: Small Dataset Curator and Small Dataset Modeler. Integration of Data Curation, Exhaustive Double Cross-Validation, and a Set of Optimal Model Selection Techniques
【24h】

New Workflow for QSAR Model Development from Small Data Sets: Small Dataset Curator and Small Dataset Modeler. Integration of Data Curation, Exhaustive Double Cross-Validation, and a Set of Optimal Model Selection Techniques

机译:来自小型数据集的QSAR模型开发的新工作流程:小型数据集策划器和小型数据集型号。 数据策择集成,详尽的双交叉验证以及一组最佳模型选择技术

获取原文
获取原文并翻译 | 示例
           

摘要

Quantitative structure activity relationship (QSAR) modeling is a well-known in silico technique with extensive applications in several major fields such as drug design, predictive toxicology, materials science, food science, etc. Handling small-sized datasets due to the lack of experimental data for specialized end points is a crucial task for the QSAR researcher. In the present study, we propose an integrated workflow/scheme capable of dealing with small dataset modeling that integrates dataset curation, "exhaustive" double cross-validation and a set of optimal model selection techniques including consensus predictions. We have developed two software tools, namely, Small Dataset Curator, version 1.0.0, and Small Dataset Modeler, version 1.0.0, to effortlessly execute the proposed workflow. These tools are freely available for download from https://dtclab.webs.com/software-tools. We have performed case studies employing seven diverse datasets to demonstrate the performance of the proposed scheme (including data curation) for small dataset QSAR modeling. The case studies also confirm the usability and stability of the developed software tools.
机译:定量结构活动关系(QSAR)建模是众所周知的硅技术,具有广泛的应用,如药物设计,预测毒理学,材料科学,食品科学等。由于缺乏实验,处理小型数据集。专业终点的数据是QSAR研究人员的关键任务。在本研究中,我们提出了一种能够处理集成数据集策策的小型数据集建模的集成工作流程/方案,“详尽的”双交叉验证和一组最佳模型选择技术,包括共识预测。我们开发了两个软件工具,即小型数据集策展程序,版本1.0.0和小型数据集更愿意的1.0.0,以毫不费力地执行所提出的工作流程。这些工具可自由地从https://dtclab.webs.com/software-tools下载。我们已经进行了采用七种不同数据集的案例研究,以展示所提出的方案(包括数据策委)进行小型数据集QSAR建模的性能。案例研究还确认了开发软件工具的可用性和稳定性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号