High-Throughput Screening Assay Datasets from the PubChem Database

Mariusz Butkiewicz; Yanli Wang; Stephen H Bryant; Edward W Lowe Jr; David Weaver C; Jens Meiler

首页> 外文期刊>Chemical informatics >High-Throughput Screening Assay Datasets from the PubChem Database

【24h】

High-Throughput Screening Assay Datasets from the PubChem Database

机译：PubChem数据库中的高通量筛选分析数据集

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Availability of high-throughput screening (HTS) data in the public domain offers great potential to foster development of ligand-based computer-aided drug discovery (LB-CADD) methods crucial for drug discovery efforts in academia and industry. LB-CADD method development depends on high-quality HTS assay data, i.e., datasets that contain both active and inactive compounds. These active compounds are hits from primary screens that have been tested in concentrationresponse experiments and where the target-specificity of the hits has been validated through suitable secondary screening experiments. Publicly available HTS repositories such as PubChem often provide such data in a convoluted way: compounds that are classified as inactive need to be extracted from the primary screening record. However, compounds classified as active in the primary screening record are not suitable as a set of active compounds for LB-CADD experiments due to high false-positive rate. A suitable set of actives can be derived by carefully analysing results in often up to five or more assays that are used to confirm and classify the activity of compounds. These assays, in part, build on each other. However, often not all hit compounds from the previous screen have been tested. Sometimes a compound can be classified as ‘active’, though its meaning is ‘inactive’ on the target of interest as it is ‘active’ on a different target protein. Here, a curation process of hierarchically related confirmatory screens is illustrated based on two specifically chosen protein use-cases.The subsequent re-upload procedure into PubChem is described for the findings of those two scenarios. Further, we provide nine publicly accessible high quality datasets for future LB-CADD method development that provide a common baseline for comparison of future methods to the scientific community. We also provide a protocol researchers can follow to upload additional datasets for benchmarking. Keywords: HTS; PubChem; Datasets; LB-CADD

机译：高通量筛选（HTS）数据在公共领域的可用性为促进基于配体的计算机辅助药物发现（LB-CADD）方法的发展提供了巨大潜力，这些方法对于学术界和工业界的药物发现工作至关重要。 LB-CADD方法的开发取决于高质量的HTS分析数据，即包含活性和非活性化合物的数据集。这些活性化合物是来自初次筛选的命中物，这些筛选物已在浓度响应实验中进行了测试，并且这些命中物的靶标特异性已通过合适的二次筛选实验进行了验证。公众可获得的HTS储存库（例如PubChem）通常会以令人费解的方式提供此类数据：归类为非活性的化合物需要从初步筛选记录中提取。但是，由于高假阳性率，在初次筛选记录中被分类为有活性的化合物不适合作为用于LB-CADD实验的一组活性化合物。一组合适的活性成分可以通过经常在多达五个或更多个用于确认和分类化合物活性的测定中仔细分析结果来得出。这些检测在某种程度上是相互依存的。但是，通常并非所有来自先前筛选的命中化合物都经过了测试。有时，化合物的含义是在目标靶标上是“无活性”，因为它在不同的靶蛋白上具有“活性”，尽管它的含义是“无活性”。在此，基于两个特定选择的蛋白质用例说明了与层次相关的确认性筛选的策划过程。随后针对这两种情况的发现描述了随后重新上传到PubChem中的过程。此外，我们为将来的LB-CADD方法开发提供了九个可公开访问的高质量数据集，这些数据集为将未来方法与科学界进行比较提供了共同的基准。我们还提供了研究人员可以遵循的协议，以上传其他数据集进行基准测试。关键字：HTS; PubChem;数据集磅CAD

著录项

来源
《Chemical informatics》 |2017年第1期|共7页
作者
Mariusz Butkiewicz; Yanli Wang; Stephen H Bryant; Edward W Lowe Jr; David Weaver C; Jens Meiler;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类化学;
关键词

相似文献

外文文献
中文文献
专利

1. Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database [J] . Mariusz Butkiewicz, Edward W. Lowe Jr., Ralf Mueller Molecules . 2013,第1期

机译：使用PubChem数据库对基于配体的虚拟高通量筛选进行基准测试
2. QSAR Modeling of Imbalanced High-Throughput Screening Data in PubChem [J] . Alexey V.Zakharov, Megan L.Peach, Markus Sitzmann, Journal of chemical information and modeling . 2014,第3期

机译：PubChem中不平衡高通量筛选数据的QSAR建模
3. A novel method for mining highly imbalanced high-throughput screening data in PubChem [J] . Li Qingliang, Wang Yanli, Bryant Stephen H. Bioinformatics . 2009,第24期

机译：在PubChem中挖掘高度不平衡的高通量筛选数据的新方法
4. MICRO-UPLC-MS HIGH-THROUGHPUT SCREENING ASSAY FOR SPHINGOLIPIDS PATHWAY ANALYSIS [C] . Kristen Randall, Helen Klodnitsky, Drew Tietz, International Mass Spectrometry Conference . 2018

机译：微型UPLC-MS高通量筛选测定用于鞘磷脂途径分析
5. A High-Throughput Screening Assay Based on TRFRET to Identify Inhibitors for T. brucei Kinases [D] . Islam, Zeba. 2018

机译：基于TRFRET的高通量筛选测定法鉴定T.Brucei激酶的抑制剂
6. High-Throughput Screening Assay Datasets from the PubChem Database [O] . Mariusz Butkiewicz, Yanli Wang, Stephen H Bryant, -1

机译：PubChem数据库中的高通量筛选分析数据集
7. Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database [O] . Mariusz Butkiewicz, Edward W. Lowe, Ralf Mueller, 2013

机译：使用pubChem数据库对基于配体的虚拟高通量筛选进行基准测试

High-Throughput Screening Assay Datasets from the PubChem Database

摘要

著录项

相似文献

相关主题

期刊订阅