论文标题
恢复:联合药物重新利用的顺序模型优化平台,可以在体外识别新型协同化合物
RECOVER: sequential model optimization platform for combination drug repurposing identifies novel synergistic compounds in vitro
论文作者
论文摘要
对于大型小分子的大型文库,在考虑各种疾病模型,测定条件和剂量范围时,详尽的组合化学筛选变得不可行。深度学习模型已经实现了技术的最终结果,以预测协同得分。但是,药物组合的数据库对协同剂有偏见,这些结果不一定会概括分布不足。我们采用了使用深度学习模型的顺序模型优化搜索,以快速发现与癌细胞系相比筛查的协同药物组合,而不是详尽的评估。我们的小规模湿实验室实验仅占总搜索空间的约5%的评估。在仅3轮ML引导的体外实验(包括校准圈)之后,我们发现,对高度协同组合的征询药物对组合的集合;进行了另外两轮ML引导实验,以确保趋势的可重复性。值得注意的是,我们重新发现药物组合后来证实将在临床试验中研究。此外,我们发现仅使用结构信息生成的药物嵌入开始反映作用机制。在计算机基准测试中的先验表明,与随机选择相比,通过使用顺序评估的顺序评估,我们可以通过使用顺序的评估来富集搜索查询,或者在使用预审预周倍的模型在单个时间点选择所有药物组合时使用> 3X。
For large libraries of small molecules, exhaustive combinatorial chemical screens become infeasible to perform when considering a range of disease models, assay conditions, and dose ranges. Deep learning models have achieved state of the art results in silico for the prediction of synergy scores. However, databases of drug combinations are biased towards synergistic agents and these results do not necessarily generalise out of distribution. We employ a sequential model optimization search utilising a deep learning model to quickly discover synergistic drug combinations active against a cancer cell line, requiring substantially less screening than an exhaustive evaluation. Our small scale wet lab experiments only account for evaluation of ~5% of the total search space. After only 3 rounds of ML-guided in vitro experimentation (including a calibration round), we find that the set of drug pairs queried is enriched for highly synergistic combinations; two additional rounds of ML-guided experiments were performed to ensure reproducibility of trends. Remarkably, we rediscover drug combinations later confirmed to be under study within clinical trials. Moreover, we find that drug embeddings generated using only structural information begin to reflect mechanisms of action. Prior in silico benchmarking suggests we can enrich search queries by a factor of ~5-10x for highly synergistic drug combinations by using sequential rounds of evaluation when compared to random selection, or by a factor of >3x when using a pretrained model selecting all drug combinations at a single time point.