Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization | Tianfan Fu

Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization | Tianfan Fu

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our YouTube channel to see the presented slides.

If you enjoyed this talk, consider joining the Molecular Modeling and Drug Discovery (M2D2) talks live: https://m2d2.io/talks/m2d2/about/

Also consider joining the M2D2 Slack: https://m2d2group.slack.com/join/shar...

Abstract: Molecular optimization is a fundamental goal in the chemical sciences and is of central interest to drug and material design. In recent years, significant progress has been made in solving challenging problems across various aspects of computational molecular optimizations, emphasizing high validity, diversity, and, most recently, synthesizability. Despite this progress, many papers report results on trivial or self-designed tasks, bringing additional challenges to directly assessing the performance of new methods. Moreover, the sample efficiency of the optimization--the number of molecules evaluated by the oracle--is rarely discussed, despite being an essential consideration for realistic discovery applications. To fill this gap, we have created an open-source benchmark for practical molecular optimization, PMO, to facilitate the transparent and reproducible evaluation of algorithmic advances in molecular optimization. This paper thoroughly investigates the performance of 25 molecular design algorithms on 23 tasks with a particular focus on sample efficiency. Our results show that most "state-of-the-art" methods fail to outperform their predecessors under a limited oracle budget allowing 10K queries and that no existing algorithm can efficiently solve certain molecular optimization problems in this setting. We analyze the influence of the optimization algorithm choices, molecular assembly strategies, and oracle landscapes on the optimization performance to inform future algorithm development and benchmarking. PMO provides a standardized experimental setup to comprehensively evaluate and compare new molecule optimization methods with existing ones.

Speakers: Tianfan Fu

Twitter Prudencio

Twitter Therence

Twitter Cas

Twitter Valence Discovery

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(60)

Structure-Independent Peptide Binder Design via Generative Language Models | Pranam Chatterjee

Structure-Independent Peptide Binder Design via Generative Language Models | Pranam Chatterjee

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifie...

20 Kesä 20231h

Learning Local Equivariant Representations for Large-Scale Atomistic Dynamics | Albert Musaelian

Learning Local Equivariant Representations for Large-Scale Atomistic Dynamics | Albert Musaelian

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplif...

13 Kesä 20231h 9min

Multimodal Deep Learning for Protein Engineering | Kevin K. Yang

Multimodal Deep Learning for Protein Engineering | Kevin K. Yang

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifie...

7 Kesä 20231h 2min

Systematic Analysis of Biomolecular Conformational Ensembles with PENSA | Martin Vögele

Systematic Analysis of Biomolecular Conformational Ensembles with PENSA | Martin Vögele

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies ...

30 Touko 202350min

Training Neural Network Potentials: Bayesian and Simulation-based Approaches | Stephan Thaler

Training Neural Network Potentials: Bayesian and Simulation-based Approaches | Stephan Thaler

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies mo...

16 Touko 20231h 3min

Accelerating Cryptic Pocket Discovery Using Alphafold and Markov State Modelling | Soumendranath Bhakat

Accelerating Cryptic Pocket Discovery Using Alphafold and Markov State Modelling | Soumendranath Bhakat

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies mole...

9 Touko 202331min

Machine Learning Molecules | Gianni De Fabritiis

Machine Learning Molecules | Gianni De Fabritiis

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies mole...

25 Huhti 202357min

Protein Representation Learning by Geometric Structure Pretraining | Zuobai Zhang

Protein Representation Learning by Geometric Structure Pretraining | Zuobai Zhang

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies molecu...

19 Huhti 202353min

Suosittua kategoriassa Tiede

rss-mita-tulisi-tietaa
rss-hereilla
rss-duodecim-lehti
rss-poliisin-mieli
tiedekulma-podcast
rss-ilmasto-kriisissa
docemilia
hippokrateen-vastaanotolla
sotataidon-ytimessa
filocast-filosofian-perusteet
rss-radplus
rss-tiedetta-vai-tarinaa
rss-opeklubi
rss-kasvikutsut
rss-totuuden-liepeilla