Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization | Tianfan Fu

Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization | Tianfan Fu

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our YouTube channel to see the presented slides.

If you enjoyed this talk, consider joining the Molecular Modeling and Drug Discovery (M2D2) talks live: https://m2d2.io/talks/m2d2/about/

Also consider joining the M2D2 Slack: https://m2d2group.slack.com/join/shar...

Abstract: Molecular optimization is a fundamental goal in the chemical sciences and is of central interest to drug and material design. In recent years, significant progress has been made in solving challenging problems across various aspects of computational molecular optimizations, emphasizing high validity, diversity, and, most recently, synthesizability. Despite this progress, many papers report results on trivial or self-designed tasks, bringing additional challenges to directly assessing the performance of new methods. Moreover, the sample efficiency of the optimization--the number of molecules evaluated by the oracle--is rarely discussed, despite being an essential consideration for realistic discovery applications. To fill this gap, we have created an open-source benchmark for practical molecular optimization, PMO, to facilitate the transparent and reproducible evaluation of algorithmic advances in molecular optimization. This paper thoroughly investigates the performance of 25 molecular design algorithms on 23 tasks with a particular focus on sample efficiency. Our results show that most "state-of-the-art" methods fail to outperform their predecessors under a limited oracle budget allowing 10K queries and that no existing algorithm can efficiently solve certain molecular optimization problems in this setting. We analyze the influence of the optimization algorithm choices, molecular assembly strategies, and oracle landscapes on the optimization performance to inform future algorithm development and benchmarking. PMO provides a standardized experimental setup to comprehensively evaluate and compare new molecule optimization methods with existing ones.

Speakers: Tianfan Fu

Twitter Prudencio

Twitter Therence

Twitter Cas

Twitter Valence Discovery

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(60)

Structure-Independent Peptide Binder Design via Generative Language Models | Pranam Chatterjee

Structure-Independent Peptide Binder Design via Generative Language Models | Pranam Chatterjee

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifie...

20 Jun 20231h

Learning Local Equivariant Representations for Large-Scale Atomistic Dynamics | Albert Musaelian

Learning Local Equivariant Representations for Large-Scale Atomistic Dynamics | Albert Musaelian

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplif...

13 Jun 20231h 9min

Multimodal Deep Learning for Protein Engineering | Kevin K. Yang

Multimodal Deep Learning for Protein Engineering | Kevin K. Yang

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifie...

7 Jun 20231h 2min

Systematic Analysis of Biomolecular Conformational Ensembles with PENSA | Martin Vögele

Systematic Analysis of Biomolecular Conformational Ensembles with PENSA | Martin Vögele

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies ...

30 Mai 202350min

Training Neural Network Potentials: Bayesian and Simulation-based Approaches | Stephan Thaler

Training Neural Network Potentials: Bayesian and Simulation-based Approaches | Stephan Thaler

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies mo...

16 Mai 20231h 3min

Accelerating Cryptic Pocket Discovery Using Alphafold and Markov State Modelling | Soumendranath Bhakat

Accelerating Cryptic Pocket Discovery Using Alphafold and Markov State Modelling | Soumendranath Bhakat

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies mole...

9 Mai 202331min

Machine Learning Molecules | Gianni De Fabritiis

Machine Learning Molecules | Gianni De Fabritiis

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies mole...

25 Apr 202357min

Protein Representation Learning by Geometric Structure Pretraining | Zuobai Zhang

Protein Representation Learning by Geometric Structure Pretraining | Zuobai Zhang

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies molecu...

19 Apr 202353min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
sinnsyn
forskningno
liberal-halvtime
vett-og-vitenskap-med-gaute-einevoll
villmarksliv
rekommandert
rss-paradigmepodden
jss
fjellsportpodden
rss-rekommandert
rss-inn-til-kjernen-med-sunniva-rose
kvinnehelsepodden
tomprat-med-gunnar-tjomlid
grunnstoffene
aldring-og-helse-podden
hva-er-greia-med
diagnose
dekodet-2