Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization | Tianfan Fu

Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization | Tianfan Fu

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our YouTube channel to see the presented slides.

If you enjoyed this talk, consider joining the Molecular Modeling and Drug Discovery (M2D2) talks live: https://m2d2.io/talks/m2d2/about/

Also consider joining the M2D2 Slack: https://m2d2group.slack.com/join/shar...

Abstract: Molecular optimization is a fundamental goal in the chemical sciences and is of central interest to drug and material design. In recent years, significant progress has been made in solving challenging problems across various aspects of computational molecular optimizations, emphasizing high validity, diversity, and, most recently, synthesizability. Despite this progress, many papers report results on trivial or self-designed tasks, bringing additional challenges to directly assessing the performance of new methods. Moreover, the sample efficiency of the optimization--the number of molecules evaluated by the oracle--is rarely discussed, despite being an essential consideration for realistic discovery applications. To fill this gap, we have created an open-source benchmark for practical molecular optimization, PMO, to facilitate the transparent and reproducible evaluation of algorithmic advances in molecular optimization. This paper thoroughly investigates the performance of 25 molecular design algorithms on 23 tasks with a particular focus on sample efficiency. Our results show that most "state-of-the-art" methods fail to outperform their predecessors under a limited oracle budget allowing 10K queries and that no existing algorithm can efficiently solve certain molecular optimization problems in this setting. We analyze the influence of the optimization algorithm choices, molecular assembly strategies, and oracle landscapes on the optimization performance to inform future algorithm development and benchmarking. PMO provides a standardized experimental setup to comprehensively evaluate and compare new molecule optimization methods with existing ones.

Speakers: Tianfan Fu

Twitter Prudencio

Twitter Therence

Twitter Cas

Twitter Valence Discovery

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(60)

Structure-Independent Peptide Binder Design via Generative Language Models | Pranam Chatterjee

Structure-Independent Peptide Binder Design via Generative Language Models | Pranam Chatterjee

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifie...

20 Juni 20231h

Learning Local Equivariant Representations for Large-Scale Atomistic Dynamics | Albert Musaelian

Learning Local Equivariant Representations for Large-Scale Atomistic Dynamics | Albert Musaelian

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplif...

13 Juni 20231h 9min

Multimodal Deep Learning for Protein Engineering | Kevin K. Yang

Multimodal Deep Learning for Protein Engineering | Kevin K. Yang

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifie...

7 Juni 20231h 2min

Systematic Analysis of Biomolecular Conformational Ensembles with PENSA | Martin Vögele

Systematic Analysis of Biomolecular Conformational Ensembles with PENSA | Martin Vögele

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies ...

30 Maj 202350min

Training Neural Network Potentials: Bayesian and Simulation-based Approaches | Stephan Thaler

Training Neural Network Potentials: Bayesian and Simulation-based Approaches | Stephan Thaler

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies mo...

16 Maj 20231h 3min

Accelerating Cryptic Pocket Discovery Using Alphafold and Markov State Modelling | Soumendranath Bhakat

Accelerating Cryptic Pocket Discovery Using Alphafold and Markov State Modelling | Soumendranath Bhakat

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies mole...

9 Maj 202331min

Machine Learning Molecules | Gianni De Fabritiis

Machine Learning Molecules | Gianni De Fabritiis

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies mole...

25 Apr 202357min

Protein Representation Learning by Geometric Structure Pretraining | Zuobai Zhang

Protein Representation Learning by Geometric Structure Pretraining | Zuobai Zhang

[DISCLAIMER] - For the full visual experience, we recommend you tune in through our ⁠⁠⁠⁠YouTube channel ⁠⁠⁠⁠to see the presented slides. Try datamol.io - the open source toolkit that simplifies molecu...

19 Apr 202353min

Populärt inom Vetenskap

p3-dystopia
dumma-manniskor
allt-du-velat-veta
kapitalet-en-podd-om-ekonomi
rss-ufobortom-rimligt-tvivel
medicinvetarna
rss-vetenskapsradion
svd-nyhetsartiklar
bildningspodden
rss-vetenskapsradion-2
vetenskapsradion
paranormalt-med-caroline-giertz
halsorevolutionen
det-morka-psyket
dumforklarat
rss-spraket
sexet
rss-ronden
rss-geopodden-2
rss-broccolipodden-en-podcast-som-inte-handlar-om-broccoli