Molecular Machine Learning Foundation

5

(18 reviews)

COHORT-BASED COURSE

Current cohort in progress.
Join the waitlist for future dates!
Molecular Machine Learning Foundation
Self-paced
NEXT COHORT

Dec 6, 2025-Dec 27, 2026

DURATION
Self-paced (see below)
LEVEL

Beginner

HOSTED BY
Pankaj Mishra, PhD

Pankaj Mishra, PhD

Industrial Molecular AI Builder, Co-founder and CTO at Future Therapeutics, Co-founder of Neovarsity

Most Complete

Unlock Lifetime Access with a One-Time Payment

€299.00
Billed once

Lifetime access to lectures, recordings, and community from day one

Perfect for professionals and employer-supported learners seeking instant, complete access

Don't Miss Out!
Be the first to know when new cohorts open

About the Course

SELF-PACED | ONLINE | BEGINNER

🚨 NOTE 🚨: This course is offered only in self-paced mode. To access the course, enroll using the Enroll Now button and you will immediately get access to all course materials in your dashboard. There are no live cohort sessions. Please ignore any live start dates shown on the website. Support is available via Slack and email.

This course is your structured entry point into machine learning for small-molecule drug discovery. It teaches the fundamentals that most people skip, which is exactly why their models fail later.

You will learn the full workflow behind molecular ML, including the software stack, molecular data collection, preprocessing, feature engineering, and basic model building with proper quality control. The goal is not to memorize algorithms, the goal is to become competent at building reliable pipelines on chemical datasets.

This course is designed to prepare you for advanced predictive modeling and later generative AI work. If you want to build molecule generation systems in the future, this is where you start, because generative AI is useless without clean data, correct representations, and correct evaluation.

WHO THIS IS FOR

This course is for you if you are serious about drug discovery ML but you are not yet confident with the basics of molecular data workflows and model building.

Good fit if you are:

  • Student, researcher, or professional entering molecular ML
  • Chemist moving into data-driven discovery
  • Computational chemist who wants ML foundations tailored to molecules
  • Early-stage ML engineer entering biotech or cheminformatics

This course is also the right starting point if your end goal is generative AI for molecules, because it builds the foundations that generative modeling depends on: chemical data preparation, molecular representations, and model quality control.

Not for you if you want:

Deep learning architectures, advanced bias handling, or explainability tooling, that is covered in the Advanced MLDD course.

WHAT YOU WILL BE ABLE TO DO

By the end of the program, you will be able to build a correct beginner-to-intermediate molecular ML workflow instead of copying random notebooks that break in real use.

You will be able to:

  • Set up the molecular machine learning software stack
  • Collect and preprocess drug discovery datasets
  • Engineer molecular features for ML, with practical feature choices for small molecules
  • Train basic machine learning models for drug discovery tasks
  • Apply quality control so you know when your model is real versus overfitting

Flexible learning options

  • Attend live (virtual) lectures
  • Access recorded lectures in your private dashboard

Practical application

  • Apply your skills through hands-on projects
  • Engage in real-world case studies

Personalized learning experiences

  • Tailored support and guidance
  • 24x7 support by our dedicated support team

Specialized community access

  • To our Members-only Slack community
  • To our invite-only deep tech global Slack community

Syllabus Overview

  • Reading

    Working with Python Virtual Environments
  • Reading

    Molecular ML Software Installation (Windows)
  • Reading

    Molecular ML Software Installation (macOS)
  • Reading

    Molecular ML Software Installation (Linux)
  • Video

    Introduction to Machine Learning in Drug Discovery
  • Video

    Molecular ML Software Stack
  • Video

    Dataset and Literature
  • Video

    Computational Molecular Representations
  • Video

    Molecular Data Files Handling & Conversions
  • Video

    Molecular Standardization
  • Video

    Molecular Descriptors
  • Video

    Molecular Diversity Set Selection
  • Video

    Molecular Similarity Analysis
Meet your Instructors
Pankaj Mishra, PhD
Pankaj Mishra, PhD
Instructor
Industrial Molecular AI Builder, Co-founder and CTO at Future Therapeutics, Co-founder of Neovarsity
I’m the Co-founder and CTO of Future Therapeutics, an AI-native biotech based in Berlin, where we build proprietary AI systems for drug discovery. I hold a PhD from the University of Freiburg, specializing in small molecule AI, and I’m trained in building models for low-data simulation and modeling, the reality most discovery teams operate in. I’ve been doing this long before it became mainstream, back in 2018 I was already building deep learning systems to explore “ultra-large chemical space”. Over the past few years, my focus has been generative molecular design. I’m also a Co-founder of Neovarsity, and since 2021 I’ve taught scientists and engineers across biopharma how to apply AI in real R&D workflows, including teams from J&J, Bayer, Takeda, Novartis, and others.

Earn Certificate of Achievement

Get certified and add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review

certificate

What Our Learners Say

My Neovarsity coursework gave me the expertise needed
I have been actively applying my newly acquired skills to the medicinal chemistry projects I am involved in [at Rutgers]. This involved creating targeted libraries and generating various substituents for my lead compounds. Fu
Anastasiia Tsymbal
Anastasiia Tsymbal
Research Associate
Rutgers University, New Jersey, USA
The instructor has done an excellent job!
The instructor has done an excellent job! The course quickly brought me up to speed not only on how to manipulate, prepare, and analyze chemical data but also on how to use them in building machine-learning models for drug di
Fabio Esposito
Fabio Esposito
Research Fellow
Sassari University

Enroll Now

Molecular Machine Learning Foundation
For customised payments options, contact us

06 Dec, 2025

One-Time Payment
€299.00

All taxes included

  • One year complete access
  • Shareable certificate on completion
  • Career guidance from instructors
Filling Fast!
Frequently Asked Questions

This course is 100% self-paced. There are no live sessions and no live cohort.


No. Ignore any live start date shown on the website. You can start immediately after enrolling.


Click the Enroll Now button. After enrollment, you will get immediate access to all course materials in your dashboard.


Yes. Support is available via Slack and email.


You do not need professional ML experience, but basic familiarity helps. Strong comfort with chemical data is more important. The course itself recommends taking cheminformatics first.


Strong understanding of chemical data is essential, and Neovarsity recommends completing the Cheminformatics course first.


No. This course builds the foundation for molecular ML. It prepares you for advanced predictive modeling and later generative AI, but it does not teach molecular generation itself.


If you want to build real predictive models used in industry workflows, take Advanced Machine Learning for Drug Discovery next.


Start an online chat with Catherine or email [email protected]. We are here to help.