🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik

Latent Space: The AI Engineer Podcast1h 25mApril 20, 2026

Get the full intelligence

Search transcripts, export clips, track mentions, and explore all topics from “🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik” inside PodZeus.

AI-Generated Summary

Ron Alfa and Dan Baer of Noetic join the Latent Space podcast to discuss their mission of using AI to solve the 95% failure rate of cancer drug trials by fundamentally rethinking patient selection. They argue that most drug failures stem not from poor pharmacology, but from flawed patient cohort selection—driven by outdated models based on immortalized cell lines and animal studies that poorly reflect human biology. Noetic’s solution is to build a human-centric, multimodal foundation model trained on massive, high-quality data generated in-house, including spatial transcriptomics, H&E pathology, protein stains, and DNA data. Their approach uses self-supervised learning on over 100 million spatially resolved cells to discover biologically meaningful patient subtypes invisible to traditional biomarkers. The company has developed novel transformer architectures like Tario and OctoVirtualCell to simulate drug responses in silico, enabling virtual clinical trials and repurposing of existing drugs. A landmark $50M deal with GSK validates their model licensing strategy, marking a shift in pharma toward broad access to foundational AI models. The episode underscores the necessity of large-scale, intentional data generation—comparable to PDB or ImageNet—as the bedrock for transformative progress in AI for biology.

Key Takeaways
1

95% of cancer drugs fail in the clinic due to poor patient selection, not flawed drug design.

2

Noetic generates its own multimodal data (H&E, spatial transcriptomics, protein stains) at scale to train human-centric AI models.

3

Their foundation models discover biologically meaningful patient subtypes invisible to traditional biomarkers.

4

Virtual cell simulations allow in silico testing of drug responses without wet lab experiments.

5

A $50M deal with GSK marks the first major foundational model licensing deal in biopharma.

…and 1 more takeaway available in PodZeus

Chapters
0:00
10 min

The 95% Failure Rate of Cancer Trials

95% of cancer drugs fail in the clinic. Why do they fail? Not because we're bad at pharmacology... we're bad at selecting which patients those drugs are in.

Highlight
10:00
10 min

Building a Human-Centric Data Foundation

Noetic's core strategy is generating its own high-quality, multimodal data in-house. This includes sourcing human tumor samples, building custom processing pipelines, and generating paired data across H&E, spatial transcriptomics, and protein stains at scale.

20:00
10 min

The Power of Multimodal Data and Self-Supervised Learning

We want the model to learn... how many different therapeutically relevant subtypes of lung cancers are just from self-supervised learning from the data.

Highlight
30:00
10 min

Virtual Cell Simulations and In Silico Trials

You can simulate this sort of counterfactual perturbation idea without even having to collect the data to do that.

Highlight
40:00
10 min

Validating Models with In Vivo Perturbations

Noetic uses a mouse platform called PerturbMap to validate their models. By barcoding hundreds of genetically perturbed tumors in mice, they can test predictions about immune response and drug efficacy in a living system.

High-Impact Quotes
95% of cancer drugs fail in the clinic. Why do they fail? Not because we're bad at pharmacology... we're bad at selecting which patients those drugs are in.
Ron Alfa1:55
Viral: 90.0
It was the first announced foundational licensing deal in the space. The substrate of the deal is not a molecule. It's a model.
Ron Alfa104:00
Viral: 88.0
We want the model to learn... how many different therapeutically relevant subtypes of lung cancers are just from self-supervised learning from the data.
Ron Alfa9:40
Viral: 85.0
Speakers

Hosts

RJ HanekeBrandon Anderson

Guests

Ron AlfaDan Baer
Topics Discussed
Cancer Drug Trial Failure95%Foundation Models for Medicine92%Patient Selection in Oncology90%AI Model Licensing in Pharma89%Multimodal Data Generation88%Virtual Cell Simulations87%Self-Supervised Learning in Biology85%In Vivo Validation of AI Models80%
People & Brands

Noetic

organization

25xNeutral

Ron Alfa

person

12xPositive

H&E

other

12xPositive

Dan Baer

person

11xPositive

Spatial Transcriptomics

other

10xPositive

GSK

organization

8xPositive

PDB

other

6xPositive

OctoVirtualCell

other

6xPositive

Recursion

organization

5xPositive

Tario

other

4xPositive

Get the full intelligence

Search transcripts, export clips, track mentions, and explore all topics from “🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik” inside PodZeus.

Start discovering podcast insights today

Start with a 7-day trial and explore a growing catalog of popular podcasts. No credit card required.

No credit card required • 7-day trial • Cancel anytime