Danielle Belgrave on Generative AI in Pharma and Drugs – O’Reilly

Generative AI within the Actual World

Generative AI within the Actual World: Danielle Belgrave on Generative AI in Pharma and Drugs

00:00
/
31m 32s

Be a part of Danielle Belgrave and Ben Lorica for a dialogue of AI in healthcare. Danielle is VP of AI and machine studying at GSK (previously GlaxoSmithKline). She and Ben focus on utilizing AI and machine studying to get higher diagnoses that mirror the variations between sufferers. Pay attention in to study concerning the challenges of working with well being knowledge—a subject the place there’s each an excessive amount of knowledge and too little, and the place hallucinations have severe penalties. And when you’re enthusiastic about healthcare, you’ll additionally learn the way AI builders can get into the sphere.

Try different episodes of this podcast on the O’Reilly studying platform.

Concerning the Generative AI within the Actual World podcast: In 2023, ChatGPT put AI on everybody’s agenda. In 2025, the problem can be turning these agendas into actuality. In Generative AI within the Actual World, Ben Lorica interviews leaders who’re constructing with AI. Study from their expertise to assist put AI to work in your enterprise.

Factors of Curiosity

0:00: Introduction to Danielle Belgrave, VP of AI and machine studying at GSK. Danielle is our first visitor representing Large Pharma. It is going to be attention-grabbing to see how folks in pharma are utilizing AI applied sciences.
0:49: My curiosity in machine studying for healthcare started 15 years in the past. My PhD was on understanding affected person heterogeneity in asthma-related illness. This was earlier than digital healthcare data. By leveraging totally different sorts of information, genomics knowledge and biomarkers from kids, and seeing how they developed bronchial asthma and allergic illnesses, I developed causal modeling frameworks and graphical fashions to see if we may determine who would reply to what remedies. This was fairly novel on the time. We recognized 5 various kinds of bronchial asthma. If we will perceive heterogeneity in bronchial asthma, an even bigger problem is knowing heterogeneity in psychological well being. The thought was making an attempt to grasp heterogeneity over time in sufferers with anxiousness.
4:12: Once I went to DeepMind, I labored on the healthcare portfolio. I turned very interested in how you can perceive issues like MIMIC, which had digital healthcare data, and picture knowledge. The thought was to leverage instruments like energetic studying to attenuate the quantity of information you’re taking from sufferers. We additionally revealed work on bettering the range of datasets.
5:19: Once I got here to GSK, it was an thrilling alternative to do each tech and well being. Well being is without doubt one of the most difficult landscapes we will work on. Human biology could be very sophisticated. There may be a lot random variation. To know biology, genomics, illness development, and have an effect on how medication are given to sufferers is superb.
6:15: My position is main AI/ML for medical growth. How can we perceive heterogeneity in sufferers to optimize medical trial recruitment and ensure the correct sufferers have the correct therapy?
6:56: The place does AI create probably the most worth throughout GSK right now? That may be each conventional AI and generative AI.
7:23: I exploit all the things interchangeably, although there are distinctions. The true essential factor is specializing in the issue we are attempting to resolve, and specializing in the information. How can we generate knowledge that’s significant? How can we take into consideration deployment?
8:07: And all of the Q&A and purple teaming.
8:20: It’s exhausting to place my finger on what’s probably the most impactful use case. Once I consider the issues I care about, I take into consideration oncology, pulmonary illness, hepatitis—these are all very impactful issues, and so they’re issues that we actively work on. If I have been to focus on one factor, it’s the interaction between after we are entire genome sequencing knowledge and molecular knowledge and making an attempt to translate that into computational pathology. By these knowledge sorts and understanding heterogeneity at that stage, we get a deeper organic illustration of various subgroups and perceive mechanisms of motion for response to medication.
9:35: It’s not scalable doing that for people, so I’m excited about how we translate throughout differing types or modalities of information. Taking a biopsy—that’s the place we’re getting into the sphere of synthetic intelligence. How can we translate between genomics and a tissue pattern?
10:25: If we consider the affect of the medical pipeline, the second instance can be utilizing generative AI to find medication, goal identification. These are sometimes in silico experiments. Now we have perturbation fashions. Can we perturb the cells? Can we create embeddings that may give us representations of affected person response?
11:13: We’re producing knowledge at scale. We wish to determine targets extra shortly for experimentation by rating likelihood of success.
11:36: You’ve talked about multimodality so much. This consists of laptop imaginative and prescient, pictures. What different modalities?
11:53: Textual content knowledge, well being data, responses over time, blood biomarkers, RNA-Seq knowledge. The quantity of information that has been generated is sort of unbelievable. These are all totally different knowledge modalities with totally different buildings, other ways of correcting for noise, batch results, and understanding human techniques.
12:51: If you run into your former colleagues at DeepMind, what sorts of requests do you give them?
13:14: Overlook concerning the chatbots. A variety of the work that’s occurring round massive language fashions—considering of LLMs as productiveness instruments that may assist. However there has additionally been numerous exploration round constructing bigger frameworks the place we will do inference. The problem is round knowledge. Well being knowledge could be very sparse. That’s one of many challenges. How can we fine-tune fashions to particular options or particular illness areas or particular modalities of information? There’s been numerous work on basis fashions for computational pathology or foundations for single cell construction. If I had one want, it might be small knowledge and the way do you’ve gotten sturdy affected person representations when you’ve gotten small datasets? We’re producing massive quantities of information on small numbers of sufferers. It is a large methodological problem. That’s the North Star.
15:12: If you describe utilizing these basis fashions to generate artificial knowledge, what guardrails do you place in place to stop hallucination?
15:30: We’ve had a accountable AI group since 2019. It’s essential to consider these guardrails particularly in well being, the place the rewards are excessive however so are the stakes. One of many issues the group has carried out is AI rules, however we additionally use mannequin playing cards. Now we have policymakers understanding the results of the work; we even have engineering groups. There’s a group that appears exactly at understanding hallucinations with the language mannequin we’ve constructed internally, referred to as Jules.¹ There’s been numerous work metrics of hallucination and accuracy for these fashions. We additionally collaborate on issues like interpretability and constructing reusable pipelines for accountable AI. How can we determine the blind spots in our evaluation?
17:42: Final 12 months, lots of people began doing fine-tuning, RAG, and GraphRAG; I assume you do all of those?
18:05: RAG occurs so much within the accountable AI group. Now we have constructed a data graph. That was one of many earliest data graphs—earlier than I joined. It’s maintained by one other group for the time being. Now we have a platforms group that offers with all of the scaling and deploying throughout the corporate. Instruments like data graph aren’t simply AI/ML. Additionally Jules—it’s maintained outdoors AI/ML. It’s thrilling once you see these options scale.
20:02: The buzzy time period this 12 months is brokers and even multi-agents. What’s the state of agentic AI inside GSK?
20:18: We’ve been engaged on this for fairly some time, particularly inside the context of huge language fashions. It permits us to leverage numerous the information that we now have internally, like medical knowledge. Brokers are constructed round these datatypes and the totally different modalities of questions that we now have. We’ve constructed brokers for genetic knowledge or lab experimental knowledge. An orchestral agent in Jules can mix these totally different brokers in an effort to draw inferences. That panorama of brokers is admittedly essential and related. It offers us refined fashions on particular person questions and sorts of modalities.
21:28: You alluded to personalised drugs. We’ve been speaking about that for a very long time. Are you able to give us an replace? How will AI speed up that?
21:54: It is a subject I’m actually optimistic about. Now we have had numerous affect; typically when you’ve gotten your nostril to the glass, you don’t see it. However we’ve come a good distance. First, via knowledge: Now we have exponentially extra knowledge than we had 15 years in the past. Second, compute energy: Once I began my PhD, the truth that I had a GPU was superb. The size of computation has accelerated. And there was numerous affect from science as nicely. There was a Nobel Prize for protein folding. Understanding of human biology is one thing we’ve pushed the needle on. A variety of the Nobel Prizes have been about understanding organic mechanisms, understanding fundamental science. We’re presently on constructing blocks in the direction of that. It took years to get from understanding the ribosome to understanding the mechanism for HIV.
23:55: In AI for healthcare, we’ve seen extra rapid impacts. Simply the actual fact of understanding one thing heterogeneous: If we each get a prognosis of bronchial asthma, that may have totally different manifestations, totally different triggers. That understanding of heterogeneity in issues like psychological well being: We’re totally different; issues must be handled in a different way. We even have the ecosystem, the place we will have an effect. We are able to affect medical trials. We’re within the pipeline for medication.
25:39: One of many items of labor we’ve revealed has been round understanding variations in response to the drug for hepatitis B.
26:01: You’re within the UK, you’ve gotten the NHS. Within the US, we nonetheless have the information silo downside: You go to your main care, after which a specialist, and so they have to speak utilizing data and fax. How can I be optimistic when techniques don’t even speak to one another?
26:36: That’s an space the place AI may also help. It’s not an issue I work on, however how can we optimize workflow? It’s a techniques downside.
26:59: All of us affiliate knowledge privateness with healthcare. When folks speak about knowledge privateness, they get sci-fi, with homomorphic encryption and federated studying. What’s actuality? What’s in your every day toolbox?
27:34: These instruments will not be essentially in my every day toolbox. Pharma is closely regulated; there’s numerous transparency across the knowledge we gather, the fashions we constructed. There are platforms and techniques and methods of ingesting knowledge. If in case you have a collaboration, you usually work with a trusted analysis surroundings. Information doesn’t essentially go away. We do evaluation of information of their trusted analysis surroundings, we be sure that all the things is privateness preserving and we’re respecting the guardrails.
29:11: Our listeners are primarily software program builders. They might marvel how they enter this subject with none background in science. Can they only use LLMs to hurry up studying? In the event you have been making an attempt to promote an ML developer on becoming a member of your group, what sort of background do they want?
29:51: You want a ardour for the issues that you just’re fixing. That’s one of many issues I like about GSK. We don’t know all the things about biology, however we now have excellent collaborators.
30:20: Do our listeners have to take biochemistry? Natural chemistry?
30:24: No, you simply want to speak to scientists. Get to know the scientists, hear their issues. We don’t work in silos as AI researchers. We work with the scientists. A variety of our collaborators are docs, and have joined GSK as a result of they wish to have an even bigger affect.

Footnotes

To not be confused with Google’s latest agentic coding announcement.

Danielle Belgrave on Generative AI in Pharma and Drugs – O’Reilly

Factors of Curiosity

Footnotes

Underdog buys Aristotle change licenses to launch US sports activities prediction market

Is the Pentagon allowed to surveil People with AI?

Taiwan Excellence showcases AI breakthroughs at Embedded World 2026

A Companion Robotic That Roams Your Desk Till You Pet It

The info behind the win: How Catapult and AWS IoT are reworking professional sports activities

IoT Now Contract Win Listing – February 2026

Vector Publicizes Warfighter-Centered Hammer F1 Drone

British startup Skycutter stuns US military-industrial advanced in Pentagon’s ‘Drone Dominance’ trials – sUAS Information

Zero-ETL integrations with Amazon OpenSearch Service

Constructing a close to real-time software with Zerobus Ingest and Lakebase

Immediate injection is the brand new SQL injection, and guardrails aren’t sufficient

Vector Publicizes Warfighter-Centered Hammer F1 Drone