back to index

Regina Barzilay: Deep Learning for Cancer Diagnosis and Treatment | Lex Fridman Podcast #40


small model | large model

link |
00:00:00.000
The following is a conversation with Regina Barsley.
link |
00:00:03.240
She's a professor at MIT and a world class researcher
link |
00:00:06.680
in natural language processing and applications
link |
00:00:09.120
of deep learning to chemistry and oncology,
link |
00:00:12.440
or the use of deep learning for early diagnosis, prevention,
link |
00:00:16.240
and treatment of cancer.
link |
00:00:18.320
She has also been recognized for a teaching
link |
00:00:21.000
of several successful AI related courses at MIT,
link |
00:00:24.680
including the popular introduction to machine
link |
00:00:27.360
learning course.
link |
00:00:28.920
This is the Artificial Intelligence Podcast.
link |
00:00:32.160
If you enjoy it, subscribe on YouTube,
link |
00:00:34.560
give it 5,000 iTunes, support it on Patreon,
link |
00:00:37.840
or simply connect with me on Twitter
link |
00:00:39.840
at Lex Freedman, spelled F R I D M A N.
link |
00:00:43.760
And now here's my conversation with Regina Barsley.
link |
00:00:48.800
In an interview you've mentioned
link |
00:00:50.320
that if there's one course you would take,
link |
00:00:51.960
it would be a literature course with a friend of yours
link |
00:00:54.600
that a friend of yours teaches just out of curiosity
link |
00:00:57.920
because I couldn't find anything on it.
link |
00:01:00.320
Are there books or ideas that had profound impact
link |
00:01:04.480
on your life journey, books and ideas perhaps outside
link |
00:01:07.760
of computer science and the technical fields?
link |
00:01:11.840
I think because I'm spending a lot of my time at MIT
link |
00:01:14.800
and previously in other institutions where I was a student,
link |
00:01:18.360
I have a limited ability to interact with people.
link |
00:01:21.080
So a lot of what I know about the world
link |
00:01:22.720
actually comes from books.
link |
00:01:25.320
And there were quite a number of books
link |
00:01:27.280
that had profound impact on me and how I view the world.
link |
00:01:31.400
Let me just give you one example of such a book.
link |
00:01:35.840
I've maybe a year ago read a book
link |
00:01:39.680
called The Emperor of All Melodies.
link |
00:01:42.520
It's a book about, it's kind of a history of science book
link |
00:01:45.760
on how the treatments and drugs for cancer were developed.
link |
00:01:50.760
And that book, despite the fact that I am
link |
00:01:54.040
in the business of science, really opened my eyes
link |
00:01:57.040
on how imprecise and imperfect the discovery process is
link |
00:02:03.080
and how imperfect our current solutions.
link |
00:02:07.000
And what makes science succeed and be implemented
link |
00:02:11.080
and sometimes it's actually not the strengths of the idea
link |
00:02:14.120
but devotion of the person who wants to see it implemented.
link |
00:02:17.480
So this is one of the books that, you know,
link |
00:02:19.800
at least for the last year quite changed the way
link |
00:02:22.320
I'm thinking about scientific process
link |
00:02:24.920
just from the historical perspective.
link |
00:02:26.720
And what do I need to do to make my ideas really implemented?
link |
00:02:33.440
Let me give you an example of a book
link |
00:02:36.040
which is not kind of, which is a fiction book.
link |
00:02:40.600
It's a book called Americana.
link |
00:02:44.440
And this is a book about a young female student
link |
00:02:48.760
who comes from Africa to study in the United States.
link |
00:02:53.240
And it describes her past, you know, within her studies
link |
00:02:57.720
and her life transformation that, you know,
link |
00:03:02.000
in a new country and kind of adaptation to a new culture.
link |
00:03:06.560
And when I read this book,
link |
00:03:09.520
I saw myself in many different points of it.
link |
00:03:14.600
But it also kind of gave me the lens on different events
link |
00:03:20.160
and some events that I never actually paid attention
link |
00:03:22.040
to one of the funny stories in this book
link |
00:03:24.720
is how she arrives to her new college
link |
00:03:30.400
and she starts speaking in English
link |
00:03:32.880
and she had this beautiful British accent
link |
00:03:35.680
because that's how she was educated in her country.
link |
00:03:39.840
This is not my case.
link |
00:03:40.960
And then she notices that the person who talks to her,
link |
00:03:45.400
you know, talks to her in a very funny way,
link |
00:03:47.160
in a very slow way.
link |
00:03:48.280
And she's thinking that this woman is disabled
link |
00:03:50.800
and she's also trying to kind of to accommodate her.
link |
00:03:54.480
And then after a while,
link |
00:03:55.400
when she finishes her discussion with this officer
link |
00:03:57.560
from her college, she sees how she interacts
link |
00:04:01.040
with the other students, with American students
link |
00:04:03.000
and she discovers that actually she talked to her this way
link |
00:04:09.720
because she saw that she doesn't understand English.
link |
00:04:12.200
And he thought, wow, this is a funny experience.
link |
00:04:15.080
And literally within few weeks,
link |
00:04:18.000
I went to LA to a conference
link |
00:04:21.880
and they asked somebody in an airport, you know,
link |
00:04:24.320
how to find like a cab or something.
link |
00:04:26.560
And then I noticed that this person is talking
link |
00:04:29.360
in a very strange way.
link |
00:04:30.280
And my first thought was that this person
link |
00:04:32.080
have some, you know, pronunciation issues or something.
link |
00:04:35.480
And I'm trying to talk very slowly to him
link |
00:04:37.160
and I was with another professor, Ernst Frankel.
link |
00:04:39.640
And he's like laughing because it's funny
link |
00:04:43.120
that I don't get that the guy is talking in this way
link |
00:04:45.720
because he thinks that I cannot speak.
link |
00:04:47.000
So it was really kind of mirroring experience
link |
00:04:50.120
and it led me think a lot about my own experiences
link |
00:04:54.320
moving, you know, from different countries.
link |
00:04:57.000
So I think that books play a big role
link |
00:05:00.280
in my understanding of the world.
link |
00:05:02.720
On the science question, you mentioned that
link |
00:05:06.440
it made you discover that personalities
link |
00:05:08.760
of human beings are more important than perhaps ideas.
link |
00:05:12.440
Is that what I heard?
link |
00:05:13.680
It's not necessarily that they are more important
link |
00:05:15.760
than ideas, but I think that ideas on their own
link |
00:05:19.200
are not sufficient.
link |
00:05:20.520
And many times at least at the local horizon
link |
00:05:24.720
is the personalities and their devotion to their ideas
link |
00:05:29.200
is really that locally changes the landscape.
link |
00:05:33.000
Now, if you're looking at AI, like let's say 30 years ago,
link |
00:05:37.520
you know, dark ages of AI or whatever,
link |
00:05:39.240
what the symbolic times you can use any word,
link |
00:05:42.480
you know, there were some people,
link |
00:05:44.720
now we are looking at a lot of that work
link |
00:05:46.680
and we are kind of thinking this was not really
link |
00:05:48.840
maybe a relevant work.
link |
00:05:50.760
But you can see that some people managed to take it
link |
00:05:53.160
and to make it so shiny and dominate the, you know,
link |
00:05:58.000
the academic world and make it to be the standard.
link |
00:06:02.440
If you look at the area of natural language processing,
link |
00:06:06.520
it is well known fact that the reason the statistics
link |
00:06:09.240
in NLP took such a long time to become mainstream
link |
00:06:14.080
because there were quite a number of personalities
link |
00:06:16.880
which didn't believe in this idea
link |
00:06:18.480
and then stop research progress in this area.
link |
00:06:22.080
So I do not think that, you know, kind of asymptotically
link |
00:06:27.360
maybe personalities matters,
link |
00:06:28.960
but I think locally it does make quite a bit of impact
link |
00:06:33.960
and it's generally, you know, speed up,
link |
00:06:38.080
speeds up the rate of adoption of the new ideas.
link |
00:06:41.440
Yeah, and the other interesting question is in the early days
link |
00:06:45.480
of particular discipline, I think you mentioned in that book
link |
00:06:50.000
was, is ultimately a book of cancer?
link |
00:06:52.400
It's called The Emperor of All Melodies.
link |
00:06:55.160
Yeah, and those melodies included the trying to,
link |
00:06:58.640
the medicine, was it centered around?
link |
00:07:00.680
So it was actually centered on, you know,
link |
00:07:04.960
how people thought of curing cancer.
link |
00:07:07.240
Like for me, it was really a discovery how people,
link |
00:07:10.720
what was the science of chemistry behind drug development
link |
00:07:14.200
that it actually grew up out of dyeing,
link |
00:07:17.280
like coloring industry that people who develop chemistry
link |
00:07:21.960
in 19th century in Germany and Britain to do,
link |
00:07:25.840
you know, the really new dyes,
link |
00:07:28.160
they looked at the molecules and identified
link |
00:07:30.200
that they do certain things to cells.
link |
00:07:32.160
And from there, the process started and, you know,
link |
00:07:34.960
like historians think, yeah, this is fascinating
link |
00:07:36.920
that they managed to make the connection
link |
00:07:38.720
and look under the microscope and do all this discovery.
link |
00:07:42.280
But as you continue reading about it
link |
00:07:44.440
and you read about how chemotherapy drugs
link |
00:07:48.760
were actually developed in Boston
link |
00:07:50.520
and some of them were developed
link |
00:07:52.480
and Dr. Farber from Dana Farber,
link |
00:07:57.480
you know, how the experiments were done,
link |
00:08:00.480
that, you know, there was some miscalculation.
link |
00:08:03.360
Let's put it this way.
link |
00:08:04.520
And they tried it on the patients
link |
00:08:05.960
and they just, and those were children with leukemia
link |
00:08:08.480
and they died and they tried another modification.
link |
00:08:11.640
You look at the process, how imperfect is this process?
link |
00:08:15.040
And, you know, like, if we're again looking back
link |
00:08:17.520
like 60 years ago, 70 years ago,
link |
00:08:19.200
you can kind of understand it.
link |
00:08:20.800
But some of the stories in this book,
link |
00:08:23.040
which were really shocking to me,
link |
00:08:24.640
were really happening, you know, maybe decades ago.
link |
00:08:28.040
And we still don't have a vehicle
link |
00:08:30.680
to do it much more fast and effective and, you know,
link |
00:08:35.120
scientific the way I'm thinking,
link |
00:08:36.280
computer science scientific.
link |
00:08:38.240
So from the perspective of computer science,
link |
00:08:40.480
you've got a chance to work the application to cancer
link |
00:08:43.800
and to medicine in general.
link |
00:08:44.920
From a perspective of an engineer and a computer scientist,
link |
00:08:48.480
how far along are we from understanding the human body,
link |
00:08:51.840
biology, of being able to manipulate it in a way
link |
00:08:55.600
we can cure some of the maladies, some of the diseases?
link |
00:08:59.760
So this is a very interesting question.
link |
00:09:03.520
And if you're thinking as a computer scientist
link |
00:09:06.080
about this problem, I think one of the reasons
link |
00:09:09.880
that we succeeded in the areas
link |
00:09:11.920
we as a computer scientist succeeded
link |
00:09:14.040
is because we don't have,
link |
00:09:16.320
we are not trying to understand in some ways.
link |
00:09:19.040
Like if you're thinking about like eCommerce, Amazon,
link |
00:09:22.280
Amazon doesn't really understand you.
link |
00:09:24.280
And that's why it recommends you certain books
link |
00:09:27.760
or certain products, correct?
link |
00:09:30.720
And in, you know, traditionally,
link |
00:09:34.280
when people were thinking about marketing, you know,
link |
00:09:36.440
they divided the population
link |
00:09:37.840
to different kind of subgroups,
link |
00:09:39.840
identify the features of the subgroup
link |
00:09:41.800
and come up with a strategy
link |
00:09:43.160
which is specific to that subgroup.
link |
00:09:45.640
If you're looking about recommendations,
link |
00:09:47.120
they're not claiming that they're understanding somebody,
link |
00:09:50.640
they're just managing from the patterns of your behavior
link |
00:09:54.800
to recommend you a product.
link |
00:09:57.600
Now, if you look at the traditional biology,
link |
00:09:59.600
obviously I wouldn't say that I am at any way,
link |
00:10:04.920
you know, educated in this field.
link |
00:10:06.200
But, you know, what I see,
link |
00:10:07.120
there is really a lot of emphasis
link |
00:10:09.320
on mechanistic understanding.
link |
00:10:10.720
And it was very surprising to me coming from computer science
link |
00:10:13.840
how much emphasis is on this understanding.
link |
00:10:17.640
And given the complexity of the system,
link |
00:10:20.760
maybe the deterministic full understanding
link |
00:10:23.240
of this process is, you know, beyond our capacity.
link |
00:10:27.400
And the same way as in computer science,
link |
00:10:29.480
when we're doing recognition,
link |
00:10:30.520
when we're doing recommendation in many other areas,
link |
00:10:32.760
it's just probabilistic matching process.
link |
00:10:35.960
And in some way, maybe in certain cases,
link |
00:10:40.080
we shouldn't even attempt to understand
link |
00:10:42.960
or we can attempt to understand, but in parallel,
link |
00:10:45.760
we can actually do this kind of matching
link |
00:10:48.040
that would help us to find Qo to do early diagnostics
link |
00:10:52.600
and so on.
link |
00:10:54.120
And I know that in these communities,
link |
00:10:55.840
it's really important to understand,
link |
00:10:59.040
but I am sometimes wondering,
link |
00:11:00.680
what exactly does it mean to understand here?
link |
00:11:02.920
Well, there's stuff that works,
link |
00:11:04.440
and, but that can be, like you said,
link |
00:11:07.600
separate from this deep human desire
link |
00:11:10.320
to uncover the mysteries of the universe,
link |
00:11:12.720
of science, of the way the body works,
link |
00:11:16.160
the way the mind works.
link |
00:11:17.600
It's the dream of symbolic AI,
link |
00:11:19.560
of being able to reduce human knowledge into logic
link |
00:11:25.200
and be able to play with that logic
link |
00:11:26.880
in a way that's very explainable
link |
00:11:28.680
and understandable for us humans.
link |
00:11:30.280
I mean, that's a beautiful dream.
link |
00:11:31.760
So I understand it, but it seems that
link |
00:11:34.840
what seems to work today, and we'll talk about it more,
link |
00:11:37.880
is as much as possible, reduce stuff into data,
link |
00:11:40.760
reduce whatever problem you're interested in to data
link |
00:11:43.880
and try to apply statistical methods,
link |
00:11:47.040
apply machine learning to that.
link |
00:11:49.080
On a personal note,
link |
00:11:51.120
you were diagnosed with breast cancer in 2014.
link |
00:11:55.400
Would it facing your mortality make you think about
link |
00:11:58.400
how did it change you?
link |
00:12:00.200
You know, this is a great question.
link |
00:12:01.800
And I think that I was interviewed many times
link |
00:12:03.800
and nobody actually asked me this question.
link |
00:12:05.680
I think I was 43 at a time.
link |
00:12:09.640
And the first time I realized in my life that I may die.
link |
00:12:12.800
And I never thought about it before.
link |
00:12:14.400
And yeah, and there was a long time
link |
00:12:16.200
since you diagnosed until you actually know
link |
00:12:17.920
what you have and how severe is your disease.
link |
00:12:20.120
For me, it was like maybe two and a half months.
link |
00:12:23.480
And I didn't know where I am during this time
link |
00:12:28.280
because I was getting different tests.
link |
00:12:30.640
And one would say, it's bad.
link |
00:12:32.200
And I would say, no, it is not.
link |
00:12:33.360
So until I knew where I am,
link |
00:12:34.840
I really was thinking about
link |
00:12:36.280
all these different possible outcomes.
link |
00:12:38.200
Were you imagining the worst?
link |
00:12:39.680
Or were you trying to be optimistic?
link |
00:12:41.920
It would be really, I don't remember, you know,
link |
00:12:45.600
what was my thinking?
link |
00:12:47.360
It was really a mixture with many components at the time
link |
00:12:50.880
at speaking, you know, in our terms.
link |
00:12:54.080
And one thing that I remember,
link |
00:12:59.320
and you know, every test comes and then you think,
link |
00:13:01.480
oh, it could be this or it may not be this.
link |
00:13:03.280
And you're hopeful and then you're desperate.
link |
00:13:04.680
So it's like, there is a whole, you know,
link |
00:13:06.600
slew of emotions that goes through you.
link |
00:13:09.800
But what I remember is that when I came back to MIT,
link |
00:13:15.120
I was kind of going the whole time
link |
00:13:17.160
through the treatment to MIT,
link |
00:13:18.280
but my brain was not really there.
link |
00:13:19.760
But when I came back really, finished my treatment
link |
00:13:21.800
and I was here teaching and everything.
link |
00:13:24.560
You know, I look back at what my group was doing,
link |
00:13:27.080
what other groups was doing,
link |
00:13:28.840
and I saw these three realities.
link |
00:13:30.840
It's like people are building their careers
link |
00:13:33.240
on improving some parts around two or three percent
link |
00:13:35.520
or whatever.
link |
00:13:36.880
I was, it's like, seriously,
link |
00:13:38.400
I did a work on how to decipher eukaryotic,
link |
00:13:40.760
like a language that nobody speak and whatever,
link |
00:13:42.880
like what is significance?
link |
00:13:46.160
When I was sad, you know, I walked out of MIT,
link |
00:13:49.000
which is, you know, when people really do care,
link |
00:13:51.600
you know, what happened to your iClear paper,
link |
00:13:54.280
you know, what is your next publication,
link |
00:13:56.680
to ACL, to the world where people, you know,
link |
00:13:59.960
people, you see a lot of suffering
link |
00:14:01.880
that I'm kind of totally shielded on it on a daily basis.
link |
00:14:04.920
And it's like the first time I've seen like real life
link |
00:14:07.520
and real suffering.
link |
00:14:09.760
And I was thinking, why are we trying to improve the parser
link |
00:14:13.280
or deal with some trivialities
link |
00:14:16.120
when we have capacity to really make a change?
link |
00:14:20.760
And it was really challenging to me
link |
00:14:23.520
because on one hand, you know,
link |
00:14:24.640
I have my graduate students who really want to do
link |
00:14:26.720
their papers and their work
link |
00:14:28.760
and they want to continue to do what they were doing,
link |
00:14:30.880
which was great.
link |
00:14:31.960
And then it was me who really kind of reevaluated
link |
00:14:36.360
what is importance and also at that point
link |
00:14:38.600
because I had to take some break.
link |
00:14:42.560
I look back into like my years in science
link |
00:14:47.760
and I was thinking, you know,
link |
00:14:49.080
like 10 years ago, this was the biggest thing.
link |
00:14:51.720
I don't know, topic models.
link |
00:14:53.000
We have like millions of papers on topic models
link |
00:14:55.400
and variation of topics models now.
link |
00:14:56.720
It's totally like irrelevant.
link |
00:14:58.640
And you start looking at this, you know,
link |
00:15:01.520
what do you perceive as important
link |
00:15:03.280
at different point of time
link |
00:15:04.560
and how, you know, it fades over time.
link |
00:15:08.960
And since we have a limited time,
link |
00:15:13.040
all of us have limited time on us.
link |
00:15:14.960
It's really important to prioritize things
link |
00:15:18.440
that really matter to you,
link |
00:15:19.760
maybe matter to you at that particular point,
link |
00:15:22.040
but it's important to take some time
link |
00:15:24.400
and understand what matters to you,
link |
00:15:26.960
which may not necessarily be the same
link |
00:15:28.880
as what matters to the rest of your scientific community
link |
00:15:31.720
and pursue that vision.
link |
00:15:34.600
And so though that moment, did it make you cognizant?
link |
00:15:38.480
You mentioned suffering of just the general amount
link |
00:15:42.520
of suffering in the world.
link |
00:15:44.360
Is that what you're referring to?
link |
00:15:45.640
So as opposed to topic models and specific detail problems
link |
00:15:49.480
in NLP, did you start to think about other people
link |
00:15:54.480
who have been diagnosed with cancer?
link |
00:15:57.040
Is that the way you saw the,
link |
00:15:58.360
started to see the world perhaps?
link |
00:16:00.040
Oh, absolutely.
link |
00:16:00.880
And it actually creates because like, for instance,
link |
00:16:04.480
you know, there's parts of the treatment
link |
00:16:06.080
where you need to go to the hospital every day
link |
00:16:08.520
and you see, you know, the community of people that you see
link |
00:16:11.640
and many of them are much worse than I was at a time
link |
00:16:16.080
and you're all of a sudden see it all.
link |
00:16:20.480
And people who are happier someday
link |
00:16:23.920
just because they feel better.
link |
00:16:25.320
And for people who are in our normal realm,
link |
00:16:28.480
you take it totally for granted that you feel well,
link |
00:16:30.800
that if you decide to go running, you can go running
link |
00:16:32.920
and you can, you know, you're pretty much free to do
link |
00:16:36.120
whatever you want with your body.
link |
00:16:37.600
Like I saw like a community,
link |
00:16:40.200
my community became those people.
link |
00:16:42.840
And I remember one of my friends,
link |
00:16:46.760
Dina Katabi took me to Prudential
link |
00:16:48.920
to buy me a gift for my birthday.
link |
00:16:50.480
And it was like the first time in months
link |
00:16:52.360
that I went to kind of to see other people.
link |
00:16:55.000
And I was like, wow.
link |
00:16:56.640
First of all, these people, you know,
link |
00:16:58.200
they're happy and they're laughing
link |
00:16:59.880
and they're very different from this other my people.
link |
00:17:02.680
And second of thing, are they totally crazy?
link |
00:17:04.680
They're like laughing and wasting their money
link |
00:17:06.400
on some stupid gifts.
link |
00:17:08.480
And, you know, they may die.
link |
00:17:12.560
They already may have cancer
link |
00:17:14.280
and they don't understand it.
link |
00:17:16.000
So you can really see how the mind changes
link |
00:17:20.120
that you can see that, you know,
link |
00:17:22.400
before that you can ask,
link |
00:17:23.240
didn't you know that you're gonna die?
link |
00:17:24.400
Of course I knew, but it was kind of a theoretical notion.
link |
00:17:28.360
It wasn't something which was concrete.
link |
00:17:31.080
And at that point when you really see it
link |
00:17:33.920
and see how little means sometime the system
link |
00:17:37.680
has to help them, you really feel
link |
00:17:40.480
that we need to take a lot of our brilliance
link |
00:17:43.960
that we have here at MIT
link |
00:17:45.480
and translate it into something useful.
link |
00:17:48.040
Yeah.
link |
00:17:48.880
And you still couldn't have a lot of definitions,
link |
00:17:50.560
but of course alleviating, suffering, alleviating,
link |
00:17:53.640
trying to cure cancer is a beautiful mission.
link |
00:17:57.480
So I, of course, know the theoretically the notion
link |
00:18:01.320
of cancer, but just reading more and more
link |
00:18:04.200
about it's the 1.7 million new cancer cases
link |
00:18:08.040
in the United States every year,
link |
00:18:09.880
600,000 cancer related deaths every year.
link |
00:18:13.520
So this has a huge impact, United States globally.
link |
00:18:19.360
When broadly, before we talk about how machine learning,
link |
00:18:24.400
how MIT can help, when do you think
link |
00:18:28.680
we as a civilization will cure cancer?
link |
00:18:32.160
How hard of a problem is it from everything
link |
00:18:34.640
you've learned from it recently?
link |
00:18:37.280
I cannot really assess it.
link |
00:18:39.320
What I do believe will happen with the advancement
link |
00:18:42.120
in machine learning that a lot of types of cancer
link |
00:18:45.960
we will be able to predict way early
link |
00:18:48.480
and more effectively utilize existing treatments.
link |
00:18:53.400
I think, I hope at least that with all the advancements
link |
00:18:57.520
in AI and drug discovery, we would be able
link |
00:19:01.200
to much faster find relevant molecules.
link |
00:19:04.680
What I'm not sure about is how long it will take
link |
00:19:08.240
the medical establishment and regulatory bodies
link |
00:19:11.920
to kind of catch up and to implement it.
link |
00:19:14.800
And I think this is a very big piece of puzzle
link |
00:19:17.400
that is currently not addressed.
link |
00:19:20.440
That's the really interesting question.
link |
00:19:21.800
So first, a small detail that I think the answer is yes,
link |
00:19:25.480
but is cancer one of the diseases
link |
00:19:30.480
that when detected earlier,
link |
00:19:33.400
that's a significantly improves the outcomes.
link |
00:19:38.320
Cause we will talk about, there's the cure
link |
00:19:40.720
and then there is detection.
link |
00:19:42.720
And I think one machine learning can really help
link |
00:19:44.880
is earlier detection.
link |
00:19:46.360
So does detection help?
link |
00:19:48.280
Detection is crucial.
link |
00:19:49.400
For instance, the vast majority of pancreatic cancer patients
link |
00:19:53.640
are detected at the stage that they are incurable.
link |
00:19:57.040
That's why they have such a terrible survival rate.
link |
00:20:03.680
It's like just a few percent over five years.
link |
00:20:07.200
It's pretty much today a death sentence.
link |
00:20:09.640
But if you can discover this disease early,
link |
00:20:14.320
there are mechanisms to treat it.
link |
00:20:16.600
And in fact, I know a number of people who were diagnosed
link |
00:20:20.560
and saved just because they had food poisoning.
link |
00:20:23.440
They had terrible food poisoning.
link |
00:20:24.840
They went to ER, they got scan.
link |
00:20:28.480
There were early signs on the scan
link |
00:20:30.560
and that would save their lives.
link |
00:20:33.440
But this wasn't really an accidental case.
link |
00:20:35.720
So as we become better,
link |
00:20:38.520
we would be able to help too many more people
link |
00:20:42.720
that are likely to develop diseases.
link |
00:20:46.400
And I just want to say that as I got more into this field,
link |
00:20:50.840
I realized that cancer is of course a terrible disease
link |
00:20:53.440
when there are really the whole slew
link |
00:20:55.600
of terrible diseases out there,
link |
00:20:58.240
like neurodegenerative diseases and others.
link |
00:21:01.560
So we, of course, a lot of us are fixated on cancer
link |
00:21:04.480
just because it's so prevalent in our society.
link |
00:21:06.440
And you see these people when there are a lot of patients
link |
00:21:08.560
with neurodegenerative diseases
link |
00:21:10.320
and the kind of aging diseases
link |
00:21:12.520
that we still don't have a good solution for.
link |
00:21:17.120
And I felt as a computer scientist,
link |
00:21:22.120
we kind of decided that it's other people's job
link |
00:21:24.920
to treat these diseases
link |
00:21:27.360
because it's like traditionally people in biology
link |
00:21:29.760
or in chemistry or MDs are the ones who are thinking about it.
link |
00:21:34.720
And after kind of start paying attention,
link |
00:21:36.720
I think that it's really a wrong assumption
link |
00:21:39.720
and we all need to join the battle.
link |
00:21:42.280
So how it seems like in cancer specifically
link |
00:21:45.880
that there's a lot of ways that machine learning can help.
link |
00:21:48.480
So what's the role of machine learning
link |
00:21:51.240
and machine learning in the diagnosis of cancer?
link |
00:21:55.320
So for many cancers today,
link |
00:21:57.280
we really don't know what is your likelihood to get cancer.
link |
00:22:03.520
And for the vast majority of patients,
link |
00:22:06.360
especially on the younger patients,
link |
00:22:08.000
it really comes as a surprise.
link |
00:22:09.640
Like for instance, for breast cancer,
link |
00:22:11.200
80% of the patients are first in their families,
link |
00:22:13.920
it's like me.
link |
00:22:15.440
And I never saw that I had any increased risk
link |
00:22:18.520
because nobody had it in my family
link |
00:22:20.880
and for some reason in my head,
link |
00:22:22.360
it was kind of inherited disease.
link |
00:22:26.640
But even if I would pay attention,
link |
00:22:28.440
the models that currently,
link |
00:22:30.280
there's very simplistic statistical models
link |
00:22:32.440
that are currently used in clinical practice
link |
00:22:34.600
that really don't give you an answer, so you don't know.
link |
00:22:37.520
And the same true for pancreatic cancer,
link |
00:22:40.440
the same true for non smoking lung cancer and many others.
link |
00:22:45.440
So what machine learning can do here
link |
00:22:47.400
is utilize all this data to tell us Ellie,
link |
00:22:51.680
who is likely to be susceptible
link |
00:22:53.200
and using all the information that is already there,
link |
00:22:56.040
be it imaging, be it your other tests
link |
00:23:00.040
and eventually liquid biopsies and others,
link |
00:23:04.880
where the signal itself is not sufficiently strong
link |
00:23:08.240
for human eye to do good discrimination
link |
00:23:11.360
because the signal may be weak,
link |
00:23:12.960
but by combining many sources,
link |
00:23:15.680
machine which is trained on large volumes of data
link |
00:23:18.120
can really detect it Ellie
link |
00:23:20.680
and that's what we've seen with breast cancer
link |
00:23:22.480
and people are reporting it in other diseases as well.
link |
00:23:25.920
That really boils down to data, right?
link |
00:23:28.240
And in the different kinds of sources of data.
link |
00:23:30.960
And you mentioned regulatory challenges.
link |
00:23:33.720
So what are the challenges
link |
00:23:35.160
in gathering large data sets in the space?
link |
00:23:40.840
Again, another great question.
link |
00:23:42.640
So it took me after I decided
link |
00:23:44.320
that I want to work on it two years to get access to data.
link |
00:23:48.720
And you did, like any significant amount.
link |
00:23:51.360
Like right now in this country,
link |
00:23:53.560
there is no publicly available data set of modern mammograms
link |
00:23:58.040
that you can just go on your computer,
link |
00:23:59.560
sign a document and get it.
link |
00:24:01.840
It just doesn't exist.
link |
00:24:03.320
I mean, obviously every hospital has its own collection
link |
00:24:06.880
of mammograms, there are data that came out
link |
00:24:10.160
of clinical trials.
link |
00:24:11.320
What we're talking about here is a computer scientist
link |
00:24:13.200
who just want to run his or her model
link |
00:24:17.120
and see how it works.
link |
00:24:19.040
This data, like ImageNet, doesn't exist.
link |
00:24:22.880
And there is an set which is called like Florid data set
link |
00:24:28.640
which is a film mammogram from 90s
link |
00:24:30.880
which is totally not representative
link |
00:24:32.440
of the current developments,
link |
00:24:33.880
whatever you're learning on them doesn't scale up.
link |
00:24:35.800
This is the only resource that is available.
link |
00:24:39.320
And today there are many agencies that govern access to data,
link |
00:24:44.440
like the hospital holds your data
link |
00:24:46.280
and the hospital decides whether they would give it
link |
00:24:49.240
to the researcher to work with this data or not.
link |
00:24:52.320
In the individual hospital?
link |
00:24:54.160
Yeah, I mean, the hospital may, you know,
link |
00:24:57.160
assuming that you're doing research collaboration,
link |
00:24:59.200
you can submit, you know,
link |
00:25:01.960
there is a proper approval process guided by IRB
link |
00:25:05.040
and if you go through all the processes,
link |
00:25:07.800
you can eventually get access to the data.
link |
00:25:10.120
But if you yourself know our AI community,
link |
00:25:13.520
there are not that many people
link |
00:25:14.680
who actually ever got access to data
link |
00:25:16.560
because it's very challenging process.
link |
00:25:20.200
And sorry, just in a quick comment,
link |
00:25:22.720
MGH or any kind of hospital,
link |
00:25:25.760
are they scanning the data?
link |
00:25:28.280
Are they digitally storing it?
link |
00:25:29.680
Oh, it is already digitally stored.
link |
00:25:31.560
You don't need to do any extra processing steps.
link |
00:25:34.120
It's already there in the right format.
link |
00:25:36.320
Is that right now there are a lot of issues
link |
00:25:39.800
that govern access to the data
link |
00:25:41.200
because the hospital is legally responsible for the data.
link |
00:25:46.280
And, you know, they have a lot to lose
link |
00:25:51.120
if they give the data to the wrong person,
link |
00:25:53.200
but they may not have a lot to gain
link |
00:25:55.360
if they give it as a hospital, as a legal entity
link |
00:25:59.920
as given it to you.
link |
00:26:00.760
And the way, you know, what I would mention
link |
00:26:02.800
happening in the future is the same thing
link |
00:26:04.840
that happens when you're getting your driving license.
link |
00:26:06.840
You can decide whether you want to donate your organs.
link |
00:26:09.880
So you can imagine that whenever a person
link |
00:26:11.600
goes to the hospital,
link |
00:26:14.280
it should be easy for them to donate their data for research
link |
00:26:18.080
and it can be different kind of,
link |
00:26:19.480
do they only give you a test results
link |
00:26:21.320
or only imaging data or the whole medical record?
link |
00:26:27.080
Because at the end,
link |
00:26:30.560
we all will benefit from all this insights.
link |
00:26:33.880
And it's only gonna say, I want to keep my data private,
link |
00:26:36.080
but I would really love to get it from other people
link |
00:26:38.800
because other people are thinking the same way.
link |
00:26:40.760
So if there is a mechanism to do this donation
link |
00:26:45.760
and the patient has an ability to say
link |
00:26:48.040
how they want to use their data for research,
link |
00:26:50.840
it would be really a game changer.
link |
00:26:54.120
People, when they think about this problem,
link |
00:26:56.480
there's a depends on the population,
link |
00:26:58.480
depends on the demographics,
link |
00:27:00.160
but there's some privacy concerns.
link |
00:27:02.400
Generally, not just medical data,
link |
00:27:04.440
just any kind of data.
link |
00:27:05.880
It's what you said, my data,
link |
00:27:07.840
it should belong kinda to me,
link |
00:27:09.600
I'm worried how it's gonna be misused.
link |
00:27:12.520
How do we alleviate those concerns?
link |
00:27:17.080
Because that seems like a problem that needs to be,
link |
00:27:19.440
that problem of trust, of transparency needs to be solved
link |
00:27:23.000
before we build large data sets that help detect cancer,
link |
00:27:27.240
help save those very people in the future.
link |
00:27:30.160
So seeing that two things that could be done,
link |
00:27:31.920
there is a technical solutions
link |
00:27:34.480
and there are societal solutions.
link |
00:27:38.240
So on the technical end,
link |
00:27:41.440
we today have ability to improve disambiguation,
link |
00:27:48.120
like for instance, for imaging,
link |
00:27:49.720
it's, you know, for imaging, you can do it pretty well.
link |
00:27:55.600
What's disambiguation?
link |
00:27:56.760
And disambiguation, sorry, disambiguation,
link |
00:27:58.520
removing the identification,
link |
00:27:59.840
removing the names of the people.
link |
00:28:02.200
There are other data, like if it is a raw text,
link |
00:28:04.840
you cannot really achieve 99.9%
link |
00:28:08.200
but there are all these techniques
link |
00:28:10.080
that actually some of them are developed at MIT,
link |
00:28:12.480
how you can do learning on the encoded data
link |
00:28:15.440
where you locally encode the image,
link |
00:28:17.400
you train on network,
link |
00:28:19.040
which only works on the encoded images
link |
00:28:22.440
and then you send the outcome back to the hospital
link |
00:28:24.960
and you can open it up.
link |
00:28:26.600
So those are the technical solutions.
link |
00:28:28.040
There are a lot of people who are working in this space
link |
00:28:30.720
where the learning happens in the encoded form.
link |
00:28:33.320
We are still early,
link |
00:28:36.160
but this is an interesting research area
link |
00:28:39.240
where I think we'll make more progress.
link |
00:28:43.360
There is a lot of work in natural language processing
link |
00:28:45.640
community, how to do the identification better.
link |
00:28:50.400
But even today, there are already a lot of data
link |
00:28:54.040
which can be identified perfectly,
link |
00:28:55.920
like your test data, for instance, correct,
link |
00:28:58.760
where you can just, you know,
link |
00:29:00.040
the name of the patient,
link |
00:29:01.000
you just want to extract the part with the numbers.
link |
00:29:04.320
The big problem here is again,
link |
00:29:08.440
hospitals don't see much incentive
link |
00:29:10.440
to give this data away on one hand
link |
00:29:12.640
and then there is general concern.
link |
00:29:14.200
Now, when I'm talking about societal benefits
link |
00:29:17.720
and about the education,
link |
00:29:19.640
the public needs to understand
link |
00:29:23.640
and I think that there are situations
link |
00:29:27.800
that I still remember myself
link |
00:29:29.360
when I really needed an answer.
link |
00:29:31.520
I had to make a choice
link |
00:29:33.280
and there was no information to make a choice.
link |
00:29:35.200
You're just guessing.
link |
00:29:36.640
And at that moment,
link |
00:29:38.760
you feel that your life is at the stake,
link |
00:29:41.040
but you just don't have information to make the choice.
link |
00:29:44.760
And many times when I give talks,
link |
00:29:48.680
I get emails from women who say,
link |
00:29:51.240
you know, I'm in this situation,
link |
00:29:52.760
can you please run statistic
link |
00:29:54.160
and see what are the outcomes?
link |
00:29:57.040
We get almost every week a mammogram
link |
00:30:00.000
that comes by mail to my office at MIT.
link |
00:30:02.520
I'm serious that people ask to run
link |
00:30:06.200
because they need to make, you know,
link |
00:30:07.840
life changing decisions.
link |
00:30:10.000
And of course, you know,
link |
00:30:11.320
I'm not planning to open a clinic here,
link |
00:30:12.920
but we do run and give them the results for their doctors.
link |
00:30:16.600
But the point that I'm trying to make
link |
00:30:20.040
that we all at some point or our loved ones
link |
00:30:23.760
will be in the situation where you need information
link |
00:30:26.600
to make the best choice.
link |
00:30:28.840
And if this information is not available,
link |
00:30:31.840
you would feel vulnerable and unprotected.
link |
00:30:35.080
And then the question is, you know,
link |
00:30:36.880
what do I care more?
link |
00:30:37.840
Because at the end everything is a trade off, correct?
link |
00:30:40.320
Yeah, exactly.
link |
00:30:41.640
Just out of curiosity,
link |
00:30:43.080
what it seems like one possible solution,
link |
00:30:45.560
I'd like to see what you think of it
link |
00:30:47.160
based on what you just said,
link |
00:30:50.680
based on wanting to know answers
link |
00:30:52.480
for when you're yourself in that situation.
link |
00:30:55.040
Is it possible for patients to own their data
link |
00:30:58.400
as opposed to hospitals owning their data?
link |
00:31:01.040
Of course, theoretically,
link |
00:31:02.280
I guess patients own their data,
link |
00:31:04.120
but can you walk out there with a USB stick
link |
00:31:07.600
containing everything or upload it to the cloud
link |
00:31:10.600
where a company, you know,
link |
00:31:13.400
I remember Microsoft had a service,
link |
00:31:15.680
like I try, I was really excited about
link |
00:31:17.760
and Google Health was there.
link |
00:31:19.240
I tried to give, I was excited about it.
link |
00:31:21.880
Basically companies helping you upload your data
link |
00:31:24.760
to the cloud so that you can move from hospital to hospital
link |
00:31:27.920
from doctor to doctor.
link |
00:31:29.200
Do you see a promise of that kind of possibility?
link |
00:31:32.680
I absolutely think this is, you know,
link |
00:31:34.640
the right way to exchange the data.
link |
00:31:38.160
I don't know now who's the biggest player in this field,
link |
00:31:41.680
but I can clearly see that even for totally selfish health
link |
00:31:46.280
reasons, when you are going to a new facility
link |
00:31:49.280
and many of us are sent to some specialized treatment,
link |
00:31:52.600
they don't easily have access to your data.
link |
00:31:55.720
And today, you know, we would want to send us
link |
00:31:58.960
Mammogram need to go to their hospital,
link |
00:32:00.680
find some small office,
link |
00:32:01.760
which gives them the CD and they ship as a CD.
link |
00:32:04.760
So you can imagine we're looking at kind of decades old
link |
00:32:08.280
mechanism of data exchange.
link |
00:32:10.080
So I definitely think this is an area where hopefully
link |
00:32:15.600
all the right regulatory and technical forces will align
link |
00:32:20.360
and we will see it actually implemented.
link |
00:32:23.200
It's sad because unfortunately,
link |
00:32:25.720
and I have, I need to research why that happened,
link |
00:32:28.400
but I'm pretty sure Google Health and Microsoft Health Vault
link |
00:32:32.080
or whatever it's called, both closed down,
link |
00:32:34.640
which means that there was either regulatory pressure
link |
00:32:37.560
or there's not a business case
link |
00:32:39.080
or there's challenges from hospitals,
link |
00:32:41.760
which is very disappointing.
link |
00:32:43.240
So when you say, you don't know what the biggest players are,
link |
00:32:46.480
the two biggest that I was aware of closed their doors.
link |
00:32:50.520
So I'm hoping I'd love to see why
link |
00:32:53.120
and I'd love to see who else can come up.
link |
00:32:54.760
It seems like one of those Elon Musk style problems
link |
00:32:59.600
that are obvious needs to be solved
link |
00:33:01.280
and somebody needs to step up
link |
00:33:02.360
and actually do this large scale data collection.
link |
00:33:07.360
So I know there is an initiative in Massachusetts,
link |
00:33:09.600
a thing actually led by the governor
link |
00:33:11.720
to try to create this kind of health exchange system
link |
00:33:15.440
where at least to help people who are kind of when you show up
link |
00:33:17.840
in emergency room and there is no information
link |
00:33:20.160
about what are your allergies and other things.
link |
00:33:23.480
So I don't know how far it will go,
link |
00:33:26.080
but another thing that you said and I find it very interesting
link |
00:33:30.280
is actually who are the successful players in this space
link |
00:33:33.760
and the whole implementation.
link |
00:33:36.080
How does it go?
link |
00:33:37.240
To me, it is from the anthropological perspective,
link |
00:33:40.280
it's more fascinating that AI that today goes in health care.
link |
00:33:44.640
We've seen so many attempts and so very little successes
link |
00:33:50.360
and it's interesting to understand that I by no means
link |
00:33:54.200
have knowledge to assess why we are in the position
link |
00:33:58.280
where we are.
link |
00:33:59.600
Yeah, it's interesting because data is really fuel
link |
00:34:02.920
for a lot of successful applications
link |
00:34:04.960
and when that data requires regulatory approval
link |
00:34:08.480
like the FDA or any kind of approval,
link |
00:34:14.160
it seems that the computer scientists are not quite there yet
link |
00:34:16.920
in being able to play the regulatory game,
link |
00:34:18.840
understanding the fundamentals of it.
link |
00:34:21.200
I think that in many cases when even people do have data,
link |
00:34:26.480
we still don't know what exactly do you need to demonstrate
link |
00:34:31.480
to change the standard of care.
link |
00:34:35.040
Let me give you an example related to my breast cancer research.
link |
00:34:41.000
So in traditional breast cancer risk assessment,
link |
00:34:45.400
there is something called density
link |
00:34:47.040
which determines the likelihood of a woman to get cancer
link |
00:34:50.400
and this is pretty much says how much white
link |
00:34:52.680
do you see on the mammogram?
link |
00:34:54.120
The whiter it is, the more likely the tissue is dense.
link |
00:34:58.840
And the idea behind density,
link |
00:35:02.640
it's not a bad idea,
link |
00:35:03.560
in 1967 a radiologist called Wolf decided to look back
link |
00:35:07.960
at women who were diagnosed
link |
00:35:09.680
and see what is special in their images.
link |
00:35:12.320
Can we look back and say that they're likely to develop?
link |
00:35:14.600
So he come up with some patterns
link |
00:35:16.080
and it was the best that his human eye can identify
link |
00:35:20.520
then it was kind of formalized and coded into four categories
link |
00:35:24.160
and that's what we are using today.
link |
00:35:26.840
And today this density assessment is actually a federal law
link |
00:35:32.240
from 2019 approved by President Trump
link |
00:35:36.120
and for the previous FDA commissioner
link |
00:35:40.040
where women are supposed to be advised by their providers
link |
00:35:43.560
if they have high density,
link |
00:35:45.040
putting them into higher risk category
link |
00:35:47.200
and in some states you can actually get supplementary screening
link |
00:35:51.240
paid by your insurance because you are in this category.
link |
00:35:53.640
Now you can say how much science do we have behind it?
link |
00:35:56.720
Whatever biological science or epidemiological evidence.
link |
00:36:00.800
So it turns out that between 40 and 50% of women
link |
00:36:05.080
have dense breast.
link |
00:36:06.600
So above 40% of patients are coming out of their screening
link |
00:36:11.040
and somebody tells them you are in high risk.
link |
00:36:14.960
Now what exactly does it mean
link |
00:36:16.800
if you as half of the population in high risk
link |
00:36:19.520
gets from saying maybe I'm not,
link |
00:36:21.960
or what do I really need to do with it?
link |
00:36:23.600
Because the system doesn't provide me a lot of the solutions
link |
00:36:28.280
because there are so many people like me,
link |
00:36:30.080
we cannot really provide very expensive solutions for them.
link |
00:36:34.560
And the reason this whole density became this big deal
link |
00:36:38.680
it's actually advocated by the patients
link |
00:36:40.720
who felt very unprotected because many women
link |
00:36:43.560
when did the mammograms which were normal
link |
00:36:46.200
and then it turns out that they already had cancer,
link |
00:36:49.400
quite developed cancer.
link |
00:36:50.520
So they didn't have a way to know who is really at risk
link |
00:36:54.320
and what is the likelihood that when the doctor tells you
link |
00:36:56.240
you're okay, you are not okay.
link |
00:36:58.000
So at the time and it was 15 years ago,
link |
00:37:02.080
this maybe was the best piece of science that we had
link |
00:37:06.760
and it took quite 15, 16 years to make it federal law.
link |
00:37:12.120
But now this is a standard.
link |
00:37:15.600
Now with a deep learning model
link |
00:37:17.560
we can so much more accurately predict
link |
00:37:19.600
who is gonna develop breast cancer
link |
00:37:21.560
just because you're trained on a logical thing.
link |
00:37:23.680
And instead of describing how much white
link |
00:37:26.040
and what kind of white machine
link |
00:37:27.360
can systematically identify the patterns
link |
00:37:30.120
which was the original idea behind the sort
link |
00:37:32.760
of the tradiologist,
link |
00:37:33.680
machines can do it much more systematically
link |
00:37:35.680
and predict the risk when you're training the machine
link |
00:37:38.240
to look at the image and to say the risk in one to five years.
link |
00:37:42.080
Now you can ask me how long it will take
link |
00:37:45.000
to substitute this density
link |
00:37:46.400
which is broadly used across the country
link |
00:37:48.560
and really it's not helping to bring this new models.
link |
00:37:54.320
And I would say it's not a matter of the algorithm.
link |
00:37:56.640
Algorithm is already orders of magnitude better
link |
00:37:58.720
than what is currently in practice.
link |
00:38:00.360
I think it's really the question,
link |
00:38:02.440
who do you need to convince?
link |
00:38:04.320
How many hospitals do you need to run the experiment?
link |
00:38:07.400
What, you know, all this mechanism of adoption
link |
00:38:11.560
and how do you explain to patients
link |
00:38:15.120
and to women across the country
link |
00:38:17.520
that this is really a better measure?
link |
00:38:20.400
And again, I don't think it's an AI question.
link |
00:38:22.680
We can walk more and make the algorithm even better
link |
00:38:25.880
but I don't think that this is the current, you know,
link |
00:38:29.240
the barrier, the barrier is really this other piece
link |
00:38:32.000
that for some reason is not really explored.
link |
00:38:35.200
It's like anthropological piece.
link |
00:38:36.800
And coming back to your question about books,
link |
00:38:39.760
there is a book that I'm reading.
link |
00:38:42.920
It's called American Sickness by Elizabeth Rosenthal
link |
00:38:48.240
and I got this book from my clinical collaborator,
link |
00:38:51.560
Dr. Kony Lehman.
link |
00:38:53.080
And I said, I know everything that I need to know
link |
00:38:54.800
about American health system,
link |
00:38:56.000
but you know, every page doesn't fail to surprise me.
link |
00:38:59.200
And I think that there is a lot of interesting
link |
00:39:03.080
and really deep lessons for people like us
link |
00:39:06.840
from computer science who are coming into this field
link |
00:39:09.600
to really understand how complex is the system of incentives
link |
00:39:13.640
in the system to understand how you really need
link |
00:39:17.160
to play to drive adoption.
link |
00:39:19.720
You just said it's complex,
link |
00:39:21.120
but if we're trying to simplify it,
link |
00:39:23.960
who do you think most likely would be successful
link |
00:39:27.360
if we push on this group of people?
link |
00:39:29.480
Is it the doctors?
link |
00:39:30.720
Is it the hospitals?
link |
00:39:31.760
Is it the governments or policy makers?
link |
00:39:34.240
Is it the individual patients, consumers
link |
00:39:37.240
who needs to be inspired to most likely lead to adoption?
link |
00:39:45.200
Or is there no simple answer?
link |
00:39:47.120
There's no simple answer,
link |
00:39:48.280
but I think there is a lot of good people in medical system
link |
00:39:52.000
who do want to make a change.
link |
00:39:56.520
And I think a lot of power will come from us as a consumers
link |
00:40:01.600
because we all are consumers or future consumers
link |
00:40:04.320
of healthcare services.
link |
00:40:06.560
And I think we can do so much more
link |
00:40:12.080
in explaining the potential and not in the hype terms
link |
00:40:15.560
and not saying that we're now cured or Alzheimer
link |
00:40:17.920
and I'm really sick of reading this kind of articles
link |
00:40:20.560
which make these claims.
link |
00:40:22.120
But really to show with some examples
link |
00:40:24.800
what this implementation does
link |
00:40:26.520
and how it changes the care.
link |
00:40:29.080
Because I can't imagine,
link |
00:40:30.040
it doesn't matter what kind of politician it is,
link |
00:40:33.240
we all are susceptible to these diseases.
link |
00:40:35.240
There is no one who is free.
link |
00:40:37.800
And eventually, we all are humans
link |
00:40:41.080
and we are looking for a way to alleviate the suffering.
link |
00:40:44.880
And this is one possible way
link |
00:40:47.320
where we currently are underutilizing,
link |
00:40:49.360
which I think can help.
link |
00:40:51.880
So it sounds like the biggest problems are outside of AI
link |
00:40:55.120
in terms of the biggest impact at this point.
link |
00:40:58.000
But are there any open problems
link |
00:41:00.440
in the application of ML to oncology in general?
link |
00:41:03.800
So improving the detection
link |
00:41:05.400
or any other creative methods,
link |
00:41:07.600
whether it's on the detection segmentations
link |
00:41:09.640
or the vision perception side
link |
00:41:11.800
or some other clever of inference.
link |
00:41:16.320
Yeah, what in general in your view
link |
00:41:18.560
are the open problems in this space?
link |
00:41:20.320
So I just want to mention that beside detection,
link |
00:41:22.480
another area where I am kind of quite active
link |
00:41:24.880
and I think it's really an increasingly important area
link |
00:41:28.640
in healthcare is drug design.
link |
00:41:30.980
Absolutely.
link |
00:41:32.820
Because it's fine if you detect something early,
link |
00:41:36.940
but you still need to get drugs
link |
00:41:41.140
and new drugs for these conditions.
link |
00:41:43.900
And today, all of the drug design, ML is non existent there.
link |
00:41:48.300
We don't have any drug that was developed by the ML model
link |
00:41:53.020
or even not developed,
link |
00:41:54.940
but at least even you,
link |
00:41:56.220
that ML model plays some significant role.
link |
00:41:59.300
I think this area with all the new ability
link |
00:42:03.300
to generate molecules with desired properties
link |
00:42:05.820
to do in silica screening is really a big open area.
link |
00:42:11.300
It to be totally honest with you,
link |
00:42:12.660
when we are doing diagnostics and imaging,
link |
00:42:14.940
primarily taking the ideas that were developed
link |
00:42:17.300
for other areas and you're applying them with some adaptation.
link |
00:42:20.500
The area of drug design
link |
00:42:24.700
is really technically interesting and exciting area.
link |
00:42:27.980
You need to work a lot with graphs
link |
00:42:30.380
and capture various 3D properties.
link |
00:42:34.620
There are lots and lots of opportunities
link |
00:42:37.420
to be technically creative.
link |
00:42:39.820
And I think there are a lot of open questions in this area.
link |
00:42:46.780
We're already getting a lot of successes
link |
00:42:48.820
even with the kind of the first generation of this models,
link |
00:42:52.700
but there is much more new creative things that you can do.
link |
00:42:56.540
And what's very nice to see is actually the more powerful,
link |
00:43:03.060
the more interesting models actually do better.
link |
00:43:05.420
So there is a place to innovate in machine learning
link |
00:43:11.300
in this area.
link |
00:43:13.900
And some of these techniques are really unique too,
link |
00:43:16.820
let's say to graph generation and other things.
link |
00:43:19.620
So...
link |
00:43:20.820
What just to interrupt really quick, I'm sorry.
link |
00:43:23.980
Graph generation or graphs, drug discovery in general.
link |
00:43:30.660
How do you discover a drug?
link |
00:43:31.980
Is this chemistry?
link |
00:43:33.340
Is this trying to predict different chemical reactions?
link |
00:43:37.500
Or is it some kind of...
link |
00:43:39.620
What do graphs even represent in this space?
link |
00:43:42.100
Oh, sorry.
link |
00:43:43.940
And what's a drug?
link |
00:43:45.300
Okay, so let's say you think that there are many different
link |
00:43:47.820
types of drugs, but let's say you're going to talk
link |
00:43:49.580
about small molecules because I think today,
link |
00:43:51.900
the majority of drugs are small molecules.
link |
00:43:53.580
So small molecule is a graph.
link |
00:43:55.020
The molecule is just where the node in the graph is an atom
link |
00:44:00.060
and then you have the bond.
link |
00:44:01.460
So it's really a graph representation
link |
00:44:03.220
if you look at it in 2D, correct?
link |
00:44:05.540
You can do it 3D, but let's say, well,
link |
00:44:07.460
let's keep it simple and stick in 2D.
link |
00:44:11.500
So pretty much my understanding today,
link |
00:44:14.740
how it is done at scale in the companies,
link |
00:44:17.740
you're without machine learning,
link |
00:44:20.220
you have high throughput screening.
link |
00:44:22.100
So you know that you are interested to get certain
link |
00:44:24.540
biological activity of the compounds.
link |
00:44:26.580
So you scan a lot of compounds,
link |
00:44:28.860
like maybe hundreds of thousands,
link |
00:44:30.700
some really big number of compounds.
link |
00:44:32.980
You identify some compounds which have the right activity
link |
00:44:36.100
and then at this point, the chemists come
link |
00:44:39.260
and they're trying to now to optimize this original heat
link |
00:44:44.340
to different properties that you want it to be,
link |
00:44:46.340
maybe soluble, you want to decrease toxicity,
link |
00:44:49.100
you want to decrease the side effects.
link |
00:44:51.660
Are those, sorry, again to the drop,
link |
00:44:54.060
can that be done in simulation
link |
00:44:55.540
or just by looking at the molecules
link |
00:44:57.700
or do you need to actually run reactions
link |
00:44:59.860
in real labs with lab posts and stuff?
link |
00:45:02.180
So when you do high throughput screening,
link |
00:45:04.060
you really do screening, it's in the lab.
link |
00:45:07.060
It's really the lab screening,
link |
00:45:09.180
you screen the molecules, correct?
link |
00:45:10.980
I don't know what screening is.
link |
00:45:12.540
The screening, you just check them for certain property.
link |
00:45:15.100
Like in the physical space, in the physical world,
link |
00:45:17.340
like actually there's a machine probably
link |
00:45:18.780
that's actually running the reaction.
link |
00:45:21.460
Actually running the reactions, yeah.
link |
00:45:22.900
So there is a process where you can run
link |
00:45:25.420
and that's why it's called high throughput,
link |
00:45:26.860
you know, it becomes cheaper and faster
link |
00:45:29.580
to do it on very big number of molecules.
link |
00:45:33.820
You run the screening, you identify potential,
link |
00:45:38.340
you know, potential good starts
link |
00:45:40.300
and then where the chemists come in
link |
00:45:42.340
who, you know, have done it many times
link |
00:45:44.060
and then they can try to look at it
link |
00:45:45.900
and say, how can you change the molecule
link |
00:45:48.260
to get the desired profile in terms of all other properties?
link |
00:45:53.460
So maybe how do I make it more bioactive and so on?
link |
00:45:56.500
And there, you know, the creativity of the chemists
link |
00:45:59.460
really is the one that determines the success
link |
00:46:03.980
of this design because again,
link |
00:46:07.460
they have a lot of domain knowledge of, you know,
link |
00:46:10.180
what works, how do you decrease the CCT and so on?
link |
00:46:12.900
And that's what they do.
link |
00:46:15.020
So all the drugs that are currently, you know,
link |
00:46:17.860
in the FDA approved drugs or even drugs
link |
00:46:20.820
that are in clinical trials,
link |
00:46:22.140
they are designed using these domain experts
link |
00:46:27.140
which goes through this combinatorial space
link |
00:46:30.060
of molecules or graphs or whatever
link |
00:46:31.980
and find the right one or adjust it to be the right ones.
link |
00:46:35.180
Sounds like the breast density heuristic from 67,
link |
00:46:39.260
the same echoes.
link |
00:46:40.500
It's not necessarily that.
link |
00:46:41.820
It's really, you know, it's really driven by deep understanding.
link |
00:46:45.380
It's not like they just observe it.
link |
00:46:46.820
I mean, they do deeply understand chemistry
link |
00:46:48.540
and they do understand how different groups
link |
00:46:50.460
and how does it change the properties.
link |
00:46:53.140
So there is a lot of science that gets into it
link |
00:46:56.660
and a lot of kind of simulation,
link |
00:46:58.740
how do you want it to behave?
link |
00:47:01.900
It's very, very complex.
link |
00:47:03.900
So they're quite effective at this design, obviously.
link |
00:47:06.140
Now, effective, yeah, we have drugs.
link |
00:47:08.420
Like depending on how do you measure effective?
link |
00:47:10.780
If you measure, it's in terms of cost, it's prohibitive.
link |
00:47:13.940
If you measure it in terms of times, you know,
link |
00:47:15.820
we have lots of diseases for which we don't have any drugs
link |
00:47:18.420
and we don't even know how to approach.
link |
00:47:20.100
I don't need to mention few drugs
link |
00:47:23.460
or degenerative disease drugs that fail, you know.
link |
00:47:26.980
So there are lots of, you know, trials that fail,
link |
00:47:30.900
you know, in later stages,
link |
00:47:32.180
which is really catastrophic from the financial perspective.
link |
00:47:35.180
So, you know, is it the effective,
link |
00:47:38.260
the most effective mechanism?
link |
00:47:39.540
Absolutely no, but this is the only one that currently works.
link |
00:47:42.740
And I would, you know, I was closely interacting
link |
00:47:46.660
with people in pharmaceutical industry.
link |
00:47:48.020
I was really fascinating on how sharp
link |
00:47:50.020
and what a deep understanding of the domain do they have.
link |
00:47:53.860
It's not observation driven.
link |
00:47:55.500
There is really a lot of science behind what they do.
link |
00:47:58.660
But if you ask me, can machine learning change it?
link |
00:48:00.940
I firmly believe yes,
link |
00:48:03.460
because even the most experienced chemists cannot, you know,
link |
00:48:07.020
hold in their memory and understanding
link |
00:48:09.460
everything that you can learn, you know,
link |
00:48:11.020
from millions of molecules and reactions.
link |
00:48:14.140
And the space of graphs is a totally new space.
link |
00:48:18.380
I mean, it's a really interesting space
link |
00:48:20.460
for machine learning to explore, graph generation.
link |
00:48:22.540
Yeah, so there are a lot of things that you can do here.
link |
00:48:24.740
So we do a lot of work.
link |
00:48:27.140
So the first tool that we started with
link |
00:48:29.940
was the tool that can predict properties of the molecules.
link |
00:48:34.940
So you can just give the molecule and the property.
link |
00:48:37.820
It can be bioactivity properties.
link |
00:48:39.900
Or it can be some other property.
link |
00:48:42.700
And you train the molecules and you can now take a new molecule
link |
00:48:48.580
and predict this property.
link |
00:48:50.620
Now, when people started working in this area,
link |
00:48:53.420
it is something very simple.
link |
00:48:54.620
They do kind of existing, you know, fingerprints,
link |
00:48:57.220
which is kind of handcrafted features of the molecule
link |
00:48:59.380
when you break the graph to substructures
link |
00:49:01.420
and then you run, I don't know, a feedforward neural network.
link |
00:49:04.420
And what was interesting to see that clearly, you know,
link |
00:49:07.100
this was not the most effective way to proceed.
link |
00:49:09.980
And you need to have much more complex models
link |
00:49:12.900
that can induce a representation
link |
00:49:15.140
which can translate this graph into the embeddings
link |
00:49:18.220
and do these predictions.
link |
00:49:20.100
So this is one direction.
link |
00:49:22.020
And another direction, which is kind of related,
link |
00:49:24.180
is not only to stop by looking at the embedding itself,
link |
00:49:27.940
but actually modify it to produce better molecules.
link |
00:49:31.580
So you can think about it as the machine translation
link |
00:49:34.780
that you can start with a molecule
link |
00:49:37.220
and then there is an improved version of molecule.
link |
00:49:39.500
And you can again, with encoder,
link |
00:49:41.380
translate it into the hidden space
link |
00:49:42.820
and then learn how to modify it to improve
link |
00:49:45.020
the in some ways version of the molecules.
link |
00:49:48.220
So that's, it's kind of really exciting.
link |
00:49:51.540
We already have seen that the property prediction works
link |
00:49:54.220
pretty well and now we are generating molecules
link |
00:49:58.740
and there is actually labs
link |
00:50:00.780
which are manufacturing this molecule.
link |
00:50:03.140
So we'll see why it will get to us.
link |
00:50:05.180
Okay, that's really exciting.
link |
00:50:06.540
There's a lot of problems.
link |
00:50:07.980
Speaking of machine translation and embeddings,
link |
00:50:10.740
you have done a lot of really great research in NLP,
link |
00:50:15.180
natural language processing.
link |
00:50:18.020
Can you tell me your journey through NLP,
link |
00:50:20.380
what ideas, problems, approaches were you working on
link |
00:50:23.860
were you fascinated with, did you explore
link |
00:50:26.820
before this magic of deep learning reemerged and after?
link |
00:50:31.820
So when I started my work in NLP, it was in 97.
link |
00:50:35.740
This was a very interesting time.
link |
00:50:37.260
It was exactly the time that I came to ACL
link |
00:50:40.780
and the dynamic would barely understand English.
link |
00:50:43.540
But it was exactly like the transition point
link |
00:50:46.140
because half of the papers were really rule based approaches
link |
00:50:51.140
where people took more kind of heavy linguistic approaches
link |
00:50:53.820
for small domains and try to build up from there.
link |
00:50:57.820
And then there were the first generation of papers
link |
00:50:59.980
which were corpus based papers.
link |
00:51:01.980
And they were very simple in our terms
link |
00:51:03.900
when you collect some statistics
link |
00:51:05.420
and do prediction based on them.
link |
00:51:07.300
But I found it really fascinating that one community
link |
00:51:10.700
can think so very differently about the problem.
link |
00:51:16.700
And I remember my first papers that I wrote,
link |
00:51:20.260
it didn't have a single formula,
link |
00:51:21.940
it didn't have evaluation, it just had examples of outputs.
link |
00:51:25.700
And this was a standard of the first generation
link |
00:51:29.500
of the field at a time.
link |
00:51:32.020
In some ways, I mean, people maybe just started emphasizing
link |
00:51:35.820
the empirical evaluation,
link |
00:51:37.820
but for many applications like summarization,
link |
00:51:39.780
you just wrote some examples of outputs.
link |
00:51:42.780
And then increasingly you can see
link |
00:51:44.460
that how the statistical approach has dominated the field.
link |
00:51:48.300
And we've seen increased performance
link |
00:51:52.060
across many basic tasks.
link |
00:51:56.020
The sad part of the story may be that if you look
link |
00:52:00.020
again through this journey,
link |
00:52:01.580
we see that the role of linguistics
link |
00:52:05.060
in some ways greatly diminishes.
link |
00:52:07.420
And I think that you really need to look
link |
00:52:11.580
through the whole proceeding to find one or two papers
link |
00:52:14.540
which make some interesting linguistic references.
link |
00:52:17.260
It's really big.
link |
00:52:18.100
You mean today?
link |
00:52:18.940
Today.
link |
00:52:19.780
Today.
link |
00:52:20.620
This was definitely...
link |
00:52:21.460
Things like syntactic trees,
link |
00:52:22.300
just even basically against our conversation
link |
00:52:24.380
about human understanding of language,
link |
00:52:27.500
which I guess what linguistics would be,
link |
00:52:30.260
structured hierarchical representing language
link |
00:52:34.260
in a way that's human explainable,
link |
00:52:35.700
understandable is missing today.
link |
00:52:39.420
I don't know if it is,
link |
00:52:41.100
what is explainable and understandable.
link |
00:52:43.580
At the end, we perform functions
link |
00:52:45.900
and it's okay to have machine which performs a function.
link |
00:52:50.100
Like when you're thinking about your calculator, correct?
link |
00:52:53.180
Your calculator can do calculation
link |
00:52:55.420
very different from you would do the calculation,
link |
00:52:57.580
but it's very effective in it.
link |
00:52:58.820
And this is fine.
link |
00:52:59.700
If we can achieve certain tasks with high accuracy,
link |
00:53:04.420
it doesn't necessarily mean that it has to understand
link |
00:53:07.100
in the same way as we understand.
link |
00:53:09.260
In some ways, it's even naive to request
link |
00:53:11.220
because you have so many other sources of information
link |
00:53:14.900
that are absent when you are training your system.
link |
00:53:17.860
So it's okay.
link |
00:53:19.180
Is it delivered?
link |
00:53:20.020
And I would tell you one application
link |
00:53:21.460
that's just really fascinating.
link |
00:53:22.780
In 97, when it came to ACL,
link |
00:53:24.300
there were some papers on machine translation.
link |
00:53:25.900
They were like primitive,
link |
00:53:27.460
like people were trying really, really simple.
link |
00:53:31.100
And the feeling, my feeling was that,
link |
00:53:34.300
to make real machine translation system,
link |
00:53:36.300
it's like to fly at the moon and build a house there
link |
00:53:39.580
and the garden and live happily ever after.
link |
00:53:41.620
I mean, it's like impossible.
link |
00:53:42.620
I never could imagine that within 10 years,
link |
00:53:46.740
we would already see the system working.
link |
00:53:48.580
And now nobody is even surprised
link |
00:53:51.620
to utilize the system on daily basis.
link |
00:53:54.460
So this was like a huge, huge progress,
link |
00:53:56.260
saying that people for very long time
link |
00:53:57.900
tried to solve using other mechanisms
link |
00:54:00.820
and they were unable to solve it.
link |
00:54:03.220
That's why I'm coming back to a question about biology,
link |
00:54:06.180
that in linguistics, people try to go this way
link |
00:54:10.820
and try to write the syntactic trees
link |
00:54:13.540
and try to obstruct it
link |
00:54:14.860
and to find the right representation.
link |
00:54:17.060
And, you know, they couldn't get very far
link |
00:54:22.220
with this understanding while these models,
link |
00:54:25.940
using, you know, other sources actually capable
link |
00:54:29.620
to make a lot of progress.
link |
00:54:31.660
Now, I'm not naive to think
link |
00:54:33.940
that we are in this paradise space in NLP
link |
00:54:36.860
and I'm sure as you know,
link |
00:54:38.540
that when we slightly change the domain
link |
00:54:40.860
and when we decrease the amount of training,
link |
00:54:42.580
it can do like really bizarre and funny thing.
link |
00:54:44.740
But I think it's just a matter of improving
link |
00:54:47.140
generalization capacity,
link |
00:54:48.540
which is just a technical question.
link |
00:54:51.500
Well, so that's the question.
link |
00:54:54.300
How much of language understanding
link |
00:54:57.020
can be solved with deep neural networks?
link |
00:54:59.180
In your intuition, I mean, it's unknown, I suppose.
link |
00:55:03.740
But as we start to creep towards romantic notions
link |
00:55:07.660
of the spirit of the Turing test
link |
00:55:10.620
and conversation and dialogue
link |
00:55:14.220
and something that may be to me or to us,
link |
00:55:18.300
so the humans feels like it needs real understanding.
link |
00:55:21.620
How much can I be achieved
link |
00:55:23.500
with these neural networks or statistical methods?
link |
00:55:28.060
So I guess I am very much driven by the outcomes.
link |
00:55:33.340
Can we achieve the performance,
link |
00:55:35.420
which would be satisfactory for us for different tasks.
link |
00:55:40.700
Now, if you again look at machine translation system,
link |
00:55:43.020
which are trained on large amounts of data,
link |
00:55:46.060
they really can do a remarkable job
link |
00:55:48.820
relatively to where they've been a few years ago.
link |
00:55:51.380
And if you project into the future,
link |
00:55:54.660
if it will be the same speed of improvement,
link |
00:55:59.380
this is great.
link |
00:56:00.220
Now, does it bother me
link |
00:56:01.060
that it's not doing the same translation as we are doing?
link |
00:56:04.900
Now, if you go to cognitive science,
link |
00:56:06.660
we still don't really understand what we are doing.
link |
00:56:10.460
I mean, there are a lot of theories
link |
00:56:11.900
and there is obviously a lot of progress and studying,
link |
00:56:13.860
but our understanding what exactly goes on in our brains
link |
00:56:17.580
when we process language is still not crystal clear
link |
00:56:21.060
and precise that we can translate it into machines.
link |
00:56:25.500
What does bother me is that, again,
link |
00:56:29.820
that machines can be extremely brittle
link |
00:56:31.740
when you go out of your comfort zone of there,
link |
00:56:34.540
when there is a distributional shift
link |
00:56:36.100
between training and testing.
link |
00:56:37.340
And it have been years and years,
link |
00:56:39.060
every year when they teach a NLP class,
link |
00:56:41.540
show them some examples of translation
link |
00:56:43.580
from some newspaper in Hebrew,
link |
00:56:45.740
whatever, it was perfect.
link |
00:56:47.340
And then they have a recipe
link |
00:56:48.860
that Tomi Akala's system sent me a while ago
link |
00:56:52.620
and it was written in Finnish of Carillian pies.
link |
00:56:55.740
And it's just a terrible translation.
link |
00:56:59.340
You cannot understand anything what it does.
link |
00:57:01.500
It's not like some syntactic mistakes.
link |
00:57:03.180
It's just terrible.
link |
00:57:04.340
And year after year, I tried it and will translate it.
link |
00:57:07.140
And year after year, it does this terrible work
link |
00:57:09.020
because I guess the recipes are not big part
link |
00:57:12.060
of their training repertoire.
link |
00:57:15.500
So, but in terms of outcomes,
link |
00:57:17.780
that's a really clean, good way to look at it.
link |
00:57:21.140
I guess the question I was asking is,
link |
00:57:24.100
do you think, imagine a future,
link |
00:57:27.740
do you think the current approaches
link |
00:57:29.820
can pass the Turing test in the way,
link |
00:57:34.740
in the best possible formulation of the Turing test?
link |
00:57:37.060
Which is, would you want to have a conversation
link |
00:57:39.500
with a neural network for an hour?
link |
00:57:42.380
Oh God, no, no, there are not that many people
link |
00:57:45.860
that I would want to talk for an hour.
link |
00:57:48.140
But there are some people in this world, alive or not,
link |
00:57:51.540
that you would like to talk to for an hour,
link |
00:57:53.300
could a neural network achieve that outcome?
link |
00:57:56.740
So I think it would be really hard
link |
00:57:58.220
to create a successful training set,
link |
00:58:01.180
which would enable it to have a conversation
link |
00:58:03.380
for an intercontextual conversation for an hour.
link |
00:58:07.140
So you think it's a problem of data, perhaps?
link |
00:58:08.180
I think in some ways it's an important data.
link |
00:58:09.980
It's a problem both of data and the problem
link |
00:58:12.500
of the way we are training our systems,
link |
00:58:15.780
their ability to truly to generalize,
link |
00:58:18.100
to be very compositional, in some ways, it's limited,
link |
00:58:21.300
in the current capacity, at least.
link |
00:58:25.580
You know, we can translate well,
link |
00:58:28.020
we can find information well, we can extract information.
link |
00:58:32.540
So there are many capacities in which it's doing very well.
link |
00:58:35.220
And you can ask me, would you trust the machine
link |
00:58:38.020
to translate for you and use it as a source?
link |
00:58:39.900
I would say absolutely, especially if we're talking
link |
00:58:41.900
about newspaper data or other data,
link |
00:58:44.180
which is in the realm of its own training set,
link |
00:58:46.780
I would say yes.
link |
00:58:48.940
But, you know, having conversations with the machine,
link |
00:58:52.940
it's not something that I would choose to do.
link |
00:58:56.500
But you know, I would tell you something,
link |
00:58:58.180
talking about Turing tests
link |
00:58:59.460
and about all this kind of ELISA conversations.
link |
00:59:02.980
I remember visiting Tencent in China
link |
00:59:05.580
and they have this chat board.
link |
00:59:06.980
And they claim that it is like really humongous amount
link |
00:59:09.540
of the local population,
link |
00:59:10.820
which like for hours talks to the chat board,
link |
00:59:12.980
to me it was, I cannot believe it,
link |
00:59:15.380
but apparently it's like documented
link |
00:59:17.140
that there are some people who enjoy this conversation.
link |
00:59:20.820
And you know, it brought to me another MIT story
link |
00:59:24.580
about ELISA and Weizimbau.
link |
00:59:26.940
I don't know if you're familiar with the story.
link |
00:59:29.380
So Weizimbau was a professor at MIT
link |
00:59:31.060
and when he developed this ELISA,
link |
00:59:32.620
which was just doing string matching, very trivial,
link |
00:59:36.740
like restating of what you said,
link |
00:59:38.580
with very few rules, no syntax.
link |
00:59:41.300
Apparently there were secretaries at MIT
link |
00:59:43.780
that would sit for hours and converse with this trivial thing.
link |
00:59:48.220
And at the time there was no beautiful interfaces.
link |
00:59:50.220
So you actually need to go through the pain of communicating.
link |
00:59:53.580
And Weizimbau himself was so horrified by this phenomenon
link |
00:59:56.980
that people can believe enough to the machine
link |
00:59:59.300
that you just need to give them the hint
link |
01:00:00.860
that machine understands you
link |
01:00:02.060
and you can complete the rest.
link |
01:00:03.980
So he kind of stopped this research
link |
01:00:05.460
and went into kind of trying to understand
link |
01:00:08.700
what this artificial intelligence can do to our brains.
link |
01:00:12.780
So my point is, you know, how much,
link |
01:00:15.580
it's not how good is the technology,
link |
01:00:19.380
it's how ready we are to believe
link |
01:00:22.660
that it delivers the good that we are trying to get.
link |
01:00:25.620
That's a really beautiful way to put it.
link |
01:00:27.220
I, by the way, I'm not horrified by that possibility
link |
01:00:29.780
but inspired by it because, I mean, human connection,
link |
01:00:35.940
whether it's through language or through love,
link |
01:00:39.180
it seems like it's very amenable to machine learning
link |
01:00:44.900
and the rest is just challenges of psychology.
link |
01:00:49.340
Like you said, the secretaries who enjoy spending hours,
link |
01:00:52.460
I would say I would describe most of our lives
link |
01:00:55.020
as enjoying spending hours with those we love
link |
01:00:58.060
for very silly reasons.
link |
01:01:00.860
All we're doing is keyword matching as well.
link |
01:01:02.820
So I'm not sure how much intelligence
link |
01:01:05.140
we exhibit to each other with the people we love
link |
01:01:08.180
that we're close with.
link |
01:01:09.860
So it's a very interesting point
link |
01:01:12.700
of what it means to pass the Turing test with language.
link |
01:01:16.060
I think you're right.
link |
01:01:16.900
In terms of conversation,
link |
01:01:18.260
I think machine translation has very clear performance
link |
01:01:23.140
and improvement, right?
link |
01:01:24.460
What it means to have a fulfilling conversation
link |
01:01:28.060
is very, very person dependent
link |
01:01:31.060
and context dependent and so on.
link |
01:01:33.580
That's, yeah, it's very well put.
link |
01:01:36.060
So, but in your view,
link |
01:01:38.340
what's a benchmark in natural language, a test,
link |
01:01:41.940
that's just out of reach right now,
link |
01:01:43.700
but we might be able to, that's exciting.
link |
01:01:46.060
Is it in machine, isn't perfecting machine translation
link |
01:01:49.140
or is there other, is it summarization?
link |
01:01:51.940
What's out there just out of reach?
link |
01:01:53.340
It goes across specific application.
link |
01:01:55.860
It's more about the ability to learn
link |
01:01:58.300
from few examples for real,
link |
01:02:00.100
what we call future planning and all these cases.
link |
01:02:03.340
Because, you know, the way we publish these papers today,
link |
01:02:05.940
we say, if we have like naively, we get 55,
link |
01:02:09.940
but now we had a few example and we can move to 65.
link |
01:02:12.500
None of these methods actually
link |
01:02:14.020
realistically doing anything useful.
link |
01:02:15.980
You cannot use them today.
link |
01:02:18.540
And the ability to be able to generalize and to move
link |
01:02:25.460
or to be autonomous in finding the data
link |
01:02:28.980
that you need to learn,
link |
01:02:31.340
to be able to perfect new tasks or new language.
link |
01:02:35.300
This is an area where I think we really need
link |
01:02:39.220
to move forward to and we are not yet there.
link |
01:02:43.060
Are you at all excited,
link |
01:02:45.060
curious by the possibility of creating human level intelligence?
link |
01:02:49.900
Is this, because you've been very in your discussion.
link |
01:02:52.540
So if we look at oncology,
link |
01:02:54.340
you're trying to use machine learning to help the world
link |
01:02:58.100
in terms of alleviating suffering.
link |
01:02:59.700
If you look at natural language processing,
link |
01:03:02.340
you're focused on the outcomes of improving practical things
link |
01:03:05.300
like machine translation.
link |
01:03:06.820
But, you know, human level intelligence is a thing
link |
01:03:09.860
that our civilizations dream about creating
link |
01:03:13.100
super human level intelligence.
link |
01:03:15.740
Do you think about this?
link |
01:03:16.940
Do you think it's at all within our reach?
link |
01:03:20.420
So as you said yourself earlier,
link |
01:03:22.660
talking about, you know, how do you perceive,
link |
01:03:26.660
you know, our communications with each other
link |
01:03:28.980
that, you know, we're matching keywords
link |
01:03:30.700
and certain behaviors and so on.
link |
01:03:33.020
So at the end, whenever one assesses,
link |
01:03:36.860
let's say relations with another person,
link |
01:03:38.660
you have separate kind of measurements and outcomes
link |
01:03:41.460
inside your head that determine, you know,
link |
01:03:43.620
what is the status of the relation.
link |
01:03:45.860
So one way, this is this classical level.
link |
01:03:48.580
What is the intelligence?
link |
01:03:49.580
Is it the fact that now we are going to do
link |
01:03:51.260
the same way as human is doing
link |
01:03:52.940
when we don't even understand what the human is doing?
link |
01:03:55.500
Or we now have an ability to deliver these outcomes,
link |
01:03:59.100
but not in one area, not in an LPL,
link |
01:04:01.300
not just to translate or just to answer questions,
link |
01:04:03.940
but across many, many areas that we can achieve
link |
01:04:06.900
the functionalities that humans can achieve
link |
01:04:09.740
with their ability to learn and do other things.
link |
01:04:12.380
I think this is, and this we can actually measure
link |
01:04:15.500
how far we are, and that's what makes me excited
link |
01:04:20.340
that we, you know, in my lifetime,
link |
01:04:22.420
at least so far what we've seen,
link |
01:04:23.780
it's like tremendous progress across
link |
01:04:26.260
with these different functionalities.
link |
01:04:28.700
And I think it will be really exciting
link |
01:04:32.260
to see where we will be.
link |
01:04:35.540
And again, one way to think about is there are machines
link |
01:04:40.020
which are improving their functionality.
link |
01:04:41.820
Another one is to think about us with our brains,
link |
01:04:44.900
which are imperfect, how they can be accelerated
link |
01:04:49.020
by this technology as it becomes stronger and stronger.
link |
01:04:55.860
Coming back to another book that I love,
link |
01:04:58.580
Flowers for Algernon, have you read this book?
link |
01:05:02.060
Yes.
link |
01:05:02.900
You know, there is this point that the patient gets
link |
01:05:05.740
this miracle cure which changes his brain
link |
01:05:08.020
and all of a sudden they see life in a different way
link |
01:05:11.060
and can do certain things better,
link |
01:05:13.340
but certain things much worse.
link |
01:05:16.540
So you can imagine this kind of computer augmented cognition
link |
01:05:22.420
where it can bring you that now in the same way
link |
01:05:24.820
as, you know, the cars enable us to get to places
link |
01:05:28.140
where we've never been before.
link |
01:05:30.100
Can we think differently?
link |
01:05:31.620
Can we think faster?
link |
01:05:32.820
So, and we already see a lot of it happening
link |
01:05:36.700
in how it impacts us,
link |
01:05:38.260
but I think we have a long way to go there.
link |
01:05:42.180
So that's sort of artificial intelligence
link |
01:05:45.020
and technology affecting our,
link |
01:05:47.260
augmenting our intelligence as humans.
link |
01:05:50.500
Yesterday, a company called Neuralink announced
link |
01:05:55.540
they did this whole demonstration.
link |
01:05:56.820
I don't know if you saw it.
link |
01:05:57.980
It's, they demonstrated brain, computer,
link |
01:06:00.980
brain machine interface where there's like a sewing machine
link |
01:06:05.260
for the brain.
link |
01:06:06.340
Do you, you know, a lot of that is quite out there
link |
01:06:11.140
in terms of things that some people would say are impossible,
link |
01:06:15.300
but they're dreamers and want to engineer systems like that.
link |
01:06:18.100
Do you see, based on what you just said,
link |
01:06:20.380
a hope for that more direct interaction with the brain?
link |
01:06:25.140
I think there are different ways.
link |
01:06:27.020
One is a direct interaction with the brain.
link |
01:06:28.980
And again, there are lots of companies
link |
01:06:30.900
that work in this space.
link |
01:06:32.220
And I think there will be a lot of developments.
link |
01:06:35.060
When I'm just thinking that many times
link |
01:06:36.540
we are not aware of our feelings
link |
01:06:39.020
of motivation, what drives us.
link |
01:06:41.420
Like let me give you a trivial example, our attention.
link |
01:06:45.500
There are a lot of studies that demonstrate
link |
01:06:47.260
that it takes a while to a person to understand
link |
01:06:49.220
that they are not attentive anymore.
link |
01:06:51.100
And we know that there are people
link |
01:06:52.180
who really have strong capacity to hold attention.
link |
01:06:54.540
There are another end of the spectrum,
link |
01:06:55.980
people with ADD and other issues
link |
01:06:57.980
that they have problem to regulate their attention.
link |
01:07:00.740
Imagine to yourself that you have like a cognitive aid
link |
01:07:03.540
that just alerts you based on your gaze.
link |
01:07:06.260
That your attention is now not on what you are doing.
link |
01:07:09.300
And instead of writing a paper, you're now dreaming
link |
01:07:11.460
of what you're gonna do in the evening.
link |
01:07:12.740
So even this kind of simple measurement things,
link |
01:07:16.340
how they can change us.
link |
01:07:18.020
And I see it even in the simple ways with myself.
link |
01:07:22.380
I have my zone up from that I got in MIT gym.
link |
01:07:26.460
It kind of records how much did you run
link |
01:07:28.780
and you have some points and you can get some status,
link |
01:07:31.940
whatever.
link |
01:07:32.900
Like I said, what is this ridiculous thing?
link |
01:07:35.860
Who would ever care about some status in some arm?
link |
01:07:38.820
Guess what?
link |
01:07:39.660
So to maintain the status,
link |
01:07:41.580
you have to set a number of points every month.
link |
01:07:44.660
And not only is that they do it every single month
link |
01:07:48.060
for the last 18 months,
link |
01:07:50.580
it went to the point that I was injured.
link |
01:07:54.180
And when I could run again,
link |
01:07:56.180
I in two days, I did like some humongous amount
link |
01:08:01.860
of writing just to complete the points.
link |
01:08:04.020
It was like really not safe.
link |
01:08:05.820
It's like, I'm not gonna lose my status
link |
01:08:08.340
because I want to get there.
link |
01:08:10.100
So you can already see that this direct measurement
link |
01:08:13.180
and the feedback, we're looking at video games
link |
01:08:16.180
and see why the addiction aspect of it,
link |
01:08:18.540
but you can imagine that the same idea
link |
01:08:20.340
can be expanded to many other areas of our life
link |
01:08:23.500
when we really can get feedback
link |
01:08:25.820
and imagine in your case in relations
link |
01:08:29.740
when we are doing keyword matching,
link |
01:08:31.220
imagine that the person who is generating the key ones,
link |
01:08:36.100
that person gets direct feedback
link |
01:08:37.700
before the whole thing explodes.
link |
01:08:39.540
Is it maybe at this happy point,
link |
01:08:41.940
we are going in the wrong direction?
link |
01:08:43.980
Maybe it will be really a behavior modifying moment.
link |
01:08:47.980
So yeah, it's a relationship management too.
link |
01:08:51.300
So yeah, that's a fascinating whole area
link |
01:08:54.180
of psychology actually as well,
link |
01:08:56.100
of seeing how our behavior has changed
link |
01:08:58.220
with basically all human relations
link |
01:09:00.820
now have other non human entities helping us out.
link |
01:09:06.180
So you've, you teach a large,
link |
01:09:09.420
a huge machine learning course here at MIT.
link |
01:09:13.620
I can ask you a million questions,
link |
01:09:15.340
but you've seen a lot of students.
link |
01:09:17.580
What ideas do students struggle with the most
link |
01:09:20.900
as they first enter this world of machine learning?
link |
01:09:25.700
Actually, this year was the first time
link |
01:09:28.020
I started teaching a small machine learning class
link |
01:09:30.060
and it came as a result of what I saw
link |
01:09:32.860
in my big machine learning class that Tommy Ackle
link |
01:09:35.620
and I built maybe six years ago.
link |
01:09:39.660
What we've seen that as this area become more and more popular,
link |
01:09:42.820
more and more people at MIT want to take this class.
link |
01:09:46.940
And while we designed it for computer science majors,
link |
01:09:49.940
there were a lot of people who really are interested
link |
01:09:52.340
to learn it, but unfortunately,
link |
01:09:54.220
their background was not enabling them
link |
01:09:57.380
to do well in the class.
link |
01:09:58.780
And many of them associated machine learning
link |
01:10:01.000
with a world struggle and failure,
link |
01:10:04.380
primarily for non majors.
link |
01:10:06.380
And that's why we actually started a new class
link |
01:10:08.660
which we call machine learning from algorithms to modeling,
link |
01:10:12.620
which emphasizes more the modeling aspects of it
link |
01:10:16.740
and focuses on, it has majors and non majors.
link |
01:10:21.700
So we kind of try to extract the relevant parts
link |
01:10:25.300
and make it more accessible
link |
01:10:27.340
because the fact that we're teaching 20 classifiers
link |
01:10:29.620
in standard machine learning class
link |
01:10:31.020
is really a big question we really needed.
link |
01:10:34.100
But it was interesting to see this
link |
01:10:36.380
from first generation of students,
link |
01:10:38.300
when they came back from their internships
link |
01:10:40.900
and from their jobs,
link |
01:10:43.940
what different and exciting things they can do
link |
01:10:47.460
is that they would never think
link |
01:10:48.380
that you can even apply machine learning to.
link |
01:10:51.100
Some of them are like matching their relations
link |
01:10:53.740
and other things like variety of different applications.
link |
01:10:56.020
Everything is amenable to machine learning.
link |
01:10:58.060
That actually brings up an interesting point
link |
01:11:00.260
of computer science in general.
link |
01:11:02.580
It almost seems, maybe I'm crazy,
link |
01:11:05.420
but it almost seems like everybody needs to learn
link |
01:11:08.420
how to program these days.
link |
01:11:10.060
If you're 20 years old or if you're starting school,
link |
01:11:13.340
even if you're an English major,
link |
01:11:15.900
it seems like programming
link |
01:11:18.980
unlocks so much possibility in this world.
link |
01:11:21.860
So when you interact with those non majors,
link |
01:11:24.980
is there skills that they were simply lacking at the time
link |
01:11:30.220
that you wish they had
link |
01:11:31.980
and that they learned in high school and so on?
link |
01:11:34.620
Like how should education change
link |
01:11:37.460
in this computerized world that we live in?
link |
01:11:41.260
So seeing because they knew that there is a Python component
link |
01:11:43.500
in the class,
link |
01:11:44.820
their Python skills were okay
link |
01:11:47.020
and the class is not really heavy on programming.
link |
01:11:49.140
They primarily kind of add parts to the programs.
link |
01:11:52.420
I think it was more of their mathematical barriers
link |
01:11:55.420
and the class, again, with the design on the majors
link |
01:11:58.220
was using the notation like big O for complexity
link |
01:12:01.220
and others, people who come from different backgrounds
link |
01:12:04.540
just don't have it in the lexical.
link |
01:12:05.820
So necessarily very challenging notion,
link |
01:12:09.140
but they were just not aware.
link |
01:12:12.380
So I think that, you know, kind of linear algebra
link |
01:12:15.340
and probability, the basics, the calculus,
link |
01:12:17.660
want to vary the calculus, things that can help.
link |
01:12:20.860
What advice would you give to students
link |
01:12:23.580
interested in machine learning, interested,
link |
01:12:26.620
if you've talked about detecting curing cancer,
link |
01:12:30.100
drug design, if they want to get into that field,
link |
01:12:33.140
what should they do?
link |
01:12:36.380
Get into it and succeed as researchers
link |
01:12:39.100
and entrepreneurs.
link |
01:12:43.300
The first good piece of news is that right now
link |
01:12:45.260
there are lots of resources
link |
01:12:47.420
that are created at different levels
link |
01:12:50.180
and you can find online
link |
01:12:51.860
on your school classes,
link |
01:12:54.820
which are more mathematical or more applied and so on.
link |
01:12:57.580
So you can find a kind of a preacher
link |
01:13:01.340
which preaches your own language
link |
01:13:02.780
where you can enter the field
link |
01:13:04.580
and you can make many different types of contribution
link |
01:13:06.740
depending of what is your strengths.
link |
01:13:10.740
And the second point,
link |
01:13:12.020
I think it's really important to find some area
link |
01:13:15.300
which you really care about
link |
01:13:18.140
and it can motivate your learning
link |
01:13:20.220
and it can be for somebody curing cancer
link |
01:13:22.620
or doing cell driving cars or whatever,
link |
01:13:25.380
but to find an area where there is data
link |
01:13:29.660
where you believe there are strong patterns
link |
01:13:31.300
and we should be doing it
link |
01:13:32.340
and we're still not doing it
link |
01:13:33.580
or you can do it better
link |
01:13:35.260
and just start there
link |
01:13:37.860
and see a way it can bring you.
link |
01:13:40.780
So you've been very successful
link |
01:13:44.060
in many directions in life,
link |
01:13:46.460
but you also mentioned Flowers of Argonaut.
link |
01:13:51.020
And I think I've read or listened to you mention somewhere
link |
01:13:53.820
that researchers often get lost
link |
01:13:55.340
in the details of their work.
link |
01:13:56.740
This is per our original discussion with cancer and so on
link |
01:14:00.220
and don't look at the bigger picture,
link |
01:14:02.180
the bigger questions of meaning and so on.
link |
01:14:05.340
So let me ask you the impossible question
link |
01:14:09.900
of what's the meaning of this thing,
link |
01:14:11.620
of life, of your life, of research.
link |
01:14:16.740
Why do you think we descendant of great apes
link |
01:14:21.460
are here on this spinning ball?
link |
01:14:26.820
You know, I don't think that I have really a global answer
link |
01:14:30.300
you know, maybe that's why I didn't go to humanities
link |
01:14:33.780
and I didn't take humanities classes in my undergrad.
link |
01:14:39.500
But the way I am thinking about it,
link |
01:14:43.580
each one of us inside of them have their own set of,
link |
01:14:48.220
you know, things that we believe are important.
link |
01:14:51.140
And it just happens that we are busy
link |
01:14:53.380
with achieving various goals,
link |
01:14:54.820
busy listening to others
link |
01:14:56.260
and to kind of try to conform
link |
01:14:58.100
to be part of the crowd that we don't listen to that part.
link |
01:15:04.580
And, you know, we all should find some time to understand
link |
01:15:09.580
what is our own individual missions
link |
01:15:11.820
and we may have very different missions
link |
01:15:14.060
and to make sure that while we are running 10,000 things,
link |
01:15:18.180
we are not, you know, missing out
link |
01:15:21.900
and we're putting all the resources
link |
01:15:24.420
to satisfy our own mission.
link |
01:15:28.500
And if I look over my time,
link |
01:15:31.500
when I was younger, most of these missions,
link |
01:15:34.820
you know, I was primarily driven by the external stimulus,
link |
01:15:38.620
you know, to achieve this or to be that.
link |
01:15:41.540
And now a lot of what I do is driven by really thinking
link |
01:15:47.660
what is important for me to achieve independently
link |
01:15:51.380
of the external recognition.
link |
01:15:55.140
And, you know, I don't mind to be viewed in certain ways.
link |
01:16:01.380
The most important thing for me is to be true to myself,
link |
01:16:05.740
to what I think is right.
link |
01:16:07.500
How long did it take?
link |
01:16:08.700
How hard was it to find the you that you have to be true to?
link |
01:16:14.180
So it takes time and even now sometimes, you know,
link |
01:16:17.740
the vanity and the triviality can take, you know.
link |
01:16:20.860
Yeah, it can everywhere, you know, it's just the vanity.
link |
01:16:26.060
The vanity is different, the vanity in different places,
link |
01:16:28.140
but we all have our piece of vanity.
link |
01:16:30.940
But I think actually for me,
link |
01:16:34.700
the many times the place to get back to it is, you know,
link |
01:16:41.700
when I'm alone and also when I read.
link |
01:16:45.820
And I think by selecting the right books,
link |
01:16:47.740
you can get the right questions and learn from what you read.
link |
01:16:54.900
So, but again, it's not perfect,
link |
01:16:58.060
like vanity sometimes dominates.
link |
01:17:02.020
Well, that's a beautiful way to end.
link |
01:17:04.780
Thank you so much for talking today.
link |
01:17:06.380
Thank you. That was fun.
link |
01:17:07.820
It was fun.