back to index

Vijay Kumar: Flying Robots | Lex Fridman Podcast #37


small model | large model

link |
00:00:00.000
The following is a conversation with Vijay Kumar.
link |
00:00:03.080
He's one of the top roboticists in the world,
link |
00:00:05.760
a professor at the University of Pennsylvania,
link |
00:00:08.760
a dean of pen engineering, former director of Grasp Lab,
link |
00:00:12.880
or the General Robotics Automation Sensing
link |
00:00:15.300
and Perception Laboratory at Penn,
link |
00:00:17.560
that was established back in 1979, that's 40 years ago.
link |
00:00:22.600
Vijay is perhaps best known for his work
link |
00:00:25.280
in multi robot systems, robot swarms,
link |
00:00:28.520
and micro aerial vehicles,
link |
00:00:30.880
robots that elegantly cooperate in flight
link |
00:00:34.020
under all the uncertainty and challenges
link |
00:00:36.200
that the real world conditions present.
link |
00:00:38.760
This is the Artificial Intelligence Podcast.
link |
00:00:41.920
If you enjoy it, subscribe on YouTube,
link |
00:00:44.320
give it five stars on iTunes, support on Patreon,
link |
00:00:47.560
or simply connect with me on Twitter
link |
00:00:49.500
at Lex Friedman, spelled F R I D M A N.
link |
00:00:53.280
And now, here's my conversation with Vijay Kumar.
link |
00:00:58.700
What is the first robot you've ever built
link |
00:01:01.080
or were a part of building?
link |
00:01:02.840
Way back when I was in graduate school,
link |
00:01:04.760
I was part of a fairly big project
link |
00:01:06.760
that involved building a very large hexapod.
link |
00:01:12.040
It's weighed close to 7,000 pounds,
link |
00:01:17.520
and it was powered by hydraulic actuation,
link |
00:01:21.620
or it was actuated by hydraulics with 18 motors,
link |
00:01:27.720
hydraulic motors, each controlled by an Intel 8085 processor
link |
00:01:34.160
and an 8086 co processor.
link |
00:01:38.120
And so imagine this huge monster that had 18 joints,
link |
00:01:44.800
each controlled by an independent computer,
link |
00:01:46.960
and there was a 19th computer that actually did
link |
00:01:49.320
the coordination between these 18 joints.
link |
00:01:52.320
So I was part of this project,
link |
00:01:53.720
and my thesis work was how do you coordinate the 18 legs?
link |
00:02:02.080
And in particular, the pressures in the hydraulic cylinders
link |
00:02:06.320
to get efficient locomotion.
link |
00:02:09.200
It sounds like a giant mess.
link |
00:02:11.640
So how difficult is it to make all the motors communicate?
link |
00:02:14.440
Presumably, you have to send signals hundreds of times
link |
00:02:17.600
a second, or at least.
link |
00:02:18.440
So this was not my work,
link |
00:02:19.880
but the folks who worked on this wrote what I believe
link |
00:02:23.960
to be the first multiprocessor operating system.
link |
00:02:26.640
This was in the 80s, and you had to make sure
link |
00:02:30.320
that obviously messages got across
link |
00:02:32.800
from one joint to another.
link |
00:02:34.640
You have to remember the clock speeds on those computers
link |
00:02:37.960
were about half a megahertz.
link |
00:02:39.660
Right, the 80s.
link |
00:02:42.180
So not to romanticize the notion,
link |
00:02:45.320
but how did it make you feel to see that robot move?
link |
00:02:51.080
It was amazing.
link |
00:02:52.280
In hindsight, it looks like, well, we built this thing
link |
00:02:55.280
which really should have been much smaller.
link |
00:02:57.320
And of course, today's robots are much smaller.
link |
00:02:59.160
You look at Boston Dynamics or Ghost Robotics,
link |
00:03:03.120
a spinoff from Penn.
link |
00:03:06.080
But back then, you were stuck with the substrate you had,
link |
00:03:10.080
the compute you had, so things were unnecessarily big.
link |
00:03:13.720
But at the same time, and this is just human psychology,
link |
00:03:18.040
somehow bigger means grander.
link |
00:03:21.600
People never had the same appreciation
link |
00:03:23.640
for nanotechnology or nanodevices
link |
00:03:26.360
as they do for the Space Shuttle or the Boeing 747.
link |
00:03:30.160
Yeah, you've actually done quite a good job
link |
00:03:32.760
at illustrating that small is beautiful
link |
00:03:36.000
in terms of robotics.
link |
00:03:37.760
So what is on that topic is the most beautiful
link |
00:03:42.600
or elegant robot in motion that you've ever seen?
link |
00:03:46.200
Not to pick favorites or whatever,
link |
00:03:47.880
but something that just inspires you that you remember.
link |
00:03:51.000
Well, I think the thing that I'm most proud of
link |
00:03:54.000
that my students have done is really think about
link |
00:03:57.200
small UAVs that can maneuver in constrained spaces
link |
00:04:00.360
and in particular, their ability to coordinate
link |
00:04:03.640
with each other and form three dimensional patterns.
link |
00:04:06.760
So once you can do that,
link |
00:04:08.920
you can essentially create 3D objects in the sky
link |
00:04:14.960
and you can deform these objects on the fly.
link |
00:04:17.680
So in some sense, your toolbox of what you can create
link |
00:04:21.560
has suddenly got enhanced.
link |
00:04:25.240
And before that, we did the two dimensional version of this.
link |
00:04:27.800
So we had ground robots forming patterns and so on.
link |
00:04:31.680
So that was not as impressive, that was not as beautiful.
link |
00:04:34.960
But if you do it in 3D,
link |
00:04:36.560
suspended in midair, and you've got to go back to 2011
link |
00:04:40.240
when we did this, now it's actually pretty standard
link |
00:04:43.040
to do these things eight years later.
link |
00:04:45.600
But back then it was a big accomplishment.
link |
00:04:47.680
So the distributed cooperation
link |
00:04:50.280
is where beauty emerges in your eyes?
link |
00:04:53.480
Well, I think beauty to an engineer is very different
link |
00:04:55.800
from beauty to someone who's looking at robots
link |
00:04:59.400
from the outside, if you will.
link |
00:05:01.240
But what I meant there, so before we said that grand,
link |
00:05:04.800
so before we said that grand is associated with size.
link |
00:05:10.520
And another way of thinking about this
link |
00:05:13.720
is just the physical shape
link |
00:05:15.600
and the idea that you can get physical shapes in midair
link |
00:05:18.400
and have them deform, that's beautiful.
link |
00:05:21.560
But the individual components,
link |
00:05:23.040
the agility is beautiful too, right?
link |
00:05:24.880
That is true too.
link |
00:05:25.720
So then how quickly can you actually manipulate
link |
00:05:28.480
these three dimensional shapes
link |
00:05:29.560
and the individual components?
link |
00:05:31.280
Yes, you're right.
link |
00:05:32.240
But by the way, you said UAV, unmanned aerial vehicle.
link |
00:05:36.760
What's a good term for drones, UAVs, quad copters?
link |
00:05:41.840
Is there a term that's being standardized?
link |
00:05:44.560
I don't know if there is.
link |
00:05:45.440
Everybody wants to use the word drones.
link |
00:05:47.920
And I've often said this, drones to me is a pejorative word.
link |
00:05:51.080
It signifies something that's dumb,
link |
00:05:53.960
that's pre programmed, that does one little thing
link |
00:05:56.360
and robots are anything but drones.
link |
00:05:58.600
So I actually don't like that word,
link |
00:06:00.680
but that's what everybody uses.
link |
00:06:02.960
You could call it unpiloted.
link |
00:06:04.880
Unpiloted.
link |
00:06:05.800
But even unpiloted could be radio controlled,
link |
00:06:08.120
could be remotely controlled in many different ways.
link |
00:06:11.560
And I think the right word is,
link |
00:06:12.960
thinking about it as an aerial robot.
link |
00:06:15.040
You also say agile, autonomous, aerial robot, right?
link |
00:06:19.080
Yeah, so agility is an attribute, but they don't have to be.
link |
00:06:23.080
So what biological system,
link |
00:06:24.800
because you've also drawn a lot of inspiration with those.
link |
00:06:27.200
I've seen bees and ants that you've talked about.
link |
00:06:30.360
What living creatures have you found to be most inspiring
link |
00:06:35.240
as an engineer, instructive in your work in robotics?
link |
00:06:38.520
To me, so ants are really quite incredible creatures, right?
link |
00:06:43.440
So you, I mean, the individuals arguably are very simple
link |
00:06:47.880
in how they're built and yet they're incredibly resilient
link |
00:06:52.360
as a population.
link |
00:06:53.960
And as individuals, they're incredibly robust.
link |
00:06:56.760
So, if you take an ant, it's six legs,
link |
00:07:00.600
you remove one leg, it still works just fine.
link |
00:07:04.120
And it moves along.
link |
00:07:05.760
And I don't know that he even realizes it's lost a leg.
link |
00:07:09.760
So that's the robustness at the individual ant level.
link |
00:07:13.400
But then you look about this instinct
link |
00:07:15.360
for self preservation of the colonies
link |
00:07:17.680
and they adapt in so many amazing ways.
link |
00:07:20.400
You know, transcending gaps by just chaining themselves
link |
00:07:26.800
together when you have a flood,
link |
00:07:29.600
being able to recruit other teammates
link |
00:07:32.360
to carry big morsels of food,
link |
00:07:35.760
and then going out in different directions looking for food,
link |
00:07:38.760
and then being able to demonstrate consensus,
link |
00:07:43.160
even though they don't communicate directly with each other
link |
00:07:47.040
the way we communicate with each other.
link |
00:07:49.080
In some sense, they also know how to do democracy,
link |
00:07:51.880
probably better than what we do.
link |
00:07:53.640
Yeah, somehow it's even democracy is emergent.
link |
00:07:57.000
It seems like all of the phenomena that we see
link |
00:07:59.120
is all emergent.
link |
00:08:00.480
It seems like there's no centralized communicator.
link |
00:08:03.560
There is, so I think a lot is made about that word,
link |
00:08:06.520
emergent, and it means lots of things to different people.
link |
00:08:09.640
But you're absolutely right.
link |
00:08:10.680
I think as an engineer, you think about
link |
00:08:13.040
what element, elemental behaviors
link |
00:08:17.720
were primitives you could synthesize
link |
00:08:21.320
so that the whole looks incredibly powerful,
link |
00:08:25.240
incredibly synergistic,
link |
00:08:26.520
the whole definitely being greater than some of the parts,
link |
00:08:29.520
and ants are living proof of that.
link |
00:08:32.480
So when you see these beautiful swarms
link |
00:08:34.960
where there's biological systems of robots,
link |
00:08:38.520
do you sometimes think of them
link |
00:08:40.200
as a single individual living intelligent organism?
link |
00:08:44.640
So it's the same as thinking of our human beings
link |
00:08:47.400
are human civilization as one organism,
link |
00:08:51.160
or do you still, as an engineer,
link |
00:08:52.960
think about the individual components
link |
00:08:54.600
and all the engineering
link |
00:08:55.440
that went into the individual components?
link |
00:08:57.320
Well, that's very interesting.
link |
00:08:58.640
So again, philosophically as engineers,
link |
00:09:01.480
what we wanna do is to go beyond
link |
00:09:05.400
the individual components, the individual units,
link |
00:09:08.280
and think about it as a unit, as a cohesive unit,
link |
00:09:11.520
without worrying about the individual components.
link |
00:09:15.120
If you start obsessing about
link |
00:09:17.760
the individual building blocks and what they do,
link |
00:09:23.320
you inevitably will find it hard to scale up.
link |
00:09:27.960
Just mathematically,
link |
00:09:29.000
just think about individual things you wanna model,
link |
00:09:31.600
and if you want to have 10 of those,
link |
00:09:34.040
then you essentially are taking Cartesian products
link |
00:09:36.440
of 10 things, and that makes it really complicated.
link |
00:09:39.320
Then to do any kind of synthesis or design
link |
00:09:41.840
in that high dimension space is really hard.
link |
00:09:44.200
So the right way to do this
link |
00:09:45.800
is to think about the individuals in a clever way
link |
00:09:49.040
so that at the higher level,
link |
00:09:51.120
when you look at lots and lots of them,
link |
00:09:53.400
abstractly, you can think of them
link |
00:09:55.320
in some low dimensional space.
link |
00:09:57.120
So what does that involve?
link |
00:09:58.680
For the individual, do you have to try to make
link |
00:10:02.160
the way they see the world as local as possible?
link |
00:10:05.160
And the other thing,
link |
00:10:06.440
do you just have to make them robust to collisions?
link |
00:10:09.560
Like you said with the ants,
link |
00:10:10.880
if something fails, the whole swarm doesn't fail.
link |
00:10:15.320
Right, I think as engineers, we do this.
link |
00:10:17.760
I mean, you think about, we build planes,
link |
00:10:19.760
or we build iPhones,
link |
00:10:22.240
and we know that by taking individual components,
link |
00:10:26.280
well engineered components with well specified interfaces
link |
00:10:30.080
that behave in a predictable way,
link |
00:10:31.680
you can build complex systems.
link |
00:10:34.440
So that's ingrained, I would claim,
link |
00:10:36.880
in most engineers thinking,
link |
00:10:39.400
and it's true for computer scientists as well.
link |
00:10:41.600
I think what's different here is that you want
link |
00:10:44.760
the individuals to be robust in some sense,
link |
00:10:49.480
as we do in these other settings,
link |
00:10:52.000
but you also want some degree of resiliency
link |
00:10:54.480
for the population.
link |
00:10:56.320
And so you really want them to be able to reestablish
link |
00:11:02.040
communication with their neighbors.
link |
00:11:03.840
You want them to rethink their strategy for group behavior.
link |
00:11:08.840
You want them to reorganize.
link |
00:11:12.200
And that's where I think a lot of the challenges lie.
link |
00:11:15.920
So just at a high level,
link |
00:11:18.160
what does it take for a bunch of,
link |
00:11:22.200
what should we call them, flying robots,
link |
00:11:24.440
to create a formation?
link |
00:11:26.680
Just for people who are not familiar
link |
00:11:28.680
with robotics in general, how much information is needed?
link |
00:11:32.760
How do you even make it happen
link |
00:11:35.840
without a centralized controller?
link |
00:11:39.520
So, I mean, there are a couple of different ways
link |
00:11:41.080
of looking at this.
link |
00:11:43.160
If you are a purist,
link |
00:11:45.680
you think of it as a way of recreating what nature does.
link |
00:11:53.560
So nature forms groups for several reasons,
link |
00:11:58.440
but mostly it's because of this instinct
link |
00:12:02.000
that organisms have of preserving their colonies,
link |
00:12:05.680
their population, which means what?
link |
00:12:09.520
You need shelter, you need food, you need to procreate,
link |
00:12:12.920
and that's basically it.
link |
00:12:14.760
So the kinds of interactions you see are all organic.
link |
00:12:18.440
They're all local.
link |
00:12:20.760
And the only information that they share,
link |
00:12:24.080
and mostly it's indirectly, is to, again,
link |
00:12:27.520
preserve the herd or the flock,
link |
00:12:30.000
or the swarm, and either by looking for new sources of food
link |
00:12:37.480
or looking for new shelters, right?
link |
00:12:39.440
Right.
link |
00:12:41.240
As engineers, when we build swarms, we have a mission.
link |
00:12:46.560
And when you think of a mission, and it involves mobility,
link |
00:12:52.480
most often it's described in some kind
link |
00:12:55.000
of a global coordinate system.
link |
00:12:56.880
As a human, as an operator, as a commander,
link |
00:12:59.440
or as a collaborator, I have my coordinate system,
link |
00:13:03.560
and I want the robots to be consistent with that.
link |
00:13:07.600
So I might think of it slightly differently.
link |
00:13:11.240
I might want the robots to recognize that coordinate system,
link |
00:13:15.440
which means not only do they have to think locally
link |
00:13:17.720
in terms of who their immediate neighbors are,
link |
00:13:19.600
but they have to be cognizant
link |
00:13:20.920
of what the global environment is.
link |
00:13:24.040
They have to be cognizant of what the global environment
link |
00:13:27.040
looks like.
link |
00:13:28.280
So if I say, surround this building
link |
00:13:31.040
and protect this from intruders,
link |
00:13:33.240
well, they're immediately in a building centered
link |
00:13:35.600
coordinate system, and I have to tell them
link |
00:13:37.040
where the building is.
link |
00:13:38.680
And they're globally collaborating
link |
00:13:40.040
on the map of that building.
link |
00:13:41.280
They're maintaining some kind of global,
link |
00:13:44.160
not just in the frame of the building,
link |
00:13:45.480
but there's information that's ultimately being built up
link |
00:13:49.000
explicitly as opposed to kind of implicitly,
link |
00:13:53.280
like nature might.
link |
00:13:54.360
Correct, correct.
link |
00:13:55.200
So in some sense, nature is very, very sophisticated,
link |
00:13:57.680
but the tasks that nature solves or needs to solve
link |
00:14:01.880
are very different from the kind of engineered tasks,
link |
00:14:05.160
artificial tasks that we are forced to address.
link |
00:14:09.760
And again, there's nothing preventing us
link |
00:14:12.520
from solving these other problems,
link |
00:14:15.160
but ultimately it's about impact.
link |
00:14:16.600
You want these swarms to do something useful.
link |
00:14:19.360
And so you're kind of driven into this very unnatural,
link |
00:14:24.640
if you will.
link |
00:14:25.480
Unnatural, meaning not like how nature does, setting.
link |
00:14:29.160
And it's probably a little bit more expensive
link |
00:14:31.920
to do it the way nature does,
link |
00:14:33.760
because nature is less sensitive
link |
00:14:37.560
to the loss of the individual.
link |
00:14:39.480
And cost wise in robotics,
link |
00:14:42.280
I think you're more sensitive to losing individuals.
link |
00:14:45.480
I think that's true, although if you look at the price
link |
00:14:49.000
to performance ratio of robotic components,
link |
00:14:51.520
it's coming down dramatically, right?
link |
00:14:54.720
It continues to come down.
link |
00:14:56.040
So I think we're asymptotically approaching the point
link |
00:14:58.920
where we would get, yeah,
link |
00:14:59.960
the cost of individuals would really become insignificant.
link |
00:15:05.040
So let's step back at a high level view,
link |
00:15:07.640
the impossible question of what kind of, as an overview,
link |
00:15:12.480
what kind of autonomous flying vehicles
link |
00:15:14.400
are there in general?
link |
00:15:16.200
I think the ones that receive a lot of notoriety
link |
00:15:19.720
are obviously the military vehicles.
link |
00:15:22.560
Military vehicles are controlled by a base station,
link |
00:15:26.280
but have a lot of human supervision.
link |
00:15:29.640
But they have limited autonomy,
link |
00:15:31.800
which is the ability to go from point A to point B.
link |
00:15:34.760
And even the more sophisticated now,
link |
00:15:37.080
sophisticated vehicles can do autonomous takeoff
link |
00:15:40.400
and landing.
link |
00:15:41.760
And those usually have wings and they're heavy.
link |
00:15:44.360
Usually they're wings,
link |
00:15:45.360
but then there's nothing preventing us from doing this
link |
00:15:47.440
for helicopters as well.
link |
00:15:49.000
There are many military organizations
link |
00:15:52.480
that have autonomous helicopters in the same vein.
link |
00:15:56.560
And by the way, you look at autopilots and airplanes
link |
00:16:00.080
and it's actually very similar.
link |
00:16:02.840
In fact, one interesting question we can ask is,
link |
00:16:07.160
if you look at all the air safety violations,
link |
00:16:12.120
all the crashes that occurred,
link |
00:16:14.080
would they have happened if the plane were truly autonomous?
link |
00:16:18.640
And I think you'll find that in many of the cases,
link |
00:16:21.960
because of pilot error, we made silly decisions.
link |
00:16:24.600
And so in some sense, even in air traffic,
link |
00:16:26.960
commercial air traffic, there's a lot of applications,
link |
00:16:29.800
although we only see autonomy being enabled
link |
00:16:33.960
at very high altitudes when the plane is an autopilot.
link |
00:16:38.960
The plane is an autopilot.
link |
00:16:41.960
There's still a role for the human
link |
00:16:42.800
and that kind of autonomy is, you're kind of implying,
link |
00:16:47.640
I don't know what the right word is,
link |
00:16:48.680
but it's a little dumber than it could be.
link |
00:16:53.480
Right, so in the lab, of course,
link |
00:16:55.720
we can afford to be a lot more aggressive.
link |
00:16:59.200
And the question we try to ask is,
link |
00:17:04.200
can we make robots that will be able to make decisions
link |
00:17:10.360
without any kind of external infrastructure?
link |
00:17:13.680
So what does that mean?
link |
00:17:14.880
So the most common piece of infrastructure
link |
00:17:16.960
that airplanes use today is GPS.
link |
00:17:20.560
GPS is also the most brittle form of information.
link |
00:17:26.680
If you have driven in a city, try to use GPS navigation,
link |
00:17:30.480
in tall buildings, you immediately lose GPS.
link |
00:17:32.760
And so that's not a very sophisticated way
link |
00:17:36.280
of building autonomy.
link |
00:17:37.840
I think the second piece of infrastructure
link |
00:17:39.560
they rely on is communications.
link |
00:17:41.920
Again, it's very easy to jam communications.
link |
00:17:47.360
In fact, if you use wifi, you know that wifi signals
link |
00:17:51.320
drop out, cell signals drop out.
link |
00:17:53.520
So to rely on something like that is not good.
link |
00:17:58.560
The third form of infrastructure we use,
link |
00:18:01.200
and I hate to call it infrastructure,
link |
00:18:02.920
but it is that, in the sense of robots, is people.
link |
00:18:06.360
So you could rely on somebody to pilot you.
link |
00:18:09.960
And so the question you wanna ask is,
link |
00:18:11.600
if there are no pilots, there's no communications
link |
00:18:14.760
with any base station, if there's no knowledge of position,
link |
00:18:18.720
and if there's no a priori map,
link |
00:18:21.640
a priori knowledge of what the environment looks like,
link |
00:18:24.880
a priori model of what might happen in the future,
link |
00:18:28.240
can robots navigate?
link |
00:18:29.560
So that is true autonomy.
link |
00:18:31.480
So that's true autonomy, and we're talking about,
link |
00:18:34.160
you mentioned like military application of drones.
link |
00:18:36.880
Okay, so what else is there?
link |
00:18:38.320
You talk about agile, autonomous flying robots,
link |
00:18:42.080
aerial robots, so that's a different kind of,
link |
00:18:45.680
it's not winged, it's not big, at least it's small.
link |
00:18:48.160
So I use the word agility mostly,
link |
00:18:50.840
or at least we're motivated to do agile robots,
link |
00:18:53.520
mostly because robots can operate
link |
00:18:58.000
and should be operating in constrained environments.
link |
00:19:02.120
And if you want to operate the way a global hawk operates,
link |
00:19:06.960
I mean, the kinds of conditions in which you operate
link |
00:19:09.120
are very, very restrictive.
link |
00:19:11.760
If you wanna go inside a building,
link |
00:19:13.720
for example, for search and rescue,
link |
00:19:15.600
or to locate an active shooter,
link |
00:19:18.120
or you wanna navigate under the canopy in an orchard
link |
00:19:22.120
to look at health of plants,
link |
00:19:23.880
or to look for, to count fruits,
link |
00:19:28.240
to measure the tree trunks.
link |
00:19:31.240
These are things we do, by the way.
link |
00:19:33.240
There's some cool agriculture stuff you've shown
link |
00:19:35.400
in the past, it's really awesome.
link |
00:19:37.080
So in those kinds of settings, you do need that agility.
link |
00:19:40.360
Agility does not necessarily mean
link |
00:19:42.560
you break records for the 100 meters dash.
link |
00:19:45.440
What it really means is you see the unexpected
link |
00:19:48.000
and you're able to maneuver in a safe way,
link |
00:19:51.480
and in a way that gets you the most information
link |
00:19:55.400
about the thing you're trying to do.
link |
00:19:57.640
By the way, you may be the only person
link |
00:20:00.440
who, in a TED Talk, has used a math equation,
link |
00:20:04.200
which is amazing, people should go see one of your TED Talks.
link |
00:20:07.600
Actually, it's very interesting,
link |
00:20:08.800
because the TED curator, Chris Anderson,
link |
00:20:12.400
told me, you can't show math.
link |
00:20:15.360
And I thought about it, but that's who I am.
link |
00:20:18.200
I mean, that's our work.
link |
00:20:20.760
And so I felt compelled to give the audience a taste
link |
00:20:25.760
for at least some math.
link |
00:20:27.640
So on that point, simply, what does it take
link |
00:20:32.880
to make a thing with four motors fly, a quadcopter,
link |
00:20:37.360
one of these little flying robots?
link |
00:20:41.760
How hard is it to make it fly?
link |
00:20:43.960
How do you coordinate the four motors?
link |
00:20:46.560
How do you convert those motors into actual movement?
link |
00:20:52.600
So this is an interesting question.
link |
00:20:54.800
We've been trying to do this since 2000.
link |
00:20:58.080
It is a commentary on the sensors
link |
00:21:00.560
that were available back then,
link |
00:21:02.080
the computers that were available back then.
link |
00:21:05.560
And a number of things happened between 2000 and 2007.
link |
00:21:11.520
One is the advances in computing,
link |
00:21:14.120
which is, so we all know about Moore's Law,
link |
00:21:16.760
but I think 2007 was a tipping point,
link |
00:21:19.680
the year of the iPhone, the year of the cloud.
link |
00:21:22.720
Lots of things happened in 2007.
link |
00:21:25.600
But going back even further,
link |
00:21:27.600
inertial measurement units as a sensor really matured.
link |
00:21:31.360
Again, lots of reasons for that.
link |
00:21:33.920
Certainly, there's a lot of federal funding,
link |
00:21:35.400
particularly DARPA in the US,
link |
00:21:38.320
but they didn't anticipate this boom in IMUs.
link |
00:21:42.760
But if you look, subsequently what happened
link |
00:21:46.560
is that every car manufacturer had to put an airbag in,
link |
00:21:50.040
which meant you had to have an accelerometer on board.
link |
00:21:52.600
And so that drove down the price to performance ratio.
link |
00:21:55.000
Wow, I should know this.
link |
00:21:56.880
That's very interesting.
link |
00:21:57.960
That's very interesting, the connection there.
link |
00:21:59.360
And that's why research is very,
link |
00:22:01.320
it's very hard to predict the outcomes.
link |
00:22:04.840
And again, the federal government spent a ton of money
link |
00:22:07.640
on things that they thought were useful for resonators,
link |
00:22:12.280
but it ended up enabling these small UAVs, which is great,
link |
00:22:16.840
because I could have never raised that much money
link |
00:22:18.520
and sold this project,
link |
00:22:20.760
hey, we want to build these small UAVs.
link |
00:22:22.200
Can you actually fund the development of low cost IMUs?
link |
00:22:25.440
So why do you need an IMU on an IMU?
link |
00:22:27.600
So I'll come back to that.
link |
00:22:31.000
So in 2007, 2008, we were able to build these.
link |
00:22:33.320
And then the question you're asking was a good one.
link |
00:22:35.200
How do you coordinate the motors to develop this?
link |
00:22:40.240
But over the last 10 years, everything is commoditized.
link |
00:22:43.880
A high school kid today can pick up
link |
00:22:46.240
a Raspberry Pi kit and build this.
link |
00:22:50.560
All the low levels functionality is all automated.
link |
00:22:54.160
But basically at some level,
link |
00:22:56.360
you have to drive the motors at the right RPMs,
link |
00:23:01.360
the right velocity,
link |
00:23:04.560
in order to generate the right amount of thrust,
link |
00:23:07.480
in order to position it and orient it in a way
link |
00:23:10.360
that you need to in order to fly.
link |
00:23:13.800
The feedback that you get is from onboard sensors,
link |
00:23:16.680
and the IMU is an important part of it.
link |
00:23:18.400
The IMU tells you what the acceleration is,
link |
00:23:23.840
as well as what the angular velocity is.
link |
00:23:26.400
And those are important pieces of information.
link |
00:23:30.440
In addition to that, you need some kind of local position
link |
00:23:34.200
or velocity information.
link |
00:23:37.480
For example, when we walk,
link |
00:23:39.360
we implicitly have this information
link |
00:23:41.560
because we kind of know what our stride length is.
link |
00:23:46.720
We also are looking at images fly past our retina,
link |
00:23:51.480
if you will, and so we can estimate velocity.
link |
00:23:54.280
We also have accelerometers in our head,
link |
00:23:56.360
and we're able to integrate all these pieces of information
link |
00:23:59.160
to determine where we are as we walk.
link |
00:24:02.360
And so robots have to do something very similar.
link |
00:24:04.320
You need an IMU, you need some kind of a camera
link |
00:24:08.160
or other sensor that's measuring velocity,
link |
00:24:12.560
and then you need some kind of a global reference frame
link |
00:24:15.800
if you really want to think about doing something
link |
00:24:19.520
in a world coordinate system.
link |
00:24:21.280
And so how do you estimate your position
link |
00:24:23.680
with respect to that global reference frame?
link |
00:24:25.160
That's important as well.
link |
00:24:26.560
So coordinating the RPMs of the four motors
link |
00:24:29.520
is what allows you to, first of all, fly and hover,
link |
00:24:32.640
and then you can change the orientation
link |
00:24:35.600
and the velocity and so on.
link |
00:24:37.600
Exactly, exactly.
link |
00:24:38.440
So it's a bunch of degrees of freedom
link |
00:24:40.320
that you're complaining about.
link |
00:24:41.160
There's six degrees of freedom,
link |
00:24:42.200
but you only have four inputs, the four motors.
link |
00:24:44.920
And it turns out to be a remarkably versatile configuration.
link |
00:24:50.920
You think at first, well, I only have four motors,
link |
00:24:53.080
how do I go sideways?
link |
00:24:55.000
But it's not too hard to say, well, if I tilt myself,
link |
00:24:57.280
I can go sideways, and then you have four motors
link |
00:25:00.440
pointing up, how do I rotate in place
link |
00:25:03.320
about a vertical axis?
link |
00:25:05.360
Well, you rotate them at different speeds
link |
00:25:07.800
and that generates reaction moments
link |
00:25:09.720
and that allows you to turn.
link |
00:25:11.520
So it's actually a pretty, it's an optimal configuration
link |
00:25:14.960
from an engineer standpoint.
link |
00:25:18.360
It's very simple, very cleverly done, and very versatile.
link |
00:25:23.360
So if you could step back to a time,
link |
00:25:27.240
so I've always known flying robots as,
link |
00:25:31.040
to me, it was natural that a quadcopter should fly.
link |
00:25:35.760
But when you first started working with it,
link |
00:25:38.800
how surprised are you that you can make,
link |
00:25:42.000
do so much with the four motors?
link |
00:25:45.520
How surprising is it that you can make this thing fly,
link |
00:25:47.600
first of all, that you can make it hover,
link |
00:25:49.760
that you can add control to it?
link |
00:25:52.000
Firstly, this is not, the four motor configuration
link |
00:25:55.080
is not ours.
link |
00:25:56.400
You can, it has at least a hundred year history.
link |
00:26:00.320
And various people, various people try to get quadrotors
link |
00:26:04.160
to fly without much success.
link |
00:26:08.480
As I said, we've been working on this since 2000.
link |
00:26:10.760
Our first designs were, well, this is way too complicated.
link |
00:26:14.400
Why not we try to get an omnidirectional flying robot?
link |
00:26:18.480
So our early designs, we had eight rotors.
link |
00:26:21.760
And so these eight rotors were arranged uniformly
link |
00:26:26.600
on a sphere, if you will.
link |
00:26:28.000
So you can imagine a symmetric configuration.
link |
00:26:30.440
And so you should be able to fly anywhere.
link |
00:26:33.280
But the real challenge we had is the strength to weight ratio
link |
00:26:36.240
is not enough.
link |
00:26:37.080
And of course, we didn't have the sensors and so on.
link |
00:26:40.520
So everybody knew, or at least the people
link |
00:26:43.040
who worked with rotorcrafts knew,
link |
00:26:44.800
four rotors will get it done.
link |
00:26:47.520
So that was not our idea.
link |
00:26:49.400
But it took a while before we could actually do
link |
00:26:52.800
the onboard sensing and the computation that was needed
link |
00:26:56.920
for the kinds of agile maneuvering that we wanted to do
link |
00:27:01.000
in our little aerial robots.
link |
00:27:03.000
And that only happened between 2007 and 2009 in our lab.
link |
00:27:07.560
Yeah, and you have to send the signal
link |
00:27:09.960
maybe a hundred times a second.
link |
00:27:12.480
So the compute there, everything has to come down in price.
link |
00:27:15.960
And what are the steps of getting from point A to point B?
link |
00:27:21.720
So we just talked about like local control.
link |
00:27:25.200
But if all the kind of cool dancing in the air
link |
00:27:30.840
that I've seen you show, how do you make it happen?
link |
00:27:34.520
How do you make a trajectory?
link |
00:27:37.360
First of all, okay, figure out a trajectory.
link |
00:27:40.520
So plan a trajectory.
link |
00:27:41.680
And then how do you make that trajectory happen?
link |
00:27:44.400
Yeah, I think planning is a very fundamental problem
link |
00:27:47.280
in robotics.
link |
00:27:48.120
I think 10 years ago it was an esoteric thing,
link |
00:27:50.800
but today with self driving cars,
link |
00:27:53.040
everybody can understand this basic idea
link |
00:27:55.840
that a car sees a whole bunch of things
link |
00:27:57.920
and it has to keep a lane or maybe make a right turn
link |
00:28:00.320
or switch lanes.
link |
00:28:01.280
It has to plan a trajectory.
link |
00:28:02.680
It has to be safe.
link |
00:28:03.560
It has to be efficient.
link |
00:28:04.840
So everybody's familiar with that.
link |
00:28:06.640
That's kind of the first step that you have to think about
link |
00:28:10.240
when you say autonomy.
link |
00:28:14.800
And so for us, it's about finding smooth motions,
link |
00:28:19.120
motions that are safe.
link |
00:28:21.320
So we think about these two things.
link |
00:28:22.880
One is optimality, one is safety.
link |
00:28:24.680
Clearly you cannot compromise safety.
link |
00:28:28.440
So you're looking for safe, optimal motions.
link |
00:28:31.360
The other thing you have to think about is
link |
00:28:34.480
can you actually compute a reasonable trajectory
link |
00:28:38.160
in a small amount of time?
link |
00:28:40.760
Cause you have a time budget.
link |
00:28:42.280
So the optimal becomes suboptimal,
link |
00:28:45.160
but in our lab we focus on synthesizing smooth trajectory
link |
00:28:51.160
that satisfy all the constraints.
link |
00:28:53.000
In other words, don't violate any safety constraints
link |
00:28:58.440
and is as efficient as possible.
link |
00:29:02.880
And when I say efficient,
link |
00:29:04.360
it could mean I want to get from point A to point B
link |
00:29:06.600
as quickly as possible,
link |
00:29:08.360
or I want to get to it as gracefully as possible,
link |
00:29:12.840
or I want to consume as little energy as possible.
link |
00:29:15.960
But always staying within the safety constraints.
link |
00:29:18.240
But yes, always finding a safe trajectory.
link |
00:29:22.800
So there's a lot of excitement and progress
link |
00:29:25.040
in the field of machine learning
link |
00:29:27.360
and reinforcement learning
link |
00:29:29.360
and the neural network variant of that
link |
00:29:32.200
with deep reinforcement learning.
link |
00:29:33.920
Do you see a role of machine learning
link |
00:29:36.360
in, so a lot of the success of flying robots
link |
00:29:40.560
did not rely on machine learning,
link |
00:29:42.320
except for maybe a little bit of the perception
link |
00:29:45.040
on the computer vision side.
link |
00:29:46.600
On the control side and the planning,
link |
00:29:48.440
do you see there's a role in the future
link |
00:29:50.400
for machine learning?
link |
00:29:51.680
So let me disagree a little bit with you.
link |
00:29:53.800
I think we never perhaps called out in my work,
link |
00:29:56.800
called out learning,
link |
00:29:57.720
but even this very simple idea of being able to fly
link |
00:30:00.600
through a constrained space.
link |
00:30:02.200
The first time you try it, you'll invariably,
link |
00:30:05.680
you might get it wrong if the task is challenging.
link |
00:30:08.440
And the reason is to get it perfectly right,
link |
00:30:12.200
you have to model everything in the environment.
link |
00:30:15.600
And flying is notoriously hard to model.
link |
00:30:19.960
There are aerodynamic effects that we constantly discover.
link |
00:30:26.520
Even just before I was talking to you,
link |
00:30:29.440
I was talking to a student about how blades flap
link |
00:30:33.440
when they fly.
link |
00:30:35.320
And that ends up changing how a rotorcraft
link |
00:30:40.880
is accelerated in the angular direction.
link |
00:30:43.960
Does he use like micro flaps or something?
link |
00:30:46.360
It's not micro flaps.
link |
00:30:47.280
So we assume that each blade is rigid,
link |
00:30:49.640
but actually it flaps a little bit.
link |
00:30:51.720
It bends.
link |
00:30:52.880
Interesting, yeah.
link |
00:30:53.720
And so the models rely on the fact,
link |
00:30:56.040
on the assumption that they're not rigid.
link |
00:30:58.640
On the assumption that they're actually rigid,
link |
00:31:00.640
but that's not true.
link |
00:31:02.240
If you're flying really quickly,
link |
00:31:03.720
these effects become significant.
link |
00:31:06.920
If you're flying close to the ground,
link |
00:31:09.240
you get pushed off by the ground, right?
link |
00:31:12.160
Something which every pilot knows when he tries to land
link |
00:31:14.920
or she tries to land, this is called a ground effect.
link |
00:31:18.920
Something very few pilots think about
link |
00:31:21.000
is what happens when you go close to a ceiling
link |
00:31:23.040
or you get sucked into a ceiling.
link |
00:31:25.320
There are very few aircrafts
link |
00:31:26.880
that fly close to any kind of ceiling.
link |
00:31:29.520
Likewise, when you go close to a wall,
link |
00:31:33.520
there are these wall effects.
link |
00:31:35.720
And if you've gone on a train
link |
00:31:37.680
and you pass another train that's traveling
link |
00:31:39.600
in the opposite direction, you feel the buffeting.
link |
00:31:42.400
And so these kinds of microclimates
link |
00:31:45.400
affect our UAV significantly.
link |
00:31:47.880
So if you want...
link |
00:31:48.720
And they're impossible to model, essentially.
link |
00:31:50.640
I wouldn't say they're impossible to model,
link |
00:31:52.480
but the level of sophistication you would need
link |
00:31:54.880
in the model and the software would be tremendous.
link |
00:32:00.000
Plus, to get everything right would be awfully tedious.
link |
00:32:02.920
So the way we do this is over time,
link |
00:32:05.080
we figure out how to adapt to these conditions.
link |
00:32:10.360
So early on, we use the form of learning
link |
00:32:13.160
that we call iterative learning.
link |
00:32:15.760
So this idea, if you want to perform a task,
link |
00:32:18.600
there are a few things that you need to change
link |
00:32:22.120
and iterate over a few parameters
link |
00:32:24.960
that over time you can figure out.
link |
00:32:29.280
So I could call it policy gradient reinforcement learning,
link |
00:32:33.400
but actually it was just iterative learning.
link |
00:32:34.920
Iterative learning.
link |
00:32:36.000
And so this was there way back.
link |
00:32:37.800
I think what's interesting is,
link |
00:32:39.440
if you look at autonomous vehicles today,
link |
00:32:43.120
learning occurs, could occur in two pieces.
link |
00:32:45.680
One is perception, understanding the world.
link |
00:32:47.960
Second is action, taking actions.
link |
00:32:50.080
Everything that I've seen that is successful
link |
00:32:52.240
is on the perception side of things.
link |
00:32:54.360
So in computer vision,
link |
00:32:55.400
we've made amazing strides in the last 10 years.
link |
00:32:57.840
So recognizing objects, actually detecting objects,
link |
00:33:01.640
classifying them and tagging them in some sense,
link |
00:33:06.400
annotating them.
link |
00:33:07.440
This is all done through machine learning.
link |
00:33:09.640
On the action side, on the other hand,
link |
00:33:12.160
I don't know of any examples
link |
00:33:13.720
where there are fielded systems
link |
00:33:15.560
where we actually learn
link |
00:33:17.560
the right behavior.
link |
00:33:20.560
Outside of single demonstration is successful.
link |
00:33:22.760
In the laboratory, this is the holy grail.
link |
00:33:24.640
Can you do end to end learning?
link |
00:33:26.040
Can you go from pixels to motor currents?
link |
00:33:30.200
This is really, really hard.
link |
00:33:32.800
And I think if you go forward,
link |
00:33:35.080
the right way to think about these things
link |
00:33:37.600
is data driven approaches,
link |
00:33:40.720
learning based approaches,
link |
00:33:42.400
in concert with model based approaches,
link |
00:33:45.280
which is the traditional way of doing things.
link |
00:33:47.320
So I think there's a piece,
link |
00:33:48.720
there's a role for each of these methodologies.
link |
00:33:51.400
So what do you think,
link |
00:33:52.440
just jumping out on topic
link |
00:33:53.880
since you mentioned autonomous vehicles,
link |
00:33:56.200
what do you think are the limits on the perception side?
link |
00:33:58.480
So I've talked to Elon Musk
link |
00:34:01.080
and there on the perception side,
link |
00:34:03.320
they're using primarily computer vision
link |
00:34:05.960
to perceive the environment.
link |
00:34:08.080
In your work with,
link |
00:34:09.760
because you work with the real world a lot
link |
00:34:12.560
and the physical world,
link |
00:34:13.720
what are the limits of computer vision?
link |
00:34:15.800
Do you think we can solve autonomous vehicles
link |
00:34:19.160
on the perception side,
link |
00:34:20.880
focusing on vision alone and machine learning?
link |
00:34:24.240
So, we also have a spinoff company,
link |
00:34:27.480
Exxon Technologies that works underground in mines.
link |
00:34:31.840
So you go into mines, they're dark, they're dirty.
link |
00:34:36.480
You fly in a dirty area,
link |
00:34:38.600
there's stuff you kick up from by the propellers,
link |
00:34:41.120
the downwash kicks up dust.
link |
00:34:42.720
I challenge you to get a computer vision algorithm
link |
00:34:45.520
to work there.
link |
00:34:46.680
So we use LIDARs in that setting.
link |
00:34:51.200
Indoors and even outdoors when we fly through fields,
link |
00:34:55.360
I think there's a lot of potential
link |
00:34:57.120
for just solving the problem using computer vision alone.
link |
00:35:01.240
But I think the bigger question is,
link |
00:35:02.760
can you actually solve
link |
00:35:06.160
or can you actually identify all the corner cases
link |
00:35:09.440
using a single sensing modality and using learning alone?
link |
00:35:13.920
So what's your intuition there?
link |
00:35:15.400
So look, if you have a corner case
link |
00:35:17.920
and your algorithm doesn't work,
link |
00:35:20.000
your instinct is to go get data about the corner case
link |
00:35:23.200
and patch it up, learn how to deal with that corner case.
link |
00:35:27.640
But at some point, this is gonna saturate,
link |
00:35:32.040
this approach is not viable.
link |
00:35:34.200
So today, computer vision algorithms can detect
link |
00:35:38.000
90% of the objects or can detect objects 90% of the time,
link |
00:35:41.360
classify them 90% of the time.
link |
00:35:43.920
Cats on the internet probably can do 95%, I don't know.
link |
00:35:47.960
But to get from 90% to 99%, you need a lot more data.
link |
00:35:52.520
And then I tell you, well, that's not enough
link |
00:35:54.480
because I have a safety critical application,
link |
00:35:56.680
I wanna go from 99% to 99.9%.
link |
00:36:00.160
That's even more data.
link |
00:36:01.600
So I think if you look at wanting accuracy on the X axis
link |
00:36:09.600
and look at the amount of data on the Y axis,
link |
00:36:14.080
I believe that curve is an exponential curve.
link |
00:36:16.440
Wow, okay, it's even hard if it's linear.
link |
00:36:19.480
It's hard if it's linear, totally,
link |
00:36:20.800
but I think it's exponential.
link |
00:36:22.560
And the other thing you have to think about
link |
00:36:24.120
is that this process is a very, very power hungry process
link |
00:36:29.600
to run data farms or servers.
link |
00:36:32.880
Power, do you mean literally power?
link |
00:36:34.600
Literally power, literally power.
link |
00:36:36.600
So in 2014, five years ago, and I don't have more recent data,
link |
00:36:41.760
2% of US electricity consumption was from data farms.
link |
00:36:48.360
So we think about this as an information science
link |
00:36:52.080
and information processing problem.
link |
00:36:54.240
Actually, it is an energy processing problem.
link |
00:36:57.840
And so unless we figured out better ways of doing this,
link |
00:37:00.440
I don't think this is viable.
link |
00:37:02.440
So talking about driving, which is a safety critical application
link |
00:37:06.600
and some aspect of flight is safety critical,
link |
00:37:10.440
maybe philosophical question, maybe an engineering one,
link |
00:37:12.960
what problem do you think is harder to solve,
link |
00:37:15.000
autonomous driving or autonomous flight?
link |
00:37:18.120
That's a really interesting question.
link |
00:37:19.920
I think autonomous flight has several advantages
link |
00:37:25.440
that autonomous driving doesn't have.
link |
00:37:29.360
So look, if I want to go from point A to point B,
link |
00:37:32.400
I have a very, very safe trajectory.
link |
00:37:34.320
Go vertically up to a maximum altitude,
link |
00:37:36.800
fly horizontally to just about the destination,
link |
00:37:39.480
and then come down vertically.
link |
00:37:42.400
This is preprogrammed.
link |
00:37:45.400
The equivalent of that is very hard to find
link |
00:37:48.040
in the self driving car world because you're on the ground,
link |
00:37:51.560
you're in a two dimensional surface,
link |
00:37:53.560
and the trajectories on the two dimensional surface
link |
00:37:56.680
are more likely to encounter obstacles.
link |
00:38:00.200
I mean this in an intuitive sense, but mathematically true.
link |
00:38:03.280
That's mathematically as well, that's true.
link |
00:38:06.360
There's other option on the 2G space of platooning,
link |
00:38:10.040
or because there's so many obstacles,
link |
00:38:11.640
you can connect with those obstacles
link |
00:38:13.280
and all these kind of options.
link |
00:38:14.560
Sure, but those exist in the three dimensional space as well.
link |
00:38:16.560
So they do.
link |
00:38:17.560
So the question also implies how difficult are obstacles
link |
00:38:21.800
in the three dimensional space in flight?
link |
00:38:23.800
So that's the downside.
link |
00:38:25.600
I think in three dimensional space,
link |
00:38:26.920
you're modeling three dimensional world,
link |
00:38:29.080
not just because you want to avoid it,
link |
00:38:31.280
but you want to reason about it,
link |
00:38:33.040
and you want to work in the three dimensional environment,
link |
00:38:35.360
and that's significantly harder.
link |
00:38:37.480
So that's one disadvantage.
link |
00:38:38.920
I think the second disadvantage is of course,
link |
00:38:41.040
anytime you fly, you have to put up
link |
00:38:43.200
with the peculiarities of aerodynamics
link |
00:38:46.560
and their complicated environments.
link |
00:38:48.720
How do you negotiate that?
link |
00:38:49.800
So that's always a problem.
link |
00:38:51.880
Do you see a time in the future where there is,
link |
00:38:55.240
you mentioned there's agriculture applications.
link |
00:38:58.720
So there's a lot of applications of flying robots,
link |
00:39:01.680
but do you see a time in the future
link |
00:39:03.040
where there's tens of thousands,
link |
00:39:05.360
or maybe hundreds of thousands of delivery drones
link |
00:39:08.160
that fill the sky, delivery flying robots?
link |
00:39:12.160
I think there's a lot of potential
link |
00:39:14.200
for the last mile delivery.
link |
00:39:15.920
And so in crowded cities, I don't know,
link |
00:39:19.240
if you go to a place like Hong Kong,
link |
00:39:21.400
just crossing the river can take half an hour,
link |
00:39:24.400
and while a drone can just do it in five minutes at most.
link |
00:39:29.400
I think you look at delivery of supplies to remote villages.
link |
00:39:35.800
I work with a nonprofit called Weave Robotics.
link |
00:39:38.680
So they work in the Peruvian Amazon,
link |
00:39:40.920
where the only highways that are available
link |
00:39:44.680
are the only highways or rivers.
link |
00:39:47.440
And to get from point A to point B may take five hours,
link |
00:39:52.960
while with a drone, you can get there in 30 minutes.
link |
00:39:56.680
So just delivering drugs,
link |
00:39:59.880
retrieving samples for testing vaccines,
link |
00:40:05.160
I think there's huge potential here.
link |
00:40:07.120
So I think the challenges are not technological,
link |
00:40:09.960
but the challenge is economical.
link |
00:40:12.040
The one thing I'll tell you that nobody thinks about
link |
00:40:15.560
is the fact that we've not made huge strides
link |
00:40:18.920
in battery technology.
link |
00:40:20.840
Yes, it's true, batteries are becoming less expensive
link |
00:40:23.520
because we have these mega factories that are coming up,
link |
00:40:26.240
but they're all based on lithium based technologies.
link |
00:40:28.800
And if you look at the energy density
link |
00:40:31.480
and the power density,
link |
00:40:33.240
those are two fundamentally limiting numbers.
link |
00:40:38.000
So power density is important
link |
00:40:39.680
because for a UAV to take off vertically into the air,
link |
00:40:42.480
which most drones do, they don't have a runway,
link |
00:40:46.360
you consume roughly 200 watts per kilo at the small size.
link |
00:40:51.560
That's a lot, right?
link |
00:40:53.920
In contrast, the human brain consumes less than 80 watts,
link |
00:40:57.520
the whole of the human brain.
link |
00:40:59.920
So just imagine just lifting yourself into the air
link |
00:41:03.600
is like two or three light bulbs,
link |
00:41:06.000
which makes no sense to me.
link |
00:41:07.840
Yeah, so you're going to have to at scale
link |
00:41:10.440
solve the energy problem then,
link |
00:41:12.880
charging the batteries, storing the energy and so on.
link |
00:41:18.920
And then the storage is the second problem,
link |
00:41:20.680
but storage limits the range.
link |
00:41:22.960
But you have to remember that you have to burn
link |
00:41:28.680
a lot of it per given time.
link |
00:41:31.600
So the burning is another problem.
link |
00:41:32.920
Which is a power question.
link |
00:41:34.640
Yes, and do you think just your intuition,
link |
00:41:38.640
there are breakthroughs in batteries on the horizon?
link |
00:41:44.960
How hard is that problem?
link |
00:41:46.440
Look, there are a lot of companies
link |
00:41:47.600
that are promising flying cars that are autonomous
link |
00:41:53.880
and that are clean.
link |
00:41:59.400
I think they're over promising.
link |
00:42:01.680
The autonomy piece is doable.
link |
00:42:04.800
The clean piece, I don't think so.
link |
00:42:08.000
There's another company that I work with called JetOptra.
link |
00:42:11.840
They make small jet engines.
link |
00:42:15.760
And they can get up to 50 miles an hour very easily
link |
00:42:18.080
and lift 50 kilos.
link |
00:42:19.960
But they're jet engines, they're efficient,
link |
00:42:23.920
they're a little louder than electric vehicles,
link |
00:42:26.320
but they can build flying cars.
link |
00:42:28.960
So your sense is that there's a lot of pieces
link |
00:42:32.440
that have come together.
link |
00:42:33.520
So on this crazy question,
link |
00:42:37.360
if you look at companies like Kitty Hawk,
link |
00:42:39.720
working on electric, so the clean,
link |
00:42:43.880
talking to Sebastian Thrun, right?
link |
00:42:45.840
It's a crazy dream, you know?
link |
00:42:48.840
But you work with flight a lot.
link |
00:42:52.080
You've mentioned before that manned flights
link |
00:42:55.760
or carrying a human body is very difficult to do.
link |
00:43:01.640
So how crazy is flying cars?
link |
00:43:04.240
Do you think there'll be a day
link |
00:43:05.400
when we have vertical takeoff and landing vehicles
link |
00:43:11.080
that are sufficiently affordable
link |
00:43:14.960
that we're going to see a huge amount of them?
link |
00:43:17.440
And they would look like something like we dream of
link |
00:43:19.680
when we think about flying cars.
link |
00:43:21.080
Yeah, like the Jetsons.
link |
00:43:22.200
The Jetsons, yeah.
link |
00:43:23.160
So look, there are a lot of smart people working on this
link |
00:43:25.560
and you never say something is not possible
link |
00:43:29.640
when you have people like Sebastian Thrun working on it.
link |
00:43:32.200
So I totally think it's viable.
link |
00:43:35.160
I question, again, the electric piece.
link |
00:43:38.240
The electric piece, yeah.
link |
00:43:39.520
And again, for short distances, you can do it.
link |
00:43:41.680
And there's no reason to suggest
link |
00:43:43.640
that these all just have to be rotorcrafts.
link |
00:43:45.840
You take off vertically,
link |
00:43:46.920
but then you morph into a forward flight.
link |
00:43:49.680
I think there are a lot of interesting designs.
link |
00:43:51.600
The question to me is, are these economically viable?
link |
00:43:56.040
And if you agree to do this with fossil fuels,
link |
00:43:59.160
it instantly immediately becomes viable.
link |
00:44:01.960
That's a real challenge.
link |
00:44:03.480
Do you think it's possible for robots and humans
link |
00:44:06.560
to collaborate successfully on tasks?
link |
00:44:08.840
So a lot of robotics folks that I talk to and work with,
link |
00:44:13.640
I mean, humans just add a giant mess to the picture.
link |
00:44:18.000
So it's best to remove them from consideration
link |
00:44:20.320
when solving specific tasks.
link |
00:44:22.400
It's very difficult to model.
link |
00:44:23.600
There's just a source of uncertainty.
link |
00:44:26.000
In your work with these agile flying robots,
link |
00:44:32.560
do you think there's a role for collaboration with humans?
link |
00:44:35.680
Or is it best to model tasks in a way
link |
00:44:38.600
that doesn't have a human in the picture?
link |
00:44:43.400
Well, I don't think we should ever think about robots
link |
00:44:46.760
without human in the picture.
link |
00:44:48.120
Ultimately, robots are there because we want them
link |
00:44:50.960
to solve problems for humans.
link |
00:44:54.360
But there's no general solution to this problem.
link |
00:44:58.280
I think if you look at human interaction
link |
00:45:00.000
and how humans interact with robots,
link |
00:45:02.400
you know, we think of these in sort of three different ways.
link |
00:45:05.280
One is the human commanding the robot.
link |
00:45:08.880
The second is the human collaborating with the robot.
link |
00:45:12.880
So for example, we work on how a robot
link |
00:45:15.520
can actually pick up things with a human and carry things.
link |
00:45:18.720
That's like true collaboration.
link |
00:45:20.880
And third, we think about humans as bystanders,
link |
00:45:25.000
self driving cars, what's the human's role
link |
00:45:27.240
and how do self driving cars
link |
00:45:30.320
acknowledge the presence of humans?
link |
00:45:32.920
So I think all of these things are different scenarios.
link |
00:45:35.840
It depends on what kind of humans, what kind of task.
link |
00:45:39.640
And I think it's very difficult to say
link |
00:45:41.840
that there's a general theory that we all have for this.
link |
00:45:45.520
But at the same time, it's also silly to say
link |
00:45:48.440
that we should think about robots independent of humans.
link |
00:45:52.000
So to me, human robot interaction
link |
00:45:55.760
is almost a mandatory aspect of everything we do.
link |
00:45:59.760
Yes, but to which degree, so your thoughts,
link |
00:46:02.440
if we jump to autonomous vehicles, for example,
link |
00:46:05.240
there's a big debate between what's called
link |
00:46:08.680
level two and level four.
link |
00:46:10.640
So semi autonomous and autonomous vehicles.
link |
00:46:13.680
And so the Tesla approach currently at least
link |
00:46:16.440
has a lot of collaboration between human and machine.
link |
00:46:18.960
So the human is supposed to actively supervise
link |
00:46:22.040
the operation of the robot.
link |
00:46:23.880
Part of the safety definition of how safe a robot is
link |
00:46:29.160
in that case is how effective is the human in monitoring it.
link |
00:46:32.880
Do you think that's ultimately not a good approach
link |
00:46:37.880
in sort of having a human in the picture,
link |
00:46:42.360
not as a bystander or part of the infrastructure,
link |
00:46:47.400
but really as part of what's required
link |
00:46:50.000
to make the system safe?
link |
00:46:51.560
This is harder than it sounds.
link |
00:46:53.720
I think, you know, if you, I mean,
link |
00:46:58.200
I'm sure you've driven before in highways and so on.
link |
00:47:01.360
It's really very hard to have to relinquish control
link |
00:47:06.120
to a machine and then take over when needed.
link |
00:47:10.440
So I think Tesla's approach is interesting
link |
00:47:12.280
because it allows you to periodically establish
link |
00:47:14.800
some kind of contact with the car.
link |
00:47:18.520
Toyota, on the other hand, is thinking about
link |
00:47:20.640
shared autonomy or collaborative autonomy as a paradigm.
link |
00:47:24.800
If I may argue, these are very, very simple ways
link |
00:47:27.480
of human robot collaboration,
link |
00:47:29.680
because the task is pretty boring.
link |
00:47:31.880
You sit in a vehicle, you go from point A to point B.
link |
00:47:35.000
I think the more interesting thing to me is,
link |
00:47:37.360
for example, search and rescue.
link |
00:47:38.760
I've got a human first responder, robot first responders.
link |
00:47:43.160
I gotta do something.
link |
00:47:45.120
It's important.
link |
00:47:46.000
I have to do it in two minutes.
link |
00:47:47.800
The building is burning.
link |
00:47:49.240
There's been an explosion.
link |
00:47:50.440
It's collapsed.
link |
00:47:51.360
How do I do it?
link |
00:47:52.800
I think to me, those are the interesting things
link |
00:47:54.740
where it's very, very unstructured.
link |
00:47:57.160
And what's the role of the human?
link |
00:47:58.480
What's the role of the robot?
link |
00:48:00.200
Clearly, there's lots of interesting challenges
link |
00:48:02.440
and there's a field.
link |
00:48:03.440
I think we're gonna make a lot of progress in this area.
link |
00:48:05.760
Yeah, it's an exciting form of collaboration.
link |
00:48:07.600
You're right.
link |
00:48:08.440
In autonomous driving, the main enemy
link |
00:48:11.120
is just boredom of the human.
link |
00:48:13.120
Yes.
link |
00:48:13.960
As opposed to in rescue operations,
link |
00:48:15.680
it's literally life and death.
link |
00:48:18.360
And the collaboration enables
link |
00:48:22.080
the effective completion of the mission.
link |
00:48:23.820
So it's exciting.
link |
00:48:24.760
In some sense, we're also doing this.
link |
00:48:27.400
You think about the human driving a car
link |
00:48:30.520
and almost invariably, the human's trying
link |
00:48:33.800
to estimate the state of the car,
link |
00:48:35.000
they estimate the state of the environment and so on.
link |
00:48:37.280
But what if the car were to estimate the state of the human?
link |
00:48:40.120
So for example, I'm sure you have a smartphone
link |
00:48:41.960
and the smartphone tries to figure out what you're doing
link |
00:48:44.580
and send you reminders and oftentimes telling you
link |
00:48:48.320
to drive to a certain place,
link |
00:48:49.540
although you have no intention of going there
link |
00:48:51.400
because it thinks that that's where you should be
link |
00:48:53.880
because of some Gmail calendar entry
link |
00:48:57.520
or something like that.
link |
00:48:58.960
And it's trying to constantly figure out who you are,
link |
00:49:01.600
what you're doing.
link |
00:49:02.740
If a car were to do that,
link |
00:49:04.200
maybe that would make the driver safer
link |
00:49:06.840
because the car is trying to figure out
link |
00:49:08.160
is the driver paying attention,
link |
00:49:09.760
looking at his or her eyes,
link |
00:49:12.480
looking at circadian movements.
link |
00:49:14.400
So I think the potential is there,
link |
00:49:16.480
but from the reverse side,
link |
00:49:18.600
it's not robot modeling, but it's human modeling.
link |
00:49:21.640
It's more on the human, right.
link |
00:49:22.880
And I think the robots can do a very good job
link |
00:49:25.320
of modeling humans if you really think about the framework
link |
00:49:29.120
that you have a human sitting in a cockpit,
link |
00:49:32.640
surrounded by sensors, all staring at him,
link |
00:49:35.820
in addition to be staring outside,
link |
00:49:37.860
but also staring at him.
link |
00:49:39.160
I think there's a real synergy there.
link |
00:49:40.960
Yeah, I love that problem
link |
00:49:42.360
because it's the new 21st century form of psychology,
link |
00:49:45.560
actually AI enabled psychology.
link |
00:49:48.520
A lot of people have sci fi inspired fears
link |
00:49:51.280
of walking robots like those from Boston Dynamics.
link |
00:49:54.080
If you just look at shows on Netflix and so on,
link |
00:49:56.480
or flying robots like those you work with,
link |
00:49:59.920
how would you, how do you think about those fears?
link |
00:50:03.160
How would you alleviate those fears?
link |
00:50:05.040
Do you have inklings, echoes of those same concerns?
link |
00:50:09.040
You know, anytime we develop a technology
link |
00:50:11.760
meaning to have positive impact in the world,
link |
00:50:14.160
there's always the worry that,
link |
00:50:17.440
you know, somebody could subvert those technologies
link |
00:50:21.000
and use it in an adversarial setting.
link |
00:50:23.280
And robotics is no exception, right?
link |
00:50:25.280
So I think it's very easy to weaponize robots.
link |
00:50:29.280
I think we talk about swarms.
link |
00:50:31.720
One thing I worry a lot about is,
link |
00:50:33.960
so, you know, for us to get swarms to work
link |
00:50:35.880
and do something reliably, it's really hard.
link |
00:50:38.280
But suppose I have this challenge
link |
00:50:42.040
of trying to destroy something,
link |
00:50:44.360
and I have a swarm of robots,
link |
00:50:45.720
where only one out of the swarm
link |
00:50:47.280
needs to get to its destination.
link |
00:50:48.920
So that suddenly becomes a lot more doable.
link |
00:50:52.640
And so I worry about, you know,
link |
00:50:54.720
this general idea of using autonomy
link |
00:50:56.920
with lots and lots of agents.
link |
00:51:00.040
I mean, having said that, look,
link |
00:51:01.320
a lot of this technology is not very mature.
link |
00:51:03.760
My favorite saying is that
link |
00:51:06.560
if somebody had to develop this technology,
link |
00:51:10.520
wouldn't you rather the good guys do it?
link |
00:51:12.320
So the good guys have a good understanding
link |
00:51:13.880
of the technology, so they can figure out
link |
00:51:15.560
how this technology is being used in a bad way,
link |
00:51:18.320
or could be used in a bad way and try to defend against it.
link |
00:51:21.360
So we think a lot about that.
link |
00:51:22.760
So we have, we're doing research
link |
00:51:25.400
on how to defend against swarms, for example.
link |
00:51:28.240
That's interesting.
link |
00:51:29.600
There's in fact a report by the National Academies
link |
00:51:32.960
on counter UAS technologies.
link |
00:51:36.680
This is a real threat,
link |
00:51:38.200
but we're also thinking about how to defend against this
link |
00:51:40.320
and knowing how swarms work.
link |
00:51:42.920
Knowing how autonomy works is, I think, very important.
link |
00:51:47.160
So it's not just politicians?
link |
00:51:49.280
Do you think engineers have a role in this discussion?
link |
00:51:51.640
Absolutely.
link |
00:51:52.480
I think the days where politicians
link |
00:51:55.280
can be agnostic to technology are gone.
link |
00:51:59.200
I think every politician needs to be
link |
00:52:03.840
literate in technology.
link |
00:52:05.680
And I often say technology is the new liberal art.
link |
00:52:09.800
Understanding how technology will change your life,
link |
00:52:12.920
I think is important.
link |
00:52:14.480
And every human being needs to understand that.
link |
00:52:18.080
And maybe we can elect some engineers
link |
00:52:20.160
to office as well on the other side.
link |
00:52:22.720
What are the biggest open problems in robotics?
link |
00:52:24.840
And you said we're in the early days in some sense.
link |
00:52:27.760
What are the problems we would like to solve in robotics?
link |
00:52:31.040
I think there are lots of problems, right?
link |
00:52:32.520
But I would phrase it in the following way.
link |
00:52:36.440
If you look at the robots we're building,
link |
00:52:39.520
they're still very much tailored towards
link |
00:52:43.160
doing specific tasks and specific settings.
link |
00:52:46.520
I think the question of how do you get them to operate
link |
00:52:49.480
in much broader settings
link |
00:52:53.560
where things can change in unstructured environments
link |
00:52:58.040
is up in the air.
link |
00:52:59.160
So think of self driving cars.
link |
00:53:02.920
Today, we can build a self driving car in a parking lot.
link |
00:53:05.680
We can do level five autonomy in a parking lot.
link |
00:53:10.040
But can you do a level five autonomy
link |
00:53:13.240
in the streets of Napoli in Italy or Mumbai in India?
link |
00:53:16.840
No.
link |
00:53:17.760
So in some sense, when we think about robotics,
link |
00:53:22.400
we have to think about where they're functioning,
link |
00:53:25.120
what kind of environment, what kind of a task.
link |
00:53:27.760
We have no understanding
link |
00:53:29.800
of how to put both those things together.
link |
00:53:32.800
So we're in the very early days
link |
00:53:34.000
of applying it to the physical world.
link |
00:53:35.920
And I was just in Naples actually.
link |
00:53:38.800
And there's levels of difficulty and complexity
link |
00:53:42.200
depending on which area you're applying it to.
link |
00:53:45.880
I think so.
link |
00:53:46.720
And we don't have a systematic way of understanding that.
link |
00:53:51.040
Everybody says, just because a computer
link |
00:53:53.800
can now beat a human at any board game,
link |
00:53:56.520
we certainly know something about intelligence.
link |
00:53:59.920
That's not true.
link |
00:54:01.360
A computer board game is very, very structured.
link |
00:54:04.400
It is the equivalent of working in a Henry Ford factory
link |
00:54:08.480
where things, parts come, you assemble, move on.
link |
00:54:11.680
It's a very, very, very structured setting.
link |
00:54:14.120
That's the easiest thing.
link |
00:54:15.680
And we know how to do that.
link |
00:54:18.400
So you've done a lot of incredible work
link |
00:54:20.400
at the UPenn, University of Pennsylvania, GraspLab.
link |
00:54:23.720
You're now Dean of Engineering at UPenn.
link |
00:54:26.560
What advice do you have for a new bright eyed undergrad
link |
00:54:31.320
interested in robotics or AI or engineering?
link |
00:54:34.640
Well, I think there's really three things.
link |
00:54:36.560
One is you have to get used to the idea
link |
00:54:40.600
that the world will not be the same in five years
link |
00:54:42.840
or four years whenever you graduate, right?
link |
00:54:45.160
Which is really hard to do.
link |
00:54:46.120
So this thing about predicting the future,
link |
00:54:48.960
every one of us needs to be trying
link |
00:54:50.520
to predict the future always.
link |
00:54:53.280
Not because you'll be any good at it,
link |
00:54:54.960
but by thinking about it,
link |
00:54:56.440
I think you sharpen your senses and you become smarter.
link |
00:55:00.880
So that's number one.
link |
00:55:02.080
Number two, it's a corollary of the first piece,
link |
00:55:05.760
which is you really don't know what's gonna be important.
link |
00:55:09.360
So this idea that I'm gonna specialize in something
link |
00:55:12.080
which will allow me to go in a particular direction,
link |
00:55:15.320
it may be interesting,
link |
00:55:16.480
but it's important also to have this breadth
link |
00:55:18.480
so you have this jumping off point.
link |
00:55:22.000
I think the third thing,
link |
00:55:23.000
and this is where I think Penn excels.
link |
00:55:25.360
I mean, we teach engineering,
link |
00:55:27.240
but it's always in the context of the liberal arts.
link |
00:55:29.960
It's always in the context of society.
link |
00:55:32.360
As engineers, we cannot afford to lose sight of that.
link |
00:55:35.840
So I think that's important.
link |
00:55:37.640
But I think one thing that people underestimate
link |
00:55:39.960
when they do robotics
link |
00:55:40.920
is the importance of mathematical foundations,
link |
00:55:43.440
the importance of representations.
link |
00:55:47.720
Not everything can just be solved
link |
00:55:50.040
by looking for Ross packages on the internet
link |
00:55:52.440
or to find a deep neural network that works.
link |
00:55:56.280
I think the representation question is key,
link |
00:55:59.080
even to machine learning,
link |
00:56:00.400
where if you ever hope to achieve or get to explainable AI,
link |
00:56:05.400
somehow there need to be representations
link |
00:56:07.760
that you can understand.
link |
00:56:09.080
So if you wanna do robotics,
link |
00:56:11.120
you should also do mathematics.
link |
00:56:12.680
And you said liberal arts, a little literature.
link |
00:56:16.160
If you wanna build a robot,
link |
00:56:17.200
it should be reading Dostoyevsky.
link |
00:56:19.320
I agree with that.
link |
00:56:20.360
Very good.
link |
00:56:21.200
So Vijay, thank you so much for talking today.
link |
00:56:23.560
It was an honor.
link |
00:56:24.400
Thank you.
link |
00:56:25.240
It was just a very exciting conversation.
link |
00:56:26.200
Thank you.