back to index

Luís and João Batalha: Fermat's Library and the Art of Studying Papers | Lex Fridman Podcast #209


small model | large model

link |
00:00:00.000
The following is a conversation with Luiz and Joao Batala, brothers and cofounders of Fermat's
link |
00:00:06.320
library, which is an incredible platform for annotating papers. Is there right on the Fermat's
link |
00:00:12.800
library website? Justice Pierre de Fermat scribbled his famous last theorem in the margins.
link |
00:00:18.640
Professional scientists, academics, and citizen scientists can annotate equations, figures,
link |
00:00:23.600
ideas, and write in the margins. Fermat's library is also a really good Twitter account to follow.
link |
00:00:30.640
I highly recommend it. They post little visual factoids and explorations that reveal the beauty of
link |
00:00:37.120
mathematics. I love it. Quick mention of our sponsors. Skiff, Simply Safe, Indeed, Nutsuite,
link |
00:00:46.000
and FourSigmatic. Check them out in the description to support this podcast.
link |
00:00:50.880
As a side note, let me say a few words about the dissemination of scientific ideas.
link |
00:00:56.000
I believe that all scientific articles should be freely accessible to the public.
link |
00:01:01.440
They currently are not. In one analysis I saw, more than 70% of published research articles are
link |
00:01:07.520
behind a paywall. In case you don't know, the funders of the research, whether that's government
link |
00:01:13.280
or industry, aren't the ones putting up the paywall. The journals are the ones putting up the
link |
00:01:19.200
paywall, while using unpaid labor from researchers for the peer review process.
link |
00:01:25.520
Where is all that money from the paywall going? In this digital age, the costs here should be
link |
00:01:31.520
minimal. This cost can easily be covered through donation, advertisement, or public funding of
link |
00:01:37.520
science. The benefit versus the cost of all papers being free to read is obvious, and the fact that
link |
00:01:43.120
they're not free goes against everything science should stand for, which is the free dissemination
link |
00:01:48.640
of ideas that educate and inspire. Science cannot be a gated institution. The more people can freely
link |
00:01:56.560
learn and collaborate on ideas, the more problems we can solve in the world together, and the faster
link |
00:02:02.240
we can drive old ideas out and bring new, better ideas in. Science is beautiful and powerful,
link |
00:02:10.160
and its dissemination in this digital age should be free. This is the Lex Friedman podcast,
link |
00:02:17.200
and here's my conversation with Louise and Joao Batalla.
link |
00:02:22.320
Louise, you suggested an interesting idea. Imagine if most papers had a backstory section,
link |
00:02:29.760
the same way that they have an abstract. Knowing more about how the authors ended up
link |
00:02:36.080
working on a paper can be extremely insightful, and then you went on to give a backstory for
link |
00:02:40.880
the Feynman QED paper. This is all on the tweet, by the way. We're doing tweet analysis today.
link |
00:02:45.520
Okay. How much of the human backstory do you think is important in understanding
link |
00:02:50.400
the idea itself that's presented in the paper or in general?
link |
00:02:54.240
I think this gives way more context to the work of scientists. A lot of people have this
link |
00:03:00.800
almost kind of romantic misconception that the way a lot of scientists work is almost as the sum
link |
00:03:07.680
of eureka moments where all of a sudden they sit down and start writing two papers in a row,
link |
00:03:13.200
and the papers are usually isolated. When you actually look at it, the papers are
link |
00:03:18.640
chapters of a way more complex story. The Feynman QED paper is a good example. Feynman was actually
link |
00:03:26.480
going through a pretty dark phase before writing that paper. He lost enthusiasm with physics and
link |
00:03:33.600
doing physics problems. There was one time when he was in the cafeteria of Cornell and he saw a
link |
00:03:39.440
guy that was throwing plates in the air. He noticed that when the plate was in the air,
link |
00:03:44.640
there were two movements there. The plate was wobbling, but he also noticed that the Cornell
link |
00:03:49.680
symbol was rotating. He was able to figure out the equations of motions, the equations of motions
link |
00:03:55.600
of those plates. That led him to kind of think a little bit about electron orbits in relativity,
link |
00:04:04.720
which led to the paper about quantum electrodynamics. That kind of reignited his interest in physics
link |
00:04:15.040
and ended up publishing the paper that led to his Nobel Prize, basically. I think there are a lot
link |
00:04:22.480
of really interesting backstories about papers that readers never get to know. For instance,
link |
00:04:26.480
we did a couple of months ago an AMA around a pretty famous paper, The Gams Paper with Ian
link |
00:04:35.920
Goodfellow. We did an AMA where everyone could ask questions about the paper and Ian was
link |
00:04:41.840
responding to those questions. He was also telling the story of how he got the idea for that paper
link |
00:04:47.680
in a bar. There was also an interesting backstory. I also read a book by Cedric Vellani. Cedric
link |
00:04:57.600
Vellani is this mathematician, a Fields Medalist. In his book, he tries to explain how he got from
link |
00:05:04.160
a PhD student to the Fields Medal. He tries to be as descriptive as possible about every single
link |
00:05:09.440
step how he got to the Fields Medal. It's interesting also to see just the amount of
link |
00:05:14.480
random interactions and discussions with other researchers sometimes over coffee and how it
link |
00:05:19.600
led to fundamental breakthroughs and some of his most important papers. I think it's super
link |
00:05:24.960
interesting to have that context of the backstory. Well, the Ian Goodfellow story is kind of interesting
link |
00:05:30.000
and perhaps that's true for Feynman as well. I don't know if it's romanticizing the thing, but
link |
00:05:34.720
it seems like just a few little insights and a little bit of work does most of the leap required.
link |
00:05:41.520
Do you have a sense for a lot of the stuff you've looked at just looking back through history?
link |
00:05:48.560
It wasn't necessarily the grind of like Andrew Wiles or the Fermat's last theorem,
link |
00:05:53.760
for example. It was more like a brilliant moment of insight. In fact, Ian Goodfellow
link |
00:05:59.760
has a kind of sadness to him almost in that at that time in machine learning,
link |
00:06:04.960
like at that time, especially in for GANs, you could code something up really quickly in a single
link |
00:06:12.880
machine and almost do the invention, go from idea to experimental validation in like a single night,
link |
00:06:20.800
a single person could do it. And now there's kind of a sadness that a lot of the breakthroughs you
link |
00:06:25.440
might have in machine learning kind of require large scale experiments. So it was almost like
link |
00:06:30.640
the early days. So I wonder how many low hanging fruit there are in science and mathematics and
link |
00:06:39.520
even engineering where it's like you could do that little experiment quickly. Like you have an
link |
00:06:44.960
insight in a bar. Why is it always a bar? But you have an insight at a bar and then just implement
link |
00:06:51.040
and the world changes. It's a good point. I think it also depends a lot on the maturity of the field.
link |
00:06:57.760
When you look at a field like mathematics, like it's a pretty mature field, a field like machine
link |
00:07:03.520
learning, it's growing pretty fast. And it's actually pretty interesting. I looked up like the
link |
00:07:11.360
number of new papers on archive with the keyword machine learning. And like 50% of those papers
link |
00:07:19.120
have been published in the last 12 months. So you can see just the sense, 50, 50%. So you can see
link |
00:07:26.320
the magnitude of growth in that field. And so I think like as fields mature, like those types of
link |
00:07:33.920
moments, I think naturally are less frequent. It's just a consequence of that. The other point
link |
00:07:41.840
that is interesting about the backstory is that it can really make it more memorable in a way.
link |
00:07:46.640
And by making it more memorable, it kind of sediments the knowledge more in your mind. I
link |
00:07:51.520
remember also reading the backstory to Dijkstra's shortest path algorithm, where he came up with it
link |
00:08:01.200
essentially while he was sitting down at a coffee shop in Amsterdam. And he came up with that
link |
00:08:08.000
algorithm over 20 minutes. And one interesting aspect is that he didn't have any pen or paper
link |
00:08:12.800
at the time. And so he had to do it all in his mind. And so there's only so much complexity
link |
00:08:17.200
that he can handle if you're just thinking about it in your mind. And that when you think about
link |
00:08:22.000
the simplicity of Dijkstra's shortest path finding algorithm, it's knowing that backstory helps
link |
00:08:29.200
sediment that algorithm in your mind so that you don't forget about it as easily.
link |
00:08:33.280
It might be from you that I saw a meme about Dijkstra. It's like he's trying to solve it and
link |
00:08:41.280
he comes up with some kind of random path. And then it's like my parents aren't home. And then he
link |
00:08:45.840
does, he figures out the algorithm for the shortest path. It's right through words to convey memes,
link |
00:08:54.160
but it's hilarious. I don't know if it's in post that we construct stories that romanticize it.
link |
00:09:00.000
Apparently with Newton, there was no Apple, especially when you're working on problems that
link |
00:09:05.440
have a physical manifestation or a visual manifestation. It feels like the world could be
link |
00:09:12.560
an inspiration to you. So it doesn't have to be completely on paper. Like you could be sitting
link |
00:09:20.000
at a bar and all of a sudden see something and a pattern will spark another pattern and you can
link |
00:09:26.160
visualize it and rethink a problem in a particular way. Of course, you can also load the math that
link |
00:09:32.320
you have on paper and always carry that with you. So when you show up to the bar, some little
link |
00:09:36.240
inspiration could be the thing that changes it. Is there any other people almost on the human side,
link |
00:09:42.960
whether it's physics with Feynman, Dirac, Einstein, or computer science, touring anybody else,
link |
00:09:50.240
any backstories that you remember that jump out? Because I'm also referring to
link |
00:09:55.760
not necessarily these stories where something magical happens, but these are personalities.
link |
00:10:01.040
They have big egos. Some of them are super friendly. Some of them are self obsessed. Some
link |
00:10:07.200
of them have anger issues. Some of them, how do I describe Feynman? But he appears to have
link |
00:10:15.280
appreciation of the beautiful in all its forms. He has a wit and a cleverness and a humor about
link |
00:10:20.560
him. Does that come into play in terms of the construction of the science?
link |
00:10:25.920
I think you brought up Newton. Newton is a good example also to think about his back story,
link |
00:10:30.480
because there's a certain back story of Newton that people always talk about. But then there's a
link |
00:10:36.000
whole another aspect of him that is also a big part of the person that he was, but he was really
link |
00:10:42.240
into alchemy. He spent a lot of time thinking about that and writing about it. He took it very
link |
00:10:49.280
seriously. He was really into Bible interpretation, trying to predict things based on the Bible.
link |
00:10:55.120
And so there's also a whole back story then. And of course, you need to look at it in the context
link |
00:10:59.520
that and the time that when Newton lived, but it adds to his personality. And it's important to
link |
00:11:07.280
also understand those aspects that maybe people are not as proud to teach to little kids, but
link |
00:11:14.800
it's important. It was part of who he was. And maybe without those, who knows what he would
link |
00:11:19.600
have done otherwise. Well, the cool thing about alchemy, I don't know how it was viewed at the
link |
00:11:26.080
time, but it almost like to me symbolizes dreaming of the impossible. Most of the breakthrough ideas
link |
00:11:35.120
kind of seem impossible until they're actually done. It's like achieving human flight. It's not
link |
00:11:39.920
completely obvious to me that alchemy is impossible or like putting myself in the mindset of the time.
link |
00:11:47.120
And perhaps even still, everything that some of the most incredible breakthroughs
link |
00:11:53.440
are would seem impossible. And I wonder the value of believing, almost like focusing and dreaming
link |
00:12:03.200
of the impossible such that it is actually is possible in your mind and that in itself manifests
link |
00:12:10.400
whether the accomplishing that goal or making progress in some unexpected direction. So alchemy
link |
00:12:16.800
almost symbolizes that for me. I distinctly remember having the same thought of thinking,
link |
00:12:21.680
you know, when I learned about atoms and that they have protons and electrons, I was like,
link |
00:12:25.600
okay, to make gold, you just take whatever has an atomic weight below it and then shove another
link |
00:12:30.160
proton in there, and then you have a bunch of gold. So like, why don't people do that?
link |
00:12:34.640
It seemed like conceptually is like, you know, this sounds feasible. You might be able to do it.
link |
00:12:39.280
And you can actually just very, very expensive. Yeah, yeah, exactly. Exactly. So in a sense,
link |
00:12:44.240
we do have alchemy. And maybe even back then, it wasn't as crazy that he was so into it.
link |
00:12:49.520
But good people just don't like to talk about that as much. Yeah, but Newton in general was a very
link |
00:12:55.120
interesting fella. Anybody else come to mind? In terms of people that inspire you, in terms of
link |
00:13:03.200
people that you just are happy that they have once or still exist on this earth?
link |
00:13:09.680
I think, I mean, Freeman Dyson for me. Yeah, Freeman Dyson was, I've had a chance to actually
link |
00:13:16.960
exchange a couple of emails with him. He was probably one of the most humble scientists that
link |
00:13:21.200
I've ever met. And that had a big impact on me. We're actually trying to convince him to annotate
link |
00:13:29.040
a paper on Fermat's library. And I sent him an email asking him if he could annotate a paper.
link |
00:13:36.560
And his response was something like, I have very limited knowledge. I just know a couple of things
link |
00:13:42.240
about certain fields. I'm not sure if I'm qualified to do that. That was his first response.
link |
00:13:47.760
And this was someone that should have won a Nobel Prize and worked on a bunch of different fields,
link |
00:13:53.600
did some really, really great work. And then just the interactions that I had with him,
link |
00:13:58.880
every time I asked him a couple of questions about his papers, and he always responded saying,
link |
00:14:05.200
I'm not here to answer your questions. I just want to open up more questions. And so that had a big
link |
00:14:13.040
impact on me. It's like just an example of an extremely humble yet accomplished scientist.
link |
00:14:21.600
And Feynman was also a big, big inspiration in the sense that he was able to be, you know,
link |
00:14:27.600
again, extremely talented and scientist. But at the same time, socially, he was able to,
link |
00:14:33.200
he was also really smart from a social perspective. And he was able to interact with people. It was
link |
00:14:39.520
also a really good teacher. And it was also to did a awesome work in terms of explaining physics
link |
00:14:48.160
to the masses and motivating and getting people interested in physics. And that for me was also
link |
00:14:53.920
big inspiration. Yeah, I like the child like curiosity. Some of those folks like you mentioned
link |
00:14:58.880
Feynman. I've, Daniel Kahneman, I got a chance to meet and interact with some of these truly
link |
00:15:04.560
special scientists. What makes them special is that even in older age, they're still like,
link |
00:15:12.000
there's still that fire of childlike curiosity that burns. And some of that is like, not taking
link |
00:15:18.560
yourself so seriously that you think you've figured it all out, but almost like thinking that you
link |
00:15:23.200
don't know much of it. And that's like step one in having a great conversation or collaboration
link |
00:15:32.880
or exploring a scientific question. And it's cool how the very thing that probably earned people the
link |
00:15:38.640
Nobel Prize or work that's seminal in some way is the very thing that still burns even after
link |
00:15:47.760
they've won the prize. It's cool to see. And they're rare humans, it seems.
link |
00:15:52.560
And to that point, I remember like the last email that I sent to Freeman Dyson was like in his
link |
00:15:56.800
last birthday, he was really into number theory and primes. So what I did is I took like a photo
link |
00:16:03.280
of him picture and then I turned that into like a giant prime number. So I converted the picture
link |
00:16:10.320
into a bunch of one and eights. And then I moved some numbers around until it was a prime. And then
link |
00:16:16.000
I sent him that. Oh, so the the visual like it still looked like the picture. It's made up of a
link |
00:16:21.280
prime. That's tricky to do. It's hard to do. It looks harder than it actually is. So the way you
link |
00:16:27.280
do it is like you convert the darker regions into eights and the lighter regions in once.
link |
00:16:33.040
And then there's just keep flipping. Yeah. But there's like some primality tests that are cheaper
link |
00:16:39.680
from a computational standpoint. But what he tells you is it excludes numbers that are not prime.
link |
00:16:46.240
Then you end up with a set of numbers that you don't know if they are prime or not.
link |
00:16:49.600
And then you run the full primality test on that. So you just have to keep iterating on that. And
link |
00:16:54.960
it was it was it's funny because when he got the picture, he was like, how did you do that? It was
link |
00:16:59.440
super curious too. And then we got into the details. And again, these was he was already 90,
link |
00:17:04.320
I think 92 or something. And that curiosity was still there. So you can really see that in some
link |
00:17:12.400
of these scientists. So could we talk about Fermat's library? Yeah, absolutely. What is it?
link |
00:17:19.920
What's the main goal? What's the dream? It is a platform for annotating papers in its essence.
link |
00:17:26.160
Right. And so academic papers can be one of the densest forms of content out there. And generally
link |
00:17:33.040
pretty hard to understand at times. And the idea is that you can make them more accessible and
link |
00:17:39.040
easier to understand by adding these rich annotations to the site, right? And so we can just imagine
link |
00:17:44.400
a PDF view on your browser, and then you have annotations on each side. And then when you click
link |
00:17:48.240
on them, a sidebar expands, and then you have annotations that support late tech and markdown.
link |
00:17:54.160
And so the idea is that you can, say, explain a tougher part of a paper where there's a step
link |
00:17:59.680
that is not completely obvious, or you can add more context to it. And then over time, papers can
link |
00:18:06.800
become easier and easier to understand and can evolve in a way. But it really came from
link |
00:18:13.600
myself, Luis and two other friends. We've had this long running habit of kind of running a journal
link |
00:18:20.000
club amongst us. We come from different backgrounds, right? I studied CS, we studied physics. And so
link |
00:18:25.120
we read papers and present them to each other. And then we try to bring some of that online.
link |
00:18:32.320
And that's when we decided to build Fermat's library. Then over time, it kind of grew into
link |
00:18:42.080
something with a broader goal. And really, what we're trying to do is trying to help
link |
00:18:49.360
move science in the right direction. That's really the ultimate goal and where we want to take it
link |
00:18:55.200
now. So there's a lot to be said. So first of all, for people who haven't seen it, the interface
link |
00:19:01.600
is exceptionally well done. That's like execution is really important here. Absolutely. The other
link |
00:19:06.880
thing is just to mention, for a large number of people, apparently, which is new to me,
link |
00:19:12.960
don't know what latex is. So it's spelled like latex. So be careful Googling it if you haven't
link |
00:19:19.600
before. It's a, sorry, I don't even know the correct terminology setting language. It's a
link |
00:19:25.520
typesetting language where it's you're basically program writing a program that then generates
link |
00:19:32.960
something that looks from a typography perspective beautiful. Absolutely. And so a lot of academics
link |
00:19:40.000
use it to write papers. I think there's like a bunch of communities that use it to write papers.
link |
00:19:46.000
I would say it's mathematics, physics, computer science. Yeah, that's yeah. That's because I'm
link |
00:19:52.880
collaborating currently on a paper with two neuroscientists from Stanford. And they don't know
link |
00:19:58.000
what. So I'm using Microsoft Word and Mendeley and like all of those kinds of things. And I'm
link |
00:20:08.880
being very zen like about the whole process. But it's fascinating. It's a little heartbreaking,
link |
00:20:15.040
actually, because it actually, it's funny to say, but we'll talk about open science, actually,
link |
00:20:23.040
the bigger mission behind Fermat's library is like really opening up the world of science to
link |
00:20:28.000
everybody. Is these silly two facts of like one community uses latex and another uses word
link |
00:20:37.600
is actually a barrier between them. That's like, it's like boring and practical in a sense, but
link |
00:20:43.680
this makes it very difficult to collaborate. Just on that, like I think that if there are some people
link |
00:20:49.040
that should have received like a Nobel Prize that but will never get it. And I think one of those
link |
00:20:53.600
is like Donald Knuth because of tech and latex and then because it had a huge impact in terms of
link |
00:20:59.840
like just making it easier for researchers to put their content out there, like making it uniform
link |
00:21:06.480
as much as possible. Oh, you mean like a Nobel Peace Prize? Maybe a Nobel Peace Prize.
link |
00:21:11.760
Maybe a Nobel Peace Prize. I think so. I mean, he at a very young age got the Turing Award for
link |
00:21:19.920
his work in algorithms and so on. So like an incredibly, I think it's in, it might be even
link |
00:21:25.760
the 60s, but I think it's the 70s. So when he was really young, and then he went on to do like
link |
00:21:30.800
incredible work with his book and yeah, with tech that people don't know. And going back just on
link |
00:21:38.400
the reason why we ended up, because I think this is interesting, the reason why we ended up using
link |
00:21:42.800
the name Fermat's Library, this was because of Fermat's last theorem. And Fermat's last theorem
link |
00:21:48.560
is actually a funny story. So Pierre de Fermat, he was like a lawyer and he wrote like on a book
link |
00:21:56.480
that he had a solution to Fermat's last theorem, which but that didn't fit the margin of that book.
link |
00:22:03.520
And so Fermat's last theorem basically states that there's no solution. If you have integers a,
link |
00:22:09.600
b and c, there's no solution to a to the power of n plus b to the power of n equals to c to the
link |
00:22:14.960
power of n if n is bigger than 2. So there's no solutions. And he said that and that problem
link |
00:22:24.560
remained open for almost 300 years, I believe. And a lot of the most famous mathematicians
link |
00:22:30.880
tried to tackle that problem. No one was able to figure that out until Andrew Wiles,
link |
00:22:36.560
I think was in the 90s, was able to publish the solution, which was I believe almost 300 pages
link |
00:22:41.760
long. And so it's kind of an anecdote that there's a lot of knowledge and insights that can be
link |
00:22:49.920
trapped in the margins, that there's a lot of potential energy that you can release if you
link |
00:22:55.680
can release, if you actually spend some time trying to digest that. And that was the origin
link |
00:23:02.080
story for the name. Yeah, you can share the contents of the margins with the world that could
link |
00:23:07.840
inspire a solution or communication that then leads to a solution. And if you think about
link |
00:23:14.080
papers, like papers are as Jean was saying, probably one of the densest pieces of text that
link |
00:23:18.960
any human can read. And you have these researchers, like some of the brightest minds in this field
link |
00:23:24.480
working on like new discoveries and publishing these work on journals that are imposing them
link |
00:23:30.400
restrictions in terms of the number of pages that they can have to explain a new scientific
link |
00:23:35.440
breakthrough. So at the end of the day, papers are not optimized for clarity and for a proper
link |
00:23:42.720
explanation of that content, because there are so many restrictions. So there's, as I mentioned,
link |
00:23:48.560
there's a lot of potential energy that can be freed if you actually try to digest a lot of the
link |
00:23:54.800
contents of papers. Can you explain some of the other things, so margins, librarian, journal club?
link |
00:24:01.600
So journal club is what a lot of people know us for, where every week we release an annotated
link |
00:24:07.360
paper in all sorts of different fields, but physics, CS, math. Margins is kind of the same
link |
00:24:13.520
software that we use to run the journal club and to host the annotations. But we've made that available
link |
00:24:19.200
for free to anybody that wants to use it. And so folks use it at universities and for running
link |
00:24:27.440
journal clubs. And so we've just made that freely available. And then librarian is a browser extension
link |
00:24:33.760
that we developed that is sort of an overlay on top of archive. So it's about bringing some of the
link |
00:24:38.880
same functionality around comments, plus adding some extra niceties to archive, like being able
link |
00:24:46.720
to very easily extract the references of a paper that you're looking at or being able to extract
link |
00:24:51.680
the BibDec in order to cite that paper yourself. So it's an overlay on top of archive.
link |
00:24:56.480
Yeah, the idea is that you can have that commenting interface without having to leave archive.
link |
00:25:01.120
It's kind of incredible. I didn't know about it. And once I've learned of it, it's like, holy shit.
link |
00:25:07.760
Why isn't it more popular given how popular archive is? Like everybody should be using it.
link |
00:25:14.880
Archive sucks in terms of its interface. Or let me rephrase that. It's limited in terms of its
link |
00:25:21.360
interface. Archive is a pretty incredible project. And in a way, the growth has been completely
link |
00:25:30.720
linear over time. If you look at like number of papers published on archive, it's pretty much
link |
00:25:36.080
a straight line for the past 20 years. Especially if you're coming from a startup background,
link |
00:25:41.760
and then you were trying to do archive, you'd probably try all sorts of growth acts and try to
link |
00:25:47.840
then maybe have paid features and things like that. And that would kind of maybe ruin it.
link |
00:25:53.120
And so there's a subtle balance there. And I don't know what aspects you can change about it.
link |
00:25:59.360
Yeah. For some tools in science, it just takes time for them to grow. Archive is just turned 30,
link |
00:26:06.000
I believe. And for people that don't know, archive is these kind of online repository where people
link |
00:26:11.280
put preprints, which are versions of the papers before they actually make it to journals.
link |
00:26:17.520
ARXIV, for people who don't know. And it's actually a really vibrant place to publish your
link |
00:26:24.720
papers in the aforementioned communities of mathematics, physics, and computer science.
link |
00:26:30.960
It started with mathematics and physics, and then over the last 30 years, it evolved. And now,
link |
00:26:36.400
actually computer science now, it's a more popular category than physics and math on archive.
link |
00:26:42.240
And there's also, which I don't know very much about, like biology, medical version of that as
link |
00:26:47.680
bioarchive. Yeah, bioarchive. It's interesting because if you look at these platforms for
link |
00:26:55.680
preprints, they actually play a super important role. Because if you look at a category like math,
link |
00:27:03.840
for some papers in math, it might take close to three years after you click upload paper
link |
00:27:11.120
on the journal website, and the paper gets published on the website of the journal.
link |
00:27:15.520
So this is literally the longest upload period on the internet. And during those three years,
link |
00:27:25.280
that content is just locked. And so that's why it's so important for people to have websites like
link |
00:27:31.760
archive so that you can share that before it goes to the journal with the rest of the world.
link |
00:27:36.160
That was actually on archive that Perlman published the three papers that led to the proof of the
link |
00:27:42.160
Poincaré conjecture. And then you have other fields like machine learning, for instance,
link |
00:27:47.920
where the field is evolving at such a high rate that people don't even wait before the papers
link |
00:27:53.920
go to journals before they start working on top of those papers. So they publish them on archive,
link |
00:27:58.560
then other people see them, they start working on that. And archive did a really good job at
link |
00:28:03.360
like building that core platform to host papers. But I think there's a really, really big opportunity
link |
00:28:09.440
in building more features on top of that platform apart from just hosting papers. So
link |
00:28:13.920
collaboration, annotations, and like having other things apart from papers like code
link |
00:28:20.560
and other things. Because for instance, in the field like machine learning,
link |
00:28:24.960
there's a really big, you know, as I mentioned, people start working on on top of preprints,
link |
00:28:29.520
and they are assuming that that that preprint is correct. But you really need a way, for instance,
link |
00:28:35.840
to maybe it's not peer review, but distinguish what is good work from bad work on archive. How
link |
00:28:42.560
do you do that? So like a commenting interface like librarian, it's useful for that so that
link |
00:28:47.760
you can distinguish that in a field that is growing so fast as machine learning.
link |
00:28:53.440
And then you have platforms that focus on just biology, bio archive is a good example.
link |
00:28:59.520
Bio archive is also super interesting because there's actually an interesting experiment that
link |
00:29:07.920
was run in the 60s. So in the 60s, the NIH supported this experiment called information
link |
00:29:16.400
exchange group, which at the time was a way for researchers to share biology preprints via mail
link |
00:29:24.320
or using libraries. And that project in the 1960s got canceled six years after it started.
link |
00:29:30.320
And it was due to intense pressure from the journals to kill that project because they
link |
00:29:35.520
were fearing competition from the preprints for the journal industry.
link |
00:29:43.120
Creek was also one of the famous scientists that opposed to the information exchange group.
link |
00:29:50.960
And it's interesting because right now, if you analyze the number of biology papers
link |
00:29:55.120
that appear first as preprints, it's only 2% of the papers. And this was almost 50 years after
link |
00:30:03.600
that first experiment. So you can see like that pressure from the journals to cancel that initial
link |
00:30:08.800
version of a preprint repo added tremendous impact on the number of papers that are showing up in
link |
00:30:15.840
biology as preprints. So it delayed a lot that revolution. But now platforms like bio archive
link |
00:30:23.920
are doing that work. But there's still a lot of room for growth there. And I think it's super
link |
00:30:28.320
important because those are the papers that are open that everyone can read.
link |
00:30:31.840
Okay. So if we just look at the entire process of science as a big system, can we just talk about
link |
00:30:37.120
how it can be revolutionized? So you have an idea, depending on the field,
link |
00:30:44.320
you want to make that idea concrete, you want to run a few experiments in computer size,
link |
00:30:49.040
there might be some code, there'd be a data set for, you know, some of the more sort of biology,
link |
00:30:57.280
psychology, you might be collecting the data set that's called, you know, a study, right?
link |
00:31:05.440
So that's part of that. That's part of the methodology. And so you are putting all that
link |
00:31:10.320
into a paper form. And then you have some results. And then you submit that to a place for review
link |
00:31:20.000
through the peer review process. And there's a process where how would you summarize the peer
link |
00:31:24.880
review process? But it's really just like a handful of people look over your paper and comment and
link |
00:31:30.400
based on that decide whether your paper is good or not. So there's a whole broken nature to it.
link |
00:31:36.320
At the same time, I love the peer review process. When I buy stuff on Amazon,
link |
00:31:44.960
for like the commenting system, whatever that is. So okay, so there's a bunch of possibilities for
link |
00:31:49.920
revolutions there. And then there's the other side, which is the collaborative aspect of the
link |
00:31:54.640
science, which is people annotating, people commenting, sort of the low effort collaboration,
link |
00:32:00.640
which is a comment. Sometimes, as you've talked about, a comment can change everything.
link |
00:32:05.680
But, you know, or a higher effort collaboration, like more like maybe annotations or even like
link |
00:32:11.840
contributing to the paper, you can think of like collaborative updating of the paper over time.
link |
00:32:20.080
So there's all these possibilities for doing things better than they've been done.
link |
00:32:27.040
Can we talk about some ideas in this space? Some ideas that you're working on,
link |
00:32:31.040
some ideas that you're not yet working on, but should be revolutionized. Because it does seem
link |
00:32:37.120
that archive and like open review, for example, are like the craigslist of science. Like,
link |
00:32:46.320
yeah, okay, I'm very grateful that we have it. But it just feels like it's like 10 to 20 years.
link |
00:32:53.600
Like it doesn't feel like that's a feature. The simplicity of it is a feature. It feels like
link |
00:32:58.240
it's a bug. But then again, the pushback there is Wikipedia has the same kind of simplicity to it.
link |
00:33:08.320
And it seems to work exceptionally well in the crowdsourcing aspect of it. So, sorry,
link |
00:33:14.320
there's a bunch of stuff going on the table. Let's just pick random things that we can talk about.
link |
00:33:20.160
Wikipedia, you know, for me, it's the cosmological constant of the internet.
link |
00:33:23.920
It's like, I think we are lucky to live in the parallel universe where Wikipedia exists.
link |
00:33:28.400
Yes. Because if someone had pitched me Wikipedia, like a publicly edited
link |
00:33:33.760
encyclopedia, like a couple of years ago, it would be, I don't know how many people would have said
link |
00:33:39.120
that that would have survived. I mean, it makes almost no sense. It's like having a Google
link |
00:33:44.480
doc that everybody on the internet can edit. And like, that will be like the most reliable
link |
00:33:48.720
source for knowledge. And I don't know how many, but hundreds of thousands of topics.
link |
00:33:54.320
Yeah. It's insane. It's insane. And like you have, and then you have users, like there's one,
link |
00:34:00.320
a single user that edited one third of the articles on Wikipedia. So we have these really,
link |
00:34:05.920
really big power users. There are a substantial part of like what makes Wikipedia successful.
link |
00:34:13.600
And so like, no one would have ever imagined that that could happen. And so that, that's,
link |
00:34:21.120
that's one thing I completely agree with what you just said. I also started to interrupt briefly.
link |
00:34:25.920
Maybe let's inject that into the discussion of everything else. I also believe I've seen
link |
00:34:31.280
that with Stack Overflow, that one individual or a small collection of individuals contribute or
link |
00:34:37.520
revolutionize most of the community. Like if you create a really powerful system for archive or
link |
00:34:44.640
like open review and made it really easy and compelling and exciting for one person who is in
link |
00:34:53.040
like a 10x contributor to do their thing, that's going to change everything. It seems like that
link |
00:34:58.640
was the mechanism that changed everything for Wikipedia. And that's the mechanism that changed
link |
00:35:02.800
everything for Stack Overflow is gamifying or making it exciting or just making it fun or
link |
00:35:08.800
pleasant or fulfilling in some way for those people who are insane enough to like answer
link |
00:35:15.280
thousands of questions or write thousands of factoids and like research them and check them,
link |
00:35:21.920
all those kinds of things, or read thousands of papers. Yeah. No, Stack Overflow is another
link |
00:35:27.200
great example of that. And those are both two incredibly productive communities that generate
link |
00:35:34.800
a ton of value and capture almost none of it. In a way, it's almost like counter,
link |
00:35:45.440
it's very counterintuitive that these communities would exist and thrive.
link |
00:35:51.120
And it's really hard to, there aren't that many communities like that.
link |
00:35:57.840
So how do we do that for science? Do you have ideas there? Like what are the
link |
00:36:02.400
biggest problems that you see? You're working on some of them?
link |
00:36:04.880
Like just on that, there are a couple of really interesting experiments that people are running.
link |
00:36:09.120
An example would be like the Polymath Projects. So this is kind of a social experiment that was
link |
00:36:15.440
created by Tim Gowers, Fields Medalist. And his idea was to try to prove that is it possible to do
link |
00:36:22.880
mathematics in a massively collaborative way on the internet? So he decided to pick a couple of
link |
00:36:29.520
problems and test that. And they found out that it actually is possible for specific types of
link |
00:36:36.880
problems, namely problems that you're able to break down in little pieces and go step by step.
link |
00:36:42.800
You might need, as with open source, you might need people that are just kind of reorganizing
link |
00:36:49.200
the house every once in a while. And then people throw a bunch of ideas and then you make some
link |
00:36:55.040
progress, then you reorganize, you reframe the problem, you go step by step. But they were actually
link |
00:37:00.480
able to prove that it is possible to collaborate online and do progress in terms of mathematics.
link |
00:37:08.080
And so I'm confident that there are other avenues that could be explored here.
link |
00:37:14.400
Can we talk about peer review, for example? Absolutely. I think in terms of the peer review,
link |
00:37:19.760
I think it's important to look at the bigger picture here of what the scientific publishing
link |
00:37:28.800
ecosystem looks like. Because for me, there are a lot of things that are wrong about that
link |
00:37:34.880
entire process. So if you look at what publishing means in a traditional journal, you have journals
link |
00:37:43.200
that pay authors for their articles. And then they might pay reviewers to review those articles.
link |
00:37:52.320
And finally, they pay people or distributors to distribute the content. In the scientific
link |
00:37:59.680
publishing world, you have scientists that are usually backed by government grants.
link |
00:38:04.480
They are giving away their work for free in the form of papers. And then you have other
link |
00:38:09.280
scientists that are reviewing their work. This process is known as the peer review process,
link |
00:38:15.200
again for free. And then finally, we have government backed universities and libraries
link |
00:38:23.120
that are buying back all that work so that other scientists can read. So this is, for me,
link |
00:38:30.400
it's bizarre. You have the government that is funding the research, is paying the salaries
link |
00:38:34.640
of the scientists, is paying the salaries of the reviewers, and is buying back all that
link |
00:38:39.680
product of their work, again. And I think the problem with this system, and it's why it's so
link |
00:38:45.760
difficult to break this suboptimal equilibrium, is because of the way academia works right now,
link |
00:38:53.920
and the way you can progress in your academic life. And so in a lot of fields, the competition in
link |
00:39:01.680
academia is really insane. So you have hundreds of PhD students, they are trying to get to a
link |
00:39:10.080
professor position, and it's hyper competitive. And the only way for you to get there is if you
link |
00:39:18.160
publish papers, ideally in journals, with a high impact factor. In computer science, it's often
link |
00:39:25.760
conferences are also very prestigious, or actually more prestigious than journals now.
link |
00:39:30.320
Interesting. So that's the one discipline where, I mean, that has to do with the thing we've discussed
link |
00:39:35.600
in terms of how quickly the field turns around. But like, NeurIPS, CVPR, those conferences are
link |
00:39:42.880
more prestigious, or at the very least, as prestigious as the journals. But yeah, but doesn't
link |
00:39:48.320
matter. The process is what it is. And so for people that don't know, the impact factor of a
link |
00:39:54.240
journal is basically the average number of citations that a paper would get if it gets
link |
00:39:58.880
published on that journal. But so you can really think that the problem with the impact factor
link |
00:40:07.600
is that it's a way to turn papers into accounting units. And let me unpack this, because it's the
link |
00:40:15.920
impact factor is almost like a nobility title. So because papers are born with impact, even before
link |
00:40:22.240
anyone reads them. So the researchers, they don't have the incentive to care about if this paper
link |
00:40:28.240
is going to ever a long term impact on the world, what they care their goal, their end goal is the
link |
00:40:34.560
paper to get published. So that they get that value upfront. So for me, that is one of the
link |
00:40:40.640
problems of that. And that really creates a tyranny of metrics. Because at the end of the day, if
link |
00:40:46.960
you are a dean, what you want to hire is people, researchers that publish papers on journals with
link |
00:40:53.040
high impact factors, because that will increase the ranking of your university and will allow you
link |
00:40:58.000
to charge more for tuition, so on and so forth. And especially when you are in super competitive
link |
00:41:06.320
areas, that people will try to gamify that system and misconduct starts showing up. There's a really
link |
00:41:14.880
interesting book on this topic called Gaming the Metrics. It's a book by a researcher called
link |
00:41:21.200
Mario Biajoli. It goes a lot into the impact factor and metrics affect science negatively.
link |
00:41:29.280
And it's interesting to think, especially in terms of citations, if you look at the early
link |
00:41:33.600
work of looking at citations, there was a lot of work that was done by a guy called Eugene Garfield.
link |
00:41:39.680
And this guy, the early work in terms of citation, they wanted to use citations as
link |
00:41:44.800
a descriptive point of view. So what they wanted to create was a map. And that map would create a
link |
00:41:52.560
visual representation of influence. So citations would be links between papers. And ideally,
link |
00:41:59.680
what they would show, they would represent is that you read someone else's paper and it had an
link |
00:42:04.960
impact on your research. They weren't supposed to be counted. I think this inspired Larry and
link |
00:42:10.080
Sergei's work for Google. Exactly. I think they even mentioned that. But what happens is, as you
link |
00:42:15.920
start counting citations, you create a market. And the same way, and the work of Eugene Garfield
link |
00:42:23.760
was a big inspiration for Larry and Sergei and for the page rank algorithm that led to the
link |
00:42:29.440
creation of Google. And they even recognized that. And if you think about it, it's the same way
link |
00:42:34.960
there's a gigantic market for search engine optimization, SEO, where people try to optimize
link |
00:42:42.480
the page rank and how a web page will rank on Google, the same will happen for papers.
link |
00:42:49.040
People will try to optimize the impact factors and the citations that they get. And that creates
link |
00:42:56.160
a really big problem. And it's super interesting to actually analyze the, if you look at the
link |
00:43:01.760
distribution of the impact factors of journals, you have like nature with,
link |
00:43:07.520
nature, I believe it's like in the low 40s. And then you have, I believe science is high 30s.
link |
00:43:12.880
And then you have a really good set of good journals that will fall between 10 and 30.
link |
00:43:20.240
And then you have a gigantic tell of journals that have impact factor below two. And you can
link |
00:43:26.480
really see two economies here. You see the universities that are maybe less prestigious,
link |
00:43:34.320
less known, that where the faculty are pressured to just publish papers, regardless of the journal,
link |
00:43:40.560
what I want to do is increase the ranking of my university. And so they end up publishing
link |
00:43:45.520
as many papers as they can in like journals with low impact factor. And unfortunately,
link |
00:43:50.960
this represents a lot of the global south. And then you have the luxury good economy.
link |
00:43:59.440
So for instance, for, and there are also problems here in the luxury good economy. So if you look
link |
00:44:03.920
at the journal like nature, so with impact factor of like in the low 40s, there's no way
link |
00:44:10.720
that you're going to be able to sustain that level of impact factor by just grabbing the
link |
00:44:15.920
attention of scientists. What I mean by that is like, for the journals, the articles that
link |
00:44:22.880
get published in nature, they need to be New York Times great. So they need to make it to the,
link |
00:44:30.240
you know, to the, to the big media, they need to be captured by the big media. And because
link |
00:44:34.160
that's the only way for you to capture enough attention to sustain that level of citations.
link |
00:44:39.520
Yes. And that of course creates problems because people then will try to again gamify the system
link |
00:44:45.760
and have like titles or abstracts or that are bigger claim, make claims that are bigger than
link |
00:44:53.040
what is actually can be sustained by the data or the content of the paper. And you will have
link |
00:44:59.440
clickbait titles or clickbait abstracts. And again, this is all a consequence of metrics and
link |
00:45:06.880
science or metrics. And, and this is a very dangerous cycle that I think it's very hard to
link |
00:45:12.480
break, but it's happening in academia in a lot of fields right now. Is it fundamentally the
link |
00:45:18.640
existence of metrics or the metrics just need to be significantly improved? Because like I said,
link |
00:45:25.280
the metrics used for Amazon for purchasing, I don't know, computer parts is pretty damn good
link |
00:45:32.560
in terms of selecting which are the good ones, which are not. In that same way, if we had Amazon
link |
00:45:39.040
type of review system in the space of ideas, in the space of science, it feels like that those
link |
00:45:45.040
metrics would be a little bit better. Sort of when it's significantly more open to the crowdsourced
link |
00:45:53.920
nature of the internet, of the scientific internet, meaning as opposed to like my biggest problem
link |
00:46:01.040
with peer review has always been that it's like five, six, seven people, usually even less. And
link |
00:46:09.120
it's often nobody's incentivized to do a good job in the whole process, meaning it's anonymous
link |
00:46:17.680
in a way that doesn't incentivize, like doesn't gamify or incentivize great work. And also,
link |
00:46:26.160
it doesn't necessarily have to be anonymous. Like there has to be, the entire system doesn't
link |
00:46:34.880
encourage actual sort of rigorous review. For example, like open review does kind of incentivize
link |
00:46:43.440
that kind of process of collaborative review, but it's also imperfect. But it just feels like
link |
00:46:49.120
the thing that Amazon has, which is like thousands of people contributing their reviews to a product.
link |
00:46:54.960
It feels like that could be applied to science, where the same kind of thing you're doing with
link |
00:47:04.640
from Oz Library, but doing at a scale that's much larger. It feels like that should be possible,
link |
00:47:11.920
given the number of grad students, given the number of general public that's, for example,
link |
00:47:18.160
I personally, as a person who got an education in mathematics and computer science, I can
link |
00:47:27.680
be a quote unquote reviewer on a lot bigger set of things than is my exact expertise. If I'm one
link |
00:47:38.400
of thousands of reviewers, if I'm the only reviewer, one of five, then I better be like an expert
link |
00:47:44.880
in the thing. And I've learned this with COVID, which is like, you can just use your basic skills
link |
00:47:52.640
as a data analyst to contribute to the review process on a particular little aspect of a paper
link |
00:47:58.560
and be able to comment, be able to sort of draw in some references that challenge the ideas presented
link |
00:48:05.680
or to enrich the ideas that are presented. It just feels like crowdsourcing the review process
link |
00:48:14.480
would be able to allow you to have metrics in terms of how good a paper is that are much
link |
00:48:21.440
better representative of its actual impact in the world, of its actual value to the world,
link |
00:48:26.720
as opposed to some kind of arbitrary gamified version of its impact.
link |
00:48:34.880
I agree with that. I think there's definitely the possibility, at least for a more resilient
link |
00:48:40.320
system than what we have today. And I think that's kind of what you're describing. To an extent,
link |
00:48:46.000
we kind of have a little bit of a Heisenberg uncertainty principle. When you pick a metric,
link |
00:48:52.400
as soon as you do it, then maybe it works as a good heuristic for a short amount of time,
link |
00:48:56.480
but soon enough, people start gamifying. But then you can definitely have metrics that are more
link |
00:49:02.560
resilient to gamification, and they'll work as a better heuristic to try to push you in the best
link |
00:49:09.200
direction. But I guess the underlying problem you're saying is there's just shortage of positions
link |
00:49:15.120
in academia. That's a big problem for me. Yeah. And so they're going to be constantly gamifying
link |
00:49:21.360
the metrics. It's a very competitive field. And that's what usually happens in very competitive
link |
00:49:29.680
fields. But I think some of the peer review problems, scale helps, I think. And it's
link |
00:49:36.480
interesting to look at what you're mentioning, breaking it down, maybe in smaller parts and
link |
00:49:41.040
having more people jumping in. But this is definitely a problem. And the peer review
link |
00:49:48.080
problem, as I mentioned, is correlated with the problem of academic career progression. And it's
link |
00:49:53.520
all intertwined. And that's why I think it's so hard to break it. There are a couple of
link |
00:50:00.320
really interesting things that are being done right now. There are a couple of, for instance,
link |
00:50:04.000
journals that are overlay journals on top of platforms like archive and bio archive that
link |
00:50:11.040
want to remove like the more traditional journals from the equation. So essentially, a journal
link |
00:50:16.080
is just a collection of links to papers. And what they're trying to do is like removing that middle
link |
00:50:23.440
man and trying to make the review process a little bit more transparent. And not charging
link |
00:50:30.320
universities like there's there's a couple of there are a couple of more famous ones. There's
link |
00:50:35.280
one discrete analysis in mathematics. There's one called the quantum journal, which we're actually
link |
00:50:40.720
working with them. We have a partnership with them for the purpose that cat published in quantum
link |
00:50:45.360
journal, they also get the annotations on formats. And they're doing pretty well, they've been able
link |
00:50:50.320
to grow substantially, the problem there is getting to critical mass. So it's again, convincing the
link |
00:50:55.200
researchers and especially the young researchers that need need that impact factor need those
link |
00:51:01.120
publications to have citations to not publish on the traditional journal and go on an open journal
link |
00:51:07.520
and publish their work there. There I think there are a couple of really high profiled
link |
00:51:12.240
scientists of people like team Gowers that are trying to incentivize like famous scientists
link |
00:51:17.680
that already have tenure and that don't need that to publish that to increase the reputation
link |
00:51:22.960
of those journals. So that other maybe younger scientists can start publishing on on those as
link |
00:51:28.240
well. And so they can try to break that vicious cycle of of the more traditional journals.
link |
00:51:35.040
I mean, another possible way to break this cycle is to like raise public awareness and just by force
link |
00:51:42.320
like ban paid journals. Like what exactly are they contributing to the world? Like basically
link |
00:51:49.600
making it illegal to forget the fact that it's mostly federally funded. So that's that's a super
link |
00:51:57.840
ugly picture too. But like, why should knowledge be so expensive? Like where everyone is working
link |
00:52:06.320
for the public good. And then there's these gatekeepers that you know, most people can't read
link |
00:52:12.560
most papers without having to pay money. And that's that doesn't make any sense. That's like
link |
00:52:20.400
that that should be illegal. I mean, that's what you're saying is exactly right. I mean, for instance,
link |
00:52:25.600
right, I went to school here in the US, we studied in Europe. And you would sit like you'd
link |
00:52:30.960
ask me all the time to download papers and send it to him because he just couldn't get it and like
link |
00:52:34.720
papers that he needed for his research. And so but he's a student like he's a grad student.
link |
00:52:39.920
He was a grad student. But that you know, I'm even referring to just regular people. Oh yeah,
link |
00:52:44.480
okay, that too. And I think during 2020, because of COVID, a lot of journals put down the
link |
00:52:53.520
walls for certain kind of coronavirus related papers. But like that just gave me an indication
link |
00:52:59.440
that like, this should be done for everything. It's absurd. Like people should be outraged
link |
00:53:06.080
that there is these gates. Because so the moment you dissolve the journals,
link |
00:53:12.640
then there will be an opportunity for startups to build stuff on top of archive. It'll be an
link |
00:53:18.960
opportunity for like, for my library to step up to scale up to something much even larger. I mean,
link |
00:53:25.600
that was the original dream of Google, which I've always admired, which is make the world's
link |
00:53:31.360
information accessible. Actually, it's interesting that Google hasn't, maybe you guys can correct me,
link |
00:53:36.640
but they have put together Google Scholar, which is incredible. But they, and they've the scanning
link |
00:53:42.640
of books, but they haven't really tried to make science accessible in the in the following way.
link |
00:53:50.160
Like, besides doing Google Scholar, they haven't like, delved into the papers, right?
link |
00:53:57.200
Which is especially curious, given what we was saying, right? That it's kind of in their genesis.
link |
00:54:02.160
There's this, you know, research that was very connected with our papers reference each other
link |
00:54:06.960
and like building a network out of that. Interesting enough, like Google, I think there was a,
link |
00:54:12.320
there was not intent Google plus was like the Google social network that caught counsel was
link |
00:54:17.040
used by a lot of researchers. Yes, it was. Whether I think was just a, you know, side,
link |
00:54:21.440
kind of the side effect, but then a lot of people ended up migrating to Twitter,
link |
00:54:24.880
but it was not on purpose. But yeah, I agree with you, like they haven't gone past Google Scholar.
link |
00:54:30.800
Well, that said, Google Scholar is incredible. For people who are not familiar, it's one of the
link |
00:54:36.320
best aggregation of all the scientific work that's out there, and especially the network
link |
00:54:42.240
that connects all of them, what sites, what, and also trying to aggregate all of the versions of
link |
00:54:47.280
the papers that are available there and trying to merge them in a way that one particular work,
link |
00:54:52.800
even though it's available in a bunch of places, counts as, you know, like a central hub of what
link |
00:54:58.000
that work is across the multiple versions. But that almost seems like a fun head project of a
link |
00:55:04.240
couple of engineers within Google as opposed to a serious effort to make the world science
link |
00:55:10.160
accessible. But going back to just the journals when you're talking about that Lex, I believe that
link |
00:55:16.240
in that front, I think we might be past the event horizon. So I think the model, the business model
link |
00:55:24.960
for the journals doesn't make sense. They are a middle layer that is not adding a lot of value.
link |
00:55:29.760
And you see a lot of motions, whereas like in Europe, a lot of the papers that are get
link |
00:55:36.720
are funded by the European Union, they will have to be open to the public. And I think there's a
link |
00:55:43.440
lot of the gates to like what the Gates Foundation funds, like the demand that it's accessible to
link |
00:55:51.760
everybody. So I think it's the question of time before that wall kind of falls. And that is going
link |
00:55:57.920
to open a lot of possibilities. Because, you know, imagine if you had like the layer of that
link |
00:56:04.480
gigantic layer of papers all available online, you know, that unlocks a lot of potential as a
link |
00:56:11.360
platform for people to build things on top of that. But to what you're saying, it is weird,
link |
00:56:15.840
like you can literally go and listen to any song that was ever made on your phone, right? You open
link |
00:56:23.440
Spotify and you might not even pay for it. You might be on the free version and you can listen to
link |
00:56:28.000
any song that was ever made pretty much. But there's like you don't have access to a huge
link |
00:56:35.280
percentage of academic papers, which is just like this fundamental knowledge that we're all funding,
link |
00:56:41.040
but you as an individual don't have access to it. And somehow, you know, like the problem for music
link |
00:56:46.080
got solved. But for papers, it's still like... It's just not yet. It could be ad supported,
link |
00:56:51.920
all those kinds of things. And then hopefully that would change the way we do science. That's the
link |
00:56:56.400
most exciting thing for me is especially once I started like making videos and this silly podcast
link |
00:57:03.280
thing, I started to realize like that if you want to do science, one of the most effective ways is
link |
00:57:09.920
to do a like couple the paper with a set of YouTube videos, like explaining it.
link |
00:57:19.280
That also seems like there's a lot of room for disruption there. What is the paper 2.0
link |
00:57:23.920
going to look like? I think like LaTeX and the PDF seems like if you look at the first
link |
00:57:31.360
paper that got published in Nature and if you look at the paper that got published in Nature
link |
00:57:35.760
today, if you look at the two side by side, they are fundamentally the same. And even though like
link |
00:57:41.280
the paper that gets published today, you know, you get even code like right now, people put like
link |
00:57:47.840
code on a PDF. And there are so many things that are related to papers today. You have data,
link |
00:57:56.240
you have code, you might need videos to better explain the concepts. So I think for me it's
link |
00:58:03.840
natural that there's going to be also an evolution there, that papers are not going to be just the
link |
00:58:08.000
static PDFs or LaTeX. There's going to be a next interface.
link |
00:58:14.000
So in academia, a lot of things that are judged, your judge by is often quantity, not quality.
link |
00:58:20.400
I wonder if there's an opportunity to have like, I tend to judge people by the best work they've
link |
00:58:26.480
ever done as opposed to, I wonder if there's a possibility for that to encourage sort of
link |
00:58:31.920
focusing on the quality and not necessarily in paper form, but maybe a subset of a paper,
link |
00:58:38.240
a subset of idea, almost even a blog post or an experiment. Like why does it have to be
link |
00:58:42.880
published in a journal to be legitimate? And it's interesting that you mentioned that I also think
link |
00:58:50.400
like, yeah, it's why is that the only format? Why can't a blog post or we were even experimenting
link |
00:58:59.040
with these a few months ago, can you actually like publish something or like a new scientific
link |
00:59:07.600
breakthrough or something that you've discovered in the form of like a set of tweets,
link |
00:59:14.320
a Twitter thread? Why can't that be possible? And we were experimenting with that idea.
link |
00:59:21.840
We even, yeah, we ran a couple of, like some people submitted a couple of those,
link |
00:59:27.440
like I think the limit was three or four tweets. Maybe it's a new way to look at a,
link |
00:59:32.000
you know, a proof or something. But I think it just serves to show that there should be other
link |
00:59:36.720
ways to publish like scientific discoveries that don't fit the paper format. Well, but so even
link |
00:59:43.200
with the Twitter thread, it would be, it would be nice to have some mechanism of formalizing it
link |
00:59:49.680
and making it into an NFT. Maybe. Like a concrete thing that you can reference with a link that's
link |
00:59:58.560
unique. Because I mean, everything we've been saying, all of that while being true,
link |
01:00:08.080
it's also true that the constraints and the formalism of a paper works well. It like forces you,
link |
01:00:16.480
constraints forces you to narrow down your thing and literally put it on paper. But, you know,
link |
01:00:24.960
I agree. Make concrete. And that's why, I mean, it's not broken. It just could be better.
link |
01:00:32.800
I agree. And that's the main idea. I think there's something about writing,
link |
01:00:36.560
whether it's a blog post or Twitter thread or a paper, that's really nice to, to concretize
link |
01:00:43.920
a particular little idea that they can then be referenced by other ideas, then it can be built
link |
01:00:51.760
on top of with other ideas. So let me ask, you've read quite a few papers. You've annotated quite
link |
01:01:02.240
a few papers. Let's talk about the process itself. How do you advise people read papers?
link |
01:01:08.160
Or maybe you want to broaden it beyond just papers, but just read concrete pieces of information to
link |
01:01:15.040
understand the insights that lay within. I would say for papers specifically, I would bring back
link |
01:01:21.120
kind of what Louise was talking about, is that it's important to keep in mind that papers are not
link |
01:01:27.120
optimized for ease of understanding. And so, right, there's all sorts of restrictions and size
link |
01:01:33.200
and format and language that they can use. And so it's important to keep that in mind. And so
link |
01:01:39.680
that if you're struggling to read a paper, that might not mean that the underlying material is
link |
01:01:45.920
actually that hard. And so that's definitely something that, especially for us, that we
link |
01:01:52.880
read papers and most of the times, the papers are completely outside of our comfort zone,
link |
01:01:58.720
I guess, and so it'd be completely new areas to us. So I always try to keep that in mind.
link |
01:02:05.360
So there's usually a certain kind of structure, like abstract introductions, methodology,
link |
01:02:11.600
depending on the community and so on. Is there something about the process of like
link |
01:02:17.520
how to read it, whether you want to skim it to try to find the parts that are easy to understand
link |
01:02:22.080
and not reading it multiple times? Is there any kind of hacks that you can comment on?
link |
01:02:29.120
I remember like Feynman had this kind of hack when he was reading papers where he would basically,
link |
01:02:38.000
I think I believe he would read the conclusion of the paper, and we would try to just see if he
link |
01:02:44.880
would be able to figure out how to get to the conclusion in like a couple of minutes by himself.
link |
01:02:50.080
And he would read a lot of papers that way. And I think Fermi also did that, almost. And Fermi
link |
01:02:58.480
was known for doing a lot of back of the envelope calculations. So he was a master at doing that.
link |
01:03:05.040
In terms of like, especially when reading a paper, I think a lot of times people might
link |
01:03:11.440
feel discouraged about the first time you read it, you know, it's very hard to grasp or you don't
link |
01:03:17.600
understand a huge fraction of the paper. And I think it's having read a lot of papers in my life.
link |
01:03:23.680
I think I've in peace with like the fact that you might spend hours where you're just reading a
link |
01:03:29.040
paper and jumping from paper to paper, reading citations, and like your level of understanding
link |
01:03:36.560
sometimes of the paper is very close to 0%. And all of a sudden, you know, everything kind of
link |
01:03:41.840
makes sense and in your mind. And then, you know, you have this quantum jump where all of a sudden
link |
01:03:48.240
you understand the big picture of the paper. And this is an exercise that I have to when
link |
01:03:56.480
reading papers and especially like more complex papers, like, okay, you don't understand because
link |
01:04:01.040
you're just going through the process and just keep going. And like, it might feel super chaotic,
link |
01:04:06.240
especially if you're jumping from reference to reference, you know, you might end up with like
link |
01:04:10.480
20 tabs opened and you're reading a ton of other papers, but it's just trusting that process
link |
01:04:15.920
that at the end, like you'll find light. And I think for me, that's a good framework
link |
01:04:21.840
when reading a paper. It's hard because, you know, you might end up spending a lot of time and it
link |
01:04:27.520
looks like you're lost. But that's the process to actually understand what they're talking about
link |
01:04:35.120
in the paper. Yeah, I think that process, I enjoy, I've found a lot of value in the process,
link |
01:04:40.560
especially for things outside my field, reading a lot of related work sections and kind of
link |
01:04:48.000
going down that path of getting a big context of the field because what's, especially when
link |
01:04:53.120
they're well written, there's opinions injected into the really related work, like what work is
link |
01:04:58.720
important, what is not. And if you read multiple related work sections that cite or don't cite
link |
01:05:04.080
each other, like the papers, you get a sense of where the field, where the tensions of the field
link |
01:05:10.080
are, where the field is striving. And that helps you put into context, like whether the work is
link |
01:05:16.880
radical, whether it's overselling itself, whether it's underselling itself, all those things.
link |
01:05:23.760
And added on top of that, I find that often the related work section is the most kind of
link |
01:05:31.840
accessible and readable part of a paper because it's kind of, it's brief to the point, it's trying
link |
01:05:38.720
to, like summarizing, it's almost like a Wikipedia style article. The introduction is supposed to
link |
01:05:43.840
be a compelling story or whatever, but it's often like overselling, there's like an agenda
link |
01:05:49.760
introduction. The related work usually has the least amount of agenda, except for the few
link |
01:05:55.360
like elements where you're trying to talk shit about previous work where you're trying to sell
link |
01:05:59.600
that you're doing much better. But other than that, when you're just painting where the field
link |
01:06:05.520
came from or where the field stands, that's really valuable. And also, again, just to agree
link |
01:06:11.120
with fine on the conclusion. But I get a lot of value from the breadth first search, kind of
link |
01:06:17.040
read the conclusion, then read the related work, and then go through the references in the related
link |
01:06:23.440
work, read the conclusion, read the related work, and just go down the tree until you like hit dead
link |
01:06:30.320
ends or run out of coffee. And then through that process, you go back up the tree, and now you can
link |
01:06:36.080
see the results in their proper, in their proper context. Unless, of course, the paper is truly
link |
01:06:41.040
revolutionary, which even that process will help you understand that is, in fact, truly revolutionary.
link |
01:06:48.000
Right. You've also, you talked about just following your Twitter thread in a depth first search.
link |
01:06:55.920
You talked about that you read the book on Grisha Perlman, Grigore Perlman, and then you had a
link |
01:07:03.360
really nice Twitter thread on it, and you were taking notes throughout. So, at a high level,
link |
01:07:09.840
is there suggestions you can give on how to take good notes, whether we're talking about annotations
link |
01:07:15.520
or just for yourself to try to put on paper ideas as you progress through the work in order to then
link |
01:07:22.720
like understand the work better? For me, I always try not to underestimate how much you can forget
link |
01:07:29.920
within six months after you've read something. Oh, I thought you were going to say five minutes,
link |
01:07:33.600
but yeah, six months is good. Yeah, or even shorter. And so, that's something that I always try to
link |
01:07:39.040
keep in mind, and it's often, I mean, every once in a while, I'll read back a paper that I annotated
link |
01:07:46.640
on Fermat, and I'll read through my own annotations, and I have completely forgotten what I had written.
link |
01:07:58.080
It also, it's interesting because in a way, after you just understood something, you're kind of the
link |
01:08:02.880
best possible teacher that can teach your future self after you've forgotten it, you're kind of
link |
01:08:11.920
your own best possible teacher at that moment. And so, it can be great to try to capture that.
link |
01:08:19.520
It's brilliant. It just made me kind of realize it's really nice to put yourself in the position
link |
01:08:27.280
of teaching an older version of yourself that returns to this paper, almost like thinking of
link |
01:08:32.320
literally, that's underexplored. But it's super powerful because you were the person that you
link |
01:08:38.320
can, like, if you look at the scale from like one, not knowing anything about the topic in 10,
link |
01:08:43.840
like you are the one that progressed from one to 10, and you know which steps you struggled with.
link |
01:08:48.320
So, you're really the best person to help yourself make that transition from one to 10.
link |
01:08:53.680
And a lot of the times, like, and I really believe that the framework that we have to,
link |
01:09:00.320
like, expose ourselves to, like, be talking to us when we were an expert, when we were taking
link |
01:09:05.920
that class, and we knew everything about quantum mechanics. And then six months later, you don't
link |
01:09:10.480
remember half of the things. How could we make it easier for, like, to have those conversations
link |
01:09:16.880
between you and your past self, past expert self? I think there might be, it's an underexplored
link |
01:09:24.640
idea. I think notes on paper are probably not the best way. I'm not sure if it's a combination of,
link |
01:09:29.360
like, video, audio, where it's like you have a guided framework that you follow to extract
link |
01:09:35.920
information from yourself so that you can later kind of revisit to make it easier to remember.
link |
01:09:42.240
But that's, I think, it's an interesting idea worth exploring. I haven't seen a lot of people kind
link |
01:09:50.000
of trying to distill that problem. Yeah, I'm creating the kind of tools. I find if I record,
link |
01:09:56.800
it sounds weird, but I'll take notes, but if I record audio, like little clips of thoughts,
link |
01:10:05.520
like rants, that's really effective at capturing something that notes can't. Because when I replay
link |
01:10:13.680
them, for some reason, it loads my brain back into where I was when I was reading that in a way
link |
01:10:20.160
that notes don't. When I read notes, I'll often be like, what? What was I thinking there? But
link |
01:10:28.480
when I listen to the audio, it brings you right back to that place. And maybe with video, with
link |
01:10:34.800
visual, that might be even more powerful. I think so. Yeah. And I think just the process of verbalizing
link |
01:10:41.440
it, that alone kind of makes you have to structure your thought and put it in a way that somebody
link |
01:10:48.560
else could come and understand it. And just the process of that is useful to organize your thoughts.
link |
01:10:56.800
Yeah, just that alone. Does the Fermat Library Journal Club have a video component or no?
link |
01:11:03.680
Not natively. Sometimes we'll include videos, but it's always embedded. Do people build videos
link |
01:11:09.600
on top of it to explain the paper? Because you're doing all the hard work of understanding deeply
link |
01:11:14.320
the paper. Not, we haven't seen that happening too much. But we were actually playing around with
link |
01:11:21.680
the idea of creating some sort of podcast version where we try to distill the paper on an audio
link |
01:11:28.240
format that not maybe you could have access to. It might be tricky. Might be trickier, but there
link |
01:11:32.720
are definitely people that could be interested in the paper and their topic, but are not willing to
link |
01:11:36.800
read it. But they might listen to a 30 minute episode on that paper. Yes. You could reach more
link |
01:11:42.240
people and you might even bring the authors to the conversation. But it's tricky and especially
link |
01:11:46.800
for more technical papers. We've thought about doing that, but we haven't converged. I'm sure if
link |
01:11:54.000
you have any tips. Well, I'm going to take that as a small project to take one a year. One of the
link |
01:11:59.840
Fermat is almost like half advertisement and half as a challenge for myself to take one of the
link |
01:12:04.480
annotated papers and use it as a basis for creating a quick video. I've seen, hopefully I'm saying
link |
01:12:13.840
the name correctly, but machine learning street talk. I think that's the name of the show that I
link |
01:12:20.720
recommend highly. That's the right name. But they do exactly that, which is multiple hour breakdown
link |
01:12:26.800
of a paper with video component. Sometimes with authors, people love it. It's very effective.
link |
01:12:33.520
There's also, I haven't seen the entire, in its entirety, but I've seen the founder of
link |
01:12:40.000
comma.ai George. I've seen him just taking a paper and then distilling the paper and coding it,
link |
01:12:48.960
coding it sometimes during 10 hours. He was able to get a lot of people interested in that and
link |
01:12:55.760
viewing him. I'm a huge fan of that. George is a personality. I think a lot of people,
link |
01:13:03.120
like, listen to this podcast for the same reason. It's not necessarily the contents. They like to
link |
01:13:08.560
listen to a silly Russian who has a childlike brain and mumbles and all those, like, struggle with
link |
01:13:16.560
ideas. George is a madman. People just enjoy, like, how is he going to struggle in implementing
link |
01:13:22.880
this particular paper? How is he going to struggle with this idea? It's fun to watch and that actually
link |
01:13:27.120
pulls you in. The personality is important there. True, but I agree with you, but
link |
01:13:32.480
they're also, it's visible. There's an extraordinary ability that is there. It's
link |
01:13:39.200
talented and you need to have, there's a craft. This guy definitely has talent and he's doing
link |
01:13:44.960
something that is not easy. I think that also draws the attention of people. Oh yeah. The other
link |
01:13:50.480
day we were actually, we ran into this YouTube channel of this guy that was restoring art.
link |
01:13:55.840
Right. It was basically just a video of him. The production is really well done.
link |
01:14:05.040
It's just him taking really old pieces of art and then paintings and then restoring them,
link |
01:14:11.760
but he's really good at that and he describes that process. That draws the attention of people,
link |
01:14:18.080
regardless of your craft. Be it, like, annotating a paper or restoring it.
link |
01:14:22.800
Craftsmanship, excellence, yeah. George is incredibly good at programming.
link |
01:14:29.520
You know those competitive programmers like Top Boat and all those kinds of stuff? He has the
link |
01:14:33.840
same kind of element where the brain just jumps around really quickly and that's, yeah, just like
link |
01:14:40.080
with art and motivation. It's motivating, but in your right, in watching people who are good
link |
01:14:47.120
at what they do, it's motivating even if the thing you're trying to do is not what they're doing.
link |
01:14:51.600
It's just contagious when they're really good at it. And the same kind of analysis with the paper,
link |
01:14:56.880
I think, not just like the final result, but the process of struggling with it. That's really
link |
01:15:02.800
interesting. Yeah. I think Twitch proved that there's really a market for that, for watching
link |
01:15:10.000
people do things that they're really good at and you'll just watch it. You will enjoy that,
link |
01:15:15.600
that might even spike your interest in that specific topic and yeah, and people will enjoy
link |
01:15:23.040
watching sometimes hours on end of great craftsmen. Do you mind if we talk about some of the papers?
link |
01:15:29.760
Do any papers come to mind that have been annotated on Formaz Library? The papers that we annotated
link |
01:15:36.160
can be about completely random topics, but that's part of what we enjoy as well. It forces you to
link |
01:15:40.960
explore these topics that otherwise maybe you'd never run into. And so the ones that come to mind
link |
01:15:47.920
to me are fairly random, but one that I really enjoyed learning more about is a paper written
link |
01:15:57.760
by a mathematician actually, Tom Apostol, and about a tunnel in a Greek island off the coast
link |
01:16:07.520
of Turkey. It's very random. Okay, so what's interesting about this tunnel? So this tunnel
link |
01:16:16.480
was built in the 6th century BC and it was built in the island of Samos, which is, as I said,
link |
01:16:26.000
off the coast of Turkey. They had the city on one side, then the other mountain, and then they had
link |
01:16:31.920
a bunch of springs on the other side, and they wanted to bring water into the city.
link |
01:16:39.040
Building an aqueduct would be pretty hard because of the way the mountain was shaped,
link |
01:16:42.800
and it would also, if they were under a siege, they could just easily destroy that aqueduct,
link |
01:16:50.160
and then the city wouldn't have any water supply. And so they decided to build a tunnel,
link |
01:16:56.480
and they decided to try to do it quickly. And so they started digging from both ends at the same
link |
01:17:07.280
time through the mountain. And so when you start thinking about this, it's a fairly difficult
link |
01:17:14.400
problem, and this is like 6th century BC, so you had very limited access to the mathematical tools
link |
01:17:22.640
that you had at the time were very limited. And so what this paper is about is about the story
link |
01:17:27.680
of how they built it, and about the fact that for about 2000 years, kind of the accepted explanation
link |
01:17:35.440
of how they built it was actually wrong. And so this tunnel has been famous for a while. There
link |
01:17:40.240
are a number of historians that talked about it since ancient Egypt, and the method that they
link |
01:17:46.240
described for building it was just wrong. And so these researchers went there and were able to
link |
01:17:56.800
figure that out. And so basically, kind of the way that they thought they had built it was,
link |
01:18:02.240
basically, if you can imagine looking at the mountain from the top, and you have the mountain,
link |
01:18:07.200
then you have both entrances. And so what they thought, and this is what the ancient historians
link |
01:18:14.160
described is that they effectively tried to draw a right angle triangle with the two entrances
link |
01:18:23.600
at each end of the hypotenuse. And the way they did it is like they would go around the mountain
link |
01:18:29.040
and kind of walking in a grid fashion, and then you can figure out the two sides of the triangle.
link |
01:18:36.080
And then after you have that triangle, you can effectively draw two smaller triangles at each
link |
01:18:42.880
entrance that are proportional to that big triangle. And then you kind of have arrows pointing in each
link |
01:18:49.120
way. And then you know at least that you have a line going through the mountain that connects
link |
01:18:56.400
both entrances. The issue with that is once you go to this mountain and you start thinking of
link |
01:19:03.680
doing this, you realize that especially given that the tools that they had at the time,
link |
01:19:08.240
that your error margin would be too small, you wouldn't be able to do it. Just the fact of
link |
01:19:16.480
trying to build this triangle in that fashion, the error would accumulate and you would end
link |
01:19:21.200
up missing. You'd start building these tunnels and they would miss each other.
link |
01:19:24.320
So the task ultimately is to figure out really perfectly as close as possible the direction
link |
01:19:29.920
you should be digging. First of all, that it's possible they have a straight line through
link |
01:19:33.840
and then what that direction would be. And then you are trying to infer that by constructing a
link |
01:19:40.240
right triangle. I'm not exactly sure about how to do that rigorously by tracing the mountain,
link |
01:19:49.360
by walking along the mountain. You said grids. Yeah, you kind of walk as if you were in a grid.
link |
01:19:55.040
And so you just walk in right angles. But then you have to walk really precisely then.
link |
01:20:00.560
You have to use tools to measure this. And then the terrain is probably a mess. So this
link |
01:20:06.880
makes more sense than 2D and 3D gets even weirder. So okay, gotcha.
link |
01:20:11.760
But so this method was described by like an ancient Egyptian historian, I think, hero of
link |
01:20:17.440
Alexandria. And then for about, yeah, for about 2000 years, that's how we thought that they had
link |
01:20:24.160
built this tunnel. And then these researchers went there and found out that actually they
link |
01:20:32.400
must have had to use other methods. And then in this paper, they describe these other methods.
link |
01:20:38.880
And of course, they can't know for sure, but they present a bunch of plausible alternatives.
link |
01:20:45.520
The one that for me is the most plausible is that what they probably must have done is to
link |
01:20:50.320
use something that is similar to an iron sight on a rifle, the way you can line up
link |
01:20:57.280
your rifle with a target off in the distance by having an iron sight.
link |
01:21:03.840
And they must have done something similar to that, effectively with three sticks.
link |
01:21:11.360
And that way they were able to line up sticks along the side of the mountain that were all
link |
01:21:18.640
on the same height. And so that then you could get to the other side. And you could and then you
link |
01:21:23.920
could draw that line. So this for me is the most plausible way that they might have done that.
link |
01:21:29.920
And they but then they they they describe this in detail and other possible approaches in this
link |
01:21:35.920
paper. So this is a mathematician doing this? Yeah, this is a mathematician that did this.
link |
01:21:41.920
Which I suppose is the right mindset instead of skills required to solve an ancient problem,
link |
01:21:47.680
right? Yeah. Mathematicians and engineers, a lot of things. Because they didn't have computers or
link |
01:21:54.960
drones or LiDAR back then or whatever technology you would use modern day for civil engineering.
link |
01:21:59.920
Yeah. And another fascinating thing is that like, you know, after effectively after the
link |
01:22:06.160
downfall of the Roman civilization, people didn't build tunnels for about a thousand years. We go
link |
01:22:11.600
a thousand years without tunnels. And then like only in late middle ages that we start doing them
link |
01:22:16.880
again. But but here is the tunnel like 6th century BC, like incredibly limited mathematics.
link |
01:22:22.400
And they and they build it in this way. And for and it was a mystery for a long time exactly how
link |
01:22:29.760
they did it. And then these mathematicians went there. And basically with no archaeology kind
link |
01:22:36.000
of background, we're able to figure it out. How do annotations for this paper look like?
link |
01:22:40.720
What is it? What's a successful annotation for paper like this?
link |
01:22:44.800
Yes. So sometimes you're for this paper, sometimes adding some more context on a specific
link |
01:22:54.320
part like sometimes they mentioned, for instance, these instruments that were common in ancient Greece
link |
01:23:01.040
and ancient Rome for for building things. And and so in some of those annotations,
link |
01:23:08.240
I described these instruments in more detail and how they worked. Because sometimes it can be
link |
01:23:12.960
hard to visualize these. Then this paper, I forget exactly when this was published,
link |
01:23:22.480
I believe maybe maybe the 70s. But then there was further research into this tunnel and more
link |
01:23:29.120
interesting other interesting aspects about it. I add those to that paper as well. There's historical
link |
01:23:35.040
context that I also go into there. For instance, the fact that as I said that effectively after
link |
01:23:42.320
the downfall of the Roman Empire, no tunnels were built. That's something that I added to the paper
link |
01:23:47.840
as well. So when other people look at the paper, how do they usually consume the annotations?
link |
01:23:55.200
Is there a commenting feature? This is a really enriching experience the way you read a paper.
link |
01:24:03.360
What aspects do people usually talk about that they value from this?
link |
01:24:07.840
So yeah, so anybody can just go on there and and either add a new annotation or
link |
01:24:14.160
other comments to an existing annotation. And so you can start kind of a thread within an
link |
01:24:20.400
existing annotation. And that's something that happens relative frequency. And then
link |
01:24:25.680
because I was the original author of the initial annotation, I get pinged. And so oftentimes I'll
link |
01:24:31.120
go back and add on to that thread. How'd you pick the paper? I mean, first of all, this whole process
link |
01:24:38.320
is really exciting. I'm gonna, especially after this conversation, I'm gonna make sure I participate
link |
01:24:43.440
much more actively on papers that I know a lot about. And on paper, I know nothing about. I should
link |
01:24:49.120
get the annotated paper. I would love to. I also, I mean, I realized that there's like it's an
link |
01:24:58.320
opportunity for people like me to publicly annotate a paper, like, or do an AMA around the
link |
01:25:06.880
paper. Yeah, exactly. But yeah, but like be, be in the conversation about a paper. It's like a
link |
01:25:14.160
place to have a conversation about an idea. You get the other way to do it that's much more at
link |
01:25:19.280
hawk is on Twitter, right? But this is more like formal. And you could actually probably integrate
link |
01:25:24.000
the two have a conversation about the conversation. So the Twitter is the conversation about a
link |
01:25:29.360
conversation and the main conversation is in the space of annotations. There's an interesting
link |
01:25:34.400
effect that we see sometimes with the annotations on our papers is that a lot of people, especially if
link |
01:25:39.440
we the annotations are really well done, people sometimes are afraid of adding more annotations
link |
01:25:46.960
because they see that as a kind of a finished work. Yes. And so they don't want to pollute that or
link |
01:25:52.000
and especially if it's like a silly question, this is, I don't think that's good. I think, you know,
link |
01:25:58.560
we should as much as possible try to lower the barrier for someone to jump in and ask questions.
link |
01:26:03.120
I think it only like most of the times it adds value, but it's some feedback that we got from
link |
01:26:08.640
users and readers. Not exactly sure how to, to kind of fight that. But well, I think I,
link |
01:26:18.000
I think if I serve as an inspiration in any way is by asking a lot of dumb questions and saying a
link |
01:26:26.160
bunch of dumb shit all the time. And hopefully that inspires the rest of the other folks to
link |
01:26:31.040
do the same because that's the only way to knowledge, I think, is to be willing to ask the
link |
01:26:36.560
dumb questions. And there are papers that are like, you know, we have a lot of papers on Formaz
link |
01:26:41.360
where it's just one page or really short papers. And we have like the shortest paper ever published
link |
01:26:47.920
in a math journal. Like we'd like just a couple of words. One of my favorite papers on the platform
link |
01:26:53.200
is actually a paper written by Enrico Fermi. And the title of the paper is My Observation,
link |
01:26:59.280
I think is My Observations at Trinity. So basically Fermi was part of the Manhattan Project. So
link |
01:27:05.440
he was in New Mexico when they exploded the first atomic bomb. And so he was a couple of miles away
link |
01:27:12.960
from the explosion. And he was probably one of the first persons to calculate the energy of the
link |
01:27:17.920
explosion. And so the way he did that was he took a piece of paper and he tore down a piece of
link |
01:27:24.400
paper in little pieces. And when the bomb exploded, the Trinity bomb was the name of the bomb,
link |
01:27:30.240
like he waited for the blast to arrive at where he was. And then he threw those pieces of paper in
link |
01:27:36.880
the air. And he calculated the energy based on the displacement of the paper, the pieces of paper.
link |
01:27:42.800
And then he wrote a report which was classified until like a couple of years ago, one page report,
link |
01:27:47.920
like calculating the energy of the explosion. So badass. And we actually went there and kind of
link |
01:27:54.160
unpacked and I think he just mentions basically the energy and we actually went and one of the
link |
01:28:00.160
annotations is like explaining how he did that. I wonder how accurate he was. It was maybe I
link |
01:28:06.000
think like 20 or 25% off. Then there was another person that actually calculated the energy based
link |
01:28:12.560
on images after the explosion at the rate and the rate at which the like the mushroom of the
link |
01:28:20.240
explosion expanded. And it's more accurate to calculate the energy based on that. And I think
link |
01:28:25.360
it was like 20% off. But it's really interesting because you know, Fermi was known for all these
link |
01:28:31.040
being a master at these back of the envelope calculations. Like the Fermi problems are well
link |
01:28:36.320
known for that. And it's super interesting to see like that just one page report that was also
link |
01:28:42.720
actually classified. And it's interesting because a couple of months ago, when the Beirut explosion
link |
01:28:48.240
happened, there was a video circulating of these a bride that was doing a photo shoot
link |
01:28:54.080
when the explosion in Beirut happened. And so you can see a video of her with the wedding dress
link |
01:28:58.800
and then the explosion happens and the blast arrives at where she was. She was a couple of
link |
01:29:02.640
miles away from the blast. And you can see like the displacement of the dress as well. And I
link |
01:29:09.440
actually looked and that video went viral on Twitter. And I actually looked at that video and
link |
01:29:13.520
based I use the same techniques that Fermi used to calculate the energy of the explosion
link |
01:29:19.520
based on the displacement of the dress. And you could actually see where she was at the
link |
01:29:24.000
the distance from the explosion because there was a store behind her and you could look the name of
link |
01:29:28.080
the store. And so I calculated that it was the distance and then you can based on the distance
link |
01:29:33.680
where she was from the explosion and also on the displacement of the dress. Like because you can
link |
01:29:38.960
when the blast happens, like you can see the dress going back and then going back to the
link |
01:29:44.400
original position and like by just looking at like how much the dress moved, you can
link |
01:29:49.520
estimate the explosion, the energy of the explosion. I assume you published this
link |
01:29:54.400
on Twitter. It was just a Twitter thread. But it actually like a lot of people share that and
link |
01:29:59.840
it was picked up by a couple of news outlets. But I was hoping it would be like a formal title
link |
01:30:05.920
and it would be an archive. No, no, no, it may be submitted just to Twitter, Twitter thread.
link |
01:30:10.080
But it was interesting because it was exactly the same math that Fermi used.
link |
01:30:14.480
Is there something else that jumps to mind? Like what is there something I know like in terms of
link |
01:30:20.560
papers, like I know the Bitcoin paper is super popular. Is there something interesting to be
link |
01:30:25.280
said about any of the white papers in the cryptocurrency space?
link |
01:30:29.600
Yeah, the Bitcoin paper was the first paper that we put on Fermat's. Why that choice as the first
link |
01:30:38.080
paper? This was a while ago. And it was one of the papers that I read and then kind of explained
link |
01:30:45.840
it to Luiz and there are two other friends that do this journal club with us. And I did some research
link |
01:30:52.400
in cryptography as an undergrad. And so it was a topic that I was interested in. But even for me,
link |
01:30:59.040
that I had that background, but reading the Bitcoin paper, it took me a few reads to really kind of
link |
01:31:06.640
wrap my head around it. It uses very Spartan, precise language in a way. You feel like you
link |
01:31:13.760
can't take any word out of it without something falling apart. And it's all there. I think it's
link |
01:31:19.760
a beautiful paper and it's very well written, of course. But we wanted to try to make it accessible
link |
01:31:29.120
so that anybody that maybe is an undergrad and computer science could go on there and know that
link |
01:31:35.840
you have all the information in that page that you're going to need to understand the mechanics
link |
01:31:41.360
of Bitcoin. And so I explain the basic public key cryptography that you need to know in order to
link |
01:31:49.760
understand it. You can explain, okay, what are the properties of a hash function and how they
link |
01:31:54.720
are useful in this context? Explain what a Merkle tree is. So a bunch of those basic concepts that
link |
01:32:01.200
maybe if you're reading it for a first time and you're an undergrad and you don't know those
link |
01:32:05.040
terms, you're going to be discouraged because maybe, okay, now I have to go and Google around
link |
01:32:09.440
until I understand these before I can make progress in the paper. And this way, it's all there.
link |
01:32:15.600
So there's a magic to also to the fact that over time, more people went on there and added
link |
01:32:23.120
further annotations. So the idea that the paper gets easier and more accessible over time, but
link |
01:32:28.720
that's still you're still looking at the original content, the way the author intended it to be.
link |
01:32:35.280
But there's just more context and the toughest bits have more in depth explanations.
link |
01:32:42.400
Okay, I think like there's just so many interesting papers there. I remember reading the paper that
link |
01:32:50.160
was written by Freeman Dyson on the like the first time that he explained or he came up with a concept
link |
01:32:56.800
of the Dyson sphere. And he put that out like it's again, it's one page paper. And what he explained
link |
01:33:04.400
was that eventually, if civilization develops and grows, there's going to be a point where
link |
01:33:11.920
when the resources on the planet are not enough for the energy requirements of that civilization.
link |
01:33:18.560
So if you want to go, the next step is you need to go to the next star and extract energy from
link |
01:33:24.000
that star. And the way to do it is you need to build some sort of cap around the star that
link |
01:33:30.000
extracts the energy. So he theorized this idea of the Dyson sphere. And he went on to kind of
link |
01:33:37.040
analyze how he would build that, the stability of that sphere, like if something happens, like
link |
01:33:42.000
if there's like a small oscillation with that fear collapsing to the star or no, what would happen.
link |
01:33:48.000
And he even went on to kind of say that a good way for us to look for signs of intelligent life out
link |
01:33:54.960
there is to look for signals of these Dyson spheres. And because, you know, according to the law of
link |
01:34:01.520
second law of thermodynamics, like there's going to be some a lot of infrared radiation,
link |
01:34:05.520
there is going to be emitted as a consequence of extracting energy from the star. And we should
link |
01:34:10.480
be able to see those signals of like infrared if we look at the sky. But all these like from
link |
01:34:16.400
the introduction of the concept, like how to build a Dyson sphere, the problems of like having a
link |
01:34:21.200
Dyson sphere, how to detect how that could be used as a signal for intelligence life.
link |
01:34:25.600
Wait, really? That's all in the paper?
link |
01:34:26.960
All in one, like one page paper. And it's like, it's, for me, it's beautiful.
link |
01:34:30.720
It's like...
link |
01:34:31.040
What was this published?
link |
01:34:32.720
Don't remember.
link |
01:34:34.000
It's fascinating that papers like that could be, I mean, the guts it takes to put that all
link |
01:34:40.240
together in a paper form. You know, that kind of challenges our previous discussion of paper.
link |
01:34:46.160
I mean, papers can be beautiful. You can play with the format, right?
link |
01:34:50.400
But there's a lot to unpack there. That's like the starting point. But it's beautiful that
link |
01:34:56.400
you're able to put that in one page. And then people can build on top of that.
link |
01:35:01.520
But the key ideas are there.
link |
01:35:02.960
Yeah, exactly.
link |
01:35:04.640
What about, have you looked at any of the big seminal papers throughout the history of science?
link |
01:35:09.680
Like you look at simple like Einstein papers.
link |
01:35:14.160
Have any of those been annotated?
link |
01:35:15.760
Yeah, we have some more seminal papers that people have heard about.
link |
01:35:22.000
We have the DNA double elix paper on there. We have the Higgs boson paper.
link |
01:35:31.440
Yeah, there's papers that we know that they're not going to be finding out about them because
link |
01:35:38.720
of us. But it's papers that we think should be more widely read and that folks would benefit
link |
01:35:44.880
from having some annotations there. And so we also have a number of those.
link |
01:35:48.960
A lot of discovery papers for fundamental party calls and all that. We have a lot of those on
link |
01:35:55.120
from us library. We haven't annotated that one, but I'd like to on the Riemann hypotheses.
link |
01:36:02.160
That's a really interesting paper as well. But we haven't annotated that one.
link |
01:36:07.280
But there's a lot of like more historical landmark papers on the platform.
link |
01:36:12.880
Have you done Poincaré conjecture with Perlman?
link |
01:36:17.120
That's too much for me. But it's interesting that in going back to our discussion,
link |
01:36:25.040
like the Poincaré paper was like published on archive and it was not on a journal,
link |
01:36:29.840
like the three papers.
link |
01:36:30.800
Yeah, what do you make of that? I mean, he's such a fascinating human being.
link |
01:36:34.880
I mentioned to you offline that I'm going to Russia. He's somebody I'm really...
link |
01:36:38.800
She tried to interview. Yeah. Well, so I definitely will interview him.
link |
01:36:45.920
And I believe I will. I believe I can. I just don't know how to... I know where you live.
link |
01:36:52.880
Okay. My hope is, my conjecture is that if I just show up to the house and look desperate enough
link |
01:37:01.280
that or threatening enough for some combination of both, that like the only way to get rid of me
link |
01:37:06.880
is to just get the thing done. That's the hope.
link |
01:37:09.040
It's sexually interesting that you mentioned that because after...
link |
01:37:12.800
So a couple of weeks ago, I was searching for stuff about Perlman online and I ended up on
link |
01:37:18.480
this Twitter account of this guy that claims to be Perlman's assistant.
link |
01:37:24.320
And he has been posting a bunch of pictures next to Perlman.
link |
01:37:28.320
You can see Perlman in a library and he's next to him taking a selfie or Perlman walking
link |
01:37:34.400
on the street and like, maybe you could reach out to his assistant and I'll send you...
link |
01:37:40.080
I'll send you this Twitter account. Maybe you're onto something.
link |
01:37:43.680
No, but going back to Perlman is super interesting because the fact that he published the proofs
link |
01:37:49.840
on archive was also like a way for him to... Because he really didn't like the scientific
link |
01:37:55.920
publishing industry and the fact that you had to pay to get access to articles.
link |
01:38:02.080
And that was a form of protest and that's why he published those papers there.
link |
01:38:07.200
I mean, I think Perlman is just a fascinating character and for me, it's this kind of ideal of...
link |
01:38:14.240
Platonic ideal of what a mathematician should be. It's someone that is...
link |
01:38:19.040
It just cares about... Deeply cares about mathematics.
link |
01:38:22.320
It cares about fair attribution of disregard's money.
link |
01:38:27.280
And the fact that he published on archive is a good example of that.
link |
01:38:33.600
What about the Fields Medal that he turned down the Fields Medal? What do you make of that?
link |
01:38:40.560
If you look at the reasons why he rejected the Fields Medal, so Perlman did a post talk in the
link |
01:38:45.920
US and when he came back to Russia... Do you know how good his English is?
link |
01:38:51.680
I think it's fairly good.
link |
01:38:53.040
I think it's pretty good, right?
link |
01:38:53.840
I think it's really good.
link |
01:38:54.880
Especially in these given lectures in the American Union.
link |
01:38:57.520
But I haven't been able to...
link |
01:38:58.880
Listen to anything.
link |
01:38:59.760
Well, certainly not listen, but I haven't been able to get anybody,
link |
01:39:02.800
because I know a lot of people have been to those lectures.
link |
01:39:05.520
I'm not able to get a sense of like, yeah, but how strong is the accent?
link |
01:39:10.320
What are we talking about here? Is this going to have to be in Russian?
link |
01:39:12.720
Is it going to have to be in English? It's fascinating.
link |
01:39:14.640
But he writes the papers in English, so...
link |
01:39:16.320
True. But there's so many... It's such a fascinating character and
link |
01:39:20.560
there are a couple of examples like him, like at I think 28 or 29, he proved like a really
link |
01:39:26.400
famous conjecture called the Soul Conjecture.
link |
01:39:29.040
I believe it was like in a very short four page proof of that.
link |
01:39:32.160
It was a really big breakthrough.
link |
01:39:33.600
Then he went to Princeton to give a lecture on that.
link |
01:39:37.120
And after the lecture, the chair of the math department at Princeton, a guy called Peter
link |
01:39:42.480
Sarnak, went up to Perlman was trying to recruit him.
link |
01:39:46.640
Trying to offer him a position at Princeton.
link |
01:39:49.440
And at some point, he asked for Perlman's resume.
link |
01:39:53.840
And Perlman responded saying, just give a lecture on like this really tough problem.
link |
01:39:59.280
Why do you need my resume? I'm not going to send you.
link |
01:40:02.400
Like I just proved my value.
link |
01:40:05.360
But going back to the Fields Medal, like when Perlman went back to Russia,
link |
01:40:11.760
he arrived at a time where the salary of postdocs was so much off in regards to
link |
01:40:18.800
inflation that they were not making any money.
link |
01:40:21.760
Like people didn't even bother to pick up the checks at the end of the month
link |
01:40:26.480
because it was like ridiculous.
link |
01:40:28.560
But thankfully he had some money that he had gained while he was doing his postdocs.
link |
01:40:33.120
So he just concentrated on like the Pankara conjecture problem.
link |
01:40:38.000
Which he, when he took that, it took it after it was reframed by this
link |
01:40:44.000
mathematician called Richard Hamilton, which posed the problem in a way that it turned into
link |
01:40:48.960
this super math Olympiad problem with perfect boundaries well defined.
link |
01:40:54.320
And that was perfect for Perlman to attack.
link |
01:40:56.880
And so he spent like seven years working on that.
link |
01:40:59.280
And then in 2002, he started publishing those papers on archive.
link |
01:41:04.560
And people started jumping on that, reading those papers.
link |
01:41:07.840
And there was like a lot of excitement around that.
link |
01:41:10.880
A couple of years later, there were two researchers, I believe it was,
link |
01:41:14.320
they were from Harvard that took Perlman's work.
link |
01:41:18.720
They sanded some of the edges.
link |
01:41:20.960
And they republished that saying that based on Perlman's work,
link |
01:41:26.080
they were able to figure out the Pankara conjecture.
link |
01:41:29.840
And then there was at the time at the International Conference of Mathematics in 2006,
link |
01:41:38.480
I believe that's when they were going to give out the Fields Medal.
link |
01:41:41.600
There was a lot of debate of who should get the credit for solving this big problem.
link |
01:41:49.280
And for Perlman, it felt really sad that people were even considering that
link |
01:41:55.520
he was not the person that solved that.
link |
01:41:58.160
And the claims that those researchers, when they published after Perlman,
link |
01:42:03.200
they were false claims that they were the ones, they just sanded a couple of edges.
link |
01:42:06.560
Like Perlman did all the really hard work.
link |
01:42:09.120
And so just the fact that they doubted that Perlman had done that was enough for him to say,
link |
01:42:16.000
I'm not interested in this prize.
link |
01:42:18.320
And that was one of the reasons why he rejected the Fields Medal.
link |
01:42:21.440
Then he also rejected the Clay Prize.
link |
01:42:24.400
So the Pankara conjecture was one of the Millennium Prizes.
link |
01:42:28.080
There was a million dollar prize associated with that problem.
link |
01:42:31.280
And that has to do with the fact that for them to attribute that prize,
link |
01:42:35.440
I think it had to be published on a journal, the proof.
link |
01:42:38.880
And again, Perlman's principles have interfered here.
link |
01:42:43.840
And he also just didn't care about the money.
link |
01:42:45.920
He was like, Clay, I think was a businessman.
link |
01:42:48.800
And he's like, doesn't have to do anything with mathematics.
link |
01:42:51.680
I don't care about this.
link |
01:42:53.920
That's one of the reasons why he rejected the Prize.
link |
01:42:56.320
Yeah, it's hard to convert into words, but at MIT,
link |
01:43:00.800
I'm distinctly aware of the distinction between when I enter a room,
link |
01:43:07.120
there's a certain kind of music to the way people talk when we're talking about ideas
link |
01:43:13.120
versus what that music sounds like when we're talking, when it's like bickering
link |
01:43:21.200
in the space of like, whether it's politics or funding or egos.
link |
01:43:27.520
It's a different sound to it.
link |
01:43:29.760
And I'm distinctly aware of the two.
link |
01:43:32.880
And I kind of sort of, to me personally, happiness was just like swimming around
link |
01:43:39.440
the one that like is the political stuff or the money stuff and all that or egos.
link |
01:43:47.680
And I think that's probably what Perlman is as well.
link |
01:43:50.160
Like the moment he senses there's any, as with the Fields Medal,
link |
01:43:53.680
like the moment you start to have any kind of drama around credit assignment,
link |
01:43:59.280
all those kinds of things, it's almost not that it's important who gets the credit.
link |
01:44:03.600
It's like the drama in itself gets in the way of the exploration of the ideas
link |
01:44:07.600
or the fundamental thing that makes science so damn beautiful.
link |
01:44:11.760
And you can really see that there's also a product of that Russian school of like doing science.
link |
01:44:17.280
And you can see that, that people were, during the Cold War, a lot of mathematicians,
link |
01:44:23.920
they were not making any money.
link |
01:44:25.840
They were doing math for the sake of math, like for the intellectual pleasure
link |
01:44:31.040
of like solving a difficult problem.
link |
01:44:33.040
And even if it was a flawed system and there were a lot of problems with that,
link |
01:44:37.920
they were able to actually achieve these and Perlman for me is the perfect product of that.
link |
01:44:45.920
He just cared about like working on tough problems.
link |
01:44:49.280
He didn't care about anything else.
link |
01:44:50.560
It was just math, pure math.
link |
01:44:53.680
Yeah, there's like for the broader audience, I think another example of that is like professional sports versus Olympics.
link |
01:45:02.160
Especially in Russia, I've seen that clear distinction where because the state manages
link |
01:45:08.720
so much of the Olympic process in Russia, as people know with the steroids, yes, yes, yes.
link |
01:45:14.800
But outside of the steroids thing is like the athlete can focus on the pure artistry of the sport.
link |
01:45:25.280
Like not worry about the money, not just in the way they talk about it, the way they think about it,
link |
01:45:30.960
the way they define excellence versus like in the perhaps a bit of a capitalist system
link |
01:45:37.760
in the United States with American football, with baseball, with basketball.
link |
01:45:42.640
So much of the discussion is about money.
link |
01:45:47.120
Now, of course, at the end of the day, it's about excellence and artistry and all that.
link |
01:45:52.080
But when the culture is so richly grounded in discussions of money and sort of this capitalistic like
link |
01:46:01.680
merch and businesses and all those kinds of things, it changes the nature of the activity.
link |
01:46:08.880
In a way that's hard again to describe in words, but when it's purely about the activity itself,
link |
01:46:16.320
it's almost like you quiet down all the noise enough to hear the signal, enough to hear the beauty.
link |
01:46:23.600
Like whenever you're talking about the money, that's when the marketing people come and the
link |
01:46:28.560
business people, the non creators come and they fill the room and they create drama and they know
link |
01:46:33.200
how to create the drama and the noise as opposed to the people who are truly excellent at what they
link |
01:46:37.920
do, the person in their arena, right? Like when you remove all the money and you just let that
link |
01:46:46.960
thing shine, that's when true excellence can come out. And that was of the few things that
link |
01:46:53.760
worked with the communist system in the Soviet Union to me at least as somebody who loves sport
link |
01:46:59.280
and loves mathematics and science, that worked well. Removing the money from the picture.
link |
01:47:11.120
Not that I'm saying poverty is good for science. There's some level in which not worrying about
link |
01:47:16.880
money is good for science. It's a weird, I'm not exactly sure what to make of that because
link |
01:47:22.000
capitalism works really damn well. But it's tricky how to find that balance.
link |
01:47:29.920
One Fields Medalist that is interesting to look at, and I think you mentioned it earlier,
link |
01:47:34.240
but is Cedric Villani, which might be the only Fields Medalist that is also a politician now.
link |
01:47:42.080
But so it's this brilliant French mathematician that won the Fields Medal. And after that,
link |
01:47:49.440
he decided that one of the ways that he could have the biggest leverage in pushing science in
link |
01:47:58.480
the direction that he thinks science should go would be to try to go into politics. And so that's
link |
01:48:05.040
what he did. And he has ran, I'm not sure if he has won any election, but I think he's running for
link |
01:48:12.080
a mayor of Paris or something like that. But it's this brilliant mathematician that before
link |
01:48:18.480
winning the Fields Medal had only been just a brilliant mathematician. But after that,
link |
01:48:24.160
he decided to go into politics to try to have an impact and try to change some of the things that
link |
01:48:29.600
he would complain about before. So there's that component as well.
link |
01:48:36.000
Yeah. And I've always thought mathematics and science should be like, James Bond
link |
01:48:40.720
would, in my eyes, I think be sexier if he did math. We should, as a society, put
link |
01:48:48.960
excellence in mathematics at the same level as being able to kill a man with your bare hands.
link |
01:48:53.920
Those are both useful features. That's admirable. It's like, oh, that makes the person interesting.
link |
01:49:02.000
Being extremely well read about history or philosophy, being good at mathematics, being
link |
01:49:07.200
able to kill a man with bare hands. Those are all the same in my book. So I think all are useful
link |
01:49:12.240
for action stars. And I think a society will benefit for giving more value to that. One of
link |
01:49:18.240
the things that bothers me about American culture is the, I don't know the right words to use,
link |
01:49:26.080
but the nerdiness associated with science. I don't think nerd is a good word in American
link |
01:49:36.160
culture because it's seen as weakness. There's images that come with that. And it's fine. You
link |
01:49:43.840
could be all kinds of shapes and colors and personalities, but to me, having sophisticated
link |
01:49:52.640
knowledge of science, being good at math, doesn't mean you're weak. In fact, it could be the very
link |
01:49:59.680
opposite. And so it's an interesting thing because it was very much differently viewed in the Soviet
link |
01:50:06.880
Union. So I know for sure as an existence proof that it doesn't have to be that way.
link |
01:50:15.440
I also feel like we lack a lot of role models. If you ask people to mention one mathematician
link |
01:50:23.040
that they know that he's alive today, I think a lot of people would struggle to answer that question.
link |
01:50:29.520
And I also think, I love Neil deGrasse Tyson. But there is, having more role models is good,
link |
01:50:40.720
like different kinds of personalities. He has kind of fun and it's like Bill Nye the
link |
01:50:47.840
science guy. I don't know if you guys know him. But Feynman is no longer there.
link |
01:50:57.200
Those kinds of personalities. Carl Sagan, man. Even Carl Sagan, yeah. A seriousness that's not
link |
01:51:04.640
playful. Not apologetical. Yeah, exactly. Not apologetic about being knowledgeable.
link |
01:51:09.920
In fact, the kind of energy where you feel self conscious about not having thought about
link |
01:51:22.400
some of these questions. Just like when I see James Bond, I feel bad about that I don't
link |
01:51:28.480
have never killed a man. I need to make sure I fix that. That's the way I feel. So the same
link |
01:51:33.280
way I want to feel like that way. Well, Carl Sagan talks, I feel like I need to have that
link |
01:51:37.920
same kind of seriousness about science. Like if I don't know something, I want to know it well.
link |
01:51:43.360
What about Terence Tao? He's kind of a superstar. What are your thoughts about him?
link |
01:51:48.080
True. It's probably one of the most famous mathematicians alive today. And probably one of
link |
01:51:52.480
I mean, regardless of like, is of course, he won a Fields Medal, is really smart and talented
link |
01:52:00.560
mathematician. It's also like a big inspiration for us, at least for some of the work that we
link |
01:52:09.600
do with Formaz Library. So Terence Tao is known for having, you know, a big blog and is pretty open
link |
01:52:15.840
about like his research and he also he tries to make his work as public as possible through his
link |
01:52:24.480
blog posts. In fact, there's a really interesting problem that got solved a couple of years ago.
link |
01:52:31.600
So Tao was working with on a problem on an Erdos problem, actually. So Paul Erdos was this
link |
01:52:38.800
mathematician from Hungary. And he was known for like the Erdos for a lot of things. But
link |
01:52:45.600
one of the things that he was also known was for the Erdos problem. So he was always like
link |
01:52:50.320
creating these problems and usually associating prizes with those problems.
link |
01:52:53.920
And a lot of those problems are still open. And some of them will be open for like maybe
link |
01:53:00.080
a couple of hundred years. And I think that's actually an interesting hack for him to collaborate
link |
01:53:04.480
with future mathematicians. You know, his name will keep coming up for future generations. But so
link |
01:53:11.440
Tao was working on one of these problems called the Erdos discrepancy. And he published a blog
link |
01:53:16.640
post on like about that problem about that problem. And he reached it like a dead end. And then all of
link |
01:53:23.920
a sudden there was this guy from Germany that wrote like a comment on his blog post saying,
link |
01:53:29.680
okay, like some of the so this problem is like a Sudoku like flavor and some of the machinery
link |
01:53:35.600
that we're using to solve Sudoku could be used here. And that was actually the key to solve the
link |
01:53:41.040
Erdos discrepancy problem. So there was a comment on his blog. And I think that that that for me is
link |
01:53:46.400
an example of like how to do again, going back to collaborative science online, and the power that
link |
01:53:53.360
it has. But Tao is also like pretty public about like some of the struggles and of being a mathematician,
link |
01:54:02.080
like and even he wrote about some of the unintended consequences of having extraordinary
link |
01:54:08.800
ability in a field. And he used himself as an example when he was growing up, he was extremely
link |
01:54:14.400
talented in mathematics from a young age like Tao was a person he won an medal in like one of the
link |
01:54:21.360
IMOs at the age I think was a gold medal at the age of 10 or something like that. And so he mentioned
link |
01:54:26.640
that when he was growing up, like and especially in college, when he was in a class that he enjoyed,
link |
01:54:32.320
it didn't it just came very natural for him and it didn't have to work hard to just ace the class.
link |
01:54:38.480
And when he found that the class was boring, like it didn't work and he barely passed, barely passed
link |
01:54:45.360
I think in college he almost failed two classes. And he was talking about that and how he brought
link |
01:54:51.680
those studying habits or like in existence of studying habits when he went to Princeton for
link |
01:54:57.600
his PhD. And in Princeton, when he started kind of delving into more complex problems
link |
01:55:04.480
in classes, he struggled a lot because he didn't have that those those habits like it wasn't taking
link |
01:55:10.480
notes and it was it wasn't studying hard when he when he faced problems. And he almost failed out of
link |
01:55:16.080
his PhD. He almost failed his PhD exam. And it talks about like having this conversation with
link |
01:55:23.680
this advisor and the advisor pointing out like you're not this is not working, you might have to get
link |
01:55:29.360
out of the program. And like how that was a kind of a turning point for him. And like it was super
link |
01:55:36.400
important in his career. So I think Tao is also like this figure that apart from being just an
link |
01:55:41.600
exceptional mathematician is also pretty open about what what it takes to to to be a mathematician
link |
01:55:47.040
and some of the struggles of these type of careers. And I think it's that's super important.
link |
01:55:52.080
In many ways, he's a contributor to open science and open humanity. He's being an open human.
link |
01:55:58.240
True. By communicating Scott Aronson is another in computer science world who's a very different
link |
01:56:04.960
style, very different style. But there's something about a blog that is authentic and real and just
link |
01:56:11.120
gives us a window into the into the mind and soul of all of these brilliant folks. So it's
link |
01:56:17.440
definitely a gift. Let me ask you about Fermat's library on Twitter, which I mean, I don't know
link |
01:56:24.640
how to describe it, people should definitely just follow from my library on Twitter. I keep
link |
01:56:29.520
following and unfollowing from my library because because it's so it gives when I follow it leads
link |
01:56:39.440
me on down rabbit holes often that that are very fruitful. But anyway, so the the posts you do
link |
01:56:50.720
with the on Twitter are just these beautiful are things that reveal some beautiful aspect of
link |
01:56:56.720
mathematics. Is there is there something you could say about the approach there? Yeah. And maybe
link |
01:57:08.320
maybe broadly what you find beautiful about mathematics and then more specifically how
link |
01:57:14.560
you convert that into a rigorous process of revealing that in tweet form.
link |
01:57:19.200
That's a good point. I think there's something about math that, you know, a lot of the mathematical
link |
01:57:24.080
content and you know, paid papers or like little proofs, you know, has in a way sort of an infinite
link |
01:57:31.920
off life. What I mean by that is that if you look at like Euclid's elements, it's as valid today as
link |
01:57:38.480
it was when he was created like 2000 years ago. And that's not true for a lot of other scientific
link |
01:57:44.240
fields. And so in regards to Twitter, I think there's also a very it's a very undexed underexplored
link |
01:57:54.480
platform from a learning perspective. I think if you look at content on Twitter, it's very easy to
link |
01:58:01.040
consume. It's very easy to read. And especially when you're trying to explain something, you know,
link |
01:58:09.680
we humans get a dopamine hit if we learn something new. And that's a very, very powerful feeling.
link |
01:58:16.880
And that's why, you know, people go to classes when you have a really good professor, you know,
link |
01:58:22.080
it's looking for those dopamine hits. And that's something that we try to explore when
link |
01:58:29.360
we're producing content on Twitter. Imagine if we could, if you would on a line to a restaurant,
link |
01:58:34.560
you could go to your phone to learn something new instead of going to a, you know, social network.
link |
01:58:41.920
And so, and I think it's very hard to sometimes to kind of provide that feeling because you need to
link |
01:58:50.400
sometimes digest content and put it in a way, you know, that it feeds 280 characters. And it
link |
01:58:58.400
requires a lot of sometimes time to do that, even though it's easy to consume, it's hard to make.
link |
01:59:04.160
But once you are able to provide that eureka moment to people, like that's very powerful,
link |
01:59:10.080
they get that dopamine hit and like you create this feedback cycle and people come back for more.
link |
01:59:15.840
And in Twitter, compared to like, you know, an online course for a book, you have a 0%
link |
01:59:20.560
dropout. So people will read the content. So that it's like, it's part of the creators,
link |
01:59:27.040
like the person that is creating the content, if you're able to actually get that feedback cycle,
link |
01:59:31.600
it's super, super powerful. Yeah. But some of this stuff is like, like, how the heck do you find
link |
01:59:37.440
that? And I don't know why it's so appealing. Like, this is from a, what is it? A couple days ago,
link |
01:59:47.360
I'll just read out the number 23456789 is the largest prime number with consecutive
link |
01:59:53.760
increasing digits. I mean, that is so cool. That's like some weird like glimpse into some
link |
02:00:02.720
deep universal truth, even though it's just a number. I mean, that's like so arbitrary. Like,
link |
02:00:09.040
why, why is it so pleasant that that's a thing, but it is in some way, it's almost like it is
link |
02:00:14.240
a little glimpse at some much bigger, like. And I think like, especially if we're talking about
link |
02:00:21.440
science, there's something unique about you go and with a lot of the tweets, you go sometimes from
link |
02:00:26.800
a state of not knowing something to knowing something. And that is very particular to science,
link |
02:00:32.160
science, math, physics. And that, again, is extra extremely addictive. And that's that's how I feel
link |
02:00:38.240
about that. And that's why I think people engage so much with our tweets and go into rabbit holes.
link |
02:00:45.200
And then they, you know, we start with prime numbers. And all of a sudden, you are spending hours
link |
02:00:50.000
reading number theory things, and you go into Wikipedia. And it was a lot of time there.
link |
02:00:56.800
Well, the variety is really interesting, too. There's human things. There's, there's physics
link |
02:01:01.440
things. There's like numeric things, like I just mentioned, but there's also more rigorous
link |
02:01:07.760
mathematical things. There's stuff that's tied to the history of math and the proofs. And
link |
02:01:13.200
there's visual, there's animations. There are looping animations that are incredible,
link |
02:01:17.840
that reveal something. There's Andrew Wiles on being smart. This is just me now, like,
link |
02:01:25.600
ignoring you guys and just going through. No, yeah, we're a bit like math drug dealers. We're
link |
02:01:29.920
just trying to get you hooked. We're trying to give you that hit and trying to get you hooked.
link |
02:01:34.720
Yes, some people are brighter than others. But I really believe that most people can really get
link |
02:01:39.760
to to quite a good level of mathematics if they're prepared to deal with these
link |
02:01:44.000
psychological issues of how to handle the situation of being stuck.
link |
02:01:47.600
Yeah, there's some truth to that. That's truth. I feel that's like really,
link |
02:01:52.800
that's some truth in terms of research and also about startups. You're stuck a lot of the time
link |
02:01:58.560
before you get to a breakthrough. And it's difficult to endure that process of like being stuck
link |
02:02:04.160
because you're not trying to to be in that position. I feel, yeah, that's
link |
02:02:09.920
Yeah, most people are broken by the stuckness or like their district. Like,
link |
02:02:13.760
uh, I've been very cognizant of the fact that more and more social media becomes a thing.
link |
02:02:24.080
Like distractions become a thing that that moment of being stuck is your mind wants to
link |
02:02:31.040
to go do stuff that's unrelated to being stuck and you should be stuck. I'm referring to small
link |
02:02:36.720
stucknesses. Like you're like trying to design something and it's a dead end, basically little
link |
02:02:42.480
dead ends, dead ends and programming dead ends and trying to think through something. And then
link |
02:02:47.600
your mind wants to like, like, like, uh, this is the problem with this like work life balance
link |
02:02:54.640
culture is like take a break. Like as if taking a break will solve everything. Sometimes it solves
link |
02:03:01.840
quite a bit, but like sometimes you need to sit in the stuckness and suffer a little bit and then
link |
02:03:06.080
take a break. But you definitely need to be and like most people quit from that psychological
link |
02:03:13.600
battle of being stuck. So success is people who, who, who, uh, persevere through that.
link |
02:03:20.720
Yeah. Yeah. And in the creative process, that's also true. I was the other day I was, I think
link |
02:03:26.000
was reading about this. What is his name? Ed Sheeran, like the musician was talking a little
link |
02:03:30.880
bit about the creative process and using was using this analogy of a faucet, like
link |
02:03:35.360
where you, when you turn on a faucet is as like the dirty water coming out in the beginning.
link |
02:03:40.480
And you just have to, you know, keep trusting that at some point you're clean, clean, clear water
link |
02:03:46.160
will come out, but you have to endure that process. Like in the beginning it's going to be dirty water
link |
02:03:50.880
and, and, and just, you know, embrace that. Yeah. Actually this, uh, the entirety of my
link |
02:03:56.880
YouTube channel and this podcast have been following that philosophy of dirty water.
link |
02:04:01.840
Like I've been, you know, I do believe that like you have to get all the crap out of your system
link |
02:04:07.120
first. And uh, sometimes it's all, it's all, sometimes it's all crappy work. I mean, I tend
link |
02:04:13.200
to be very self critical, but I do think that quantity leads the quality for some people.
link |
02:04:18.240
It does for my, the way my mind works is like just keep putting stuff out there, keep creating
link |
02:04:23.680
and, uh, the quality will come as opposed to sitting there waiting, not doing anything until
link |
02:04:32.160
the thing seems perfect because the perfect may never come.
link |
02:04:35.840
But just, just on like, on, on, on our Twitter, uh, like profile, I really, and sometimes when
link |
02:04:41.120
you look on, on some of those tweets, they might seem like pretty kind of, um, you know,
link |
02:04:46.560
why is this interesting? It's like so raw, uh, like it's just a number. But I really believe
link |
02:04:52.160
that especially with math or physics, it is possible to get everyone to love math or physics,
link |
02:04:58.400
even if you think you hate it. It's, it's not a function of the student or the person that is
link |
02:05:03.120
on the other side. I think it's just purely a function of like how you explain, uh, hidden
link |
02:05:08.480
beauty that they hadn't realized before. It's not easy, but I think it's like a lot of the times
link |
02:05:14.400
is on, like on the creator side to, to be able to like show that beauty to the other person.
link |
02:05:20.480
I think some of that is native to, to humans. We just have that curiosity and you look at
link |
02:05:25.680
small toddlers and babies and like them trying to figure things out. And there's just something
link |
02:05:30.560
that is born with us that we, we, we want for that understanding. We want to figure out the
link |
02:05:35.360
world around us. And, and so yeah, it shouldn't be like, uh, whether or not people are going,
link |
02:05:40.960
are going to, to enjoy it. Like I also really believe that everybody has that capacity to
link |
02:05:47.920
fall in love with, with math and physics. You mentioned startup. What do you think it takes
link |
02:05:54.000
to build a successful startup? Yeah, that it's what, what Louise was saying that, um, you need to
link |
02:06:01.440
end, to be able to endure being stuck. And, and I think the best way to put it is that startups
link |
02:06:08.400
don't have a linear reward function, right? You oftentimes don't get rewarded for effort.
link |
02:06:15.520
And, and, and in most of our lives, we go through these processes that do give you those small
link |
02:06:23.840
rewards for effort, right? In school, you study hard, generally you'll get a good grade. And
link |
02:06:27.760
then you get, you get like good grades ever, or you get grades every semester. And so you're,
link |
02:06:32.800
you're slowly getting rewarded and pushed in the right direction for, for startups. And startups
link |
02:06:39.840
are not the only thing that is like this, but for startups, it's, you know, you can put in a ton of
link |
02:06:44.640
effort into something that, and then get no reward for it, right? It's, it's like, like Sisyphus
link |
02:06:50.240
Boulder, where you were pushing that Boulder up the mountain and, and, and you get to the top
link |
02:06:56.240
and then it just rolls all the way back down. And, and so that's something that I think a lot of
link |
02:07:00.960
people are not equipped to deal with and can be incredibly demoralizing, especially if that happens
link |
02:07:07.440
more than, than a few times. And so, but I think it's absolutely essential to, to power through it,
link |
02:07:14.000
because by the nature of startups, it's often times, you know, you're dealing with, with,
link |
02:07:20.000
with non obvious ideas and things that, that might be contrarian. And so you're gonna, you're
link |
02:07:26.800
gonna run into, into that a lot. You're gonna do things that are not going to work out. And you
link |
02:07:31.360
need to be prepared to deal with that. But, but if we're not coming out of college, you're, you're
link |
02:07:36.880
just not equipped. I'm not sure if there's a way to train people to deal with those non linear
link |
02:07:42.640
reward functions, but it's definitely, I think, one of the most difficult things to, you know,
link |
02:07:48.640
about doing a startup. And also happens in research. Sometimes, you know, we're talking
link |
02:07:52.720
about the default state is being stuck. You just, you know, you don't know, like you try things,
link |
02:07:57.280
you get zero results, you close doors, you constantly closing doors, until you, you know,
link |
02:08:02.240
find something. And yeah, that is a big thing. What about sort of this point when you're stuck,
link |
02:08:08.880
there's a kind of decision, whether you, if you have a vision to persist through, through with
link |
02:08:14.960
this direction that you've been going along, or what a lot of startups do or businesses pivot.
link |
02:08:21.280
How do you decide whether like, to give up on a particular flavor of the way you've imagined
link |
02:08:29.040
the design, and to like, adjust it or completely, like, alter it.
link |
02:08:34.720
I think that's a core question for startups that I've asked myself exactly. And like, I've never
link |
02:08:40.640
been able to come up with a great framework to make those decisions. I think that's really at the
link |
02:08:46.160
core of, yeah, out of a lot of the toughest questions that, that people that's that started a
link |
02:08:53.360
company have to deal with. I think maybe the best framework that I was able to figure out is like,
link |
02:09:00.720
I was able to figure out is like, when you run out of ideas, you just, you know, you're exploring
link |
02:09:05.920
something is not working, you try it in a different angle, you know, we try different business model.
link |
02:09:11.760
When you run out of ideas, like, you don't have any more cards, just switch and yeah, it's not
link |
02:09:18.880
perfect. Because you also, it's, you have a lot of stories of startups was like,
link |
02:09:23.440
people kept pushing, and then, you know, that paid off. And then you have philosophies is like,
link |
02:09:30.640
fell fast, and pivot fast. So it's, you know, it's hard to, you know, balance these two worlds
link |
02:09:37.920
and understand what is the best framework. And I mean, if you look at Fermat's library, you're,
link |
02:09:44.160
maybe you can correct me, but it feels like you're an operating in a space
link |
02:09:48.080
where there's a lot of things that are broken, and or could be significantly improved. So it
link |
02:09:55.280
feels like there's a lot of possibilities for pivoting. Or like, how do you revolutionize
link |
02:10:01.200
science? How do you revolutionize the aggregation, the, the annotation, the commenting the community
link |
02:10:09.360
around information of knowledge, structured knowledge? I mean, that's kind of what like
link |
02:10:13.840
Stack Overflow and Stack Exchange has struggled with to come up with a solution. And they've come
link |
02:10:19.520
up, I think with an interesting set of solutions, that are also, I think flawed in some ways,
link |
02:10:24.160
but they're much, much better than the alternatives. But there's a lot of other possibilities. If we
link |
02:10:29.680
just look at papers, as we talked about, there's so many possible revolutions, and there are a lot
link |
02:10:34.000
of money to be potentially made as revolutions, plus coupled with that, the benefit to humanity.
link |
02:10:39.840
And so like, you're sitting there, like, I don't know how many people are legitimately,
link |
02:10:45.600
from a business perspective, playing with these ideas. It feels like there's a lot of ideas here.
link |
02:10:50.320
True. There is.
link |
02:10:51.360
Are you right now grinding in a particular direction? Like, is there a like a five year
link |
02:10:56.640
vision that you're thinking in your mind? For us, it's more like a 20 year vision,
link |
02:11:02.080
in the sense that we've consciously tried to make the decision of, so we run Formats as
link |
02:11:10.720
it's a side project. And it's a separate in the sense like it's not what we're working on full
link |
02:11:15.840
time. And, but our thesis there is that we actually think that it's that's a good thing,
link |
02:11:24.240
at least for this stage of Formats library. And also because some of these projects,
link |
02:11:29.920
you just, if you're coming from a start, from a startup framework, you probably try to try to fit
link |
02:11:39.680
every single idea into something that can change the world within three to five years. And there's
link |
02:11:44.800
just some problems that take longer than that. Right. And so, you know, we're talking about
link |
02:11:49.120
archive, and I'm very doubtful that you could grow like archive into what it is today, like within
link |
02:11:55.600
two or three years, no matter how much money you throw at it, there's just some things that can
link |
02:11:59.920
take longer. But you need to be able to power through the time that it takes. But if you look
link |
02:12:07.360
at it as it's okay, this is a company, this is a startup, we have to grow fast, we have to raise
link |
02:12:11.120
money, then then sometimes you might forgo those ideas because of that, because they don't very
link |
02:12:18.800
well fit into the typical startup framework. And so, for us, Formats, it's something that we're
link |
02:12:26.560
okay with growing, with having it grow slowly, and maybe taking many years. And that's why we
link |
02:12:33.280
think it's not a bad thing that it is a side project, because it makes it much more acceptable
link |
02:12:40.400
in a way to be able to be okay with that. That said, I think what happens is if you keep pushing
link |
02:12:47.920
new little features, new little ideas, I'd feel like there's like certain ideas will just become
link |
02:12:53.280
viral. And then you just won't be able to help yourself, but it'll revolutionize things. It
link |
02:12:59.520
feels like there needs to be, not needs to be, but there's opportunity for viral ideas to change
link |
02:13:07.600
science. Absolutely. And maybe we don't know what those are yet. It might be a very small kind of
link |
02:13:13.040
thing. Maybe you don't even know if should this be a for profit company doing these? It's the
link |
02:13:18.000
Wikipedia question. Yeah. There are a lot of questions, like really fundamental questions
link |
02:13:24.080
about this space that we've talked about. I mean, you take Wikipedia and you try to run it as a
link |
02:13:29.440
startup, and by now, we'd have a paywall, you'd be paying $9.99 a month to read more than $20.
link |
02:13:34.320
I mean, that's one view. The other, the ad driven model, so they rejected the ad driven model.
link |
02:13:41.440
I don't know if we could, I mean, this is a difficult question, you know,
link |
02:13:46.400
if archive was supported by ads, I don't know if that's bad for archive. If Fermat's library was
link |
02:13:52.880
supported by ads, I don't know. I don't, I'm not, it's not trivial to me. I'm, unlike I think a lot
link |
02:13:59.360
of people, I'm not against advertisements. I think ads when done well are really good. I think the
link |
02:14:06.000
problem with Facebook and all the social networks are the way, the lack of transparency around the
link |
02:14:10.960
way they use data, and the lack of control the users have over their data, not the fact that
link |
02:14:16.480
data is being collected and used to sell advertisements. It's a lack of transparency,
link |
02:14:21.360
lack of control. If you, if you do a good job of that, I feel like it's really nice way to make
link |
02:14:25.920
stuff free. Yeah, it's like Stack Overflow, right? I think they've done a good job with that,
link |
02:14:32.720
even though as we said, like they're capturing very little of the value that they're putting out
link |
02:14:37.120
there, right? But, but it makes it a sustainable company, and they're providing a lot of, it's a
link |
02:14:44.160
fantastic and very productive community. Let me ask a ridiculous tangent of a question.
link |
02:14:50.640
Louise, you wrote a paper on Game of Thrones, Battle of Winterfell, just as a side, I'm sorry,
link |
02:14:58.000
I noticed, I'm sure you've done a lot of ridiculous stuff like this. I just noticed that particular
link |
02:15:01.920
one by ridiculous, I mean, ridiculous, the awesome. Can you describe the approach in this work,
link |
02:15:08.160
which I believe is a legitimate publication? So going back to the original, like when we were
link |
02:15:14.400
talking about the backstory of papers and the importance of that, so this is actually, you
link |
02:15:18.880
know, there was a, when the last season of the show was airing, this was a during a company
link |
02:15:25.760
lunch, there was, in the last season, there's a really big battle against the forces of evil
link |
02:15:34.320
and the forces of good, and it's called the Battle of Winterfell. And in this battle,
link |
02:15:42.080
there are like these two armies, and there's a very particular thing that they have to take
link |
02:15:46.800
into account is that in the army of death, like if someone dies in the army of the living,
link |
02:15:52.320
like that person is going to, you know, be reborn as a soldier in the army of the dead.
link |
02:15:59.600
And so that was an important thing to take into account. And the initial conditions,
link |
02:16:03.760
as you specified, it's about 100,000 on each side. Exactly. So I was able to, like, based on
link |
02:16:08.800
some images, like on previous episodes, to figure out what was the size of the armies.
link |
02:16:12.640
And so what I want, what we wanted to, what we were theorizing was like, how many soldiers does,
link |
02:16:18.320
like, a soldier on the army of the living has to kill in order for them to be able to destroy
link |
02:16:25.680
the army of the dead without, like, losing, because every time one of the good soldiers dies,
link |
02:16:31.760
going to turn into, like, the other side. And so it's, so we were theorizing that, and I wrote
link |
02:16:37.280
a couple of differential equations, and I was able to figure out that based on the size of the armies,
link |
02:16:42.800
I think, I think was the ratio had to be, like, 1.7. So it had to kill, like, 1.7 soldiers of,
link |
02:16:49.200
like, the army of the dead in order for them to win the battle. Well, yeah, that's science,
link |
02:16:54.880
it's the most powerful. And this is also somehow a pitch for, like, a hiring pitch, in a sense,
link |
02:17:04.240
like, this is the kind of important science you do at lunch. Exactly. Well, it turned out to be,
link |
02:17:09.440
you know, as for people that have watched these shows, they know that every time you try to predict
link |
02:17:15.520
something that is going to happen, you're going to fail miserably, and that's what happens. So it
link |
02:17:19.520
was not at all important for the show, but we ended up, like, putting that out, and there
link |
02:17:25.920
was a lot of people that shared that. And I think it was some, like, elements of the show,
link |
02:17:29.920
the cast of the show, that actually retweeted that and shared that. That person was fun.
link |
02:17:34.720
I would love if this kind of calculation happened, like, during the making of the show, or, you know,
link |
02:17:41.280
I love it, like, in, for example, I now know Alex Garland, the director of Ex Machina,
link |
02:17:49.200
and I love it, and he doesn't seem to be some, not many people seem to do this, but I love it
link |
02:17:55.440
when directors and people who wrote the story really think through the technical details,
link |
02:18:01.840
like, whether it's knowing, like, how things, even if it's science fiction,
link |
02:18:06.960
if you were to try to do this, how would you do this? Like, Stephen Wolfram and his son were,
link |
02:18:13.760
were collaborating with the movie Arrival in designing the alien language of how you
link |
02:18:18.560
communicate with aliens. Like, how would you really have a math based language that could
link |
02:18:25.680
span the alien and being and the human being? So, I love it when they have that kind of rigor.
link |
02:18:33.360
The Martian was also big on that, like, the book in the movie was all about, like, can we actually,
link |
02:18:39.440
is this plausible? Can this happen? It was all about that.
link |
02:18:42.320
And that can really bring you in, like, sometimes those small details. I mean,
link |
02:18:47.200
the guy that wrote the Martian book is another book that is also filled with those, like,
link |
02:18:52.560
things that when you realize that, okay, these are grounded in science can just really bring you
link |
02:18:58.880
in. Yeah. Like, he has a book about a colony on the moon.
link |
02:19:03.120
A colony on the moon. And he goes about, like, all the details that would, you know,
link |
02:19:06.800
be required about setting up a colony in the moon and, like, things that he wouldn't think about,
link |
02:19:11.680
like, the fact that they would, you know, it's hard to bring, like, air to the moon so they wouldn't,
link |
02:19:18.640
like, how do you make that breathable, that environment breathable? You need to bring oxygen.
link |
02:19:23.760
But, like, you probably wouldn't bring nitrogen. So, what you do is, like, instead of having an
link |
02:19:31.280
atmosphere that is 100% oxygen, you, like, decrease the pressure so that you have the
link |
02:19:36.480
same ratio of oxygen on Earth, but, like, lowering the pressure here. And so, like,
link |
02:19:41.280
things like water boils at the lower temperature. So, people would have coffee and the coffee would
link |
02:19:47.360
be colder. Like, there was a problem in this environment in the moon. So, like, and these
link |
02:19:52.640
are, like, small things in the book. But I studied physics. So, like, when I read these,
link |
02:19:58.640
that throws me into, like, tangents. And I start researching that. And it's like,
link |
02:20:04.640
I really like to read books and watch movies when they go to that level of detail about science.
link |
02:20:11.920
Yeah. I think Interstellar was one where they also consulted heavily with the number of
link |
02:20:15.680
these. And I think even resulted in a couple of papers about, like, the black hole visualizations.
link |
02:20:24.000
Yeah. And there's even more examples of interesting science around, like, these
link |
02:20:28.080
fantasy. We were reading at some point, like, these guys that were trying to figure out if
link |
02:20:34.960
the Tolkien's Middle Earth, if it was round, if it was, like, a sphere, if it was, like, a flat
link |
02:20:42.800
flat. Based on the map. Based on the map and some of the references in the books. And so,
link |
02:20:50.240
yeah, we actually, I think we tweeted about that. Yeah, we did. Based on the distance
link |
02:20:54.080
between the cities, you can actually prove that that could be, like, a map of a sphere,
link |
02:20:59.120
or, like, a spheroid. And you can actually calculate the radius of that planet.
link |
02:21:03.760
That's fascinating. I mean, yeah, that's fascinating. But there's something about,
link |
02:21:11.600
like, calculating the number, like, exactly the calculation you did for the Battle of Winterfell
link |
02:21:20.320
is something fascinating about that. Because it's not, like, being, that's very mathematical,
link |
02:21:27.120
versus, like, grounded in physics. And that's really interesting. I mean,
link |
02:21:31.440
that's like injecting mathematics into fantasy. There's something magical about that.
link |
02:21:38.800
I see what you're saying. And that, for me, that's why I think it's also when you look at things like
link |
02:21:44.880
Fermat's last theorem, like, problems that are very kind of self contained and simple to state.
link |
02:21:50.560
I think, like, that's the same with that paper. It's very easy to understand the boundaries
link |
02:21:54.560
of the problem. And that, for me, that's why math is so appealing. And those, like,
link |
02:22:03.040
problems are also so appealing to the general public. It's not that they look simple, or that
link |
02:22:07.680
people think that they're easy to, like, solve. But I feel that a lot of the times they are
link |
02:22:13.680
almost intellectually democratic, because everyone understands the starting point.
link |
02:22:18.240
You know, you look at Fermat's last theorem, everyone understands, like, this is the universe
link |
02:22:24.000
of the problem. And the same maybe with that paper, everyone understands, okay, these are the
link |
02:22:27.600
starting conditions. And, yeah, the fact that it becomes intellectually democratic, and I think
link |
02:22:34.640
that's a huge motivation for people. And that's why so many people gravitate towards these, like,
link |
02:22:40.160
Riemann hypotheses, or Fermat's last theorem, or that simple paper, which is, like, just one page.
link |
02:22:44.560
It was very simple. And I just talked to somebody, I don't know if you know who he is,
link |
02:22:48.880
Jaco Willink, who was this person who, among many things, loves military tactics. So he would
link |
02:22:58.080
probably either publish a follow on paper, maybe you guys should collaborate. But he would see the
link |
02:23:04.000
fundamental, the basic assumptions that he started that paper with is flawed. Because, you know,
link |
02:23:08.960
there's like dragons, too, right? There's like, like, you have to integrate tactics. Because
link |
02:23:13.920
it's not, it's not, it's not a homogeneous system. It's not, I don't take into account the dragons
link |
02:23:19.040
and like, and he would say tactics fundamentally change the dynamics of the system. And so like,
link |
02:23:24.400
that's what happened. So, yeah, so at least from a scientific perspective, he was right,
link |
02:23:31.360
but he never published. So there you go. Let me ask the most important question. You guys are from
link |
02:23:36.480
Portugal, both in Portugal. So who is the greatest soccer player, footballer of all time?
link |
02:23:45.520
Yeah, I think we're a little bit biased on this topic. But I mean, I have a huge,
link |
02:23:53.040
I have a tremendous respect for what. Here we go.
link |
02:23:59.840
You can convince me. I mean, I have tremendous respect for what Ronaldo has achieved in his
link |
02:24:05.200
career. And I think soccer is one of those sports where I think you can get to maybe be
link |
02:24:09.840
one of the best players in the world. We, if you just have like natural talent, and even if you
link |
02:24:16.160
don't put a lot of hard work and discipline into soccer, you can be one of the best players in the
link |
02:24:21.680
world. And I think Ronaldo is kind of like, of course, he's naturally talented, but he also,
link |
02:24:26.560
Ronaldo would say the football from exactly from Portugal and not the Brazilian in this case.
link |
02:24:33.600
And so, and Ronaldo came from nothing, he's known from being probably one of the hardest working
link |
02:24:40.000
athletes in the game. And I see that sometimes a lot of these discussions about the best player,
link |
02:24:45.840
a lot of people tend to gravitate towards like, you know, this person is naturally talented,
link |
02:24:51.760
and the other person has to work hard. And so, as if it was bad, if he had to work hard to be
link |
02:24:58.960
good at something. And I think that's, you know, the, I think so many people fall into that trap.
link |
02:25:05.920
And the reason why so many people fall into that trap is because if you're saying that someone
link |
02:25:11.600
is good and achieved a lot of success by working hard, as opposed to achieving success because
link |
02:25:17.440
he has some sort of God given natural talent, that you can't explain why the person was born with
link |
02:25:22.720
that. What does it tell you about you? It tells you that maybe if you work hard on a lot of
link |
02:25:28.560
fields, you could have, could accomplish a lot of great things. And I think that's hard to digest
link |
02:25:33.200
for a lot of people. And in that way, Ronaldo's inspiring that. I think so. So you find hard
link |
02:25:38.560
work inspiring, but he's way too good looking. That's the, I don't like him probably. No, I like
link |
02:25:45.600
the part of the hard work and like, of him being like one of the hardest working athletes in Saga.
link |
02:25:51.680
So he is to you the greatest of all time? Is he up there? Is he will be number one? Okay.
link |
02:25:56.320
Okay. Do you agree with this? Hard or disagree? Well, I definitely disagree. I mean, I like him
link |
02:26:02.800
very much. He works hard. I admire, I admire, you know, would like, he's incredible goal scorer,
link |
02:26:12.480
right? But I, so first of all, Leo Messi, and there was some confusion because I've kept saying
link |
02:26:21.360
Maradona is my favorite player, but I think, I think Leo has surpassed them. So it's messy
link |
02:26:29.680
than Maradona than Pele for me. But the reason is, is there's certain aesthetic definitions of
link |
02:26:38.800
beauty that I admire, whether it came by hard work or through God given talent or through
link |
02:26:45.200
anything. It doesn't, it doesn't really matter to me. There's certain aesthetic,
link |
02:26:48.480
like, like genius when I, when I see it to me, and especially it doesn't have to be consistent.
link |
02:26:55.440
It isn't a case of Messi in case of the Ronaldo, but just even moments of genius,
link |
02:27:00.720
which is where Maradona really shines. It's, I, even if that doesn't translate into like
link |
02:27:06.720
results and goals being scored, right, right? And that's the challenge. I'm like, I did that
link |
02:27:11.360
because that's where people that tell me that Leo Messi is never, even on strong teams have led
link |
02:27:20.400
his, the national team, people as part of the World Cup, right? That's really important. And
link |
02:27:26.480
to me, no, it's the moment, like winning to me was never important. What's more important is the
link |
02:27:31.520
moments of genius. And, but you're, you're talking to the human story. And
link |
02:27:38.000
yeah, Cristiano Ronaldo definitely has the beautiful human story.
link |
02:27:42.800
Yeah. And I think you can't, for me, it's hard to decouple those two. I don't, I don't just look at,
link |
02:27:49.280
you know, the, the list of achievements, but I like how he got there and how he keeps pushing
link |
02:27:54.080
the boundaries at like almost 40. And how that sets up an example. Like maybe 10 years ago,
link |
02:27:59.440
I wouldn't have ever imagined that like one of the top players in the world could be a top
link |
02:28:03.840
player at like 37 or, but so, and there's an interesting tent. The human story is really
link |
02:28:08.320
important. But like, if you look at Ronaldo, he's like, he's somebody like kids could aspire to be.
link |
02:28:15.600
But at the same time, I also like Maradona, who like is a, is a tragic figure in many ways.
link |
02:28:21.600
It's like the, you know, the drugs, the, the temper, all of those things, that's beautiful too.
link |
02:28:27.840
Like I don't necessarily think to me, the flaws are beautiful too in athletes. I don't think
link |
02:28:36.480
you need to be perfect from a personality perspective. Those flaws are also beautiful.
link |
02:28:43.120
So, but yeah, there is something about hard work. And there's also something about the
link |
02:28:49.840
being an underdog and being able to carry a team. That's, that's an argument from Maradona.
link |
02:28:56.240
I don't know if you can make that argument for Messi and Ronaldo either, because they've all
link |
02:29:01.120
played on superstar teams for most of their lives. So I don't know how,
link |
02:29:08.240
you know, it's, it's difficult to know how they would do
link |
02:29:12.640
when they had to work, like did what Maradona had to do to carry a team on the shoulders.
link |
02:29:18.240
True. And Pele did as well. And so, depending on the, the context.
link |
02:29:22.880
Maybe you could argue that with the Portuguese national team, but we have a good team.
link |
02:29:27.680
Yeah. But maybe what Maradona did with, you know, Napples and, and a couple other teams,
link |
02:29:33.200
it's, it seems incredible.
link |
02:29:34.400
It speaks to the beauty of the game that, you know, we're talking about all these different
link |
02:29:37.680
players that have, or especially, you know, if you're comparing Messi and Ronaldo,
link |
02:29:41.840
they have such different, you know, styles of play and also even their bodies are so different.
link |
02:29:47.440
And, and, and, but these two very different players can be at the top of the game. And
link |
02:29:54.320
that's not, that's the, there are not a lot of other sports where you, where you have that,
link |
02:29:59.120
you know, like you have kind of a mental image of a basketball player and like the top basketball
link |
02:30:05.440
players kind of fit that mental image and they look a certain way. And, but for soccer, there's
link |
02:30:11.600
some, there's the, it's, it's not so much like that. And, and that's, I think that's, that's
link |
02:30:17.520
beautiful. But that's, that really adds something to the sport.
link |
02:30:21.440
Well, do you play soccer yourself? Have you played that in your life? What do you find
link |
02:30:25.760
beautiful about the game?
link |
02:30:26.960
Yeah. I mean, it's one of the, I'd say it's the biggest sport in Portugal. And so growing up,
link |
02:30:31.600
we played a lot.
link |
02:30:32.720
Did you see the paper from DeepMind? I didn't look at it where they're like,
link |
02:30:37.040
doing some analysis on soccer strategy.
link |
02:30:40.240
Yeah. Interesting.
link |
02:30:41.520
I saved that paper. I haven't read it yet. It's actually, I, when I was in college,
link |
02:30:47.520
I actually did some research on, on applying machine learning and statistics in sports.
link |
02:30:55.120
And in our case, in our case, we're doing it for basketball. But what they're effectively
link |
02:31:02.640
trying to do was, have you ever watched Moneyball? So they're trying to do something
link |
02:31:07.760
similar, right? Taking that, in this case, basketball, taking a statistical approach
link |
02:31:12.160
to, to basketball. The interesting thing there is that baseball is much more about having these
link |
02:31:19.280
discrete events that happen kind of in similar conditions. And so it's easier to take a statistical
link |
02:31:24.240
approach to it. Whereas basketball, it's a much more dynamic game. It's harder to measure.
link |
02:31:31.760
It's hard to replicate these conditions. And so you have to think about it in a slightly
link |
02:31:37.680
different way. And so we were doing work on that and working like with the Celtics to analyze the,
link |
02:31:43.360
the data that they had, like they had these cameras in the, in the arena, they were tracking
link |
02:31:47.120
the players. And so you, so they have, they had a ton of data, but they didn't really know what to
link |
02:31:50.800
do with it. And so we, we were doing work on that. And, and, and soccer is maybe an even a step
link |
02:31:57.120
further. It's, it's right. It's a game where you don't have as many in basketball, you have a lot
link |
02:32:01.920
of field goals. And so you can measure success. Soccer, it's, it's right. It's more of a Poisson
link |
02:32:07.600
process almost where it's like, you have a goal like or two in a game. In terms of metrics,
link |
02:32:12.640
I wonder if there's a way, and I've actually have thought about this in the past, never coming up
link |
02:32:16.880
with any good solution. If there's a way to definitively say whether it's messy or not,
link |
02:32:21.120
they're the greatest of all time. Like, like honestly, sort of measure, like convert the game
link |
02:32:26.720
of soccer into metrics, like you said, baseball, but like those moments are genius. Like, like,
link |
02:32:33.760
you know, if it's just about goals or passes that led to goals, that feels like it doesn't capture
link |
02:32:38.800
the genius of the play. Yeah, they'll be like, you know, like, like you kind of do, you have
link |
02:32:44.880
more metrics, for instance, in chess, right? And you can try to understand all heart of a move
link |
02:32:50.800
that was, you know, there's like Bobby Fisher as this move that like, that it's, I think it's
link |
02:32:56.080
called the move of the century, where you have to go so deep into the tree to understand that
link |
02:33:01.680
that was the right move, and you can quantify how hard it was. So it'd be interesting to try to think
link |
02:33:06.720
of those type of metrics, but say for soccer and computer vision, unlock some of that for us.
link |
02:33:10.960
That's, that's one possibility. I have a cool idea, a computer vision product legs that you
link |
02:33:15.840
could build for soccer. Like, if you could detect the ball and like imagine that it seems like
link |
02:33:24.640
totally doable right now. But like, if you could detect when the ball enters one of the goals,
link |
02:33:29.840
and like just had like, you know, a crowd cheering for you when you're playing soccer with your
link |
02:33:35.200
friends, every time you score a goal, or you had like the, the Champions League song going on.
link |
02:33:40.160
And like having that, like you go play soccer with your friends, just turn that on. And there's
link |
02:33:44.000
like a computer vision, like program, analyzing the ball every time there's a goal. Like if you
link |
02:33:48.880
miss, like there's a, you know, the fan reacting to that. And then it should be pretty simple by now.
link |
02:33:54.480
It's like, I think there's an opportunity. Yeah, just throwing that. I would go all out. But by the
link |
02:33:59.280
way, I did, I've never released, I was thinking of just putting on GitHub, but I did write exactly
link |
02:34:04.640
that, which is the trackers for the players for the, for the bodies of the player is this is the
link |
02:34:09.600
hard part, actually, the detection of player bodies and the ball is not hard. What's hard is
link |
02:34:16.000
is very like robust tracking through time of each of those. So like, so I've wrote a track of this
link |
02:34:24.000
pretty damn good. Is that open source? You open source? No, I've never released it because interesting
link |
02:34:29.680
because I thought like, I need to, this is the perfection thing because I knew it was going
link |
02:34:36.880
to be like, it's going to pull me in and it wasn't really that done. And so I've never actually been
link |
02:34:43.920
part of a GitHub project where it's like really active development. And I didn't want to make it,
link |
02:34:48.880
I knew there's a non zero probability that will become my life for like a half a year.
link |
02:34:53.520
That's just how much I love soccer and all of those kinds of things. And, and ultimately,
link |
02:34:58.320
it will be all for just the, the joy of analyzing the game, which I'm all for.
link |
02:35:04.000
I remember you also like one of, in one of the episodes you mentioned that you did also a lot
link |
02:35:07.360
of eye tracking analyses on like Joe Rogan's. That was the, that was the research side of my life.
link |
02:35:12.240
Interesting. Yeah. And you have that library, right? You, you kind of downloaded all the episodes.
link |
02:35:16.480
Yep. Allegedly. I, of course I didn't. If you're a lawyer, I'm listening to this.
link |
02:35:21.040
No, it is, I was listening to the episode where you mentioned that. And I was actually,
link |
02:35:25.040
there was something that I might ask you for, for access to that, to allegedly that library.
link |
02:35:31.040
But I was doing some, not, not regarding like eye tracking, but I was playing around with
link |
02:35:37.200
analyzing the distribution of silences on one of the Joe Rogan episodes. So like,
link |
02:35:43.040
I did that for the Elon conversation where it's like, you just take all the silences
link |
02:35:49.280
after Joe asks the question and Elon responded and you plot that distribution and like and see
link |
02:35:54.800
how, how, how, how that looks like. Yeah. I think there's a huge opportunity,
link |
02:35:59.280
especially long form podcast to do that kind of analysis bigger than Joe.
link |
02:36:04.480
Exactly. But it has to be in fairly unedited podcasts so that you don't cut the silences.
link |
02:36:09.600
So one of the benefits I have, like how I'm doing this podcast is like the, what we're
link |
02:36:13.920
recording today is there's individual audio that being recorded. So like I have the raw
link |
02:36:19.680
information. It's, when it's published, it's all combined together and individual video feeds. So
link |
02:36:24.720
even when you're listening, which I usually don't, I only show one video stream, I, I'll know,
link |
02:36:30.240
I can track your blinks and so on. But, but ultimately the hope is you don't need that raw
link |
02:36:37.840
data because if you don't need the raw data for whatever analysis you're doing, you can then
link |
02:36:41.920
do a huge number of podcasts because there's, it's quickly growing now, the number, especially
link |
02:36:46.480
comedians. There's quite a few comedians with, with long form podcasts and they have a lot of
link |
02:36:54.080
facial expressions. They have a lot of fun and all those kinds of things. And it's, it's prone
link |
02:36:58.160
for analysis. And it's, there's so many interesting things. That, that, that idea actually sparked
link |
02:37:03.840
because I was watching a Q and A by, by Steve Jobs and I think it was at MIT. And then like
link |
02:37:10.720
people's like, it did a talk there. And then the Q and A started and people starting asking
link |
02:37:15.440
questions that I was, I was working while listening to it. And like, someone asked the question
link |
02:37:20.000
and he goes like on a 20 second silence before answering the question. I like, I had to check
link |
02:37:24.880
if the, if the video hadn't paused or something. And, and I was thinking about like, if that is a
link |
02:37:32.080
feature of a person, like how long on average you take to respond to a question and if it's like,
link |
02:37:38.480
as to do with the, like how thoughtful you are, and if that changes over time.
link |
02:37:43.040
But it also could be, this is really fascinating metric because it also could be,
link |
02:37:47.360
it's certainly a feature of a person, but it's also a function of the question.
link |
02:37:50.400
True. Like if you normalize to the person, you can probably infer a bunch of stuff about the
link |
02:37:55.680
question. So it's a nice flag. Like it's a really strong signal, the length of that silence,
link |
02:38:01.360
relative to the usual silence they have. So one, the silence is a measure of how thoughtful they are.
link |
02:38:07.040
And two, the particular silence is a measure how thoughtful the question was.
link |
02:38:11.120
Thoughtful the question was. It's really interesting. I mean, yeah.
link |
02:38:13.680
Yeah. I just analyzed Elon's episode, but I think there's like room for exploration there.
link |
02:38:21.280
I feel like the average for comedians would be like, I mean, the time would be so small because
link |
02:38:26.000
you're trained to like, I would think you're reacting to Hickler, to reacting to all sorts
link |
02:38:30.400
of things. You have to be like so quick. Maybe. Yeah. But some of the greatest comedians are very
link |
02:38:35.840
good at sitting in the silence. I mean, there's Lucy Kay, they played with that because you have
link |
02:38:42.480
a rhythm. Like Dave Chappelle, a comedian who did a Joe's show recently, he has a, especially when
link |
02:38:52.720
he's just having a conversation, he does long pauses. It's kind of cool. It's one of the ways
link |
02:39:01.040
to have people hang in your ward is to play with the pauses, to play with the silences
link |
02:39:06.960
and the emphasis and like mid sentence. There's a bunch of different things that, it'd be interesting
link |
02:39:12.560
to really, really analyze, but still soccer to me is that one. I just want a conclusive
link |
02:39:20.080
definitive statement about, because like there are so many soccer highlights of both Messi and
link |
02:39:27.520
Ronaldo. I just feel like the raw data is there. I'm just in the side. Because you don't have
link |
02:39:35.920
that with Pelé and Maradona. Yeah, true. But here's a huge amount of high dev data,
link |
02:39:41.680
then the annoying, the difficult thing. And this is really hard for tracking. And this is actually
link |
02:39:46.880
where I kind of gave up. I didn't really give much effort, but I gave up to the way that highlights
link |
02:39:55.680
or usually football match filmed is they switched the camera. So they'll do a different
link |
02:40:01.600
switch perspective. So you have to, it's a really interesting computer vision problem. When the
link |
02:40:06.160
perspective is switched, you still have a lot of overlap about the players, but the perspective
link |
02:40:11.200
is sufficiently different that you have to like recompute everything. So there's two ways to solve
link |
02:40:17.200
this. One is doing it the full way where you're constantly doing the slam problem. You're doing
link |
02:40:22.720
a 3D reconstruction the whole time and projecting into that 3D world. But you could also, there could
link |
02:40:28.160
be some hacks that I wonder like some trick where you can hop like when the perspective shifts
link |
02:40:36.240
do a high probability tracking hops from one object to another. But I thought especially
link |
02:40:44.080
in exciting moments when you're passing players, like you're doing a single ball dribble across
link |
02:40:52.240
players and you switch perspective, which is when they often do when you're making a run on goal.
link |
02:40:56.400
If you switch a perspective, it feels like that's going to be really tricky to get right
link |
02:41:02.560
automatically. But in that case, for instance, I feel like if somebody released that data set or
link |
02:41:06.880
it's like you just have all like these this data set a massive data set of all these games from
link |
02:41:13.280
from say Ronaldo and Messi like and just you just add that in like whatever CSV format and some
link |
02:41:19.040
some publicly available data set like that. I feel like people just there would be so many
link |
02:41:24.640
cool things that you could do with it and you just set it free and then like the world would
link |
02:41:28.720
like do its thing and then like interesting things would come out of it. By the way, I have this data
link |
02:41:34.080
set. So the two the two things I've did of this scale is soccer. So is body pose and ball tracking
link |
02:41:41.200
for soccer and then I try it's the pupil tracking and blink tracking for it was Joe Rogan and a
link |
02:41:48.720
few other podcasts that I did. So those are the two data sets I have. Did you analyze any of your
link |
02:41:53.840
podcasts? No, I think I really started doing this podcast after after doing that work and it's difficult
link |
02:42:03.840
to maybe I'd be afraid of what I find. I'm already annoyed with my own voice and video like editing
link |
02:42:13.440
it. But perhaps that's the honest thing to do because one useful thing I bought doing computer
link |
02:42:19.120
vision about myself is like I know what I was thinking at the time. So you can start to like
link |
02:42:25.200
connect the particular the behavioral peculiarities of like the way you blink the way you squint
link |
02:42:33.120
the way you close your eyes like talking about details. There's it's like for example, I just
link |
02:42:39.760
closed my eyes. Is that a blink or no? Like figuring that out in terms of timing in terms of the
link |
02:42:46.320
blink dynamics is tricky. It's very doable. I think there's universal laws about what is a
link |
02:42:53.520
blink and what is a closed eye and all those things plus makeup and eyelashes. I actually
link |
02:42:59.680
have annoyingly long eyelashes. So I remember when I was doing a lot of this work, I cut off my
link |
02:43:05.200
eyelashes which when like especially it was funny like female colleagues were like what the
link |
02:43:11.040
fuck are you doing? Like no keep the eyelashes but because it got in the way made the computer vision
link |
02:43:16.480
a lot more difficult. But super interesting topics. Yeah, but speaking about the one still on the
link |
02:43:23.920
topic of the data sets for sports, there's one paper that and I actually annotated it on Fermat
link |
02:43:29.680
and it was published in the 90s. 90s, I believe. 90s or 80s, I forget. But the researcher was
link |
02:43:39.120
effectively looking at the hot and phenomena in basketball. So whether like the fact that you
link |
02:43:47.200
just made a field goal, if on your next attempt, if you're more likely to make it or not.
link |
02:43:56.720
And it was super interesting because I mean he pulled like I think 100 undergrads and I think
link |
02:44:02.160
from Stanford and Cornell and asking people like do you think that you have a higher likelihood of
link |
02:44:07.440
making your free throw if you just made one and I think it's like 68% said yes. They believe that.
link |
02:44:15.520
And then he looked at the data and this was back in as I said like a few decades ago and so I think
link |
02:44:22.720
he had the data set of about, he looked at it specifically for free throws and he had a data
link |
02:44:28.080
set of about 5,000 free throws and effectively what he found was that specifically in the case of
link |
02:44:36.640
free throws, for the aggregate data he didn't find that he couldn't really spot that correlation,
link |
02:44:44.800
that hot and correlation. So if you made the first one, you weren't more likely to make the second
link |
02:44:50.960
one. What he did find was that they were just better at the second one because you just got
link |
02:44:55.760
like maybe a tiny practice and you just attempted once and then you were going to be better at the
link |
02:45:00.960
next one and then I went and there's a data set on Kaggle that has like 600,000 free throws
link |
02:45:08.080
and I re ran the same computations and confirmed like you can see a very clear pattern that they're
link |
02:45:15.040
just better at their second free throw. That's interesting because I think there's similar,
link |
02:45:21.120
that kind of analysis is so awesome because I think with tennis they have like a fault like
link |
02:45:26.000
when you serve they have analysis of like are you most likely to miss the second serve if you
link |
02:45:30.960
missed the first obviously. I think that's the case. So that integrates, that's so cool when
link |
02:45:38.240
psychology is converted into metrics in that way and in sports it's especially cool because it's
link |
02:45:43.680
such a constrained system that you can really study human psychology because it's repeated,
link |
02:45:48.880
it's constrained, so many things are controlled which is something you rarely have in the wild
link |
02:45:55.200
psychological experiments so it's cool. Plus everyone loves it, like sports is really cool to
link |
02:46:01.200
analyze. People actually care about the results. Yeah, I still think, well like I and I will
link |
02:46:09.360
definitely publish this work on Messi versus Ronaldo and I've left to read it. Objective,
link |
02:46:14.320
fully objective. I'd love to peer review. Yeah, this is very true, this is not past peer review.
link |
02:46:21.920
Let me ask sort of an advice question to the young folks.
link |
02:46:29.360
You've explored a lot of fascinating ideas in your life. You built a startup, worked on physics,
link |
02:46:35.520
worked on computer science. What advice would you give to young people today in high school,
link |
02:46:40.800
maybe early college, about life, about career, about science and mathematics?
link |
02:46:46.640
I remember reading that Poincaré was once asked by a French journal about his advice for young
link |
02:46:59.440
people and what was his teaching philosophy and he said that one of the most important things
link |
02:47:04.960
that parents should teach their kids is how to be enthusiastic in regards to the mysteries of the
link |
02:47:11.600
world and that he said like striking that balance was actually one of the most important things
link |
02:47:16.720
between like in education. You want to have your kids be enthusiastic about the mysteries of the
link |
02:47:21.680
world but you also don't want to traumatize them like if you really force them into something
link |
02:47:25.520
and I think like especially if you're young, I think you should be curious and I think you should
link |
02:47:33.600
explore that curiosity to the fullest, to the point where you even become almost as an expert
link |
02:47:41.440
on that topic and you might start with something that it's small like you might start with
link |
02:47:48.000
you're interested in numbers and how to factor numbers into primes and then all of a sudden you
link |
02:47:52.400
go and you're like lost in number theory and you discover cryptography and then all of a
link |
02:47:57.920
sudden you're buying Bitcoin and I think you should do this. You should really try to fulfill
link |
02:48:04.160
this curiosity and you should live in a society that allows you to fulfill this curiosity which
link |
02:48:08.960
is also important and I think you should do this not to get to some sort of status or fame or money
link |
02:48:14.960
but I think this is the way this iterative process, I think this is the way to find happiness
link |
02:48:20.800
and I think this also allows you to find the meaning for your life. I think it's all about
link |
02:48:26.960
like being curious and being able to fulfill that curiosity and that path to fulfilling that
link |
02:48:33.120
your curiosity. Yeah, the start small and let the fire build this kind of interesting way to
link |
02:48:38.640
think about it. And you never know where you're going to end up. For us it's just a really good
link |
02:48:44.960
example. We started by doing this as an internal thing that we did in the company and then we
link |
02:48:51.440
started putting out there and now a lot of people follow it and know about it. And you still don't
link |
02:48:56.800
know where Fermara's library is going to end up actually. True, exactly. So yeah, I think that
link |
02:49:02.400
would be my piece of advice with very limited experience of course but yeah, I agree. I agree.
link |
02:49:10.560
I mean is there something in from particular as you're all from the computer science versus
link |
02:49:14.960
physics perspective? Do you regret not doing physics? Do you regret not doing computer science?
link |
02:49:21.280
Which one is the wiser, the better human being? This is messy versus Ronaldo. Those are very,
link |
02:49:29.040
I don't know if you would agree but they're kind of different disciplines.
link |
02:49:33.120
True, yeah, very much so. Actually, I had that question in my mind. I took physics classes
link |
02:49:43.440
as an undergrad or like besides what I had to take and it's definitely something that I considered
link |
02:49:50.800
at some point. And I do feel like later in life that might be something that I'm not sure if
link |
02:50:02.880
regret is the right word but it's kind of something that I can imagine in an alternative
link |
02:50:08.960
universe what would have happened if I had gone into physics. I try to think that like,
link |
02:50:17.040
well, it depends on what your path ends up being but that it's not super important, right? Like
link |
02:50:24.080
exactly what you decide to major on. I think Tim Urban, the blogger, had a good visualization
link |
02:50:33.120
of this where it's like, he has a picture where you have all sorts of paths that he could pursue
link |
02:50:39.360
in your life and then maybe you're in the middle of it and so there's maybe some paths that are
link |
02:50:43.440
not accessible to you but like the tree that is still in front of you gives you a lot of
link |
02:50:47.840
optionality and so there's still lessons to learn from that. Like we have a huge number of options
link |
02:50:53.200
now and probably you're just one to reflect. Like to try to derive wisdom from the one little
link |
02:51:02.720
path you've taken so far may be flawed because there's all these other paths you could have taken.
link |
02:51:07.040
Yeah. So it's like, so one, it's inspiring that you can take any path now and two,
link |
02:51:13.120
it's like the path you've taken so far is just one of many possible ones but it does seem that like
link |
02:51:20.400
physics and computer science both open a lot of doors and a lot of different doors.
link |
02:51:24.960
True. It's very interesting. It is. In this case and especially in our case because I could see the
link |
02:51:31.600
difference, I studied, I went to college in Europe and João went to college here in the US so I could
link |
02:51:38.000
see the difference and like the European system is more rigid in the sense that when you decide to
link |
02:51:43.920
study physics you don't have a lot, especially in the early years, you don't have a lot of,
link |
02:51:48.400
you can't choose to take like a class from like computer science course or something like that,
link |
02:51:52.880
don't have a lot of freedom to explore in that sense in university as opposed to here in the US
link |
02:51:57.120
where you have more freedom. And I think that's important. I think that's what constitutes a
link |
02:52:04.240
good kind of educational system is one that gravitates towards the interests of a student
link |
02:52:09.600
as you progress. But I think in order for you to do that, you need to explore different areas.
link |
02:52:14.960
And I felt like if I had a chance to take say more computer science class when I was in college,
link |
02:52:20.000
I would have probably have taken those classes but yeah, but I ended up like focusing maybe too
link |
02:52:25.280
too much in physics and I think here at least my perception is that you can explore more fields.
link |
02:52:32.960
But there is a kind of, it's funny, but physics can be difficult. So I don't see too many computer
link |
02:52:39.920
science people then exploring into physics. It's like the one, not the one, but one of the beneficial
link |
02:52:47.920
things of physics, it feels like it, was it Rutherford that said like basically that physics is
link |
02:52:57.440
the hard thing and everything is easy. So like there's a certain sense once you've figured out
link |
02:53:02.160
some basic like physics that it's not that you need the tools of physics to understand the other
link |
02:53:07.840
disciplines, it's that you're empowered by having done difficult shit. I mean, the ultimate I think
link |
02:53:13.120
is probably mathematics there. Yeah, true. So maybe just doing difficult things and proving to
link |
02:53:20.080
yourself that you can do difficult things, whatever those are. That's not positive, I believe.
link |
02:53:23.840
Not positive. Yeah. And I think like before I started a company, I worked in the financial
link |
02:53:30.640
sector for a bit. And like I think having a physics background, I felt I was not afraid of
link |
02:53:36.240
like learning like finance things. And I think like when you come from those backgrounds, you are
link |
02:53:41.280
generally not afraid of stepping into other fields and learning about those because
link |
02:53:46.640
yeah, I feel we've learned a lot of difficult things and yeah, that's an added benefit, I believe.
link |
02:53:54.000
This was an incredible conversation, Luis, Joao. We started with, who do we start with? Feynman
link |
02:54:01.280
ended up with Messi and Ronaldo. So this is like the perfect conversation. It's really an honor
link |
02:54:05.920
that you guys would waste all this time with me today. It was really fun. Thanks for talking.
link |
02:54:09.920
Thank you so much for having us. Yeah, thank you so much. Thanks for listening to this conversation
link |
02:54:14.640
with Luis and Joao Batala. And thank you to Skiff, Simply Safe, Indeed, NetSuite, and ForSigmatic.
link |
02:54:22.640
Check them out in the description to support this podcast. And now let me leave you with some words
link |
02:54:27.840
from Richard Feynman. Nobody ever figures out what life is all about. And it doesn't matter.
link |
02:54:33.280
Explore the world. Nearly everything is really interesting if you go into it deeply enough.
link |
02:54:39.120
Thank you for listening. I hope to see you next time.