fix: typo
This commit is contained in:
parent
81dbb4ad1e
commit
417303cf0b
|
@ -1,16 +1,15 @@
|
||||||
## Thoughts on the "evolutionary anchor" of Aleja Cotra's report
|
A concern about the "evolutionary anchor" of Ajeya Cotra's report
|
||||||
|
=============================================================
|
||||||
|
|
||||||
![](https://upload.wikimedia.org/wikipedia/commons/a/ae/Darwin%27s_finches_by_Gould.jpg)
|
<p><img src="https://upload.wikimedia.org/wikipedia/commons/a/ae/Darwin%27s_finches_by_Gould.jpg" class="img-medium-center"></p>
|
||||||
|
|
||||||
Within Effective Altruism, Aleja Cotra's report on artificial general intelligence (AGI) timelines has been influential in justifying or convincing members and organizations to work on AGI safety.
|
Within Effective Altruism, Ajeya Cotra's report on artificial general intelligence (AGI) timelines has been influential in justifying or convincing members and organizations to work on AGI safety. The report has a section on the "evolutionary anchor", i.e., an upper bound on how much compute it would take to reach artificial general intelligence. The section can be found in pages 24-28 of [this Google doc](https://docs.google.com/document/d/1k7qzzn14jgE-Gbf0CON7_Py6tQUp2QNodr_8VAoDGnY/edit#heading=h.gvc1xyxlemkd). As a summary, in the report's own words:
|
||||||
|
|
||||||
The report has a section on the "evolutionary anchor", i.e., an upper bound on how much compute it would take to reach artificial general intelligence. The section can be found in pages 24-28 of [this Google doc](https://docs.google.com/document/d/1k7qzzn14jgE-Gbf0CON7_Py6tQUp2QNodr_8VAoDGnY/edit#heading=h.gvc1xyxlemkd). As a summary, in the report's own words:
|
|
||||||
|
|
||||||
> This hypothesis states that we should assume on priors that training computation requirements will resemble the amount of computation performed in all animal brains over the course of evolution from the earliest animals with neurons to modern humans, because we should expect our architectures and optimization algorithms to be about as efficient as natural selection.
|
> This hypothesis states that we should assume on priors that training computation requirements will resemble the amount of computation performed in all animal brains over the course of evolution from the earliest animals with neurons to modern humans, because we should expect our architectures and optimization algorithms to be about as efficient as natural selection.
|
||||||
|
|
||||||
This anchor isn't all that important in the report's own terms: it only gets a 10% probability assigned to it in the final weighted average. But this bound is personally important to be because I do buy that if you literally reran evolution, or if you use as much computation as evolution, you would have a high chance of producing something as intelligent as humans, and so I think that it is particularly forceful as an "outside view".
|
This anchor isn't all that important in the report's own terms: it only gets a 10% probability assigned to it in the final weighted average. But this bound is personally important to be because I do buy that if you literally reran evolution, or if you use as much computation as evolution, you would have a high chance of producing something as intelligent as humans, and so I think that it is particularly forceful as an "outside view".
|
||||||
|
|
||||||
However, I'm not sure I buy some of the details of how the author arrives at this estimate:
|
However, I'm don't buy the details of how the author arrives at this estimate:
|
||||||
|
|
||||||
> The amount of computation done over evolutionary history can roughly be approximated by the following formula: (Length of time since earliest neurons emerged) \* (Total amount of computation occurring at a given point in time). My rough best guess for each of these factors is as follows:
|
> The amount of computation done over evolutionary history can roughly be approximated by the following formula: (Length of time since earliest neurons emerged) \* (Total amount of computation occurring at a given point in time). My rough best guess for each of these factors is as follows:
|
||||||
|
|
||||||
|
@ -20,16 +19,51 @@ However, I'm not sure I buy some of the details of how the author arrives at thi
|
||||||
|
|
||||||
> This implies that the total amount of computation done over the course of evolution from the first animals with neurons to humans was (\~1e16 seconds) \* (\~1e25 FLOP/s) = **\~1e41 FLOP**
|
> This implies that the total amount of computation done over the course of evolution from the first animals with neurons to humans was (\~1e16 seconds) \* (\~1e25 FLOP/s) = **\~1e41 FLOP**
|
||||||
|
|
||||||
Unlike what the reader might suspect, I don't particularly want to take issue with the assumption that "...the total amount of brain computation occurring inside animals with very few neurons is roughly comparable to the amount of brain computation occurring inside the animals with the largest brains." I haven't looked at the literature on this, and the assumption seems a prima facie plausible.
|
Unlike what the reader might suspect, I don't particularly want to take issue with the assumption that *"...the total amount of brain computation occurring inside animals with very few neurons is roughly comparable to the amount of brain computation occurring inside the animals with the largest brains"*. I haven't looked at the literature on this, and the assumption seems a prima facie plausible.
|
||||||
|
|
||||||
Instead, I don't buy the assumption that to simulate evolution, it would be enough to simulate the behaviour of all the neurons throughout history. Instead, I think that one would also have to simulate the stimuli to which these neurons are exposed. For instance, in the case of AlphaGo, one not only has to simulate the inner workings of the model, but also the state of the Go board. Likewise, to simulate evolution, one would not only have to simulate the neurons of the brains in it, but also the state of the world in which they are.
|
Instead, I don't buy the assumption that to simulate evolution, it would be enough to simulate the behaviour of all the neurons throughout history. Instead, I think that one would also have to simulate the stimuli to which these neurons are exposed, and for this one. For instance, in the case of AlphaGo, one not only has to simulate the inner workings of the model, but also the state of the Go board. Likewise, to simulate evolution, one would not only have to simulate the neurons of the brains in it, but also the state of the world in which they are.
|
||||||
|
|
||||||
It's not clear to me how big a world would one need to spawn intelligence life. For instance, I think that Minecraft requires on the order of ~500 GFLOPs to run[^1]. Conversely, if we want a planet as large as Earth, with a complex atmosphere & so on, this would exceed the computing power of current supercomputers[^2].
|
It's not clear to me how big a world would one need to spawn intelligence life. For instance, I think that Minecraft requires on the order of ~500 GFLOPs to run[^1]. Conversely, if we want a planet as large as Earth, with a complex atmosphere & so on, this might vastly exceed the computing power of current supercomputers[^2]. Working through these details might be a neat research project, and perhaps a good fit for an organization such as [AI Impacts](https://aiimpacts.org/) or [Epoch](https://epochai.org/), though I'm not sure how many estimates other than my own it would change. In any case, this would involve:
|
||||||
|
|
||||||
Working through these details might be a neat research projects, though I'm not sure it would change many estimates other than my own. In any case, this would involve:
|
1. Coming up with estimates of what the least fine-grained world that we would expect might be able to produce intelligence life if we simulated natural selection in it.
|
||||||
|
|
||||||
1. Coming up with estimates of what the least fine-grained world that we would expect might be able to produce intelligence life due to evolution
|
|
||||||
2. Calculating how much compute it would take to in fact simulate it.
|
2. Calculating how much compute it would take to in fact simulate it.
|
||||||
|
|
||||||
[^1]: This page: <https://minecraft.fandom.com/wiki/Java_Edition_hardware_requirements> claims that recommended requirements are an Intel Core i5 for the CPU, and a Radeon 7 200 for the GPU. The Intel CPU has a processing speed of 37.73 GFLOPs (<https://setiathome.berkeley.edu/cpu_list.php>) and the Radeon of 467.2 GFLOPs (<https://en.wikipedia.org/wiki/Radeon_200_series#Radeon_R7_250>). I'm not sure what .
|
### Propagation of beliefs
|
||||||
[^2]: E.g., see [this article](https://www.reuters.com/article/us-britain-weather-computer-idUSKBN20B00U) on a £1.2 billion, 60 petaFLOPS supercomputer built solely for better atmospheric prediction/simulation. Simulating the rest of the world would require more computing power on top of that. I haven't really
|
|
||||||
|
After reading this, I think that the reader should:
|
||||||
|
|
||||||
|
1. Make AI timelines (slightly) wider to the right
|
||||||
|
2. Slightly reduce credence in the argument chain in [The Most Important Century](https://www.cold-takes.com/most-important-century/) series, which leans heavily on the argument that AGI will be developed this century.
|
||||||
|
3. Slightly increase credence that something like [The Patient Philanthropy Project](https://founderspledge.com/funds/patient-philanthropy-fund), which pretty much conditions on no AGI soon, is a good idea.
|
||||||
|
4. The meta update: about whether arguments in favor of shorter timelines are found more often, and about how intensely Open Philanthropy reports have been scrutinized.
|
||||||
|
|
||||||
|
I think that the first three updates should be relatively small, on the order of 0.1% to 10%:
|
||||||
|
|
||||||
|
- Something like 0.1% if you are keeping careful track of your AI timelines in a spreadsheet and don't particularly care about the evolutionary anchor, but want to update a bit to not fall prey to confirmation bias.
|
||||||
|
- Something like 10% if you are replacing the whole of Ajeya Cotra's weight on the previous evolutionary anchor timelines with some unspecified but much longer timelines, and if you greatly favored Cotra's estimate over [Carlsmith's](https://arxiv.org/pdf/2206.13353.pdf) or [Davidson's](https://www.openphilanthropy.org/research/semi-informative-priors-over-ai-timelines/)
|
||||||
|
- You could also have a much larger update if you more predominantly cared about the evolutionary anchor.
|
||||||
|
|
||||||
|
I haven't given much thought about how the meta update should look like.
|
||||||
|
|
||||||
|
### Discussion
|
||||||
|
|
||||||
|
Alex Guzey mentions:
|
||||||
|
|
||||||
|
> indeed by only considering neurons fired instead information observed by the organisms you probably save yourself orders of magnitude of computational requirements. But you don't know a priori how exactly to make these neurons fire without looking at all of the environment and doing computation over it.
|
||||||
|
|
||||||
|
In conversation with Misha Yagudin, he mentions:
|
||||||
|
|
||||||
|
1. that it is not clear whether the report is talking about "simulating all of evolution" when saying something like "we should expect our architectures and optimization algorithms to be about as efficient as natural selection."
|
||||||
|
2. That he personally expects AI to require less compute than evolution.
|
||||||
|
3. Than even if you have to simulate an environment, this could have comparable computational requirements to computing the neurons firing in the ML model. But it could also be much greater. "It’s important if the computational requirements are compatible or greater. If it’s 10% more than baseline, this is within model’s precision. If it’s ×1 — that’s concerning, if it’s ×10 it’s a mistake, ×10^10 — that’s a disaster."
|
||||||
|
|
||||||
|
I broadly agree with these points. On 3., I don't like to use the term "Knightian uncertainty", but I've given an overly broad range between "Minecraft" and "simulate all of the earth at similar levels of precision as we currently simulate the atmosphere".
|
||||||
|
|
||||||
|
## Acknowledgements
|
||||||
|
|
||||||
|
<p><img src="https://i.imgur.com/7yuRrge.png" class="img-frontpage-center"></p>
|
||||||
|
|
||||||
|
Thanks to Alexey Guzey and Misha Yagudin for reviewing this post. Otherwise, my work is generally supported by the [Quantified Uncertainty Research Institute](https://quantifieduncertainty.org/).
|
||||||
|
|
||||||
|
[^1]: [This page](<https://minecraft.fandom.com/wiki/Java_Edition_hardware_requirements>) claims that recommended requirements are an Intel Core i5 for the CPU, and a Radeon 7 200 for the GPU. The Intel CPU has a processing speed of [37.73 GFLOPs](<https://setiathome.berkeley.edu/cpu_list.php>) and the Radeon of [467.2 GFLOPs](<https://en.wikipedia.org/wiki/Radeon_200_series#Radeon_R7_250>). I'm not sure what the utilization of the GPU/CPU is, but in this case it doesn't matter much because the estimate is so low compared to the cost of computing neurons.
|
||||||
|
[^2]: E.g., see [this article](https://www.reuters.com/article/us-britain-weather-computer-idUSKBN20B00U) on a £1.2 billion, 60 petaFLOPS supercomputer built solely for better atmospheric prediction/simulation. Simulating the rest of the world below the atmosphere would require more computing power on top of that. I'm also not clear on what the resolution on the atmospheric grid which these supercomputers are simulated is.
|
||||||
|
|
Loading…
Reference in New Issue
Block a user