Announcing: Existential Choices Debate Week (March 17-24)

Toby Tremlett🔹

March 17 - 24 will be Existential Choices Debate Week on the EA Forum. We’ll be discussing the debate statement "On the margin^[1], it is better to work on reducing the chance of our^[2] extinction than increasing the value of futures where we survive^[3]"

Add this and future events to your calendar

Like the last two debate weeks (1,2), during the event you’ll find a banner on the front page with an axis going from “strong disagree” to “strong agree” where forum users can place their avatar, and attach a comment explaining their position.

We’ll also be hosting a “symposium” with Will MacAskill — a chance to join in a live debate with experts on the issue. Provisionally, this will happen on the Monday of debate week, in the comments of a symposium post^[4].

If you’d like to take part, you can start thinking about writing a post to be published during the week. Posts considered a part of the event don’t have to answer the question directly — the rule of thumb is that if a post could plausibly help someone decide how to vote, it’s relevant for the week.

As always, message me if you have any questions, or would benefit from writing support.

Why this topic?

In recent years, we’ve somewhat moved away from the longtermism label (I at least see it pop up far less often on the Forum). Partially this is for reasons well phrased by Scott Alexander in his post "Long-Termism" [sic]^[5] vs. "Existential Risk".

A movement whose descriptive priorities can be explained as extinction risk reduction may as well just say that, rather than appealing to a philosophy which concluded that extinction risk reduction is important. However, if we don’t discuss longtermism, we also won’t discuss a range of potential projects not captured by common sense morality or extinction reduction. These are projects that aim for trajectory change, or a better future.

Now, terms like “post-AGI-governance” are starting to pop up… attempts to seed projects which hope to improve the long term future in ways other than merely (haha) ensuring that it exists.

This seems like a point when it is important to ask the question — should we be doing this now? Are there promising projects to be sought out, researched, or directly worked on, which are more important than extinction reduction? Is this area or research and action, a tangent, or a necessity?

Clarifications

Extinction is far narrower than “existential risk”: The extinction of “earth originating intelligent life” means a future devoid of any value which could have come from those lives. You can understand this as a future with 0 value^[6]. But “existential risk” is a broader term which includes irreversibly bad futures — full of suffering, led by an immortal dictator, trapped in pre-industrial scarcity forever. The future’s value could be below 0. In this debate, the ‘value of the future’ side, not the ‘extinction’ side, incorporates these risks which don’t route through extinction.

We are treating extinction reduction and increasing the value of the future as mutually exclusive: In order to make this a debate of (at least) two sides, we are separating interventions which reduce the risk of extinction, and interventions that increase the value of the future, via means other than extinction reduction. Otherwise, a substantive position in the argument, that extinction risk reduction is the best way to increase the value of the future, would be recorded as a neutral vote, rather than a strong agree.

Tractability is a part of this conversation. Including “on the margin” in the debate statement means that we can’t avoid thinking about tractability — i.e, where extra effort would actually do the most good, today. This makes the debate harder, but more action relevant. Remember though — just because the core debate question relies on claims about tractability, you can write about anything that could meaningfully influence a vote.

Please do ask for clarification publicly below - I'll add to this section if multiple people are confused about something.

How to take part:

Vote and comment

The simplest way to contribute to the debate week (though you can make it as complex as you like) is voting on the debate week banner, and commenting on your vote, describing your position. This comment will be attached to your icon on the banner, but it’ll also be visible on the debate week discussion thread, which will look like this.

Post

Everyone is invited to write posts for debate week! All you need to do is tag them with the “Existential Choices Debate Week” tag. Posts don’t have to end with a clear position on the debate statement to be useful. They can also:

Summarise other arguments and classify them.
Bring up considerations which might influence someone’s position on the statement.
Crosspost content for elsewhere, which contributes to the debate.

Message me if you have questions or would like feedback (anything from “is this post suitable?” to “does this post make my point clearly enough?”)

Turn up for the Symposium

We’ll hold the “Symposium” on Monday of debate week. It’ll (probably)^[7] be a post, like an AMA post, where a conversation will happen in the comments, between Will MacAskill and other experts. If you log onto the Forum at that time, you can directly take part in the debate, as a commenter.

We reserve the right to announce different moderation rules for this conversation. For example, we’ll consider hiding comments that aren’t on topic to make sure the discussion stays valuable.

52 Reactions

More posts like this

Comments14

Sorted by

New & upvoted

Click to highlight new comments since: Today at 8:36 AM

Maxime RichéMar 58

Great news!

> If there are other posts you think more people should read, please comment them below. I might highlight them during the debate week, or before.

I am in the process of publishing a series of posts ("Evaluating the Existence Neutrality Hypothesis") related to the theme of the debate ("Extinction risks" VS "Alignment risks / Future value"). The series is about evaluating how to update on those questions given our best knowledge about potential space-faring civilizations in the universe.

I will aim to publish several of the remaining posts during the debate week.

Zach Stein-PerlmanMar 412

I want to make salient these propositions, which I consider very likely:

In expectation, almost all of the resources our successors will use/affect comes via von Neumann probes (or maybe acausal trade or affecting the simulators).
If 1, the key question for evaluating a possible future from scope-sensitive perspectives is will the von Neumann probes be launched, and what is it that they will tile the universe with? (modulo acausal trade and simulation stuff)
[controversial] The best possible thing to tile the universe with (maybe call it "optimonium") is wildly better than what you get if you not really optimizing for goodness,^[1] so given 2, the key question is will the von Neumann probes tile the universe with ~the best possible thing (or ~the worst possible thing) or something else?

Considerations about just our solar system or value realized this century miss the point, by my lights. (Even if you reject 3.)

^{^}
Related:
Call computronium optimized to produce maximum pleasure per unit of energy "hedonium," and that optimized to produce maximum pain per unit of energy "dolorium," as in "hedonistic" and "dolorous." Civilizations that colonized the galaxy and expended a nontrivial portion of their resources on the production of hedonium or dolorium would have immense impact on the hedonistic utilitarian calculus. Human and other animal life on Earth (or any terraformed planets) would be negligible in the calculation of the total. Even computronium optimized for other tasks would seem to be orders of magnitude less important.
So hedonistic utilitarians could approximate the net pleasure generated in our galaxy by colonization as the expected production of hedonium, multiplied by the "hedons per joule" or "hedons per computation" of hedonium (call this H), minus the expected production of dolorium, multiplied by "dolors per joule" or "dolors per computation" (call this D).

Zach Stein-PerlmanMar 44

Given 3, a key question is what can we do to increase P(optimonium | ¬ AI doom)?

For example:

Averting AI-enabled human-power-grabs might increase P(optimonium | ¬ AI doom)
Averting premature lock-in and ensuring the von Neumann probes are launched deliberately would increase P(optimonium | ¬ AI doom), but what can we do about that?
Some people seem to think that having norms of being nice to LLMs is valuable for increasing P(optimonium | ¬ AI doom), but I'm skeptical and I haven't seen this written up.

(More precisely we should talk about expected fraction of resources that are optimonium rather than probability of optimonium but probability might be a fine approximation.)

SiebeRozendalMar 56

I really like that you've chosen this topic and think it's an important one! I wrote my MA Philosophy thesis on this (in 2019, now outdated).

On the margin[1], it is better to work on reducing the chance of our[2] extinction than increasing the value of futures where we survive

I want to flag that I disagree with this framing, as it's very anthropocentric. There are futures in which we go extinct but that are nevertheless highly valuable (happy sentient AI spreading via VNM probes). Perhaps more empirically relevant, I expect almost all effects to go via making the transition to superintelligence go well, and the most distinct action is focusing on digital sentience (which has little effect on extinction risk and much effect on the value of the future).

Toby Tremlett🔹Mar 52

Does footnote #2 on the debate statement cover this? "Our" and "We" are supposed to refer to "earth-originating intelligent life", so "happy sentient AI spreading via VNM probes" would be included.

[edit- I'm very open to rephrasing it if not]

SiebeRozendalMar 52

Ah yeah that seems fine then! "Life" is an imprecise term and I'd prefer "sentience" or "sentient beings" but maybe I'm overdoing it

Toby Tremlett🔹Mar 52

PS- would it still be worth sharing the thesis, or some thoughts from it? You could claim late draft amnesty if you'd like to post it without editing it :)

SiebeRozendalMar 52

It's here!

Ryan GreenblattMar 54

I think reducing the risk of misaligned AI takeover looks like a pretty good usage of people on the margin. My guess is that misaligned AI takeover typically doesn't result in extinction in the normal definition of the term (killing basically all humans within 100 years). (Maybe I think the chance of extinction-defined-normally given AI takeover is 1/3.)

Thus, for me, the bottom line of the debate statement comes down to whether misaligned AI takeover which doesn't result in extinction-defined-normally actually counts as extinction in the definition used in the post.

I don't feel like I understand the definition you give of "a future with 0 value" handles cases like:

"Misaligned AIs takeover and have preferences that on their own have ~0 value from our perspective. However, these AIs keep most humans alive out of a small amount of kindness and due to acausal trade. Additionally, lots of stuff happens in our lightcone which is good due to acausal trade (but this was paid for by some entity that shared our preferences). Despite this, misaligned AI takeover is actually somewhat worse (from a pure longtermist perspective) than life on earth being wiped about prior to this point, because aliens were about 50% likely to be able to colonize most of our lightcone (or misaligned AIs they create would do this colonization) and they share our preferences substantially more than the AIs do."

More generally, my current overall guess at a preference ordering something like: control by a relatively enlightened human society that shares my moral perspectives (and has relatively distributed power > human control where power is roughly as democratic as now > human dictator > humans are driven extinct but primates aren't (so probably other primates develop an intelligent civilization in like 10-100 million years) > earth is wiped out totally (no AIs and no chance for intelligent civilization to re-evolve) > misaligned AI takeover > earth is wiped out and there aren't aliens so nothing ever happens with resources in our lightcone > various s-risk scenarios.

What line here counts as "extinction"? Does moving from misaligned AI takeover to "human control where power is roughly as democratic as now" count as an anti extinction scenario?

Zach Stein-PerlmanMar 4*8

One key question for the debate is: what can we do / what are the best ways to "increas[e] the value of futures where we survive"?

My guess is it's better to spend most effort on identifying possible best ways to "increas[e] the value of futures where we survive" and arguing about how valuable they are, rather than arguing about "reducing the chance of our extinction [vs] increasing the value of futures where we survive" in the abstract.

Toby Tremlett🔹Mar 52

I agree- this is what I mean by my clarification of the tractability point above. One of the biggest considerations for me personally in this debate is whether there are any interventions in the 'increasing the value of the future' field which are as robust in their value as extinction risk reduction.

ArepoMar 62

I'm going to struggle to cast a meaningful vote on this, since I find 'existential risk' terminology as used in the OP more confusing than helpful, since e.g. it includes nonexistential considerations and in practice excludes non-extinction catastrophes from a discussion they should very much be in, in favour of work on the heuristical-but-insufficient grounds of focusing on events that have maximal extinction probability (i.e. AI).

I've argued here that non-extinction catastrophes could be as or more valuable to work on than immediate extinction events, even if all we care about is the probability of very long-term survival. For this reason I actually find Scott's linked post extremely misleading, since it frames his priorities as 'existential' risk, then pushes people entirely towards working on extinction risk - and gives reasons that would apply as well to non-extinction GCRs. I gave some alternate terminology here, and while I don't want to insist on my own clunky suggestions, I wish serious discussions would be more precise.

Sören KlingerMar 52

According to my understanding the last two posts from the "Further reading" section rather represent disagreement with the proposed debate statement given their emphasis on s-risks.

Toby Tremlett🔹Mar 52

Thanks for the close reading- wish I could say that had been a test. I'll edit it now :)

Effective Altruism Forum
EA Forum

Announcing: Existential Choices Debate Week (March 17-24)

52

Why this topic?

Clarifications

How to take part:

Vote and comment

Post

Turn up for the Symposium

Further reading:

52

Reactions

More posts like this