The Singularity Institute’s current best guess on what to do with a general AI is to have it implement humanity’s coherent extrapolated volition (CEV) - what we would want if we “knew more, thought faster, were more the people we wished we were, had grown up farther together; where the extrapolation converges rather than diverges, where our wishes cohere rather than interfere; extrapolated as we wish that extrapolated, interpreted as we wish that interpreted”. This is quite a mouthful.
To trade brevity for decreased accuracy, another way of saying the above is that we want an AI that represents the spirit of humanity’s desires rather than just the letter.
Is CEV democratic? Yes, but it is a representative democracy, where humanity is represented by the aggregate of its extrapolated volition.
There are four objections to CEV I generally hear, summarized as follows:
1. The devil’s pact objection. In fiction as well as in real life, great-sounding deals often have a hidden catch. Why should we expect this to be any different?
2. The fear of patriarchy objection. All the talk of self-improving general AI and its potential capabilities make people nervous because of the power asymmetry it implies.
3. The anti-AI objection. Many people take the line that machines should be mindless tools to serve humans, and never anything more.
4. The “I’m too special to be extrapolated” objection. Quite a few people have the idea that the human mind is too complex to ever be understood in any significant detail, much less be extrapolated accurately.
Because the question of what goal system to give the first general artificial intelligence is obviously a pretty big deal, all objections deserve to be heard and considered. There are probably others beyond the above four, but I wanted to focus on the obvious ones for now.
In my mind, all of the above objections are rooted in valid motivations, but none of them should be deal-breakers. I will briefly respond to the objections.
The devil’s pact objection requires that one deal participant (in this case, the AI) has an innate ill will towards the other deal participant (in this case, humanity). The AI would have to secretly want to screw us over from the get-go. But because general AI will be built from scratch, and is not likely, at least initially, to be heavily inspired by the human brain, there is no reason for us to postulate that this sort of behavior will be present. In terms of actual development concerns, AI programmers should be watchful as to whether “shortcuts”, like modeling an extrapolated humanity but not actually implementing its desires, generate just as much positive utility for the AI as what we would consider the “real deal” - making the real world a better place.
The fear of patriarchy objection stems largely from history, wherein all of the relevant actors were members of our unique species, for which power is proven to corrupt. Power corrupts humans for evolutionary reasons - if one is on top of the heap, one had better take advantage of the opportunity to reward one’s allies and punish one’s enemies. This is pure evolutionary logic and need not be consciously calculated. AIs, which can be constructed entirely without selfish motivations, can be immune to these tendencies. Insofar as significant power asymmetries in general bother people, this seems hard to avoid in the long term - technological development will lead to a diversity of possible beings, and with this diversity will inevitably come a diversity in levels of capability and intelligence.
The anti-AI objection is just anthropocentric. If human-level AI is possible, it will be created sooner or later. It’s in our best interests to admit this and try to ensure that AI is on our side. Anti-AI bias in this area is no different than the other unfortunate biases held throughout history against minorities.
The final objection has to do with the complexity of extrapolation. Believe it or not, we engage in extrapolations every day. We can’t fit realistic computational duplicates of the people we know in our heads, so we use abstract models that work well for many pragmatic purposes. In a CEV-implementing AI, the models used might be more detailed than those we use, but need not simulate every single atom of every single biopolymer to perform a tractable extrapolation.
Are there any other obvious objections people might have to CEV? Addressing these objections could help strengthen the idea.