Intelligent Agent Foundations Forumsign up / log in
Intertheoretic utility comparison: examples
discussion post by Stuart Armstrong 421 days ago | discuss

A previous post introduced the theory of intertheoretic utility comparison. This post will give examples of how to do that comparison, by normalising individual utility functions.

The methods

All methods presented here obey the axioms of Relevant data, Continuity, Individual normalisation, and Symmetry. Later, we’ll see which ones follow Utility reflection, Cloning indifference, Weak irrelevance, and Strong irrelevance.

Max, min, mean

The maximum of a utility function \(u\) is \(\max_{s\in \mathbb{S}} u(s)\), while the minimum is \(\min_{s\in \mathbb{S}} u(s)\). The mean of \(u\) \(\sum_{s\in \mathbb{S}} u(s)/||\mathbb{S}||\).

  • The max-min normalisation of \([u]\) is the \(u\in [u]\) such that the maximum of \(u\) is \(1\) and the minimum is \(0\).

  • The max-mean normalisation of \([u]\) is the \(u\in [u]\) such that the maximum of \(u\) is \(1\) and the mean is \(0\).

The max-mean normalisation has an interesting feature: it’s precisely the amount of utility that an agent completely ignorant of its own utility, would pay to discover that utility (as a otherwise the agent would employ a random, ‘mean’, strategy).

For completeness, there is also:

  • The mean-min normalisation of \([u]\) is the \(u\in [u]\) such that the mean of \(u\) is \(1\) and the minimum is \(0\).

Controlling the spread

The last two methods find ways of controlling the spread of possible utilities. For any utility \(u\), define the mean difference: \(\sum_{s,s'\in\mathbb{S}} |u(s)-u(s')|\). And define the variance: \(\sum_{s\in\mathbb{S}} (u(s)-\mu)^2\), where \(\mu\) is the mean defined previously.

These lead naturally to:

  • The mean difference normalisation of \([u]\) is the \(u\in [u]\) such that \(u\) has a mean difference of \(1\).

  • The variance normalisation of \([u]\) is the \(u\in [u]\) such that \(u\) has a variance of \(1\).

Properties

The different normalisation methods obey the following axioms:

Max-min

Max-mean

Mean-min

Mean difference

Variance

Utility reflection

YES

NO

NO

YES

YES

Cloning indifference

YES

NO

NO

NO

NO

Weak Irrelevance

YES

YES

YES

NO

YES

Strong Irrelevance

YES

YES

YES

NO

NO

As can be seen, max-min normalisation, despite its crudeness, is the only one that obeys all the properties. If we have a measure on \(\mathbb{S}\), then ignoring the cloning axiom becomes more reasonable. Strong irrelevance can in fact be seen as an anti-variance; it’s because of its second order aspect that it fails this.



NEW LINKS

NEW POSTS

NEW DISCUSSION POSTS

RECENT COMMENTS

I found an improved version
by Alex Appel on A Loophole for Self-Applicative Soundness | 0 likes

I misunderstood your
by Sam Eisenstat on A Loophole for Self-Applicative Soundness | 0 likes

Caught a flaw with this
by Alex Appel on A Loophole for Self-Applicative Soundness | 0 likes

As you say, this isn't a
by Sam Eisenstat on A Loophole for Self-Applicative Soundness | 1 like

Note: I currently think that
by Jessica Taylor on Predicting HCH using expert advice | 0 likes

Counterfactual mugging
by Jessica Taylor on Doubts about Updatelessness | 0 likes

What do you mean by "in full
by David Krueger on Doubts about Updatelessness | 0 likes

It seems relatively plausible
by Paul Christiano on Maximally efficient agents will probably have an a... | 1 like

I think that in that case,
by Alex Appel on Smoking Lesion Steelman | 1 like

Two minor comments. First,
by Sam Eisenstat on No Constant Distribution Can be a Logical Inductor | 1 like

A: While that is a really
by Alex Appel on Musings on Exploration | 0 likes

> The true reason to do
by Jessica Taylor on Musings on Exploration | 0 likes

A few comments. Traps are
by Vadim Kosoy on Musings on Exploration | 1 like

I'm not convinced exploration
by Abram Demski on Musings on Exploration | 0 likes

Update: This isn't really an
by Alex Appel on A Difficulty With Density-Zero Exploration | 0 likes

RSS

Privacy & Terms