Intelligent Agent Foundations Forumsign up / log in
My recent posts
discussion post by Paul Christiano 47 days ago | Ryan Carey, Jessica Taylor, Patrick LaVictoire, Stuart Armstrong and Tsvi Benson-Tilsen like this | discuss

Over at medium, I’m continuing to write about AI control; here’s a roundup from the last month.

Many of these seem like interesting things to discuss here; would it be better to post each of these as a link when I write it?

Strategy

  • Prosaic AI control argues that AI control research should first consider the case where AI involves no “unknown unknowns.”
  • Handling destructive technology tries to explain the upside of AI control, if we live in a universe where we eventually need to build a singleton anyway.
  • Hard-core subproblems explains a concept I find helpful for organizing research.

Building blocks of ALBA

Terminology and concepts



NEW LINKS

NEW POSTS

NEW DISCUSSION POSTS

RECENT COMMENTS

I agree that the epistemic
by Tsvi Benson-Tilsen on Open problem: thin logical priors | 0 likes

A very similar idea is
by Paul Christiano on Online Learning 1: Bias-detecting online learners | 0 likes

I think the fact that traders
by Paul Christiano on Open problem: thin logical priors | 1 like

Prior to working more on
by Paul Christiano on Updatelessness and Son of X | 0 likes

It seems quite challenging to
by Vadim Kosoy on Towards learning incomplete models using inner pre... | 0 likes

> I see minimally
by Paul Christiano on My current take on the Paul-MIRI disagreement on a... | 0 likes

> If such a recipe existed
by Paul Christiano on My current take on the Paul-MIRI disagreement on a... | 0 likes

> My current estimate is that
by Paul Christiano on Towards learning incomplete models using inner pre... | 0 likes

Regarding exploration, I
by Vadim Kosoy on Towards learning incomplete models using inner pre... | 0 likes

If an AI causes its human
by Wei Dai on My current take on the Paul-MIRI disagreement on a... | 0 likes

This result features in the
by Ryan Carey on In memoryless Cartesian environments, every UDT po... | 0 likes

Cool! It seems to me that
by Paul Christiano on Towards learning incomplete models using inner pre... | 0 likes

I see what you're arguing.
by Jessica Taylor on Pursuing convergent instrumental subgoals on the u... | 0 likes

It's just meant to be a
by Jessica Taylor on My current take on the Paul-MIRI disagreement on a... | 0 likes

Thanks, I think I understand
by David Krueger on My current take on the Paul-MIRI disagreement on a... | 0 likes

RSS

Privacy & Terms (NEW 04/01/15)