|
|
|
|
|
|
|
|
|
|
|
|
|
 | Acausal trade: introduction | | post by Stuart Armstrong 12 days ago | discuss | |
| I’ve never really understood acausal trade. So in a short series of posts, I’ll attempt to analyse the concept sufficiently that I can grasp it - and hopefully so others can grasp it as well.
| |
|
 | CIRL Wireheading | | post by Tom Everitt 16 days ago | Abram Demski and Stuart Armstrong like this | 1 comment | |
| Cooperative inverse reinforcement learning (CIRL) generated a lot of attention last year, as it seemed to do a good job aligning an agent’s incentives with its human supervisor’s. Notably, it led to an elegant solution to the shutdown problem. | |
|
 | Infinite ethics comparisons | | post by Stuart Armstrong 18 days ago | 1 comment | |
| Work done with Amanda Askell; the errors are mine.
It’s very difficult to compare utilities across worlds with infinite populations. For instance, it seems clear that world \(w_1\) is better than \(w_2\), if the number indicate the utilities of various agents:
- \(w_1 = 1,0,1,0,1,0,1,0,1,0, \ldots\)
- \(w_2 = 1,0,1,0,0,1,0,0,0,1, \ldots\)
| |
|
|
|
|
|
|
|
|
|
|
|
| Older |