by Tom Everitt 9 days ago | link | parent That is a good question. I don’t think it is essential that the agent can move from $$s_1$$ to $$s_2$$, only that the agent is able to force a stay in $$s_2$$ if it wants to. The transition from $$s_1$$ to $$s_2$$ could instead happen randomly with some probability. The important thing is that the human’s action in $$s_1$$ does not reveal any information about $$s_2$$.

