Consider a modified version of the vacuum environment in Exercise vacuum-start-exercise, in which the agent is penalized one point for each movement.

Can a simple reflex agent be perfectly rational for this environment? Explain.
What about a reflex agent with state? Design such an agent.
How do your answers to 1 and 2 change if the agent’s percepts give it the clean/dirty status of every square in the environment?

Provide feedback

Saved searches