Semi-TD-Agent-in-Random-walk-environment-

Initially, we applied semi-gradient TD using State Aggregation to tackle a policy evaluation task. Subsequently, we employed semi-gradient TD with a basic Neural Network for the same evaluation.

Our agent was devised to assess a static policy within the 500-State Random Walk environment, comprising precisely 500 states. Each episode starts with the agent positioned at the center and concludes if the agent reaches state 1 (far left) or state 500 (far right). At each step, the agent randomly opts to move left or right with an equal chance. The environment determines the distance of the agent's movement in the chosen direction.

This project is based on the assignments of this Coursera Reinforcement Specialization: https://www.coursera.org/specializations/reinforcement-learning

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Semi-TD-Agent-in-Random-walk-environment-

Files

README.md

Latest commit

History

README.md

File metadata and controls

Semi-TD-Agent-in-Random-walk-environment-