This article demonstrates my experience of training a reinforcement learning (RL) model on AWS.
Recently, I signed up for a free-tier account on AWS to delve deeper into their
various services, and that’s when I stumbled upon AWS DeepRacer—a thrilling car
racing game based on reinforcement learning. I found it interesting
and thought it was definitely worth a try, especially considering that new
users receive 10 free hours to train or evaluate models, along with 5GB of free
storage during the first month. Why not take advantage of this opportunity?
In DeepRacer, we have the opportunity to define our desired behavior and
indicate what is considered favorable or unfavorable to the car by creating a
Python function that returns a floating-point reward. This function serves as a
guiding principle, helping the car distinguish between right and wrong actions.
Additionally, we can configure training scenarios by selecting virtual tracks,
adjusting training durations, and fine-tuning the car’s steering and
acceleration sensitivity. During training, progress is constantly monitored,
measuring the percentage of the track completed before veering off-course. To
further assess the car’s performance, we can evaluate and monitor its behavior
through virtual evaluation videos after each training session.
Below is the python-based reward function I used:
You can see how the car performs during the evaluation phase: