data:image/s3,"s3://crabby-images/81b23/81b233d19098766e41bc80d4e8e64181d5feed2e" alt="Machine Learning with Swift"
Reinforcement learning
Reinforcement learning is special in the sense that it doesn't require a dataset (see the following diagram). Instead, it involves an agent who takes actions, changing the state of the environment. After each step, it gets a reward or punishment, depending on the state and previous actions. The goal is to obtain a maximum cumulative reward. It can be used to teach the computer to play video games or drive a car. If you think about it, reinforcement learning is the way our pets train us humans: by rewarding our actions with tail-wagging, or punishing with scratched furniture.
One of the central topics in reinforcement learning is the exploration-exploitation dilemma—how to find a good balance between exploring new options and using what is already known:
data:image/s3,"s3://crabby-images/4b4a9/4b4a9c49196fd3d57e54e45ca115889da15cb5c0" alt=""
Table 1.3: ML tasks:
data:image/s3,"s3://crabby-images/88569/8856991cdb5693a1f6d04f609ea14fb33800d925" alt=""