Читать книгу Artificial Intelligent Techniques for Wireless Communication and Networking - Группа авторов - Страница 19
i) Reward shaping
ОглавлениеFor faster learning, incentive shaping is a heuristic to change the reward of the task to ease learning. Reward shaping incorporates prior practical experience by providing intermediate incentives for actions that lead to the desired outcome. This approach is also used in deep reinforcement training to strengthen the learning process in environments with sparse and delayed rewards.