陪你度过漫长岁月


  • 首页

  • 归档

  • 标签
陪你度过漫长岁月

课程笔记《UCL强化学习》

发表于 2017-09-09 | 分类于 课程笔记 | |
UCL David Silver的强化学习课程

思维导图

Intro to RL


Intro to RL

MDP


MDP

Planning by DP


Planning by DP

Model-Free Prediction


Model-Free Prediction

Model-Free Control


Model-Free Control

Value Function Approximation


Value Function Approximation

Policy Gradient


Policy Gradient

Integrating Learning and Planning


Integrating Learning and Planning

Exploration and Exploitation


Exploration and Exploitation

References

  • 强化学习课程
  • 强化学习教材
# 强化学习
课程笔记《Learning How to Learn》
学习总结《强化学习与深度强化学习》
  • 文章目录
  • 站点概览
Criss

Criss

Talk is cheap

35 日志
9 分类
46 标签
  1. 1. 思维导图
    1. 1.1. Intro to RL
    2. 1.2. MDP
    3. 1.3. Planning by DP
    4. 1.4. Model-Free Prediction
    5. 1.5. Model-Free Control
    6. 1.6. Value Function Approximation
    7. 1.7. Policy Gradient
    8. 1.8. Integrating Learning and Planning
    9. 1.9. Exploration and Exploitation
  2. 2. References
© 2024 Criss
由 Hexo 强力驱动
主题 - NexT.Mist