登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 177
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • actuator11
    Another file of actuator example.
    2014-01-16 17:33:33下载
    积分:1
  • 8PSK
    MATLAB code for 8PSK modulation
    2013-12-16 02:50:46下载
    积分:1
  • paper.pdf
    a good example of image processing
    2010-10-10 10:47:03下载
    积分:1
  • Dijkstra
    说明:  使用MATLAB实现Dijkstra算法(Dijkstra algorithm using MATLAB Implementation)
    2010-04-15 11:07:10下载
    积分:1
  • adhe
    edgh detection in matlab
    2012-12-30 14:12:48下载
    积分:1
  • fwdlabprograms
    In this optic fiber link budget analysis and pcm and other program are written in matlab code
    2014-02-10 17:14:30下载
    积分:1
  • als
    通过交替最小二乘法解决协同过滤典型问题,包含针对训练集的超参数和特征个数的选择(Alternated Linear Square)
    2021-03-15 19:19:22下载
    积分:1
  • finite element code
    说明:  基于有限元理论的相关程序,都可以运行!同时包括多种数值计算方法,可供参考。(Relevant programs based on finite element theory can run!)
    2020-06-21 02:20:02下载
    积分:1
  • dss_1-0.tar
    国外的一个非常不错的图像去噪matlab代码(Abroad, a very good image denoising matlab code)
    2009-12-16 20:59:28下载
    积分:1
  • DSP0404
    matlab数字信号处理代码,仿真信号处理的内容(Matlab digital signal processing code, simulation of signal processing)
    2006-09-23 18:01:17下载
    积分:1
  • 696516资源总数
  • 106409会员总数
  • 8今日下载