登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 229
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • mirror HT
    对信号两端进行镜像延拓,可以用于消除Hilbert变换所带来的端点效应。(to extend both sides of signal)
    2009-11-05 21:31:22下载
    积分:1
  • framesubstract
    说明:  帧差法实现视频图像的读取和运动目标的检测。(The video frame difference method of reading and moving target detection.)
    2010-04-23 10:24:39下载
    积分:1
  • dip1
    image cropping in matlab a digital image processing project with .m files and presentation
    2011-05-27 18:30:34下载
    积分:1
  • matlabwavelet
    张德丰,《小波变换的matlab》,附带的程序,觉得这本书很好,很不错,值得参考(The wavelet transform matlab" , attached to the process, feel that this book very good, very good, it is also useful)
    2009-06-26 12:14:23下载
    积分:1
  • KNN
    本程序是用Matlab编写的用KNN算法解决文本分类问题的程序。(The program is written in Matlab using KNN algorithm to solve the text classification procedures.)
    2011-05-13 20:17:10下载
    积分:1
  • Fffuuzz_pidu
    模糊控制器:运用matlab建立复杂的模糊规则。既能开发程序源码码实现,也能使用matlab自带fuzzy工具箱完成。 可直接使用。 (Fuzzy controller: the use of Matlab to create complex fuzzy rules. Both the development of program source code implementation, can also use matlab own fuzzy toolbox to complete. Can be used directly.)
    2012-08-14 09:20:55下载
    积分:1
  • compressed--sensing-
    压缩感知算法,提出了一种新型的信道估计算法(compressed sensing although,A novel channel estimation algorithm is proposed)
    2014-02-19 15:12:54下载
    积分:1
  • MATLABrumem.zip
    介绍了MATLAB入门知识,里面有很多简单MATLAB程序,助于快速学习(Presented an introduction to MATLAB, there are many simple MATLAB program, help to quickly learn)
    2011-05-27 12:52:46下载
    积分:1
  • insdem06
    inertial navigation system 2
    2008-01-09 19:36:56下载
    积分:1
  • RBF
    Time series RBF predict
    2013-12-06 20:28:32下载
    积分:1
  • 696516资源总数
  • 106913会员总数
  • 8今日下载