登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 168
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • thinning
    细化,绝对管用,效果特别好特别好,是我自己一句一句写的,基于zhang算法细化,希望会大家有帮助,希望大家多多交流(absolutely useful!!!thank you for your communication!!)
    2020-10-24 14:20:01下载
    积分:1
  • xil
    MATLAB源码,P0201:MATLAB赋值 P0202:MATLAB中的for循环 P0203:MATLAB中的for循环和if条件 P0205:MATLAB图像处理的基本操作 P0206:MATLAB高级图像处理操作 P0207:根据RGB图像创建一幅灰度图像 P0208:二值图像的取反操作 P0209:用imshow函数显示图像 P0210:在同一个窗口内显示两幅图像(MATLAB source code, P0201 : MATLAB assignment P0202 : MATLAB for the cycle P0203 : MATLAB for the cycle, and if conditions P0205 : MATLAB image processing basic operation P0206 : High MATLAB image processing P0207 : According to RGB images to create a gray-scale image P0208 : binary images from the operation of the anti-P0209 : use imshow function displays images P0210 : in the same window shows two images)
    2006-12-09 20:22:53下载
    积分:1
  • ino.ent
    read text,image,audio and calculate entropy and information
    2011-10-19 01:46:20下载
    积分:1
  • MyGui.m
    This GUI will show normal,Rayleigh and rice distribution with the law of large number. It has an easy to understand interface and few parameters.
    2009-11-25 16:17:15下载
    积分:1
  • GA
    说明:  遗传算法的设计与实现 y=x.*sin(10*pi*x)+2 ( Genetic Algorithm Design and Implementation y=x.*sin(10*pi*x)+2 )
    2013-11-18 21:09:52下载
    积分:1
  • 01-07210
    Load balancing algorithm
    2012-10-05 15:27:39下载
    积分:1
  • SVM
    SVM MATLABSVM MATLABSVM MATLABSVM MATLAB(Support Vector Machine SVM and nuclear assembly MATLAB function)
    2009-04-18 16:24:09下载
    积分:1
  • Matlab7baseandimprove
    Matlab7基础与提高下载资料,很详细,书中的例程都包括在内了(Matlab7 base and improve the download information, very detailed, the book' s routines are included in)
    2009-09-25 00:28:52下载
    积分:1
  • Convey
    颜色空间转换。。。。。。。。。。。。。。。。。。。。。(Color space conversion)
    2011-05-10 22:31:23下载
    积分:1
  • RBF--and--BP
    用径向基神经网络和BP网络预测的源码,径向基神经网络优于BP网络预测。(With RBF neural network and BP network prediction source, RBF neural network is better than BP network prediction.)
    2014-12-22 22:39:40下载
    积分:1
  • 696518资源总数
  • 106148会员总数
  • 10今日下载