登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 228
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • MATLAB_JiaoCheng
    自己一直用的而且很不错的matlab教程.适合入门.(And he has been very good with the matlab tutorial. For entry.)
    2011-04-26 16:00:48下载
    积分:1
  • 70170707_LYpack emd
    MATLAB官网的EMD原始程序,里面包含了C语言的emd程序代码,经过验证该程序可以使用,实现了emd分解。(The official website of the MATLAB EMD original program, which contains the C language EMD code, after verification of the procedure can be used to achieve the EMD decomposition.)
    2021-02-03 01:10:00下载
    积分:1
  • abc_report
    说明:  Matlab 关于混沌的一个gui编程简介(Matlab Adventures in Bifurcations & Chaos" (ABC++) is a GUI application, which runs in a Matlab environment and has been designed for the exploration of bifurcations and chaos within the Chua Circuit paradigm.)
    2010-04-05 17:36:45下载
    积分:1
  • Knn
     MATLAB 聚类和近邻搜索模块代码如下(MATLAB clustering and nearest neighbor search module code is as follows)
    2011-04-28 22:48:22下载
    积分:1
  • 3
    说明:  例题全部代码. 薛定宇, 陈阳泉. 高等应用数学问题的Matlab求解.(All the code examples. Xue Dingyu, Chen Yangquan. MATLAB to solve higher applied mathematical problems.)
    2011-04-29 10:13:28下载
    积分:1
  • viterbi_deCODE
    卷积码的维特比译码仿真,主要是仿真误码率曲线。(Convolutional codes Viterbi decoding simulation, mainly simulation BER curves.)
    2010-05-25 22:16:24下载
    积分:1
  • The-application-of-Copula
    copula函数在海洋中的应用,并且举出了理论和实例,有一定参考作用(The application of Copula)
    2013-08-07 15:00:37下载
    积分:1
  • permutationica
    考虑排序问题的盲卷积分离模型算法程序,实现盲源分离。(Consider the scheduling problem of blind separation model convolution algorithm program to achieve blind source separation.)
    2010-08-30 10:13:41下载
    积分:1
  • data_analysis
    matlab 资料,具体是关于数据分析的matlab功能函数,方法的介绍(matlab data, specific data analysis on the matlab function, methods of introducing)
    2009-05-01 17:06:32下载
    积分:1
  • Kalman.tar
    Kalman filter related Matlab source codes
    2012-04-23 12:53:25下载
    积分:1
  • 696516资源总数
  • 106913会员总数
  • 8今日下载