登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 154
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • 2
    说明:  随机序列 实现随机序列 Random sequence (Random sequence )
    2011-12-05 19:07:45下载
    积分:1
  • 11
    说明:  思源科技出版 matlab基础与提高 11章 源代码(Springsoft matlab basis and published the source code to improve Chapter 11)
    2010-04-24 11:57:54下载
    积分:1
  • blueteeth
    台湾人写的GFSK调制和解调,非常清晰,便于学习(Taiwanese write GFSK modulation and demodulation, very clear, easy to learn)
    2014-10-17 17:49:48下载
    积分:1
  • FSM
    使用频域平滑的方法来实现信号循环谱估计的算法(The smooth frequency domain signal cyclic spectrum estimation algorithm)
    2013-04-01 16:58:59下载
    积分:1
  • digital_modulation
    非常好的一个MATLAB GUI学习实例!!!(Very good example of a MATLAB GUI learning! ! !)
    2010-09-11 15:49:48下载
    积分:1
  • Full-Face-Detector
    说明:  网上下载的外国的人脸检测程序,不知道有没有,传上来与大家分享(Download the foreign face detection program, do not know not, pass up and share)
    2011-03-22 09:34:46下载
    积分:1
  • 090403excelmatlab
    用MATLAB写的关于其中的数据保存到excel中的m文件程序。(Writing about the use of MATLAB data into one of the excel document in the process m.)
    2009-04-03 09:03:26下载
    积分:1
  • omp
    典型的正交匹配追踪OMP算法的matlab源程序,结构合理,简单易懂(Typical orthogonal matching pursuit OMP algorithm matlab source, reasonable structure, easy to understand)
    2013-07-14 21:43:58下载
    积分:1
  • JIAQUAN32N
    文章在分析灰色线性回归组合预测模型基本原理的基础上,利用MATLAB强大的矩阵运算功能,使MATLAB实现加权预测模型算法,作者通过实例分析验证了该程序的准确性和可靠性.(This paper analyzes the combination of gray linear regression prediction model based on the basic principle, the use of the powerful MATLAB matrix calculation function, so MATLAB algorithm to achieve weighted prediction model, the authors analyzed to verify the accuracy and reliability of the program by example.)
    2014-01-13 09:23:42下载
    积分:1
  • C56766
    Forensic science it the study of objects that relate to a crime. Forensic scientists study evidence so that it can be used in a court of law. Forensic scientists observe, classify, compare, use numbers, measure, predict, interpret data, and draw inferences based on evidence. In this lesson, students will use these science inquiry skills.In this dental image was classify by sift algorithm
    2014-01-22 15:16:09下载
    积分:1
  • 696518资源总数
  • 105877会员总数
  • 14今日下载