登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 230
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • matlab1
    是改进平方根法和平方根法的matlab小程序可以学习用的(Is to improve the square root law and the square root of the matlab can learn to use a small program)
    2010-10-22 11:35:28下载
    积分:1
  • snake
    matlab版贪吃蛇游戏 经典中的经典,强烈推荐(matlab snake game)
    2009-04-25 12:18:17下载
    积分:1
  • ad_rec
    This paper investigates the behavior of one particular near-far resistant detector, the parallel fractional adaptive detector
    2013-02-10 23:40:51下载
    积分:1
  • C_CMethod
    利用这个方法计算混沌时间序列的时间延迟和嵌入维数(Time delay and embedding dimension of chaotic time series calculated using this method)
    2012-08-29 11:25:22下载
    积分:1
  • ICPfull
    matlab完美实现ICP(迭代最近点算法)算法,功能强大,适合匹配数据点(matlab perfect realization of ICP (iterative closest point algorithm) algorithm, a powerful, data points for matching)
    2011-11-15 13:40:05下载
    积分:1
  • ex3-network
    代码提供了一个矩阵,其中存有点到点的距离,根据输入的起点与终点计算出最短路径(Code provides a matrix into which the distance from point to point, the start and end points of the input calculate the shortest path)
    2013-11-23 02:19:50下载
    积分:1
  • binary
    C program of binary search
    2015-03-06 14:58:59下载
    积分:1
  • gdqthresh
    说明:  采用matlab软件编写一种基于过渡区的图像分割方法,(using Matlab software development based on the transitional zone of image segmentation methods,)
    2006-01-09 10:40:11下载
    积分:1
  • code
    matlab code for matrix multiplication
    2013-02-02 21:36:14下载
    积分:1
  • Path_loss
    MATLAB Path loss computation
    2012-01-27 16:29:26下载
    积分:1
  • 696516资源总数
  • 106918会员总数
  • 4今日下载