登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 143
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • VBEMGMM
    用Matlab写的高斯混合模型的贝叶斯EM估计,(Variational Bayesian EM for Gaussian mixture models)
    2010-05-25 23:01:33下载
    积分:1
  • wavelet
    对某一标准图像进行二维Mallat小波的3级分解,获得小波高/低频系数,分别 用低频分量LL1、LL2、LL3的小波系数进行图像重建,分别计算这三种情况 和原始图像比较的PSNR。(A standard image 3 2D Mallat wavelet decomposition, wavelet high/low frequency coefficients , respectively Low frequency component LL1 , LL2, LL3 wavelet coefficients for image reconstruction, respectively, to calculate the three And compared to the original image PSNR. )
    2013-05-16 22:07:13下载
    积分:1
  • IAA 算法说明
    说明:  IAA迭代自适应估计算法,用于空时自适应处理。简单的子程序(IAA method for STAP, which can be used to process the echo data received by Radar.)
    2021-05-13 08:30:02下载
    积分:1
  • cdmamodem
    cdma code in matlab it is cdma receiver and transmiter.
    2009-06-29 19:36:40下载
    积分:1
  • fpga_matlab
    说明:  用matlab建立模型设计fpga,文章采用系统模型(Using matlab to establish model design fpga, articles used system model)
    2008-11-22 10:02:32下载
    积分:1
  • MATLAB_SIMULINK_QDPSK
    摘 要: 利用MATLAB平台的SIMULINK功能编写了一套QDPSK通用功能模块库,并在此基础上进行了多项可视化仿真,较 好地显示了数字通信的工作方式和优越性,以及采用SIMULINK进行仿真的良好的演示效果,为QDPSK数字通信系统的研究提 供了一个较好的软件平台。(Abstract: This paper gives a group of module library for digital communication function applying MATLAB’ s SIMULINK function, and offer a group of software to perform the visualizing simulation. The software perfectly shows the advantage and the process of normal digital communication. More over, it provides a software base for further study on QDPSK Digital communication.)
    2010-05-24 12:40:26下载
    积分:1
  • shift_LDPC
    基于循环移位寄存器的ldpc编码的仿真程序,好程序啊,(cycle shift register on the ldpc coded simulation program, the program ah,)
    2007-05-13 17:12:25下载
    积分:1
  • 打靶法求解常微分方组的两点边值问题 shoot
    matlab中用打靶法求解常微分方程组的两点边值问题(shooting method for bvps)
    2020-12-15 20:59:13下载
    积分:1
  • first
    二维傅里叶变换,对图像进行二维傅里叶变换处理(Two-Dimensional Fast Fourier Transform The purpose of this project is to develop a 2-D FFT program "package" that will be used in several other projects that follow. Your implementation must have the capabilities to: (a) Multiply the input image by (-1)x+y to center the transform for filtering. (b) Multiply the resulting (complex) array by a real function (in the sense that the the real coefficients multiply both the real and imaginary parts of the transforms). Recall that multiplication of two images is done on pairs of corresponding elements. (c) Compute the inverse Fourier transform. (d) Multiply the result by (-1)x+y and take the real part. (e) Compute the spectrum.)
    2010-11-27 23:15:03下载
    积分:1
  • immerge
    此程序是对不同传感器得到的图像进行融合,以便得到更好的图像(This program is an image of different sensors are fused to obtain better image)
    2010-05-14 22:09:29下载
    积分:1
  • 696518资源总数
  • 105547会员总数
  • 4今日下载