-
WindyGridWorldQLearning
Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian
domains. It amounts to an incremental method for dynamic programming which imposes limited computational
demands. It works by successively improving its evaluations of the quality of particular actions at particular states.
This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins
(1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions
are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions
to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed
each iteration, rather than just one.
- 2013-04-19 14:23:35下载
- 积分:1
-
horner-algo
horner algorithm for matlab uses
- 2015-03-26 16:50:23下载
- 积分:1
-
Steel-strip-mechanism-analysis_2
matlab编写的钢丝运动分析,分析了其速度特性和加速度特性(matlab prepared wire motion analysis, analysis of the characteristics of its speed and acceleration characteristics)
- 2013-11-07 20:49:30下载
- 积分:1
-
SF_Romberg
基于matlab的龙贝格算法程序(Matlab-based algorithm of Romberg procedure)
- 2007-12-05 18:24:57下载
- 积分:1
-
GenerateCode
自动生成伪随机码,画图说明产生的三组伪机码,将三组伪随机值存放到新矩阵并传送到Workspace中,方便以后调用数据(Automatic generation of pseudo-random code generated drawing shows three sets of pseudo-machine code, the three groups of pseudo-random value is stored into a new matrix and sent to the Workspace, the call data to facilitate future)
- 2009-11-08 11:02:17下载
- 积分:1
-
Advanced-PID-control
先进PID控制matlab仿真光盘的全套源程序,欢迎大家下载,谢谢(Advanced PID control matlab simulation CD-ROM full source code, are welcome to download, thank you.)
- 2012-06-23 12:57:46下载
- 积分:1
-
WCDMA
WCDMA仿真附完整程序源码,各个功能模块,可运行(WCDMA simulation with complete program source code, the various functional modules can be run)
- 2013-05-13 19:19:59下载
- 积分:1
-
Getting-Started-with-HW
采用zedboard、zynq等在matlab的平台上进行硬件协仿真的,文章介绍Getting Started with HW,环境的搭建和调试方式。(Using zedboard, zynq etc. on matlab platform for hardware co-simulation, the article describes the Getting Started with HW, build and debug mode environment.)
- 2014-01-10 17:35:15下载
- 积分:1
-
amam_ampm
功放非线性仿真,输入为BPSK信号,功放模型为volterra(Amplifier nonlinear simulation, the input signal is BPSK)
- 2021-04-16 09:48:53下载
- 积分:1
-
Mengtekaluo
蒙特卡罗算法方法的课件,自己看了感觉还行,请一起分享(Monte Carlo algorithm method courseware, their own feel good read, please share)
- 2010-09-08 17:14:11下载
- 积分:1