-
WindyGridWorldQLearning
Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian
domains. It amounts to an incremental method for dynamic programming which imposes limited computational
demands. It works by successively improving its evaluations of the quality of particular actions at particular states.
This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins
(1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions
are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions
to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed
each iteration, rather than just one.
- 2013-04-19 14:23:35下载
- 积分:1
-
cdpei
importannt orogram for frft
- 2013-09-22 17:25:03下载
- 积分:1
-
mohu
此为模糊PID自适应控制程序,可以用于仿真或者实验,经过本人验证可直接运行(This is a fuzzy adaptive PID control procedures can be used to simulate or experiment can be run directly after I verify)
- 2014-10-29 21:57:52下载
- 积分:1
-
fuhuaqiuji
MATLAB语言编写变步长复化梯形求积公式的算法程序(函数式M文件)(MATLAB language-based variable step-size re-trapezoidal quadrature formula of the algorithm program (functional M file))
- 2009-12-16 16:15:41下载
- 积分:1
-
Decoupling
Direct Torque Control, Induction Motor
Vector Control Without an Encoder dtc
- 2013-09-01 18:47:36下载
- 积分:1
-
gaussianfilter
The noises are one of the research topic in image processing. In digital images removal of gaussian noises using bilateral filter is performd in this file
- 2015-02-27 19:45:31下载
- 积分:1
-
bananaex
一个非高斯目标方法的例子,研究不同自适应方法(A non-Gaussian examples of the target method to study the different adaptive methods)
- 2012-04-18 09:51:25下载
- 积分:1
-
holospectrum-program
能够提取旋转机械信号特征的全息谱分析程序(matlab programs of holospectrum method for feature extraction of signals )
- 2011-07-20 20:05:25下载
- 积分:1
-
improve-leach
无线传感器网络leach的matlab仿真程序,有详细的仿真模块和程序说明。
- 2010-12-18 09:54:23下载
- 积分:1
-
imageprocess
该程序是在matlab环境下gui界面编写的,集成了图像处理的基本功能。图像大小变换,加噪去噪、滤波、插值、截图、旋转角度等功能(The program in matlab environment prepared by the gui interface, integrated image processing functions. The size of the image transformation, adding noise denoising, filtering, interpolation, screenshots, rotation angle)
- 2013-01-01 16:36:47下载
- 积分:1