-
WindyGridWorldQLearning
Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian
domains. It amounts to an incremental method for dynamic programming which imposes limited computational
demands. It works by successively improving its evaluations of the quality of particular actions at particular states.
This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins
(1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions
are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions
to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed
each iteration, rather than just one.
- 2013-04-19 14:23:35下载
- 积分:1
-
Appendix1B_K_cross_validation
sourse code for cross validation
- 2013-09-15 19:34:45下载
- 积分:1
-
qpsk
this is simple qpsk for matlab code
- 2013-09-11 09:56:11下载
- 积分:1
-
11111444
简易一级倒立摆MATLAB仿真,做的很完整可以当课设直接使用(MATLAB simulation of a simple pendulum, doing very complete set can be directly used as lesson)
- 2013-11-28 17:08:45下载
- 积分:1
-
FIR
1、系统仅处于训练状态。训练序列为2cos(2π×f1×n/fs),
干扰为sin(2π×f2×n/fs);序列长度为100个样点。
其中n=0…99,f1=1kHz,f2=2kHz,fs=8kHz。
2、采用LMS算法,均衡器输入为训练序列加上干扰,均衡器阶数为21阶,步长因子β=0.01。
3、在Matlab中编写浮点LMS算法,绘制均衡器的输出波形,并与训练序列进行对比。
(1, the system only in the training status. The training sequence is 2cos (2πf1n/fs), interference sin (2πf2n/fs) sequence length of 100 samples. Where n = 0 ... 99, f1 = 1kHz, f2 = 2kHz, fs = 8kHz. 2, using LMS algorithm, the input to the equalizer training sequence plus interference equalizer order is 21 bands, step factor β = 0.01. 3, the preparation of the floating-point LMS algorithm in Matlab, draw the output waveform equalizer, and with the training sequence comparison.)
- 2013-12-05 14:53:23下载
- 积分:1
-
NLM_for_timeSeries
ECG NML for timeseries
- 2014-11-05 20:13:10下载
- 积分:1
-
CHAZHI
插值;线性插值;样条插值;普通插值 ;编程(Interpolation one)
- 2015-01-18 10:40:45下载
- 积分:1
-
matalab-stbc
空时编码的matalb仿真程序,包含stbc(Space-time coding matalb simulation program, contains STBC)
- 2013-05-08 01:47:46下载
- 积分:1
-
kmeans
kmean函数详细说明,很不错哦!真的很好!(kmean function details, very good, oh!)
- 2010-08-13 20:18:54下载
- 积分:1
-
0914
stochastic diff equ, 随即微分方程的,比较使用(stochastic diff equ,)
- 2009-05-01 18:05:21下载
- 积分:1