suntton-RL-book-demo
代码说明:
sutton强化学习书籍的所有matlab例子,学习很有用,不易找到(all matlab demo about sutton s book for reinforcement learning)
文件列表:
suntton强化学习书籍所有代码
...........................\Chapter 2 (Evaluative Feedback)
...........................\...............................\binary_bandit_exps.m,5861,2010-09-28
...........................\...............................\binary_bandit_exps_Script.m,791,2010-09-28
...........................\...............................\exercise_2_11.m,5069,2010-09-28
...........................\...............................\exercise_2_11_Script.m,590,2010-09-28
...........................\...............................\exercise_2_5.m,4401,2010-09-28
...........................\...............................\exercise_2_7.m,4056,2010-09-28
...........................\...............................\exercise_2_7_Script.m,1159,2010-09-28
...........................\...............................\n_armed_testbed.m,4603,2010-09-28
...........................\...............................\n_armed_testbed_softmax.m,5122,2010-09-28
...........................\...............................\opt_initial_values.m,3739,2010-09-28
...........................\...............................\opt_initial_values_Script.m,777,2010-09-28
...........................\...............................\persuit_method.m,5405,2010-09-28
...........................\...............................\persuit_method_Script.m,592,2010-09-28
...........................\...............................\reinforcement_comparison_methods.m,4663,2010-09-28
...........................\...............................\reinforcement_comparison_methods_Script.m,887,2010-09-28
...........................\...............................\sample_discrete.m,962,2010-09-28
...........................\Chapter 3 (The Reinforcement Learning Problem)
...........................\..............................................\rr_action_bellman.m,2712,2010-09-28
...........................\..............................................\rr_state_bellman.m,2083,2010-09-28
...........................\Chapter 4 (Dynamic Programming)
...........................\...............................\cmpt_P_and_R.m,1732,2010-09-28
...........................\...............................\ex_4_2_sys_solv.m,301,2010-09-28
...........................\...............................\ex_4_5_policy_evaluation.m,2771,2010-09-28
...........................\...............................\ex_4_5_policy_improvement.m,3893,2010-09-28
...........................\...............................\ex_4_5_rhs_state_value_bellman.m,1422,2010-09-28
...........................\...............................\ex_4_5_Script.m,3162,2010-09-28
...........................\...............................\gam_rhs_state_bellman.m,1031,2010-09-28
...........................\...............................\gam_Script.m,2785,2010-09-28
...........................\...............................\iter_poly_gw_inplace.m,4302,2010-09-28
...........................\...............................\iter_poly_gw_not_inplace.m,4500,2010-09-28
...........................\...............................\jcr_example.m,2290,2010-09-28
...........................\...............................\jcr_policy_evaluation.m,2679,2010-09-28
...........................\...............................\jcr_policy_improvement.m,3085,2010-09-28
...........................\...............................\jcr_rhs_state_value_bellman.m,1020,2010-09-28
...........................\Chapter 5 (Monte Carlo Methods)
...........................\...............................\cmpt_bj_value_fn.m,3288,2010-09-28
...........................\...............................\determineReward.m,477,2010-09-28
...........................\...............................\ex_5_4_Script.m,4422,2010-09-28
...........................\...............................\gen_rt_episode.m,4947,2010-09-28
...........................\...............................\handValue.m,527,2010-09-28
...........................\...............................\init_unif_policy.m,954,2010-09-28
...........................\...............................\mcEstQ.m,1365,2010-09-28
...........................\...............................\mc_es_bj_Script.m,5642,2010-09-28
...........................\...............................\mk_rt.m,558,2010-09-28
...........................\...............................\rt_pol_mod.m,1533,2010-09-28
...........................\...............................\shufflecards.m,288,2010-09-28
...........................\...............................\soft_policy_bj_Script.m,7929,2010-09-28
...........................\...............................\stateFromHand.m,754,2010-09-28
...........................\...............................\velState2PosActions.m,2140,2010-09-28
...........................\Chapter 6 (Temporal Difference Learning)
...........................\........................................\cmpt_arms_err.m,799,2010-09-28
...........................\........................................\eg_6_2_learn.m,1675,2010-09-28
...........................\........................................\eg_rw_batch_learn.m,4020,2010-09-28
...........................\........................................\learn_cw.m,6125,2010-09-28
...........................\........................................\learn_cw_Script.m,2556,2010-09-28
...........................\........................................\mk_arms_error_plt.m,1131,2010-09-28
...........................\........................................\mk_batch_arms_error_plt.m,918,2010-09-28
...........................\........................................\mk_fig_6_6.m,772,2010-09-28
...........................\........................................\plot_cw_policy.m,1899,2010-09-28
...........................\........................................\plot_gw_policy.m,2150,2010-09-28
...........................\........................................\run_all_gw_Script.m,1172,2010-09-28
...........................\........................................\R_learn_acq.m,3888,2010-09-28
...........................\........................................\R_learn_acq_Script.m,1829,2010-09-28
...........................\........................................\wgw_w_kings.m,5066,2010-09-28
...........................\........................................\wgw_w_kings_n_wind.m,4668,2010-09-28
...........................\........................................\wgw_w_kings_n_wind_Script.m,1521,2010-09-28
...........................\........................................\wgw_w_kings_Script.m,1499,2010-09-28
...........................\........................................\wgw_w_stoch_wind.m,4981,2010-09-28
...........................\........................................\wgw_w_stoch_wind_Script.m,1632,2010-09-28
...........................\........................................\windy_gw.m,4044,2010-09-28
...........................\........................................\windy_gw_Script.m,1491,2010-09-28
...........................\Chapter 7 (Eligibility Traces)
...........................\..............................\eg_7_5_episode.m,1285,2010-09-28
...........................\..............................\eg_7_5_learn_at.m,1783,2010-09-28
...........................\..............................\eg_7_5_learn_rt.m,1777,2010-09-28
...........................\..............................\eg_7_5_Script.m,1718,2010-09-28
...........................\..............................\gw_w_et.m,4619,2010-09-28
...........................\..............................\gw_w_et_Script.m,1693,2010-09-28
...........................\..............................\rw_accumulating_vs_replacing_Script.m,2480,2010-09-28
...........................\..............................\rw_episode.m,1218,2010-09-28
...........................\..............................\rw_offline_ntd_learn.m,2070,2010-09-28
...........................\..............................\rw_offline_ntd_learn_Script.m,1947,2010-09-28
...........................\..............................\rw_offline_tdl_learn.m,2618,2010-09-28
...........................\..............................\rw_offline_tdl_learn_Script.m,2205,2010-09-28
...........................\..............................\rw_online_ntd_learn.m,2554,2010-09-28
...........................\..............................\rw_online_ntd_learn_Script.m,1983,2010-09-28
...........................\..............................\rw_online_tdl_learn.m,2443,2010-09-28
...........................\..............................\rw_online_tdl_learn_Script.m,2195,2010-09-28
...........................\..............................\rw_online_w_et.m,1710,2010-09-28
...........................\..............................\rw_online_w_et_Script.m,2111,2010-09-28
...........................\..............................\rw_online_w_replacing_traces.m,1796,2010-09-28
...........................\Chapter 8 (Generailzation and Function Approximation)
...........................\.....................................................\do_mnt_car_Exps.m,1848,2010-09-28
...........................\.....................................................\GetTiles_Mex.C,2774,2010-09-28
...........................\.....................................................\GetTiles_Mex_Script.m,809,2010-09-28
...........................\.....................................................\get_ctg.m,939,2010-09-28
...........................\.....................................................\linAppFn.m,684,2010-09-28
...........................\.....................................................\mnt_car_learn.m,5579,2010-09-28
下载说明:请别用迅雷下载,失败请重下,重下不扣分!


