Metabuscador

Inicio Atrás
Título:	The Essential Dynamics Algorithm: Essential Results
Autores:	Martin, Martin C.
Fecha:	2004-10-08 2004-10-08 2003-05-01
Publicador:	MIT
Fuente:
Tipo:
Tema:	AI Reinforcement learning bicycle policy search markov decision processes
Descripción:	This paper presents a novel algorithm for learning in a class of stochastic Markov decision processes (MDPs) with continuous state and action spaces that trades speed for accuracy. A transform of the stochastic MDP into a deterministic one is presented which captures the essence of the original dynamics, in a sense made precise. In this transformed MDP, the calculation of values is greatly simplified. The online algorithm estimates the model of the transformed MDP and simultaneously does policy search against it. Bounds on the error of this approximation are proven, and experimental results in a bicycle riding domain are presented. The algorithm learns near optimal policies in orders of magnitude fewer interactions with the stochastic MDP, using less domain knowledge. All code used in the experiments is available on the project's web site.
Idioma:	Inglés

1 Problem Investigation in High-Hazard Industries: Creating and Negotiational Learning por Hatakenaka, Sachi,Rudolph, Jenny,Carroll, John S.	6 Trade Linkages and Output-Multiplier Effects: A Structural VAR por Forbes, Kristin J.,Abeysinghe, Tilak
2 Global sourcing in the automotive supply chain:The case of Fiat Auto por Volpato, Giuseppe,Camuffo, Arnaldo	7 Description Of Procedures In Automotive Engine Plants por Artzner, Denis,Whitney, Dr. Daniel
3 Reading Courtesy Amounts on Handwritten Paper Checks por Palacios, Rafael,Wang, Patrick S.P.,Gupta, Amar	8 Mobility Issues in the Developing World por Gakenheimer, Ralph
4 On Trees and Logs por Pavlova, Anna,Cass, David	9 Saturn, The GM/UAW Partnership por Rubinstein, Saul,Kochan, Thomas
5 Academic Earmarks and the Returns to Lobbying por De Figueiredo, John M.,Silverman, Brian S.	10 Toward a Stakeholder Theory of the Firm: The Case of the Saturn Partnership por Kochan, Thomas,Rubinstein, Saul