Metabuscador

Inicio Atrás
Título:	Using combination of actions in reinforcement learning
Autores:	Karanik, Marcelo J. Gramajo, Sergio D.
Fecha:	2010-03-22 2010
Publicador:	Unversidad Nacional de La Plata
Fuente:
Tipo:	Articulo Articulo
Tema:	reinforcement learning SARSA action combination optimalpolicy Ciencias Informáticas Informática Software
Descripción:	Software agents are programs that can observe their environment and act in an attempt to reach their design goals. In most cases the selection of particular agent architecture determines the behaviour in response to the different problem states However, there are some problem domains in which it is desirable that the agent learns a good action execution policy by interacting with its environment. This kind of learning is called Reinforcement Learning and it is useful in the process control area. Given a problem state, the agent selects the adequate action to do and receives an immediate reward, then estimations about every action are updated and, after a certain period of time, the agent learns which the best action to be executed is. Most reinforcement learning algorithms perform simple actions while two or more are capable of being used. This work involves the use of RL algorithms to find an optimal policy in a gridworld problem and proposes a mechanism to combine actions of different types.
Idioma:	Inglés

1 Ajuste de las variables que gobiernan los modelos de comportamiento de HDM-4 para vías no pavimentadas de la región de Antofagasta (Chile) por Rojas Cazaluade, Oscar Orlando	6 Alternativas para el fraccionamiento de un Parque Industrial por Borrego, Juan Antonio
2 La Agrimensura como base de investigaciones arqueológicas por Alfageme, Marcelo O.,Papagni, Jorge H.	7 La propiedad horizontal por López, María Inés
3 Anteproyecto de saneamiento y loteo por Antolín, Ricardo Gabriel,Williams, Enrique Oliver	8 Laguna "San Antonio" por Milone, Ricardo Raúl
4 Balneario Municipal de la ciudad de Diamante por Battistessa, Gustavo J.,Palomeque, José F.	9 Estudio catastral de tierras en el partido de San Fernando por Miquelarena, Carlos Alberto
5 Mensura para inscribir dominio y determinación de la línea de ribera del puerto de Olivos por Benetti, Marcos	10 Relevamiento planialtimétrico, plano de mensura, unificación y división y tasación de la estancia "La Pilarica" por Montero, Adrián Oscar