Metabuscador

Inicio Atrás
Título:	Model-based Bayesian reinforcement learning in complex domains
Autores:	Ross, Stéphane
Fecha:	2008
Publicador:	McGill University - MCGILL
Fuente:
Tipo:	Electronic Thesis or Dissertation
Tema:	Applied Sciences - Artificial Intelligence
Descripción:	Reinforcement Learning has emerged as a useful framework for learning to perform a task optimally from experience in unknown systems. A major problem for such learning algorithms is how to balance optimally the exploration of the system, to gather knowledge, and the exploitation of current knowledge, to complete the task. Model-based Bayesian Reinforcement Learning (BRL) methods provide an optimal solution to this problem by formulating it as a planning problem under uncertainty. However, the complexity of these methods has so far limited their applicability to small and simple domains. To improve the applicability of model-based BRL, this thesis presents several extensions to more complex and realistic systems, such as partially observable and continuous domains. To improve learning efficiency in large systems, this thesis includes another extension to automatically learn and exploit the structure of the system. Approximate algorithms are proposed to efficiently solve the resulting inference and planning problems. L'apprentissage par renforcement a émergé comme une technique utile pour apprendre à accomplir une tâche de façon optimale à partir d'expérience dans les systèmes inconnus. L'un des problèmes majeurs de ces algorithmes d'apprentissage est comment balancer de façon optimale l'exploration du système, pour acquérir des connaissances, et l'exploitation des connaissances actuelles, pour compléter la tâche. L'apprentissage par renforcement bayésien avec modèle permet de résoudre ce problème de façon optimale en le formulant comme un problème de planification dans l'incertain. La complexité de telles méthodes a toutefois limité leur applicabilité à de petits domaines simples. Afin d'améliorer l'applicabilité de l'apprentissage par renforcement bayésian avec modèle, cette thèse presente plusieurs extensions de ces méthodes à des systèmes beaucoup plus complexes et réalistes, où le domaine est partiellement observable et/ou continu. Afin d'améliorer l'efficacité de l'apprentissage dans les gros systèmes, cette thèse inclue une autre extension qui permet d'apprendre automatiquement et d'exploiter la structure du système. Des algorithmes approximatifs sont proposés pour résoudre efficacement les problèmes d'inference et de planification résultants.
Idioma:	en

1 Investigations on the form-genera Beauveria and Tritirachium por MacLeod, Donald Murdock	6 Treatment and recovery in first-episode psychosis : a qualitative analysis of client experiences por Windell, Deborah L.
2 Seismic sensitivity of tall guyed telecommunication towers. por Ghodrati Amiri, Gholamreza.	7 Geology of the Mutton Bay Intrusion and surrounding area, North Shore, Gulf of St. Lawrence, Quebec por Davies, Raymond
3 Exploring the Relationship Between Assets and Family Stress Among Low-Income Families por Rothwell, David W.,Han, Chang-Keun	8 Geology of the Mutton Bay Intrusion and surrounding area, North Shore, Gulf of St. Lawrence, Quebec por Davies, Raymond
4 The case for asset-based interventions with indigenous peoples: Evidence from Hawai‘i por Rothwell, David W.	9 Geology of the Mutton Bay Intrusion and surrounding area, North Shore, Gulf of St. Lawrence, Quebec por Davies, Raymond
5 Second Thoughts: Who Almost Participates in an IDA Program? por Rothwell, David W.,Han, Chang-Keun	10 Recent contributions to the phenomenology of musical time : a critical survey por Beaudreau, Pierre