UvA DARE | Search results

UvA-DARE

Digital Academic Repository

UvA-DARE

Results: 11

Number of items: 11

Bakker, B., Whiteson, S., Kester, L., & Groen, F. C. A. (2010). Traffic light control by multiagent reinforcement learning systems. In R. Babuška, & F. C. A. Groen (Eds.), Interactive collaborative information systems (pp. 475-510). (Studies in computational intelligence; No. 281). Springer. https://doi.org/10.1007/978-3-642-11688-9_18
Grappiolo, C., Whiteson, S., Pavlin, G., & Bakker, B. (2009). Integrating distributed Bayesian inference and reinforcement learning for sensor management. In 12th International Conference on Information Fusion (FUSION 2009): Seattle, Washington, USA, 6 - 9 July 2009 (pp. 93-101). IEEE. http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5203741
van Seijen, H., Bakker, B., & Kester, L. (2008). Switching between different state representations in reinforcement learning. In A. Gammerman (Ed.), Proceedings of the IASTED International Conference on Artificial Intelligence and Applications: As part of the 26th IASTED International Multi-Conference on Applied Informatics, February 11-13, 2008, Innsbruck, Austria (pp. 595-163). ACTA. http://www.actapress.com/Abstract.aspx?paperId=32307
Kuyer, L., Whiteson, S., Bakker, B., & Vlassis, N. (2008). Multiagent reinforcement learning for urban traffic control using coordination graphs. In W. Daelemans, B. Goethals, & K. Morik (Eds.), Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2008, Antwerp, Belgium, September 15-19, 2008 : proceedings (Vol. I, pp. 656-671). (Lecture Notes in Computer Science; Vol. 5211). Springer. https://doi.org/10.1007/978-3-540-87479-9_61
Bakker, B. (2007). Reinforcement learning by backpropagation through an LSTM model/critic. In Proceedings of the IEEE symposium on Approximate Dynamic Programming and Reinforcement Learning (pp. 127-134). http://www.science.uva.nl/research/isla/pub/Bakker07ADPRL.pdf
van Seijen, H. H., Bakker, B., & Kester, L. J. H. M. (2007). Reinforcement Learning with Multiple, Qualitatively Different State Representations. In Proceedings of NIPS 2007 Workshop on Hierarchical Organization of Behavior
Zivkovic, Z., Bakker, B., & Kröse, B. J. A. (2006). Hierarchical map building and planning based on graph partitioning. In Proc. IEEE Int. Conf. on Robotics and Automation (pp. 803-809)
Bakker, B., Zivkovic, Z., & Kröse, B. J. A. (2005). Hierarchical dynamic programming for robot path planning. In Proceedings IEEE/RSJ International Conference on Intelligent Robots and Systems (pp. 3720-3725)
Kok, J. R., 't Hoen, P. J., Bakker, B., & Vlassis, N. (2005). Utile coordination: Learning interdependencies among cooperative agents. In Proceedings IEEE Symposium on Computational Intelligence and Games (CIG'05) (pp. 29-36).
Zivkovic, Z., Bakker, B., & Kröse, B. J. A. (2005). Hierarchical map building using visual landmarks and geometric constraints. In Proceedings IEEE/RSJ International Conference on Intelligent Robots and Systems (pp. 7-12)

Page 1 of 2