Search results
Results: 11
Number of items: 11
-
Bakker, B., Whiteson, S., Kester, L., & Groen, F. C. A. (2010). Traffic light control by multiagent reinforcement learning systems. In R. Babuška, & F. C. A. Groen (Eds.), Interactive collaborative information systems (pp. 475-510). (Studies in computational intelligence; No. 281). Springer. https://doi.org/10.1007/978-3-642-11688-9_18
-
Grappiolo, C., Whiteson, S., Pavlin, G., & Bakker, B. (2009). Integrating distributed Bayesian inference and reinforcement learning for sensor management. In 12th International Conference on Information Fusion (FUSION 2009): Seattle, Washington, USA, 6 - 9 July 2009 (pp. 93-101). IEEE. http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5203741
-
van Seijen, H., Bakker, B., & Kester, L. (2008). Switching between different state representations in reinforcement learning. In A. Gammerman (Ed.), Proceedings of the IASTED International Conference on Artificial Intelligence and Applications: As part of the 26th IASTED International Multi-Conference on Applied Informatics, February 11-13, 2008, Innsbruck, Austria (pp. 595-163). ACTA. http://www.actapress.com/Abstract.aspx?paperId=32307
-
Kuyer, L., Whiteson, S., Bakker, B., & Vlassis, N. (2008). Multiagent reinforcement learning for urban traffic control using coordination graphs. In W. Daelemans, B. Goethals, & K. Morik (Eds.), Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2008, Antwerp, Belgium, September 15-19, 2008 : proceedings (Vol. I, pp. 656-671). (Lecture Notes in Computer Science; Vol. 5211). Springer. https://doi.org/10.1007/978-3-540-87479-9_61
-
Bakker, B. (2007). Reinforcement learning by backpropagation through an LSTM model/critic. In Proceedings of the IEEE symposium on Approximate Dynamic Programming and Reinforcement Learning (pp. 127-134). http://www.science.uva.nl/research/isla/pub/Bakker07ADPRL.pdf
Page 1 of 2