UvA DARE | Search results

UvA-DARE

Digital Academic Repository

UvA-DARE

Results: 56

Number of items: 56

Höpner, N. R. (2026). Algorithms for knowledge-guided sequential decision-making: Integrating graphs, demonstrations, human and cross-agent experience. [Thesis, fully internal, Universiteit van Amsterdam].
Hoepner, N., Kuric, D., & van Hoof, H. (2025). Making Universal Policies Universal. In Y. Vorobeychik, S. Das, & A. Nowe (Eds.), AAMAS '25: Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems : May 19-23, 2025, Detroit, Michigan, USA (pp. 2553-2555). International Foundation for Autonomous Agents and Multiagent Systems. https://doi.org/10.48550/arXiv.2502.14777
Hoepner, N., Tiddi, I., & van Hoof, H. (2025). Data Augmentation for Instruction Following Policies via Trajectory Segmentation. In T. Walsh, J. Shah, & Z. Kolter (Eds.), Proceedings of the 39th Annual AAAI Conference on Artificial Intelligence: February 25-March 4, 2025, Philadelphia, Pennsylvania, USA (Vol. 16, pp. 17214-17222). AAAI Press. https://doi.org/10.1609/aaai.v39i16.33892
Mussi, M., Metelli, A. M., Restelli, M., Losapio, G., Bessa, R. J., Boos, D., Borst, C., Leto, G., Castagna, A., Chavarriaga, R., Dias, D., Egli, A., Eisenegger, A., El Manyari, Y., Fuxjäger, A., Geraldes, J., Hamouche, S., Hassouna, M., Lemetayer, B., ... Zanotti, G. (2025). Human-AI interaction in safety-critical network infrastructures. iScience, 28(9), Article 113400. https://doi.org/10.1016/j.isci.2025.113400
Kuric, D., Infante, G., Gómez, V., Jonsson, A., & van Hoof, H. (2024). Planning with a Learned Policy Basis to Optimally Solve Complex Tasks. In S. Bernardini, & C. Muise (Eds.), Proceedings of the Thirty-Fourth International Conference on Automated Planning and Scheduling: June 1–6, 2024, Alberta, Canada (pp. 333-341). (ICAPS; Vol. 34). AAAI Press. https://doi.org/10.1609/icaps.v34i1.31492
Loftin, R., Çelikok, M. M., van Hoof, H., Kaski, S., & Oliehoek, F. A. (2024). Uncoupled Learning of Differential Stackelberg Equilibria with Commitments. In AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems : May 6-10, 2024, Auckland, New Zealand (pp. 1265-1273). International Foundation for Autonomous Agents and Multiagent Systems. https://www.ifaamas.org/Proceedings/aamas2024/pdfs/p1265.pdf
Giri, C., Granmo, O. C., & Van Hoof, H. (2024). Accelerated Tsetlin Machine Inference Through Incremental Model Re-evaluation. In 2024 International Symposium on the Tsetlin Machine (ISTM 2024): Pittsburgh, Pennsylvania, USA, 28-30 August 2024 (pp. 93-100). IEEE. https://doi.org/10.1109/ISTM62799.2024.10931270
Huang, J., Oosterhuis, H., Mansoury, M., van Hoof, H., & de Rijke, M. (2024). Going Beyond Popularity and Positivity Bias: Correcting for Multifactorial Bias in Recommender Systems. In SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval : July 14-18, 2024, Washington, DC, USA (pp. 416-426). Association for Computing Machinery. https://doi.org/10.1145/3626772.3657749
Wöhlke, J. G. (2024). Reinforcement learning and planning for autonomous agent navigation: With a focus on sparse reward settings. [Thesis, fully internal, Universiteit van Amsterdam].
Mansoury, M., Mobasher, B., & van Hoof, H. (2024). Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading Bandits. In CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management : October, 21-25. 2024, Boise, ID, USA (pp. 1638-1648). Association for Computing Machinery. https://doi.org/10.1145/3627673.3679763

Page 1 of 6