Analyzing Grid Log Data with Affinity Propagation

Authors
Publication date 2013
Host editors
  • M Ali
  • T. Bosse
  • K.V. Hindriks
  • M. Hoogendoorn
  • C.M Jonker
  • J. Treur
Book title Recent trends in applied artificial intelligence: 26th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (IEA/AIE) Amsterdam, The Netherlands, June 17-21, 2013 Proceedings
ISBN
  • 9783642385766
Series Lecture Notes in Artificial Intelligence, 7906
Pages (from-to) 83-91
Number of pages 9
Publisher New York: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In this paper we present an unsupervised learning approach to detect meaningful job traffic patterns in Grid log data. Manual anomaly detection on modern Grid environments is troublesome given their increasing complexity, the distributed, dynamic topology of the network and heterogeneity of the jobs being executed. The ability to automatically detect meaningful events with little or no human intervention is therefore desirable. We evaluate our method on a set of log data collected on the Grid. Since we lack a priori knowledge of patterns that can be detected and no labelled data is available, an unsupervised learning method is followed. We cluster jobs executed on the Grid using Affinity Propagation. We try to explain discovered clusters using representative features and we label them with the help of domain experts. Finally, as a further validation step, we construct a classifier for five of the detected clusters and we use it to predict the termination status of unseen jobs.
Document type Chapter
Language English
Published at https://doi.org/10.1007/978-3-642-38577-3_9
Permalink to this page
Back