Analyzing Grid Log Data with Affinity Propagation
| Authors |
|
|---|---|
| Publication date | 2013 |
| Host editors |
|
| Book title | Recent trends in applied artificial intelligence: 26th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (IEA/AIE) Amsterdam, The Netherlands, June 17-21, 2013 Proceedings |
| ISBN |
|
| Series | Lecture Notes in Artificial Intelligence, 7906 |
| Pages (from-to) | 83-91 |
| Number of pages | 9 |
| Publisher | New York: Springer |
| Organisations |
|
| Abstract |
In this paper we present an unsupervised learning approach to detect meaningful job traffic patterns in Grid log data. Manual anomaly detection on modern Grid environments is troublesome given their increasing complexity, the distributed, dynamic topology of the network and heterogeneity of the jobs being executed. The ability to automatically detect meaningful events with little or no human intervention is therefore desirable. We evaluate our method on a set of log data collected on the Grid. Since we lack a priori knowledge of patterns that can be detected and no labelled data is available, an unsupervised learning method is followed. We cluster jobs executed on the Grid using Affinity Propagation. We try to explain discovered clusters using representative features and we label them with the help of domain experts. Finally, as a further validation step, we construct a classifier for five of the detected clusters and we use it to predict the termination status of unseen jobs.
|
| Document type | Chapter |
| Language | English |
| Published at | https://doi.org/10.1007/978-3-642-38577-3_9 |
| Permalink to this page | |