Frequency Ratio: a method for dealing with missing values within nearest neighbour search

Open Access
Authors
  • R. Janssen
  • P. Spronck
  • P. Dibbets
  • A. Arntz
Publication date 2015
Journal Journal of Systems Integration
Volume | Issue number 6 | 3
Pages (from-to) 3-14
Organisations
  • Faculty of Social and Behavioural Sciences (FMG) - Psychology Research Institute (PsyRes)
Abstract
In this paper we introduce the Frequency Ratio (FR) method for dealing with missing values within nearest neighbour search. We test the FR method on known medical datasets from the UCI machine learning repository. We compare the accuracy of the FR method with five commonly used methods (three "imputation" and two "bypassing" methods) for dealing with values that are "missing completely at random" (MCAR) for the purpose of classification. We discovered that in most cases, the FR method outperforms the other methods. We conclude that the FR method is a strong addition to the commonly used methods for dealing with missing values within the nearest neighbour method.
Document type Article
Language English
Published at https://doi.org/10.20470/jsi.v6i3.233
Downloads
502264 (Final published version)
Permalink to this page
Back