- Frequency Ratio: a method for dealing with missing values within nearest neighbour search
- Journal of Systems Integration
- Volume | Issue number
- 6 | 3
- Pages (from-to)
- Document type
- Faculty of Social and Behavioural Sciences (FMG)
- Psychology Research Institute (PsyRes)
In this paper we introduce the Frequency Ratio (FR) method for dealing with missing values within nearest neighbour search. We test the FR method on known medical datasets from the UCI machine learning repository. We compare the accuracy of the FR method with five commonly used methods (three "imputation" and two "bypassing" methods) for dealing with values that are "missing completely at random" (MCAR) for the purpose of classification. We discovered that in most cases, the FR method outperforms the other methods. We conclude that the FR method is a strong addition to the commonly used methods for dealing with missing values within the nearest neighbour method.
If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library, or send a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.