Kernel-based multi-imputation for missing data

UTSePress Research/Manakin Repository

Search UTSePress Research


Advanced Search

Browse

My Account

Show simple item record

dc.contributor.author Zhang Shichao en_US
dc.contributor.author Qin Yongsong en_US
dc.contributor.author Zhu Xiaofeng en_US
dc.contributor.author Zhang Jilian en_US
dc.contributor.author Zhang Chengqi en_US
dc.contributor.editor Li, Y; Looi, M; Zhoong, N en_US
dc.date.accessioned 2009-11-09T02:45:42Z
dc.date.available 2009-11-09T02:45:42Z
dc.date.issued 2006 en_US
dc.identifier 2006005284 en_US
dc.identifier.citation Zhang Shichao et al. 2006, 'Kernel-based multi-imputation for missing data', IOS Press, Amsterdam, Netherlands, pp. 106-111. en_US
dc.identifier.issn 1-58603-615-7 en_US
dc.identifier.other E1 en_US
dc.identifier.uri http://hdl.handle.net/10453/1914
dc.description.abstract Missing or incomplete data is a very important problem in many fields of research. Such as in active media technology, opinion polls, market research surveys, mail enquiries, medical studies, and other scientific experiments. Missing data imputation is a challenging issue in machine learning and data mining (Zhang et al. 2004; Batista & Monard 2003). Many missing data analysis techniques are of single-imputation which missing values are filled in by a plausible estimate such as the mean or median for that variable on other participants. However, single-imputation cannot provide valid standard errors and confidence intervals, since it ignores the uncertainty implicit in the fact that the imputed values are not the actual values. Recently, much research on missing data analysis has focused on multi-imputation techniques for addressing the issues in single-imputation (Faris et al. 2002; Little et al. 1987; Schaffer 2002; Taylor et al. 2002; Zhang 2004). Little et al. (1987) proposed a multiple imputation procedure to replace each missing value with a set of plausible values that represent the uncertainty about the right value to impute. The multiple-imputed-data sets are then analyzed using a standard procedure for complete data and combining the results from these analyses. In this paper, a kernel-based nonparametric multiple-imputation (KBNM) is proposed under MAR (missing Y mainly depends on X) and MCAR (MCAR is when the probability of missing a value is the same for all variables). The rest of this paper is organized as follows. Our kernel-based multiple imputation method is described in Section 2. Section 3 presents a series of experimental results on simulation models and a real-world dataset (from UeI) to compare the performances between our KBNM approach and the NORM. Conclusions are given in Section 4. en_US
dc.publisher IOS Press en_US
dc.relation.isbasedon http://www.iospress.nl/flyers_b/fl9781586036157.pdf en_US
dc.title Kernel-based multi-imputation for missing data en_US
dc.parent Advances in Intelligent IT: Active Media Technology 2006 en_US
dc.journal.volume en_US
dc.journal.number en_US
dc.publocation Amsterdam, The Netherlands en_US
dc.identifier.startpage 106 en_US
dc.identifier.endpage 111 en_US
dc.cauo.name Software Engineering en_US
dc.conference The 4th International Conference on Active Media Technology (AMT06) en_US
dc.conference.location Brisbane, Australia en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record