| dc.contributor.author | Zhang Shichao | en_US |
| dc.contributor.author | Qin Yongsong | en_US |
| dc.contributor.author | Zhu Xiaofeng | en_US |
| dc.contributor.author | Zhang Jilian | en_US |
| dc.contributor.author | Zhang Chengqi | en_US |
| dc.contributor.editor | Li, Y; Looi, M; Zhoong, N | en_US |
| dc.date.accessioned | 2009-11-09T02:45:42Z | |
| dc.date.available | 2009-11-09T02:45:42Z | |
| dc.date.issued | 2006 | en_US |
| dc.identifier | 2006005284 | en_US |
| dc.identifier.citation | Zhang Shichao et al. 2006, 'Kernel-based multi-imputation for missing data', IOS Press, Amsterdam, Netherlands, pp. 106-111. | en_US |
| dc.identifier.issn | 1-58603-615-7 | en_US |
| dc.identifier.other | E1 | en_US |
| dc.identifier.uri | http://hdl.handle.net/10453/1914 | |
| dc.description.abstract | Missing or incomplete data is a very important problem in many fields of research. Such as in active media technology, opinion polls, market research surveys, mail enquiries, medical studies, and other scientific experiments. Missing data imputation is a challenging issue in machine learning and data mining (Zhang et al. 2004; Batista & Monard 2003). Many missing data analysis techniques are of single-imputation which missing values are filled in by a plausible estimate such as the mean or median for that variable on other participants. However, single-imputation cannot provide valid standard errors and confidence intervals, since it ignores the uncertainty implicit in the fact that the imputed values are not the actual values. Recently, much research on missing data analysis has focused on multi-imputation techniques for addressing the issues in single-imputation (Faris et al. 2002; Little et al. 1987; Schaffer 2002; Taylor et al. 2002; Zhang 2004). Little et al. (1987) proposed a multiple imputation procedure to replace each missing value with a set of plausible values that represent the uncertainty about the right value to impute. The multiple-imputed-data sets are then analyzed using a standard procedure for complete data and combining the results from these analyses. In this paper, a kernel-based nonparametric multiple-imputation (KBNM) is proposed under MAR (missing Y mainly depends on X) and MCAR (MCAR is when the probability of missing a value is the same for all variables). The rest of this paper is organized as follows. Our kernel-based multiple imputation method is described in Section 2. Section 3 presents a series of experimental results on simulation models and a real-world dataset (from UeI) to compare the performances between our KBNM approach and the NORM. Conclusions are given in Section 4. | en_US |
| dc.publisher | IOS Press | en_US |
| dc.relation.isbasedon | http://www.iospress.nl/flyers_b/fl9781586036157.pdf | en_US |
| dc.title | Kernel-based multi-imputation for missing data | en_US |
| dc.parent | Advances in Intelligent IT: Active Media Technology 2006 | en_US |
| dc.journal.volume | en_US | |
| dc.journal.number | en_US | |
| dc.publocation | Amsterdam, The Netherlands | en_US |
| dc.identifier.startpage | 106 | en_US |
| dc.identifier.endpage | 111 | en_US |
| dc.cauo.name | Software Engineering | en_US |
| dc.conference | The 4th International Conference on Active Media Technology (AMT06) | en_US |
| dc.conference.location | Brisbane, Australia | en_US |