• Login
    View Item 
    •   MKSU Digital Repository Home
    • Conferences / Workshops / Seminars
    • Conferences
    • 1st International Conference
    • View Item
    •   MKSU Digital Repository Home
    • Conferences / Workshops / Seminars
    • Conferences
    • 1st International Conference
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Performance of Imputation Methods towards Increasing Percentage of Missing Values

    Thumbnail
    View/Open
    Full text (327.8Kb)
    Date
    2018-04
    Author
    Brice, Kenfac D. P.
    Mwita, Peter N.
    Roger, Kamga T. I.
    Metadata
    Show full item record
    Abstract
    The aim of this paper is to study the performance of eightdifferent existing imputation methodsused on simulatedand real dataset. The methods are compared in term of their ability to estimate the missing observationsand estimate some statistics(mean, standard deviation and coefficient of a regression) using the full data set completed by the imputation. The comparisons are made using root mean square error, mean absolute deviationand bias observed after estimation of statistics. Simulation results using specific simulated data and bootstrap show that Mean Imputation and Complete case analysisare the best method in completing the data set and in obtaining best estimators for statistics.However, the results are subject to major changes if parameters like sample size, number of replication and type of distribution chosen are modified. In short with real data, result will change depending on the structure of dataset to impute. For example, application of the simulation results to a Rwandan dataset on smallholder farmers revealed that k-NN is the best method in reconstructing and Multiple Imputation can be used as imputation method in case we are to estimate some statistics. Our final conclusion is that imputation methods cannot be compared since in most cases their performance is parametrically linked to the data. We finally proposed a methodology and a simulation protocol to identify for any data set which imputationmethod will give the best results and therefore should be applied in priority. Key words. Bias, Bootstrap, Imputation,Root Mean Squared Error, Mean Absolute error.
    URI
    http://ir.mksu.ac.ke/handle/123456780/768
    Collections
    • 1st International Conference [83]

    DSpace software copyright © 2002-2015  DuraSpace
    Contact Us | Send Feedback
    Theme by 
    @mire NV
     

     

    Browse

    All of Digital RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsBy Submit DateThis CollectionBy Issue DateAuthorsTitlesSubjectsBy Submit Date

    My Account

    LoginRegister

    DSpace software copyright © 2002-2015  DuraSpace
    Contact Us | Send Feedback
    Theme by 
    @mire NV