Article

Limitations Of Common Solutions To Missing Data

Topic: Business ConsultingPublished October 12, 2008

Legacy signals

Legacy popularity: 928 legacy views

A previous article discussed some of the causes of missing data and some of the consequences of analyzing only complete cases. This newsletter will discuss some other common ways of dealing with missing data, with a discussion of their advantages and disadvantages.nnAvailable case analysis (pairwise deletion) calculates each step of the analysis separately using the cases that have data available for that step. Therefore, a case with data missing on one variable will be used only in steps that do not involve that variable. The advantage is that the sample size for each individual analysis is generally higher than with complete case analysis, but the results are unbiased only if the data are MCAR. It can also lead to mathematical problems in computing estimates of some parameters, and is not recommended.nnMost other methods involve imputation-replacing the missing values with an estimate, then analyzing the full data set as if the imputed values were actual observed values. There are many ways to choose an estimate. The following are common methods:nn * Mean: the mean of the observed values for that variablen * Substitution: the value from a new individual who was not selected to be in the samplen * Hot deck: a randomly chosen value from an individual who has similar values on other variablesn * Cold deck: a systematically chosen value from an individual who has similar values on other variablesn * Regression: the predicted value obtained by regressing the missing variable on other variablesn * Stochastic regression: the predicted value from a regression plus a random residual value.n * Interpolation and extrapolation: an estimated value from other observations from the same individual.nnImputation is popular because it is conceptually simple and because the resulting sample has the same number of observations as the full data set. It can be very tempting when complete-case analysis eliminates a large proportion of the data set. But it has limitations. Some imputation methods result in biased parameter estimates, such as means and correlations, unless the data are MCAR. The bias is often worse than with complete-case analysis, especially for mean imputation. The extent of the bias depends on many factors, including the missing data mechanism, the proportion of the data that is missing, and the information available in the data set.nnMoreover, all of these imputation methods underestimate standard errors. Since the imputed observations are themselves estimates, their values have corresponding random error. Despite this, imputed values are treated as actual observations in analyses. The extra source of error is ignored, resulting in too-small standard errors and too-small p-values. Furthermore, although imputation is conceptually simple, it is usually difficult to do well in practice. Therefore, these imputation methods are not satisfactory in most circumstances.nnTwo alternate methods maintain the full sample size and can result in unbiased estimates of parameters and standard errors for ignorable missing data: multiple imputation and maximum likelihood estimation. These techniques are now available in common statistical software. Subsequent newsletters will describe these methods and discuss their availability in software packages.

Further reading

Further Reading

4 total

Article

The medical device sector demands greater regulatory standards worldwide. Firms must ensure product safety and quality for patient well-being. Implementing the ISO 13485standards for medical devices can help meet these expectations. Skilled ISO 13485 consultants can assist in the implementation journey,and this delivers measurable value. This ISO is not about a paperwork exercise, but it offers practical implementation procedures. It allows medical firms to design efficient q

February 17, 2026

Article

Are You Worried That Competitors Are Ahead in Ways We Can’t See? How to Stop Playing Blind and Start Seeing What Actually Matters: Weekly Winning StrategiesrnMany companies lose because they fight ghosts. Imagining competitor advantage that doesn’t exist. Missing the real threats right in front of them. Stop worrying about invisible competitors and start seeing what matters. The Panic That Wastes MillionsrnA fintech startup approached us in 2025 with $800K in their bank a

February 8, 2026

Article

Inventory management is one of the most important parts of running a successful business. No matter if you own a retail store, a restaurant, or a small warehouse, knowing what products you have in stock helps you avoid losses and serve customers better. When inventory is poorly managed, businesses often face common problems such as missing items, overstocked shelves, or products running out at the wrong time. These issues can directly affect profits and customer trust. In the

January 16, 2026

Article

Inventory management is one of the most important parts of running a successful business. No matter if you own a retail store, a restaurant, or a small warehouse, knowing what products you have in stock helps you avoid losses and serve customers better. When inventory is poorly managed, businesses often face common problems such as missing items, overstocked shelves, or products running out at the wrong time. These issues can directly affect profits and customer trust.rnIn th

January 16, 2026