Regression Analysis with Block Missing Values and Variables Selection

Chien-Pai Han, Yan Li

Abstract


We consider a regression model when a block of observations is missing, i.e. there are a group of observations with all the explanatory variables or covariates observed and another set of observations with only a block of the variables observed. We propose an estimator of the regression coefficients that is a combination of two estimators, one based on the observations with no missing variables, and the other the set all observations after deleting of the block of variables with missing values. The proposed combined estimator will be compared with the uncombined estimators. If the experimenter suspects that the variables with missing values may be deleted, a preliminary test will be performed to resolve the uncertainty. If the preliminary test of the null hypothesis that regression coefficients of the variables with missing value equal to zero is accepted, then only the data with no missing values are used for estimating the regression coefficients. Otherwise the combined estimator is used. This gives a preliminary test estimator. The properties of the preliminary test estimator and comparisons of the estimators are studied by a Monte Carlo study

Keywords


Missing data; Combined estimator; Preliminary test estimator; Comparisons of regression coefficient estimators; Missing values

Full Text:

PDF


DOI: http://dx.doi.org/10.18187/pjsor.v7i2-Sp.303

Refbacks

  • There are currently no refbacks.




Copyright (c)

Title

Regression Analysis with Block Missing Values and Variables Selection

Keywords

Missing data; Combined estimator; Preliminary test estimator; Comparisons of regression coefficient estimators; Missing values

Description

We consider a regression model when a block of observations is missing, i.e. there are a group of observations with all the explanatory variables or covariates observed and another set of observations with only a block of the variables observed. We propose an estimator of the regression coefficients that is a combination of two estimators, one based on the observations with no missing variables, and the other the set all observations after deleting of the block of variables with missing values. The proposed combined estimator will be compared with the uncombined estimators. If the experimenter suspects that the variables with missing values may be deleted, a preliminary test will be performed to resolve the uncertainty. If the preliminary test of the null hypothesis that regression coefficients of the variables with missing value equal to zero is accepted, then only the data with no missing values are used for estimating the regression coefficients. Otherwise the combined estimator is used. This gives a preliminary test estimator. The properties of the preliminary test estimator and comparisons of the estimators are studied by a Monte Carlo study

Date

2011-07-01

Identifier


Source

Pakistan Journal of Statistics and Operation Research; Vol 7. No. 2-Sp, Oct 2011



Print ISSN: 1816-2711 | Electronic ISSN: 2220-5810