A Novel Control-Variates Approach for Performative Gradient-Based Learners with Missing Data

Published:

We propose a new, principled approach to tackling missing data problems that can reduce both bias and variance of any (stochastic) gradient descent-based predictive model that is learned on such data. The proposed method can use an arbitrary (and potentially biased) imputation model to fill in the missing values, as it corrects the biases introduced by imputation with a control variates method, leading to an unbiased estimation for gradient updates. Theoretically, we prove that our control variates approach improves the convergence of stochastic gradient descent under common missing data settings. Empirically, we show that our method yields superior performance as compared to the results obtained using competing imputation methods, on various applications, across different missing data patterns.

Conference

Recommended citation:

@inproceedings{han2023novel,
  title={A Novel Control-Variates Approach for Performative Gradient-Based Learners with Missing Data},
  author={Han, Xing and Hu, Jing and Ghosh, Joydeep},
  booktitle={2023 International Joint Conference on Neural Networks (IJCNN)},
  pages={1--8},
  year={2023},
  organization={IEEE}
}