Patents
[Google Scholar]
Population modeling system based on multiple data sources having missing entries
Inventors Bishal Santra, Howard Mizes, Kush Motwani
Publication date 2022/2/2
Patent office US
Patent number 11,256,957
Application number 16/694,118
Description
A neural network is used to model the joint distribution of attributes across multiple health surveys. These multiple health surveys include large scale survey datasets and small scale survey datasets. The neural network model is trained using a combined dataset of the large scale survey datasets and the small scale survey datasets. The large scale survey datasets and the small scale survey datasets may include missing value indicators. The joint distribution of attributes modeled by the neural network model are the used to impute substitute values for the missing values to thereby create an output large scale dataset that does not include missing values.