[Google Scholar]

Population modeling system based on multiple data sources having missing entries

Inventors Bishal Santra, Howard Mizes, Kush Motwani

Publication date 2022/2/2

Patent office US

Patent number 11,256,957

Application number 16/694,118

Description

A neural network is used to model the joint distribution of attributes across multiple health surveys. These multiple health surveys include large scale survey datasets and small scale survey datasets. The neural network model is trained using a combined dataset of the large scale survey datasets and the small scale survey datasets. The large scale survey datasets and the small scale survey datasets may include missing value indicators. The joint distribution of attributes modeled by the neural network model are the used to impute substitute values for the missing values to thereby create an output large scale dataset that does not include missing values.