Can Synthea be used to generate microdata? #1569
-
Dear Synthea community I am a researcher from a group looking for a tool to generate synthetic data. For example, le's say I want to generate the data of 10k patients with Alzheimer's Disease, including scores of Mini Mental State Examination (MMSE), a test typically administered to this class of patients. Can this be done (possibly requiring editing/extending an existing module), thus creating 10k MMSE scores with a distribution matching the real AD MMSE distribution? Sorry for the probably naive question. Claudio from Neuroinformatica |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Synthea is designed by default to generate a population that matches the demographics (e.g., age, gender, race/ethnicity, education, economic, SDoH distributions, etc) of a specific named area (e.g., state, or town/city). The disease modules determine the prevalence of disease and associated treatments and the resulting medical records (and claims), that are applied to that population.
Yes, this can be done. By default the population will match the demographics for the area you are simulating -- so you'd expect to see the appropriate rate of Alzheimer's, and not everyone having that diagnosis.... unless you make direct edits to the modules, create a well-considered Keep Module, defined a Module Override file, or use some other approach.
Yes, that is the default behavior. |
Beta Was this translation helpful? Give feedback.
Synthea is designed by default to generate a population that matches the demographics (e.g., age, gender, race/ethnicity, education, economic, SDoH distributions, etc) of a specific named area (e.g., state, or town/city). The disease modules determine the prevalence of disease and associated treatments and the resulting medical records (and claims), that are applied to that population.
Yes, this can be done. By default the population will match the demographics for the area you are simulating -- so you'd expect to see the a…