Population Informatics Lab


Congratulations. Great job!!!

Theo has his first paper published on a systematic review of Frequent ED use in Journal of Evaluation in Clinical Practice!

Nikita has won multiple student poster awards on her research on HPV vaccinations!
  • Student poster award at APHA 2018.
  • First place in Graduate poster presentation in the health sciences category at the TAMU student research week 2019.
  • Finalist in Jean Brender Student Research Poster Contest and Symposium 2019.

We are hosting Dr. Yong Choi on December 10th. Please join us in his talk on Smart Homes to support Aging. Co-Hosted with the Department of Industrial & Systems Engineering and the Applied Cognitive Ergonomics Lab.

Dr. Hye-Chung Kum was awarded the Presidential Impact Fellow at Texas A&M University 2018.

Dr. Hye-Chung Kum recieved the TAMU RDF funding for Spring 2018 to implement the Texas ViDaL (Virtual Data Library)! Please let us know if you have a pilot project you want to start.

Paper on information privacy and record linkage got the CHI2018 Honourable Mention Award (top 5% of all submissions).

New collaboration in neuroscience!
Dr. Hye-Chung Kum (Co-Chair) has been funded, with Dr. Tomáš Paus (Chair), to organize a workshop in 2020 on "Towards a Digital Ethology of Adolescence and Youth" by Ernst Strüngmann Forum. ESF is a German foundation which promotes interdisciplinary communication and research on high-priority issues in basic science by hosting an intellectual retreat, a week-long think tank, with international leaders in diverse fields to hold open dialogue as a means to overcome intellectual dead-ends, conceptualize new ways to grasp issues, and build future collaborations. The workshop will result in a co-edited book from MIT Press.

Dr. Hye-Chung Kum (Chair) and Dr. Luiza Antonie (Co-Chair) organized the Data Linkage Repository (DLREP) Workgroup. The workgroup aims to engage diverse experts in record linkage around data and code. Repository funded by NSF, in collaboration with ICPSR.

Recently accepted publication:

Ragan, E., Kum, H.-C., Ilangovan, G., and Wang, H. (2018). Balancing Privacy and Information Disclosure in Interactive Record Linkage with Visual Masking. Proceedings of the SIGCHI conference on Human factors in computing systems. ACM. CHI2018 Honourable Mention Award (top 5% of all submissions). Also invited to be presented at the Fourteenth Symposium on Usable Privacy and Security (SOUPS) Aug 2018 as a poster.

Giannouchos, T.*, Kum, H.-C., Foster, M., Ohsfeldt, R. Characteristics and predictors of adult frequent emergency departments’ users in the United States: a systematic literature review. Journal of Evaluation in Clinical Practice. In press.

Choi, D. T.*, Kum, H.-C., Park, S.*, Ohsfeldt, R. L., Shen, Y., Parikh, N. D., & Singal, A. G. (2019). Hepatocellular Carcinoma Screening is Associated with Increased Survival of Patients with Cirrhosis. Clinical Gastroenterology and Hepatology. 17. 976-987.

Trueblood, A. Pant, A., Kim, J., Kum, H-C., Perez, M. Das, S., Shipp, E. (2019). A semi-automated tool for identifying agricultural roadway crashes in crash narratives. Traffic Injury Prevention. In press.

Correa, M. F.*, Li, Y., Kum, H.-C., & Lawley, M. A. (2018). Assessing the Effect of Clinical Inertia on Diabetes Outcomes: a Modeling Approach. Journal of general internal medicine, 1-7.

Telemonitoring in Texas. 2019. Technical Report 2019-001-2.


The Population Informatics Lab applies informatics, data science, and computational methods to the increasingly large digital traces available to advance public health, social science, and population research. This research group is a joint effort between UNC-CH and Texas A&M (TAMU). We currently collaborate with the Renaissance Computing Institute, Carolina Population Center, The Odum Institute, the Gillings School of Global Public Health, and the School of Social Work at UNC and the School of Public Health and the Department of Computer Science at Texas A&M. We specialize in data science, KDD (Knowledge Discovery and Datamining), data integration, visualization, decision support systems, health informatics, computational social science, and privacy. UNC-CH Population Informatics Lab Website

Population Informatics News !

What is Population Informatics?

Computational Social Science is an emerging research area at the intersection of health science, social sciences, computer science, and statistics in which quantitative methods and computational tools are applied to big data about people to answer social science questions. Broadly speaking there are two approaches as follows:
  • Population Informatics : The systematic study of populations via secondary analysis of massive data collections (termed “big data”) about people. In particular, we focus most on improving health outcomes for a population and the data science approach which is about generating actionable information from raw data. Another important aspect of population informatics is Public health informatics which is more about how to best utilize the information generated using data science to improve public health.
  • Simulations (i.e., Agent Based Modeling (ABM) ) : Discover useful information and knowledge about our society through simulating the actions and interactions of autonomous agents (individuals and groups/organizations). Many of the parameters to model autonomous agents come from Population Informatics research.

Our Social Genome : A Federated Data System of Digital Data about People

New scientific opportunities are emerging as a result of increasingly effective data organization, access, and usage. Many fields of study have been transformed to a new level by new tools and data infrastructure. For example, the analysis of DNA sequence data has transformed medical research. We need to push the frontier of social sciences by doing the same with digital data available about our society; this will enable us to gain fundamental insights into the many facets of our society. A key source of information about all aspects of our society resides in government administrative data and various private operational data. From the day we are born until our death, most all of our activities leave footprints in various digital data systems. Birth, marriage, and death certificates are filed with the government, education records remain with departments of public instruction, and traces of employment can be found in the ESC UI (Employment Security Commission Unemployment Insurance) wage data. Without a doubt, a well-integrated data system that can encompass much of the data systems will hold the footprints of our society, our social genome. The two main hurdles to building such a system to transform the social sciences are (1) privacy concerns and the laws in place to protect individual confidentiality, and (2) the physiology of administrative data, which is fragmented, short-lived, and sometimes has questionable reliability. Our group’s research focuses on resolving these two barriers to building a federated data system of digital data about people for research. Once resolved, we can build the social genome data infrastructure that could finally allow us to move toward understanding how current policies play out in our society and how to make informed policies using information and knowledge gathered from these digital traces. Together, our digital traces collectively capture the footprints of our society. Like the human genome, the social genome data has much buried in the massive almost chaotic data. If properly analyzed and interpreted, this social genome could offer crucial insights into many of the most challenging problems facing our society (i.e. affordable and accessible quality healthcare, economics, education, employment, and welfare) For more details ...