To help you get started, we've compiled a variety of datasets and APIs from which to gain inspiration. Many of these datasets have already been cleaned and normalized, so they are ready to be explored using AI tools. The use of these datasets is often intended for research purposes only. If you want to use the data in your startup, be sure to read any associated license agreements to understand if there are commercial restrictions. Also note that you are not restricted to basing your idea on the data sets below. You may discover other open source data sets that inspire your creativity or you may bring your own proprietary data sets if you wish.
And if there’s a data set you think we should add to the list, please send it to us.
Public health data available and accessible on the internet can be broadly categorized into various types based on the information they contain and the purposes they serve. Here are some of the main categories of public health data:
1. Disease Surveillance Data: This type of data includes information on the occurrence, prevalence, and distribution of diseases within a specific population or region. It helps public health authorities monitor and respond to disease outbreaks, track trends, and implement control measures.
2. Health Indicators and Vital Statistics: These data encompass a range of health indicators, such as mortality rates, birth rates, life expectancy, and various demographic statistics. They provide insights into the overall health and well-being of a population.
3. Behavioral Risk Factor Data: Behavioral risk factor data focus on lifestyle and behavior-related factors that influence health outcomes. These can include data on smoking, physical activity, diet, alcohol consumption, and other health-related behaviors.
4. Healthcare Utilization and Access Data: This category includes data on healthcare utilization patterns, healthcare access, health services utilization, and healthcare facility capacities. It helps assess healthcare system efficiency and identify gaps in healthcare access.
5. Environmental Health Data: Environmental health data cover factors related to the environment and its impact on human health. This can include air and water quality, exposure to hazardous substances, and data on environmental health risks.
6. Social Determinants of Health Data: These data encompass socioeconomic factors, such as income, education, employment, housing, and social support, which influence health outcomes and health disparities.
7. Infectious Disease Outbreak Data: This category includes data related to outbreaks of infectious diseases, tracking the spread of the disease, identifying the source of the outbreak, and implementing control measures.
8. Immunization Data: Immunization data track vaccination coverage rates, vaccine-preventable disease rates, and efforts to improve immunization coverage.
9. Maternal and Child Health Data: These data focus on the health of mothers and children, including prenatal care, maternal mortality, infant mortality, and child health indicators.
10. Chronic Disease Data: Chronic disease data cover non-communicable diseases like cardiovascular diseases, diabetes, cancer, and respiratory conditions, providing information on prevalence, risk factors, and management.
11. Mental Health Data: Data related to mental health includes prevalence rates of mental disorders, access to mental health services, and data on mental health outcomes.
12. Healthcare Quality and Safety Data: This category encompasses data related to healthcare quality, patient safety, healthcare-associated infections, and medical errors.
13. Global Health Data: Global health data covers health-related data from various countries and regions, facilitating cross-country comparisons and supporting international health initiatives.
14. Emergency and Disaster Preparedness Data: These data focus on preparedness for natural disasters, pandemics, and other emergencies, including planning, response, and recovery efforts.
15. Public Health Interventions and Programs Data: Data in this category pertain to the effectiveness and impact of public health interventions, such as vaccination campaigns, health education programs, and disease control initiatives.
16. Health Policy and Health System Data: These data provide insights into health policies, health system performance, healthcare financing, and health governance.
These categories are not exhaustive, and there may be some overlap between them. Public health data sources can be diverse and may be provided by various organizations, including governments, international agencies, research institutions, and non-governmental organizations. It is essential to use reliable and up-to-date sources to ensure the accuracy and validity of the data used for public health analysis and decision-making.
Here's the list of the top sources of public health data along with a brief description of each source:
1. World Health Organization (WHO) - Global Health Observatory (GHO): The WHO's GHO provides a comprehensive collection of health-related data from countries around the world. It covers various health indicators, disease statistics, health systems information, and more. - Website: https://www.who.int/data/gho
2. Centers for Disease Control and Prevention (CDC) - Data & Statistics: The CDC offers a wide range of health-related data and statistics on topics such as diseases, public health, nutrition, and other health-related issues in the United States. - Website: https://www.cdc.gov/datastatistics/index.html
3. National Institutes of Health (NIH) - Health Information: NIH provides data and statistics related to biomedical research, health conditions, clinical trials, and other health-related information. - Website: https://www.nih.gov/health-information
4. European Centre for Disease Prevention and Control (ECDC) - Health Topics: ECDC offers data and statistics on infectious diseases in Europe, along with risk assessments and epidemiological reports. - Website: https://www.ecdc.europa.eu/en
5. Data.gov - Health Datasets: Data.gov is the official data portal of the United States government, providing access to various public datasets, including health-related data from different federal agencies. - Website: https://catalog.data.gov/dataset/?_organization_limit=0&organization=hhs-gov
6. HealthData.gov - Health Datasets: HealthData.gov focuses specifically on health-related datasets from agencies such as the Department of Health and Human Services (HHS) and Centers for Medicare & Medicaid Services (CMS). - Website: https://healthdata.gov/
7. Organisation for Economic Co-operation and Development (OECD) - Health Data: OECD provides health statistics for its member countries, enabling comparisons and analysis of health-related indicators across nations. - Website: https://data.oecd.org/health.htm
8. UNICEF Data - Child and Maternal Health Data: UNICEF's data portal offers information on child and maternal health, nutrition, immunization, and other related areas. - Website: https://data.unicef.org/
9. Health Indicators Warehouse: This resource offers health data for the United States, including chronic diseases, behavioral risk factors, and healthcare access. - Website: https://healthdata.gov/State/Health-Indicators-Warehouse/gd8q-h5hm
10. Global Burden of Disease Study (GBD): The GBD study provides comprehensive data on the impact of diseases and injuries worldwide, offering insights into global health trends. - Website: https://www.healthdata.org/research-analysis/gbd
11. World Bank - Health Nutrition and Population Data: The World Bank's data platform includes health, nutrition, and population data from various countries. - Website: https://databank.worldbank.org/source/health-nutrition-and-population-statistics
12. United Nations - World Health Organization: The United Nations' website provides access to WHO's health-related data and reports. - Website: https://www.un.org/en/academic-impact/who
13. National Health Service (NHS) Digital - Data and Information Services: NHS Digital offers health-related data and information services in the United Kingdom. - Website: https://digital.nhs.uk/data
14. Australian Institute of Health and Welfare (AIHW): AIHW provides a wide range of health-related data, statistics, and reports in Australia. - Website: https://www.aihw.gov.au/reports-data
15. Canadian Institute for Health Information (CIHI): CIHI offers health data, statistics, and information related to healthcare in Canada. - Website: https://www.cihi.ca/en/access-data-and-reports
16. European Union Open Data Portal - Health Data: The EU's open data portal provides access to health-related datasets and information from EU member states. - Website: https://data.europa.eu/en/publications/datastories/open-health-data-european-data-portal
17. Health Systems Trust (South Africa): Health Systems Trust offers health data and research in South Africa. - Website: https://www.hst.org.za/
18. Public Health England (PHE) - Health Data and Analysis: PHE provides health data and analysis for public health in England. - Website: https://www.gov.uk/guidance/phe-data-and-analysis-tools
19. India Health Data Portal: This portal provides health data and statistics for India. - Website: https://www.india.gov.in/nhm-health-statistics-information-portal
20. Korea Centers for Disease Control and Prevention (KCDC) - Statistics and Data: KCDC offers statistics and data on diseases and public health in South Korea. - Website: https://ghdx.healthdata.org/organizations/korea-centers-disease-control-and-prevention-kcdc
21. Japan Ministry of Health, Labour, and Welfare (MHLW): MHLW provides health-related data and information in Japan. - Website: https://www.mhlw.go.jp/english/database/
22. Singapore Ministry of Health - Health Statistics: The Ministry of Health in Singapore offers health statistics and information. - Website: https://www.moh.gov.sg/resources-statistics
23. Brazilian Institute of Geography and Statistics (IBGE) - Health Indicators: IBGE provides health indicators and data in Brazil. - Website: https://www.ibge.gov.br/en/statistics/social/health.html
24. Mexican Institute of Statistics and Geography (INEGI) - Health Data: INEGI offers health-related data in Mexico. - Website: https://en.www.inegi.org.mx/
25. Chile Ministry of Health - Health Statistics: The Chilean Ministry of Health provides health statistics and information. - Website: https://ghdx.healthdata.org/organizations/department-statistics-and-health-information-ministry-health-chile
26. New Zealand Ministry of Health - Health Data and Stats: The Ministry of Health in New Zealand offers health data and statistics. - Website: https://www.health.govt.nz/nz-health-statistics
27. Colombia National Institute of Health - Health Statistics: The National Institute of Health in Colombia provides health statistics and data. - Website: https://www.healthdata.org/research-analysis/health-by-location/profiles/colombia
28. Argentina Ministry of Health - Health Information: The Ministry of Health in Argentina offers health information and data. - Website: https://www.argentina.gob.ar/salud
29. South Africa Department of Health - Health Statistics: The Department of Health in South Africa provides health statistics and data. - Website: https://www.health.gov.za/
30. Kenya Ministry of Health - Health Data and Statistics: The Ministry of Health in Kenya offers health data and statistics. - Website: http://dsl.health.go.ke/
31. Nigeria Federal Ministry of Health - Health Data: The Federal Ministry of Health in Nigeria provides health data and information. - Website: https://www.health.gov.ng/
32. Ghana Ministry of Health - Health Statistics: The Ministry of Health in Ghana offers health statistics and data. - Website: https://www.moh.gov.gh/facts-figures/
33. U.S. Agency for Healthcare Research and Quality (AHRQ) - Healthcare Data: AHRQ provides data and research on healthcare quality and outcomes in the United States. - Website: https://www.ahrq.gov/data/index.html
34. U.S. Census Bureau - Health Insurance Data: The U.S. Census Bureau offers data on health insurance coverage in the United States. - Website: https://www.census.gov/topics/health/health-insurance.html
35. U.S. National Cancer Institute (NCI) - Cancer Statistics: NCI provides cancer-related statistics and data. - Website: https://www.cancer.gov/about-cancer/understanding/statistics#:~:text=The%20cancer%20mortality%20rate%20is,women%20(85.6%20per%20100%2C000).
36. U.S. Food and Drug Administration (FDA) - Adverse Events Reporting System (FAERS): FDA's FAERS provides data on adverse events related to drugs and medical products. - Website: https://open.fda.gov/data/faers/#:~:text=About%20FAERS-,The%20FDA%20Adverse%20Event%20Reporting%20System%20(FAERS)%20is%20a%20database,drug%20and%20therapeutic%20biologic%20products.
37. U.S. National Center for Health Statistics (NCHS): NCHS offers health statistics and data in the United States. - Website: https://www.cdc.gov/nchs/index.htm
38. U.S. Substance Abuse and Mental Health Services Administration (SAMHSA) - Data: SAMHSA provides data and statistics on substance abuse and mental health. - Website: https://www.samhsa.gov/data/
39. U.S. Environmental Protection Agency (EPA) - Health Data: EPA offers health-related data and information on environmental health hazards. - Website: https://www.epa.gov/data
40. U.S. National Library of Medicine (NLM) - Health Data Resources: NLM provides access to various health data resources and databases. - Website: https://www.nlm.nih.gov/
41. U.S. National Heart, Lung, and Blood Institute (NHLBI) - Data: NHLBI offers data and resources related to heart, lung, and blood diseases. - Website: https://www.nhlbi.nih.gov/grants-and-training/funding-opportunities-and-contacts/NHLBI-heart-failure-data-challenge/access-data
42. U.S. National Institute on Aging (NIA) - Health and Aging Data: NIA provides data and statistics on health and aging. - Website: https://www.nia.nih.gov/
43. U.S. National Institute of Mental Health (NIMH) - Data and Statistics: NIMH offers data and statistics on mental health. - Website: https://www.nimh.nih.gov/health/statistics
44. U.S. National Institute on Drug Abuse (NIDA) - Data and Statistics: NIDA provides data and statistics on drug abuse. - Website: https://nida.nih.gov/research-topics/trends-statistics
45. U.S. National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) - Data and Statistics: NIDDK offers data and statistics on diabetes, digestive diseases, and kidney diseases. - Website: https://www.niddk.nih.gov/
46. U.S. Department of Veterans Affairs (VA) - Health Data: The VA provides health data and information related to veterans. - Website: https://www.data.va.gov/
47. U.S. Census Bureau - Population Health Data: The U.S. Census Bureau offers population health data and information. - Website: https://www.census.gov/topics/health/data.html
48. World Health Organization (WHO) Global Health Expenditure Database: WHO's database provides data on global health expenditures. - Website: https://apps.who.int/nha/database
49. World Health Organization (WHO) Global Health Estimates: WHO's Global Health Estimates provide data on mortality and health-related indicators. - Website: https://www.who.int/data/global-health-estimates
50. Global Health Data Exchange (GHDx): GHDx is a data repository managed by the Institute for Health Metrics and Evaluation (IHME). - Website: https://ghdx.healthdata.org/
51. Medical Appointment No-Shows: - Website: https://www.kaggle.com/joniarroba/noshowappointments
52. Seer Cancer Incidence Database: - Website: https://seer.cancer.gov/statistics-network/explorer/application.html
53. CDC Cause of Death: - Website: https://wonder.cdc.gov/
54. Mental Health in Tech Survey - Website: https://www.kaggle.com/datasets/osmi/mental-health-in-tech-survey
55. Prescription-based Prediction - Website: https://www.kaggle.com/roamresearch/prescriptionbasedprediction
56. Hospital Charges for Inpatients: - Website: https://www.kaggle.com/speedoheck/inpatient-hospital-charges
57. US County-Level Mortality: - Website: https://www.kaggle.com/datasets/IHME/us-countylevel-mortality
58. Exercise Pattern Prediction: - Website: https://www.kaggle.com/athniv/exercisepatternpredict
59. Human Activity Recognition: - Website: https://archive.ics.uci.edu/dataset/240/human+activity+recognition+using+smartphones
60. Data.gov: This is the official data portal of the United States government, providing access to a wide range of public datasets, including health-related data from various federal agencies. - Website: https://www.data.gov/
61. OECD Health Data: The Organisation for Economic Co-operation and Development (OECD) offers health statistics for its member countries. The data covers a broad range of health indicators and can be useful for cross-country comparisons. - Website: https://stats.oecd.org/Index.aspx?DataSetCode=HEALTH_STAT
1. Fitbit API: - Data Offered: The Fitbit API allows access to data from Fitbit wearable devices and the Fitbit app. It provides information on daily activity levels (steps, distance, active minutes), heart rate data, sleep duration and patterns, calories burned, and weight measurements. - Website: https://dev.fitbit.com/build/reference/web-api/
2. Apple HealthKit API: - Data Offered: The HealthKit API is available for developers to integrate with Apple's Health app on iOS devices. It offers a wide range of health and fitness data, including steps, heart rate, nutrition, sleep, reproductive health, and data from third-party health apps connected to HealthKit. - Website: https://developer.apple.com/documentation/healthkit
3. Google Fit API: - Data Offered: The Google Fit API provides access to health and fitness data collected by Google Fit on Android devices and compatible apps. It includes data on daily activity, step count, distance, heart rate, sleep, and weight. - Website: https://developers.google.com/fit/rest
4. MyFitnessPal API: - Data Offered: The MyFitnessPal API allows access to nutrition and exercise data from the MyFitnessPal app. It offers information on food intake (calories, nutrients), exercise log, and weight. - Website: https://myfitnesspalapi.com/docs/about/
5. OpenFDA API: - Data Offered: The OpenFDA API provides access to a wide range of health-related data, including drug adverse event reports, drug labeling information, medical device recalls, and food-related data, collected by the U.S. Food and Drug Administration (FDA). - Website: https://open.fda.gov/apis/
6. Nutritionix API: - Data Offered: The Nutritionix API offers access to a large database of nutrition data, including information on food items, their nutrient content, calories, fat, carbohydrates, and other nutrients. - Website: https://www.nutritionix.com/business/api
7. COVID-19 APIs: - Data Offered: Various APIs related to COVID-19 provide real-time data on the number of cases, deaths, and recoveries worldwide, as well as data on vaccine distribution and administration.Please note that the availability and data offered by these APIs may change over time, and some APIs may require authentication or subscription for access to certain data. Before using any API, make sure to review the terms of use and data access policies provided by the respective API providers.
Here are some examples of public health data hidden gems that might not be widely known but offer valuable information for public health research and analysis:
1. World Health Organization (WHO) Global Health Observatory (GHO): The WHO's GHO provides a wealth of global health data, including statistics on diseases, health systems, mortality, and risk factors. It offers a comprehensive view of global health trends and challenges. - Website: https://www.who.int/data/gho
2. Global Health Data Exchange (GHDx): Managed by the Institute for Health Metrics and Evaluation (IHME), GHDx offers a vast collection of global health data, including mortality rates, disease burden, risk factors, and health financing indicators. - Website: http://ghdx.healthdata.org/
3. HealthData.gov: HealthData.gov is the U.S. government's data portal specifically focused on health-related datasets. It offers datasets from various federal agencies, providing valuable insights into public health issues in the United States. - Website: https://www.healthdata.gov/
4. European Centre for Disease Prevention and Control (ECDC) - Health Topics: ECDC offers data and statistics on infectious diseases in Europe, along with risk assessments and epidemiological reports. It is a valuable resource for tracking disease outbreaks and patterns in the European region. - Website: https://www.ecdc.europa.eu/en
5. National Health and Nutrition Examination Survey (NHANES): Conducted by the U.S. Centers for Disease Control and Prevention (CDC), NHANES is a series of surveys that provide data on the health and nutrition status of the U.S. population. It offers detailed information on various health indicators. - Website: https://www.cdc.gov/nchs/nhanes/
6. Global Burden of Disease Study (GBD): The GBD study provides comprehensive data on the impact of diseases and injuries worldwide. It offers insights into global health trends, burden of diseases, and risk factors. - Website: http://ghdx.healthdata.org/gbd-results-tool
7. National Cancer Institute's Surveillance, Epidemiology, and End Results (SEER) Program: SEER provides data on cancer incidence, prevalence, and survival in the United States. It is a valuable resource for cancer-related research and analysis. - Website: https://seer.cancer.gov/
8. National Violent Death Reporting System (NVDRS): Managed by the U.S. CDC, NVDRS provides data on violent deaths, including suicides and homicides, in the United States. It offers detailed information on circumstances and contributing factors. - Website: https://www.cdc.gov/violenceprevention/datasources/nvdrs/
9. Institute for Health Metrics and Evaluation (IHME) Data Visualization: IHME offers a variety of interactive data visualizations and tools that provide valuable insights into global health metrics and trends. - Website: http://www.healthdata.org/results/data-visualizations
10. Human Mortality Database: The Human Mortality Database offers mortality and population data from various countries, facilitating cross-national and historical comparisons of mortality trends. - Website: https://www.mortality.org/
11. County Health Rankings & Roadmaps: This resource provides county-level data on various health factors and outcomes in the United States. It is useful for understanding health disparities at the local level. - Website: https://www.countyhealthrankings.org/
Remember that data availability and sources may change over time, so it's essential to check the websites of these resources for the most up-to-date information and access to the data. These hidden gems can offer researchers, policymakers, and public health professionals valuable insights to inform evidence-based decision-making and initiatives in the field of public health.
We are with our founders from day one, for the long run.