Challenge Overview
Welcome to the PseudoVet - Scrape Morbidity Profiles for War Eras Challenge.
Overview
PseudoVet is an automated patient data fabrication engine which provides a set of active synthetic patients and clinical data that can be used for healthcare software development. Development against real patient data unnecessarily exposes patient health information (PHI) and personally identifiable information (PII) and cannot be used by developers outside of the VA network. However, fully functional, realistic datasets can be used safely in development, testing, training and other non-production environments in compliance with the Health Information Technology for Economic and Clinical Health Act (HITECH Act) and other regulations. Development against current fabricated data is not useful because the data sets are outdated, which requires development teams to spend time developing data sets to use in lieu of writing code or require licenses and cannot be shared.
Challenge Requirements
This challenge is a similar data searching/ scraping challenge similar to the PseudoVet - Scrape US Veteran Health Records Data Challenge that we had launched last month.
In this challenge, we want to focus on finding/ scraping data related for morbidity profiles for US veterans for the following war eras
-
World War II
-
Korean Conflict
-
Vietnam War
-
Persian Gulf War
The key requirements are listed below and you will need to search public sources on the Internet/ .gov sites to find this data.
Some good starting points are mentioned below (but feel free to pick other sites)
-
Official list of War Eras: https://fas.org/sgp/crs/natsec/RS21405.pdf
-
Veteran Data
-
https://www.data.va.gov/
-
Morbidity data
We seek the following detail:
-
Types of morbidities affecting veterans for each war era
-
ICD-10 Code of morbidity
-
% of veterans that were affected with each morbidity during specific war era
You can lookup ICD-10 codes here: https://www.nuemd.com/icd-10/codes
See this link for more details on the ICD10 format.
Example file (Korean War): https://docs.google.com/spreadsheets/d/1LHHcMVOM_uw0xANCAhkAnLLq1Cunzy0or63DJKlDzO0/edit?usp=sharing
There is no coding or implementation involved in this challenge
Final Submission Guidelines
Final Submission Guidelines
-
Zipped datasets in CSV format (double quote enclosed text) for each war era. (4 files total)
-
Table of contents listing references where data was obtained. Submissions which do not list valid references will be disqualified.
Review Criteria
-
The co-pilot and PM will review the submissions for data authenticity as well as the volume of dataset provided. The submission with most credible data as per the above spec will be chosen as winners. All submissions will be judged on a scale of 1-10. There will be no appeals or appeals response.