Skip to content

Instantly share code, notes, and snippets.

@nedaresa
Forked from gajomi/instructions.txt
Created March 25, 2020 04:14
Show Gist options
  • Save nedaresa/4f40ee75b398a340d2e7f2cbd6862edc to your computer and use it in GitHub Desktop.
Save nedaresa/4f40ee75b398a340d2e7f2cbd6862edc to your computer and use it in GitHub Desktop.
Healthcare data engineering with pandas practice problem
Instructions:
(1) Load ICD category code descriptions into dataframe. Do not store the file on disk.
url: https://github.com/kamillamagna/ICD-10-CSV/blob/master/categories.csv?raw=true
(2) Load ICD block descriptions into a dataframe. Do no store any files to disk
url: https://www.aapc.com/icd-10/
hint: consider using `pd.read_html`
(3) Count all the icd10 codes characterized as diseases according to the ICD block description
(4) From the codes in step (3) isolate though refering to 'viral' or 'virus' and concatenate, separating by semicolon
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment