Skip to content

Instantly share code, notes, and snippets.

@gajomi
Created March 25, 2020 04:05
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 1 You must be signed in to fork a gist
  • Save gajomi/a2d3c368d1c706a66c3967ceba641fa6 to your computer and use it in GitHub Desktop.
Save gajomi/a2d3c368d1c706a66c3967ceba641fa6 to your computer and use it in GitHub Desktop.
Healthcare data engineering with pandas practice problem
Instructions:
(1) Load ICD category code descriptions into dataframe. Do not store the file on disk.
url: https://github.com/kamillamagna/ICD-10-CSV/blob/master/categories.csv?raw=true
(2) Load ICD block descriptions into a dataframe. Do no store any files to disk
url: https://www.aapc.com/icd-10/
hint: consider using `pd.read_html`
(3) Count all the icd10 codes characterized as diseases according to the ICD block description
(4) From the codes in step (3) isolate though refering to 'viral' or 'virus' and concatenate, separating by semicolon
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment