steps to integrate:
- parse the html
- get the root, shake it
- emflatten it with a temporary array
- make a content string; for each flattened item, add it to the content (do fancy formatting/whitespace removal if you want); do other stuff to the content string if you want (eg trimming, adding title, etc)