Creating a system for identifying keywords in textual humanitarian data
This is a brief crash course for creating a system to categorize key words in sets of data with known classifications of segments of text. Although it is written in a humanitarian context, it can be flexibly used elsewhere.
Overview: What's in the data?
A sample dataset could include the following format where it is normalized: