We have an uncleansed raw dataset containing EPA Water System Violations. These violations are associated with Water Districts or Water Systems who supply our everyday drinking water.
In a real world situation, you'd be responsible for cleansing and wrangling the dataset into a readable datamodel. However, for this exercise, you will be delivered the sampled set in a pre-cleansed SQL database environment