Some Google Refine snippets and resources to help with cleaning PVA data
Re-add the leading 0 to the 7-digit property numbers
- Numeric facet on
property
and filter for 7-digit values only - Run a transform on the faceted
property
data. Concatenate a 0 to the front of the 7-digit values'0' + value
- Remove the facet.
Get rid of dollar signs ($) and commas (,)
value.replace('$','').replace(',','')
Get rid of tilde-A (Ã)
value.replace('Ã','')
Concatenating across columns. This can be useful for creating unique IDs when there are multiple records for a property
cells['column name'].value + cells['other column name'].value