Skip to content

Instantly share code, notes, and snippets.

@tillahoffmann
Last active November 7, 2022 14:25
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save tillahoffmann/fc12349c02950e43a9edefe5907eb62c to your computer and use it in GitHub Desktop.
Save tillahoffmann/fc12349c02950e43a9edefe5907eb62c to your computer and use it in GitHub Desktop.

Data accompanying the publication "Wastewater Catchment Areas in Great Britain"

Code to reproduce the results can be found in the accompanying repository. The resultant data comprise

  • catchments_consolidated.*: geospatial data as a shapefile in the British National Grid projection, including auxiliary files. Each feature has the following attributes:

    • identifier: a unique identifier for the catchment based on its geometry. These identifiers are stable across different versions of the data provided the geometry of the associated catchment remains unchanged.
    • company: the water company that contributed the feature.
    • name: the name of the catchment as provided by the water company.
    • comment (optional): an annotation providing additional information about the catchment, e.g. overlaps with other catchments.
  • waterbase_consolidated.csv: wastewater treatment plant metadata reported under the UWWTD between 2006 and 2018. See here for the original data. The columns comprise:

    • uwwState: whether the treatment work is active or inactive.
    • rptMStateKey: key of the member state (should be UK or GB for all entries).
    • uwwCode: unique treatment works identifier in the UWWTD database.
    • uwwName: name of the treatment works.
    • uwwLatitude and uwwLongitude: GPS coordinates of the treatment works in degrees.
    • uwwLoadEnteringUWWTP: actual load entering the treatment works measured in BOD person equivalents, corresponding to an "organic biodegradable load having a five-day biochemical oxygen demand (BOD5) of 60 g of oxygen per day".
    • uwwCapacity: potential treatment capacity measured in BOD person equivalents.
    • version: the reporting version (incremented with each reporting cycling, corresponding to two years).
    • year: the reporting year.

    Note that there are some data quality issues, e.g. treatment works UKENNE_YW_TP000055 and UKENNE_YW_TP000067 are both named Doncaster (Bentley) in 2006.

  • waterbase_catchment_lookup.csv: lookup table to walk between catchments and treatment works. The columns comprise:

    • identifier and name: catchment identifier and name as used in catchments_consolidated.*.
    • uwwCode and uwwName: treatment works identifier and name as used in waterbase_consolidated.csv.
    • distance: distance between the catchment and treatment works in British National Grid projection (approximately metres).
  • lsoa_catchment_lookup.csv: lookup table to walk between catchments and Lower Layer Super Output Areas (LSOAs). The columns comprise:

    • identifier: catchment identifier as used in catchments_consolidated.*.
    • LSOA11CD: LSOA identifier as used in the 2011 census.
    • intersection_area: area of the intersection between the catchment and LSOA in British National Grid projection (approximately square metres).
This file has been truncated, but you can view the full file.
View raw

(Sorry about that, but we can’t show files that are this big right now.)

View raw

(Sorry about that, but we can’t show files that are this big right now.)

View raw

(Sorry about that, but we can’t show files that are this big right now.)

View raw

(Sorry about that, but we can’t show files that are this big right now.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment