Skip to content

Instantly share code, notes, and snippets.

View Kahdam's full-sized avatar

Arthur ROGER Kahdam

View GitHub Profile
@Kahdam
Kahdam / italy_listing_clean.sql
Created October 16, 2025 11:54
Cleaning the raw csv given for the assessement of HousingAnywhere
CREATE OR REPLACE VIEW `housing-anywhere-studapart.Ha_data_assessment.italy_listing_clean` AS
WITH src AS (
SELECT
NULLIF(TRIM(REGEXP_REPLACE(CAST(city AS STRING), '^(\"|\')+|(\"|\')+$', '')), '') AS city_raw,
NULLIF(TRIM(REGEXP_REPLACE(CAST(category AS STRING), '^(\"|\')+|(\"|\')+$', '')), '') AS category_raw,
LOWER(NULLIF(TRIM(CAST(country_code AS STRING)), '')) AS country_code_raw,
NULLIF(TRIM(REGEXP_REPLACE(CAST(furnished AS STRING), '^(\"|\')+|(\"|\')+$', '')), '') AS furnished_raw,
NULLIF(TRIM(REGEXP_REPLACE(CAST(registration_possible AS STRING), '^(\"|\')+|(\"|\')+$', '')), '') AS registration_possible_raw,
NULLIF(TRIM(REGEXP_REPLACE(CAST(washing_machine AS STRING), '^(\"|\')+|(\"|\')+$', '')), '') AS washing_machine_raw,
NULLIF(TRIM(REGEXP_REPLACE(CAST(tv AS STRING), '^(\"|\')+|(\"|\')+$', '')), '') AS tv_raw,