Skip to content

Instantly share code, notes, and snippets.

@lfy79001
Created May 28, 2024 03:28
Show Gist options
  • Save lfy79001/95d57c0b788d63985784ef58c90430b3 to your computer and use it in GitHub Desktop.
Save lfy79001/95d57c0b788d63985784ef58c90430b3 to your computer and use it in GitHub Desktop.
SELECT
*
FROM (
SELECT
reference_name,
COUNT(reference_name) / r.length AS variant_density,
COUNT(reference_name) AS variant_count,
r.length AS reference_length
FROM
`bigquery-public-data.genomics_cannabis.MNPR01_201703` v,
`bigquery-public-data.genomics_cannabis.MNPR01_reference_201703` r
WHERE
v.reference_name = r.name
AND EXISTS (
SELECT
1
FROM
UNNEST(v.call) AS call
WHERE
EXISTS (
SELECT
1
FROM
UNNEST(call.genotype) AS gt
WHERE
gt > 0))
GROUP BY
reference_name,
r.length ) AS d
ORDER BY
variant_density DESC;
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment