Skip to content

Instantly share code, notes, and snippets.

@matthew-brett
Created August 28, 2020 12:04
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save matthew-brett/0a350f2a8d863f0bbebe69ab9fd6f294 to your computer and use it in GitHub Desktop.
Save matthew-brett/0a350f2a8d863f0bbebe69ab9fd6f294 to your computer and use it in GitHub Desktop.
select pip_version, distro_name, distro_version, (download_count / total * 100) as percent
from (
SELECT REGEXP_REPLACE(REGEXP_REPLACE(REGEXP_REPLACE(LOWER(details.distro.name), " (gnu/)?linux", ""), "^red.?hat.*", "redhat"), "os x","macos") AS distro_name,
details.distro.version as distro_version,
details.installer.version as pip_version,
COUNT(*) AS download_count,
SUM(COUNT(*)) OVER () AS total
FROM `the-psf.pypi.downloads*`
WHERE
details.installer.version IS NOT NULL
AND details.distro.name IS NOT NULL
AND details.distro.name != 'os x'
-- Only query the last 30 days of history
AND _TABLE_SUFFIX
BETWEEN FORMAT_DATE(
'%Y%m%d', DATE_SUB(CURRENT_DATE(), INTERVAL 30 DAY))
AND FORMAT_DATE('%Y%m%d', CURRENT_DATE())
GROUP BY
pip_version, distro_name, distro_version
ORDER BY
pip_version, download_count DESC
)
WHERE download_count / total * 100 > 0.5
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment