Skip to content

Instantly share code, notes, and snippets.

View royduin's full-sized avatar

Roy Duineveld royduin

View GitHub Profile
<?php
/*
Plugin Name: Import demo
Plugin URI: http://royduineveld.nl
Description: A demo import for my blog
Version: 1.0
Author: Roy Duineveld
Author URI: http://royduineveld.nl
*/
@royduin
royduin / crawl.php
Last active September 24, 2016 18:05
Search for public git repositories in the Alexa 1 million top ranked websites list, see: https://royduineveld.nl/hacking-public-git-repositories/
<?php
// Download the list from: http://s3.amazonaws.com/alexa-static/top-1m.csv.zip
$csv = file_get_contents('top-1m.csv');
$lines = explode(PHP_EOL,$csv);
$counter = 1;
foreach($lines as $line){
$url = explode(',',$line)[1];
echo $counter.' '.$url.PHP_EOL;