Skip to content

Instantly share code, notes, and snippets.

@pistocop
Last active February 11, 2021 19:36
Show Gist options
  • Save pistocop/d30b4b49f716875085b0b78c1bae8d79 to your computer and use it in GitHub Desktop.
Save pistocop/d30b4b49f716875085b0b78c1bae8d79 to your computer and use it in GitHub Desktop.
[medium][subreddit-downloader]example.sh
# Init
$ git clone https://github.com/pistocop/subreddit-comments-dl.git
$ cd subreddit-comments-dl
$ pip install -r requirements.txt
# Download the AskReddit comments of the last 30 submissions
$ python src/subreddit_downloader.py AskReddit --batch-size 10 --laps 3 --reddit-id <reddit_id> --reddit-secret <reddit_secret> --reddit-username <reddit_username>
2021-02-11 19:54:44.175 | INFO | __main__:main:241 - Start download: UTC range: [None, None], direction: `before`, batch size: 10, total submissions to fetch: 30
2021-02-11 19:54:49.769 | INFO | codetiming._timer:stop:57 - Lap 0/3 completed in 0.1m | [new/tot]: 0/0
2021-02-11 19:54:54.583 | INFO | codetiming._timer:stop:57 - Lap 1/3 completed in 0.1m | [new/tot]: 2/2
2021-02-11 19:57:10.515 | INFO | codetiming._timer:stop:57 - Lap 2/3 completed in 2.3m | [new/tot]: 11/13
2021-02-11 19:57:10.525 | INFO | __main__:main:287 - Stop download: lap 3/3 [total]: 51
2021-02-11 19:57:10.526 | INFO | codetiming._timer:stop:57 - Total downloading time: 2.5m
# Download the News comments after 1 January 2021
$ python src/subreddit_downloader.py News --batch-size 512 --laps 3 --reddit-id <reddit_id> --reddit-secret <reddit_secret> --reddit-username <reddit_username> --utc-after 1609459201
2021-02-11 20:28:18.428 | INFO | __main__:main:241 - Start download: UTC range: [None, 1609459201], direction: `after`, batch size: 512, total submissions to fetch: 1536
2021-02-11 20:31:09.061 | INFO | codetiming._timer:stop:57 - Lap 0/3 completed in 2.8m | [new/tot]: 0/0
2021-02-11 20:32:05.762 | INFO | codetiming._timer:stop:57 - Lap 1/3 completed in 0.9m | [new/tot]: 883/883
2021-02-11 20:32:54.005 | INFO | codetiming._timer:stop:57 - Lap 2/3 completed in 0.8m | [new/tot]: 1133/2016
2021-02-11 20:32:54.008 | INFO | __main__:main:287 - Stop download: lap 3/3 [total]: 2696
2021-02-11 20:32:54.009 | INFO | codetiming._timer:stop:57 - Total downloading time: 4.6m
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment