urllib2 vs urllib3 vs requests; updated for Python 3
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/env python3 | |
import time | |
# REMEMBER TO CHANGE THE VARIABLES BELOW BEFORE YOU RUN THE SCRIPT | |
_URL = 'http://localhost/tmp/derp.html' | |
_NUMBER = 1000 | |
def test_urllib2(): | |
# Per page-top note from https://python.readthedocs.io/en/v2.7.2/library/urllib2.html | |
import urllib.request | |
try: | |
response = urllib.request.urlopen(_URL) | |
except (urllib.request.HTTPError, e): | |
response = e | |
response.code | |
return response.read() | |
def test_urllib3(): | |
import urllib3 | |
http = urllib3.PoolManager() | |
response = http.request('GET', _URL) | |
response.status | |
return response.data | |
def test_requests(): | |
import requests | |
response = requests.get(_URL) | |
response.status_code | |
return response.text | |
if __name__ == '__main__': | |
from timeit import Timer | |
t_urllib2 = Timer("test_urllib2()", "from __main__ import test_urllib2") | |
print('{0} urllib_request: {1}'.format(_NUMBER, t_urllib2.timeit(number=_NUMBER))) | |
t_urllib3 = Timer("test_urllib3()", "from __main__ import test_urllib3") | |
print('{0} urllib3: {1}'.format(_NUMBER, t_urllib3.timeit(number=_NUMBER))) | |
t_requests = Timer("test_requests()", "from __main__ import test_requests") | |
print('{0} requests: {1}'.format(_NUMBER, t_requests.timeit(number=_NUMBER))) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Found this script as I needed to figure out which HTTP library should I use, and I figured I want to test it out for fun. The results for https://en.wiktionary.org/wiki/abdi#Malay are run under the following conditions:
Results
1 execution:
100 executions:
500 executions:
1000 executions:
I don't realistically need anything beyond 1000 executions as in my use case, it will almost always be a single request just to check if the entry exists.
Regardless, it's clear that
requests
wins out by a wider margin the more requests/executions you make. Coupled with its more user-friendly API, this is what I'll be going with.