Skip to content

Instantly share code, notes, and snippets.

View hamukazu's full-sized avatar

Kimikazu Kato hamukazu

View GitHub Profile
@hamukazu
hamukazu / Reuters.py
Last active June 1, 2017 03:35 — forked from herrfz/Reuters.py
Reuters-21578 keyword extraction
# Reuters-21578 dataset downloader and parser
#
# Author: Eustache Diemert <eustache@diemert.fr>
# http://scikit-learn.org/stable/auto_examples/applications/plot_out_of_core_classification.html
#
# Modified by @herrfz, get pandas DataFrame from the orig SGML
# License: BSD 3 clause
from __future__ import print_function