Skip to content

Instantly share code, notes, and snippets.

View yubessy's full-sized avatar

Shotaro Tanaka yubessy

View GitHub Profile
@yubessy
yubessy / myhtmlclean.py
Created June 29, 2014 03:03
HTMLをクリーンアップ
# -*- coding: utf-8 -*-
# stdlib
import re
LEFT_SPACES = re.compile(r'\s+<')
RIGHT_SPACES = re.compile(r'>\s+')
SCRIPT_TAG = re.compile(r'<script[^>]*>.*?</script>')
COMMENT = re.compile(r'<!--[\s\S]*?-->')
# -*- coding: utf-8 -*-
# third-party lib
import networkx
def blank_split(text):
u"""
半角スペースによる単語分割
"""
return text.split()
@yubessy
yubessy / myshorten.py
Created June 29, 2014 03:16
長い文字列を短く表示
# -*- coding: utf-8 -*-
def shorten(s, length=80):
u"""
長い文字列を"abraca ... tabra"のように短く表示
"""
if len(s) < length:
return s
else:
if isinstance(s, unicode):
@yubessy
yubessy / mywiki.py
Created June 29, 2014 03:19
MediaWIki APIを使って項目の別名を取得したりなど
#-*- encoding: utf-8 -*-
# thirdlib
import requests
BASE_URL = "http://en.wikipedia.org/w/api.php"
BLLIMIT = 250
def get_regular_name(q):
@yubessy
yubessy / Gemfile
Last active August 29, 2015 14:04
LESSやらSassやらを使わずにRailsにBootstrapを導入 ref: http://qiita.com/yubessy/items/d01a1e4ea54741331942
gem 'twitter-bootstrap-rails', :git => 'git://github.com/seyhunak/twitter-bootstrap-rails.git', :branch => 'bootstrap3'
@yubessy
yubessy / Python3
Last active May 14, 2019 11:34
Python3の...(Ellipsisオブジェクト)について ref: https://qiita.com/yubessy/items/cc1ca4dbc3161f84285e
>>> str(Ellipsis)
'Ellipsis'
>>> str(pass)
File "<stdin>", line 1
str(pass)
^
SyntaxError: invalid syntax
@yubessy
yubessy / インストール
Created January 16, 2015 10:13
Pythonで正規表現にマッチする文字列をランダムに生成する ref: http://qiita.com/yubessy/items/6df954f3ad1806c17fdd
$ pip install rstr
@yubessy
yubessy / file0.txt
Last active August 29, 2015 14:13
あまり知られていないPythonの言語仕様(Python3.4以降対応) ref: http://qiita.com/yubessy/items/bfcce577e2266ce86641
>>> c1 = 1 + 1j
>>> c2 = 1 - 2j
>>> c1 + c2
(2-1j)
>>> c1 * c2
(3-1j)
>>> c1 / c2
(-0.2+0.6j)
@yubessy
yubessy / sqlite3db.py
Created July 9, 2015 08:32
Stupid SQLite3 handler
import re
import sqlite3
class SQLite3DB:
_IDENTIFIER_RE = re.compile(r'^[a-zA-Z_]\w*$')
def __init__(self, filename):
self._con = sqlite3.connect(filename, isolation_level=None)
self._con.row_factory = sqlite3.Row
@yubessy
yubessy / nyt_example.yaml
Last active October 23, 2015 08:34
The New York Times Annotated Corpus の各フィールドのデータ例
identifier:
guid: "129961"
fileName: "0129961.xml"
url: "http://query.nytimes.com/gst/fullpage.html?res=940DE4D9103CF930A15750C0A96E948260"
alternativeURL:
date:
publicationDate: "2004-09-03T00:00:00Z"
correctDate: "1987-02-04T00:00:00Z"
analyzed_text:
articleAbstract: "Gov James E McGreevey, ..."