Skip to content

Instantly share code, notes, and snippets.

@mrluanma
Created September 4, 2012 14:40
Show Gist options
  • Save mrluanma/3621775 to your computer and use it in GitHub Desktop.
Save mrluanma/3621775 to your computer and use it in GitHub Desktop.
Python 登录新浪微博(requests 真的比 urllib2 强了 2^^32 倍 pip install requests)
requests==2.4.3
rsa==3.1.4
# -*- coding: utf-8 -*-
from __future__ import absolute_import, division, print_function, unicode_literals
import re
import json
import base64
import binascii
import rsa
import requests
import logging
logging.basicConfig(level=logging.DEBUG)
WBCLIENT = 'ssologin.js(v1.4.5)'
user_agent = (
'Mozilla/5.0 (Windows NT 5.1) AppleWebKit/536.11 (KHTML, like Gecko) '
'Chrome/20.0.1132.57 Safari/536.11'
)
session = requests.session()
session.headers['User-Agent'] = user_agent
def encrypt_passwd(passwd, pubkey, servertime, nonce):
key = rsa.PublicKey(int(pubkey, 16), int('10001', 16))
message = str(servertime) + '\t' + str(nonce) + '\n' + str(passwd)
passwd = rsa.encrypt(message.encode('utf-8'), key)
return binascii.b2a_hex(passwd)
def wblogin(username, password):
resp = session.get(
'http://login.sina.com.cn/sso/prelogin.php?'
'entry=sso&callback=sinaSSOController.preloginCallBack&'
'su=%s&rsakt=mod&client=%s' %
(base64.b64encode(username.encode('utf-8')), WBCLIENT)
)
pre_login_str = re.match(r'[^{]+({.+?})', resp.text).group(1)
pre_login = json.loads(pre_login_str)
pre_login = json.loads(pre_login_str)
data = {
'entry': 'weibo',
'gateway': 1,
'from': '',
'savestate': 7,
'userticket': 1,
'ssosimplelogin': 1,
'su': base64.b64encode(requests.utils.quote(username).encode('utf-8')),
'service': 'miniblog',
'servertime': pre_login['servertime'],
'nonce': pre_login['nonce'],
'vsnf': 1,
'vsnval': '',
'pwencode': 'rsa2',
'sp': encrypt_passwd(password, pre_login['pubkey'],
pre_login['servertime'], pre_login['nonce']),
'rsakv' : pre_login['rsakv'],
'encoding': 'UTF-8',
'prelt': '115',
'url': 'http://weibo.com/ajaxlogin.php?framelogin=1&callback=parent.si'
'naSSOController.feedBackUrlCallBack',
'returntype': 'META'
}
resp = session.post(
'http://login.sina.com.cn/sso/login.php?client=%s' % WBCLIENT,
data=data
)
login_url = re.search(r'replace\([\"\']([^\'\"]+)[\"\']',
resp.text).group(1)
resp = session.get(login_url)
login_str = re.match(r'[^{]+({.+?}})', resp.text).group(1)
return json.loads(login_str)
if __name__ == '__main__':
print(json.dumps(wblogin('XXXXXXXX@gmail.com', 'XXXXXXXX'), ensure_ascii=False))
# timeline
print(session.get('http://weibo.com/').text)
@dreampuf
Copy link

Nice!

@seckcoder
Copy link

貌似加密方式已经改成rsa2

@uimeet
Copy link

uimeet commented May 23, 2013

改造一下兴许能用

@mrluanma
Copy link
Author

mrluanma commented Jul 2, 2013

@seckcoder @uimeet 已更新为最新的加密方式。

@lizy1630
Copy link

谢谢!已经改成python3也成功啦~

@Logic0
Copy link

Logic0 commented Mar 8, 2014

给你32个赞!

@lincolnge
Copy link

client.statuses.upload.post(status=ctx.request['status'], pic=f)
问问 如何传入 f 的值?
从 中~
THX

@fatpo
Copy link

fatpo commented Jun 1, 2014

这个。。竟然更新这么勤快。。还。。能用。。兼职无情

@xiaolukuan
Copy link

非常感谢,不过我很想了解怎么分析它的加密过程,下次要是在更改加密算法的话,也能自己修改代码,忘不吝赐教,本人邮箱380046897@qq.com 特希望能加个企鹅赐教,企鹅:380046897

@wgchen
Copy link

wgchen commented Jun 30, 2014

我想问一下,此程序登陆成功后怎么运用它的cookie信息来抓取微博的内容?请不吝赐教

@liudragonfly
Copy link

@wgchen 可以使用cookielib来保存cookie信息,之后抓取微博内容时会重新发送cookie

@tvrcgo
Copy link

tvrcgo commented Jul 23, 2014

要验证码,怎么办

@zja711
Copy link

zja711 commented Aug 22, 2014

要如何发布微博?

@mrluanma
Copy link
Author

@wgchen requests.session 会自动使用 cookie 维护 session,wblogin 之后直接使用 session.get(URL) 访问需要抓取的新浪微博页面即可。

@findmyway
Copy link

thanks, 很有帮助.
另外,似乎第75行的正则需要改一下,
现在没法匹配到'}}'
不过问题不大

@android1122
Copy link

报错2092什么意思

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment