Skip to content

Instantly share code, notes, and snippets.

Chris Le chrisle

Block or report user

Report or block chrisle

Hide content and notifications from this user.

Learn more about blocking users

Contact Support about this user’s behavior.

Learn more about reporting abuse

Report abuse
View GitHub Profile
@chrisle
chrisle / gist:1885478
Created Feb 22, 2012
ImportXML for Google Search
View gist:1885478
=ArrayFormula(RegexReplace(RegexExtract(ImportXML("http://www.google.com/search?q=KEYWORDHERE", "//h3/a/@href"), "http.*"), "\&sa.*", ""))
@chrisle
chrisle / gist:2252209
Created Mar 30, 2012
CURL as GoogleBot 2.1
View gist:2252209
curl --user-agent "Googlebot/2.1 (+http://www.google.com/bot.html)" -v $@
View gist:2319642
curl --user-agent "Mozilla/5.0 (Windows NT 6.1; rv:12.0) Gecko/20120403211507 Firefox/12.0" http://www.google.com/search\?q\=law%20firm%20boston,%20ma
View gist:2319730
function myFunction() {
var response = UrlFetchApp.fetch("http://www.google.com/search?q=law%20firm%20boston,%20ma&num=10").getContentText();
Logger.log(response);
}
@chrisle
chrisle / gist:2785524
Created May 25, 2012
Curl as ImportXML
View gist:2785524
curl --user-agent "-" $@
@chrisle
chrisle / functions.py
Created Aug 14, 2012
Facebook Likes for Excel using DataNitro
View functions.py
""" Facebook likes for Excel using DataNitro
chrisl@seerinteractive.com
http://www.seeinteractive.com/blog/get-facebook-likes-in-excel-using-datanitro
"""
import urllib2
import json
def facebook_likes(url):
facebook_url = "https://graph.facebook.com/?ids=" + url
raw_data = urllib2.urlopen(facebook_url).read()
@chrisle
chrisle / csv_writer.rb
Created Nov 2, 2012
JSON > CSV > data_miner > database
View csv_writer.rb
# Converts JSON data into CSV and writes to a temporary CSV file
require 'ruport'
# see ruport_19.rb
require 'monkey_patches/ruport_19'
class CsvWriter
# Initialize an instance of CsvWriter
def initialize
@chrisle
chrisle / gist:4206925
Created Dec 4, 2012
newspaper to seomoz
View gist:4206925
require 'mechanize'
require 'linkscape'
agent = Mechanize.new
agent.user_agent_alias = 'Mac Safari'
# Put your state here
state = "PA"
page = agent.get "http://newsmap.mhlakhani.com/data/US-#{state}"
View capybara_with_phantom_js.rb
module CapybaraWithPhantomJs
include Capybara
# Create a new PhantomJS session in Capybara
def new_session
# Register PhantomJS (aka poltergeist) as the driver to use
Capybara.register_driver :poltergeist do |app|
Capybara::Poltergeist::Driver.new(app)
end
View escaped_fragment.rb
# Looks for the escaped fragment meta tag. If found, gets the HTML snapshot
# instead
module GoogleBotSimulator::EscapedFragment
def has_meta_fragment?
(@response.search('//meta[@name="fragment"]/@content').to_s == '!') ? true : false
end
def url_with_escaped_fragment(url)
You can’t perform that action at this time.