Skip to content

Instantly share code, notes, and snippets.

@jiahaoli57
Forked from korakot/selenium.py
Created September 24, 2021 12:38
Show Gist options
  • Save jiahaoli57/32577e36e5667ed3ce580386a47af349 to your computer and use it in GitHub Desktop.
Save jiahaoli57/32577e36e5667ed3ce580386a47af349 to your computer and use it in GitHub Desktop.
Use selenium in Colab
# install chromium, its driver, and selenium
!apt update
!apt install chromium-chromedriver
!pip install selenium
# set options to be headless, ..
from selenium import webdriver
options = webdriver.ChromeOptions()
options.add_argument('--headless')
options.add_argument('--no-sandbox')
options.add_argument('--disable-dev-shm-usage')
# open it, go to a website, and get results
wd = webdriver.Chrome(options=options)
wd.get("https://www.website.com")
print(wd.page_source) # results
# divs = wd.find_elements_by_css_selector('div')
# I create my own library to make it even easier
!pip install kora -q
from kora.selenium import wd
wd.get("https://www.website.com")
print(wd.page_source) # results
# I add a few helpers
divs = wd.select("div") # css selecter
div = divs[0]
span = div.select1("span") # return the first result
wd # screenshot
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment