-
-
Save UGURSELIMOZEN/519973b44ec0b99f280c502619a45105 to your computer and use it in GitHub Desktop.
ISTDSA_DS_Bootcamp_Project2_WebScraping_Regression_P2
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
PAGE_LINKS = [] | |
def getPageLinks(url) : | |
html = getAndParseURL(url) | |
for i in html.findAll("link",{"rel":"canonical"}) : | |
for b in range(1,51) : | |
PAGE_LINKS.append(url + "?page={}".format(b)) | |
return PAGE_LINKS |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment