Skip to content

Instantly share code, notes, and snippets.

@Mahelita
Created April 17, 2021 17:52
Show Gist options
  • Save Mahelita/a6a934071f926a944d57ad0c6c99852d to your computer and use it in GitHub Desktop.
Save Mahelita/a6a934071f926a944d57ad0c6c99852d to your computer and use it in GitHub Desktop.
Scrape tonie tracks
Display the source blob
Display the rendered blob
Raw
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@steve8x8
Copy link

steve8x8 commented Jan 7, 2023

Unfortunately, retrieving a "series" using requests.get() for me doesn't return the same information I'd get via the browser. Example: https://tonies.com/de-de/tonies/?series=anne-kaffeekanne ("tonies.de" gets replaced by "tonies.com/de-de" and "tonies/${series}" becomes "tonies/?series=${series}") - in the browser I get 1 hit while the python code returns some random, and unrelated, stuff :(
Any suggestions what might go wrong here?

Also I'm still trying to find out what's happening in the 4th stage... I'm getting some rather bad matches

@steve8x8
Copy link

steve8x8 commented Jan 9, 2023

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment