Skip to content

Instantly share code, notes, and snippets.

@limushi
Forked from fuwiak/hh_parser.py
Created August 30, 2019 07:48
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save limushi/9a6bed6f49f1068f5978949a3aee2bef to your computer and use it in GitHub Desktop.
Save limushi/9a6bed6f49f1068f5978949a3aee2bef to your computer and use it in GitHub Desktop.
import requests
import pandas as pd
number_of_pages = 100
#number_of_ads = number_of_pages * per_page
job_title = ["'Data Analyst' and 'data scientist'"]
for job in job_title:
data=[]
for i in range(number_of_pages):
url = 'https://api.hh.ru/vacancies'
par = {'text': job, 'area':'113','per_page':'10', 'page':i}
r = requests.get(url, params=par)
e=r.json()
data.append(e)
vacancy_details = data[0]['items'][0].keys()
df = pd.DataFrame(columns= list(vacancy_details))
ind = 0
for i in range(len(data)):
for j in range(len(data[i]['items'])):
df.loc[ind] = data[i]['items'][j]
ind+=1
csv_name = job+".csv"
df.to_csv(csv_name)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment