Skip to content

Instantly share code, notes, and snippets.

@JimmyJames404
Created November 25, 2021 22:09
Show Gist options
  • Save JimmyJames404/3085f8c004d17d960d2edd7df220d559 to your computer and use it in GitHub Desktop.
Save JimmyJames404/3085f8c004d17d960d2edd7df220d559 to your computer and use it in GitHub Desktop.
  from flashgeotext.geotext import GeoText

  geotext = GeoText()


  input_text = "Senior Enterprise Mobility Engineer New York, New York, United States San Francisco, California, United States"
  x=geotext.extract(input_text=input_text, span_info=True)
  x=str(x)
  Word_list = x.split("'")

  for i in range(len(Word_list)):
      #print(Word_list[i])
      xy = Word_list[i]
      if xy in input_text:
          input_text = input_text.replace(xy, "")

  print(input_text)

Descripcion de lo necesario: Actividad actual -> Depuracion de nombres de posiciones

  • Eliminar todo despues de salto de linea
  • Eliminar todo despues de ' - ''
  • Eliminar todo despues de ' [ '
  • Eliminar todo despues de ' , '
  • Eliminar Ciudades
  • Ajustar salto de linea en webscraper....replace( '\n' , ' ')
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment