Skip to content

Instantly share code, notes, and snippets.

@hamletbatista
Created September 4, 2019 10:59
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save hamletbatista/fcef87e6cae4c3f975162a0305672852 to your computer and use it in GitHub Desktop.
Save hamletbatista/fcef87e6cae4c3f975162a0305672852 to your computer and use it in GitHub Desktop.
import re
i = len(df["Custom Extraction 1"])#.split('<img src="')
images = list()
for i, image_list in enumerate(df["Custom Extraction 1"]):
#print(i)
images.extend(df["Custom Extraction 1"][i].split('<img src="'))
#new_image_urls = re.split(r'<img .+ src="', df["Custom Extraction 1"][i])
images.extend(new_image_urls)
print(images)
#example partial output
#['',
# 'https://alpacas.com/pub/media/wysiwyg/Banner_image/blankets_2.jpg" alt="" /> ',
# 'https://alpacas.com/pub/media/wysiwyg/promo_banner/scarves-2.jpg" alt="" /> ',
# 'https://alpacas.com/pub/media/wysiwyg/promo_banner/blankets-3.jpg" alt="" /> ',
# 'https://alpacas.com/pub/media/wysiwyg/promo_banner/sweaters-2.jpg" alt="" /> ',
# 'https://alpacas.com/pub/media/wysiwyg/promo_banner/bears-3.jpg" alt="" /> ',
# 'https://alpacas.com/pub/media/wysiwyg/promo_banner/yarn-2.jpg" alt="" /> ',
# 'https://alpacas.com/pub/media/wysiwyg/promo_banner/case_for_support_cover_-2.jpg" alt="" /> ',
# 'https://alpacas.com/pub/media/wysiwyg/promo_banner/alliyma_banner_website_1.jpg" alt="" /> ',
# 'https://alpacas.com/pub/media/wysiwyg/promo_banner/two_alpacas.jpg" alt="" />',
# '',
# 'https://alpacas.com/pub/media/wysiwyg/testimonial/1._women_edited.jpg" alt="" />',
# '<img style="display: block; margin-left: auto; margin-right: auto;" src="https://alpacas.com/pub/media/wysiwyg/panel/shippingusa.jpg" alt="" /> <img style="display: block; margin-left: auto; margin-right: auto;" src="https://alpacas.com/pub/media/wysiwyg/panel/usa-shipping-3.jpg" alt="" /> <img style="display: block; margin-left: auto; margin-right: auto;" src="https://alpacas.com/pub/media/wysiwyg/panel/intl-shipping-3.jpg" alt="" />',
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment