Skip to content

Instantly share code, notes, and snippets.

@soeffing
Created June 4, 2017 22:00
Show Gist options
  • Save soeffing/3229fa39ad103fd9832c4419c04f883f to your computer and use it in GitHub Desktop.
Save soeffing/3229fa39ad103fd9832c4419c04f883f to your computer and use it in GitHub Desktop.
In [39]: intent_counts['purchase'].most_common(20)
Out[39]:
[(u'www.amazon.com', 418),
(u'www.walmart.com', 222),
(u'www.target.com', 85),
(u'www.google.com', 31),
(u'www.walgreens.com', 31),
(u'www.ebay.com', 26),
(u'www.samsclub.com', 18),
(u'www.mccormick.com', 16),
(u'www.shoprite.com', 14),
(u'www.heb.com', 11),
(u'www.cvs.com', 11),
(u'www.instacart.com', 10),
(u'www.schwans.com', 9),
(u'www.beermenus.com', 9),
(u'www.dollartree.com', 8),
(u'www.webstaurantstore.com', 7),
(u'www.thekitchn.com', 7),
(u'nuts.com', 7),
(u'books.google.com', 7),
(u'www.bettycrocker.com', 7)]
In [40]: intent_counts['information'].most_common(20)
Out[40]:
[(u'en.wikipedia.org', 215),
(u'www.dictionary.com', 146),
(u'www.merriam-webster.com', 92),
(u'www.businessdictionary.com', 81),
(u'dictionary.cambridge.org', 80),
(u'www.thefreedictionary.com', 62),
(u'www.urbandictionary.com', 37),
(u'www.webopedia.com', 29),
(u'www.computerhope.com', 24),
(u'en.oxforddictionaries.com', 22),
(u'www.investopedia.com', 19),
(u'www.youtube.com', 17),
(u'whatis.techtarget.com', 16),
(u'www.vocabulary.com', 14),
(u'www.thoughtco.com', 14),
(u'thelawdictionary.org', 10),
(u'www.thebalance.com', 10),
(u'www.psychologytoday.com', 9),
(u'www.mathsisfun.com', 9),
(u'www.webmd.com', 8)]
In [41]: intent_counts['local'].most_common(20)
Out[41]:
[(u'www.tripadvisor.com', 259),
(u'www.yelp.com', 254),
(u'www.travelmath.com', 110),
(u'www.mapquest.com', 83),
(u'www.zillow.com', 54),
(u'www.opentable.com', 43),
(u'www.walmart.com', 38),
(u'bestflyfishing.us', 36),
(u'www.hotels.com', 34),
(u'www.indeed.com', 32),
(u'www.ushospitalfinder.com', 26),
(u'www.facebook.com', 26),
(u'en.wikipedia.org', 23),
(u'churches.find-near-me.info', 22),
(u'beststateparks.com', 20),
(u'bestbrewpubs.org', 19),
(u'besthiking.info', 18),
(u'www.collegesimply.com', 16),
(u'www.hotelplanner.com', 15),
(u'bestnaturecenters.com', 14)]
In [42]: intent_counts['comparison'].most_common(20)
Out[42]:
[(u'www.google.com', 42),
(u'www.amazon.com', 32),
(u'cars.usnews.com', 21),
(u'www.techradar.com', 19),
(u'www.health.com', 14),
(u'www.lifewire.com', 14),
(u'www.pinterest.com', 14),
(u'www.cnet.com', 13),
(u'www.popularmechanics.com', 13),
(u'www.eatthis.com', 12),
(u'en.wikipedia.org', 11),
(u'www.caranddriver.com', 11),
(u'www.huffingtonpost.com', 11),
(u'foursquare.com', 10),
(u'thewirecutter.com', 9),
(u'www.timeout.com', 8),
(u'www.seriouseats.com', 8),
(u'thesweethome.com', 8),
(u'www.youtube.com', 8),
(u'www.kbb.com', 8)]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment