Skip to content

Instantly share code, notes, and snippets.

@jefftriplett
Created October 10, 2013 18:10
Show Gist options
  • Save jefftriplett/6922884 to your computer and use it in GitHub Desktop.
Save jefftriplett/6922884 to your computer and use it in GitHub Desktop.
rss feed finder for open lawrence
We can make this file beautiful and searchable if this error is corrected: No commas found in this CSV file in line 0.
http://amyklamet.com/rss.php
http://archaeology.biodiversity.ku.edu/rss.xml
http://computation.llnl.gov/rss.xml
http://dhumc.com/site/feed/
http://finalfridayslawrence.wordpress.com/feed/
http://freestateflyfishers.org/feed/
http://heritagebaptistchurch.cc/wp-content/plugins/nextgen-gallery/xml/media-rss.php
http://kansaspublicradio.org/index.php?format=feed&type=atom
http://kansaszencenter.org/?feed=rss
http://lawrenceartscenter.org/comments/feed/
http://lawrencebrewers.org/wp/?feed=rss2
http://lawrencecommunitynews.com/geeklog-1.7.2/public_html/backend/lcn_rss2_0.xml
http://lawrencekids.net/comments/feed/
http://lawrenceks.org/rss.xml
http://lcpj.org/lcpj/?q=node/feed
http://naturalhistory.ku.edu/rss.xml
http://prairiemoon.org/?feed=rss2
http://streetphoto.com/feed/
http://thegranada.com/home/feed/
http://wickedbroadband.blogspot.com/feeds/posts/default
http://www.1863lawrence.com/comments/feed/
http://www.1inchrist.org/rss.xml
http://www.basehorinfo.com/rss/headlines/
http://www.cccparish.org/controls/cms_v2/components/rss/rss.aspx?sid=270&gid=1&calcid=2478&page_id=408
http://www.circlesranch.com/comments/feed/
http://www.fimel.org/comments/feed/
http://www.gagemgmt.com/feed/
http://www.hedgesrealty.com/rss2_0/pageReport/updated
http://www.hitecollision.com/feed
http://www.hongmark.com/wushu/index.xml
http://www.ittc.ku.edu/rss/
http://www.kanren.net/index.php?format=feed&type=rss
http://www.lasl.org/Feeds/News/?org=lasl.org
http://www.lawrenceartguild.org/rss.xml
http://www.lawrencehall.org/rss.xml
http://www.lawrenceks.org/rss.xml
http://www.lawrencetransit.org/rss.xml
http://www.lfmchurch.org/rss.xml
http://www.mattneedham.com/featured.atom
http://www.medchem.ku.edu/rss.xml
http://www.news.ku.edu/headlines.xml
http://www.pachamamas.com/home/feed/
http://www.pcitraining.edu/rss.xml
http://www.rilinglaw.com/comments/feed/
http://www.saint-johns.net/feed/
http://www.saintmargaret.org/feed
http://www.thelawrencechurchofchrist.org/feeds/posts/default
http://www.tlclawrence.org/index.php?format=feed&type=atom
http://www.visitlawrence.com/rss.xml
http://www.walterlawrencetrophy.com/?feed=rss2
http://www.warrenmcelwain.com/home/feed
http://www2.kusports.com/rss/headlines/
http://www2.ljworld.com/rss/headlines/news/
"""
requires feedfinder from:
http://www.aaronsw.com/2002/feedfinder/feedfinder.py
"""
import feedfinder
def main():
urls = open('openlawrence-com-nutch/urls/seed.txt', 'r').read()
for url in urls.split():
try:
print url, "|", feedfinder.feed(url)
except Exception as e:
print '*', e
print '--'
if __name__ == '__main__':
main()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment