Failing me with the attached config.
urd:/home/reagle> rawdog -uw
Feed:
http://blog.epistemographer.com/?feed=rss2
HTTP Status: 301
New URL:
http://www.epistemographer.com/?feed=rss2
The feed has moved permanently to a new URL.
You should update its entry in your config file.
Traceback (most recent call last):
File "/home/reagle/bin/rawdog", line 31, in <module>
launch()
File "/home/reagle/bin/rawdog", line 24, in launch
sys.exit(main(sys.argv[1:]))
File "/home/reagle/bin/rawdog-2.6/rawdoglib/rawdog.py", line 1490, in main
rawdog.write(config)
File "/home/reagle/bin/rawdog-2.6/rawdoglib/rawdog.py", line 1371, in
write
if not plugins.call_hook("output_write_files", self, config, articles,
article_dates):
File "/home/reagle/bin/rawdog-2.6/rawdoglib/plugins.py", line 70, in
call_hook
if not func(*args):
File "/home/reagle/bin/rawdog-2.6/plugins/paged-output.py", line 59, in
output_write_files
rawdog.write_article(f, article, config)
File "/home/reagle/bin/rawdog-2.6/rawdoglib/rawdog.py", line 1166, in
write_article
description = detail_to_html(entry_info[key], False, config,
force_preformatted)
File "/home/reagle/bin/rawdog-2.6/rawdoglib/rawdog.py", line 151, in
detail_to_html
return sanitise_html(html, detail["base"], inline, config)
File "/home/reagle/bin/rawdog-2.6/rawdoglib/rawdog.py", line 79, in
sanitise_html
html = feedparser._resolveRelativeURIs(html, baseurl, "UTF-8")
File "/home/reagle/bin/rawdog-2.6/rawdoglib/feedparser.py", line 1633, in
_resolveRelativeURIs
p.feed(htmlSource)
File "/home/reagle/bin/rawdog-2.6/rawdoglib/feedparser.py", line 1480, in
feed
sgmllib.SGMLParser.feed(self, data)
File "/usr/lib/python2.5/sgmllib.py", line 99, in feed
self.goahead(0)
File "/usr/lib/python2.5/sgmllib.py", line 133, in goahead
k = self.parse_starttag(i)
File "/usr/lib/python2.5/sgmllib.py", line 291, in parse_starttag
self.finish_starttag(tag, attrs)
File "/usr/lib/python2.5/sgmllib.py", line 333, in finish_starttag
self.unknown_starttag(tag, attrs)
File "/home/reagle/bin/rawdog-2.6/rawdoglib/feedparser.py", line 1628, in
unknown_starttag
_BaseHTMLProcessor.unknown_starttag(self, tag, attrs)
File "/home/reagle/bin/rawdog-2.6/rawdoglib/feedparser.py", line 1497, in
unknown_starttag
value = unicode(value, self.encoding)
File "encodings/utf_8.py", line 16, in decode
UnicodeDecodeError: 'utf8' codec can't decode bytes in position 6-8: invalid
data
# Sample rawdog config file. Copy this into your ~/.rawdog/ directory, and edit
# it to suit your preferences.
# All paths in this file should be either absolute, or relative to your .rawdog
# directory.
# The maximum number of articles to show on the generated page.
# Set this to 0 for no limit.
maxarticles 200
# The maximum age in minutes of articles to show on the generated page.
# Set this to 0 for no limit.
maxage 0
# The format to write day headings in. (See "man strftime" if you want to
# change this.)
dayformat %A, %d %B
# The format to write time headings in. Use "%I:%M %p" if you prefer
# a 12-hour clock.
timeformat %H:%M
# The template file to use, or "default" to use the built-in template
# (which is probably sufficient for most users). Use "rawdog -t" to show
# the template currently in use as a starting-point for customisation.
# The following strings will be replaced in the output:
# __version__ The rawdog version in use
# __refresh__ The HTML 4 <meta http-equiv="refresh" ...> header
# __items__ The aggregated items
# __feeds__ The listing of feeds
# Note that rawdog's output is always in the UTF-8 encoding, so you should
# preserve the <meta http-equiv="Content-type" ...> header from the default
# template when building your own.
template template
# Where to write the output HTML to. You should place style.css in the same
# directory. Specify this as "-" to write the HTML to stdout.
outputfile /home/reagle/data/2web/reagle.org/joseph/rawdog/rawdog.html
# Whether to use (1) or not use (0) a <meta http-equiv="Refresh" ...> tag
# in the generated HTML to indicate that the page should be refreshed
# automatically. If this is turned on, then the page will refresh every N
# minutes, where N is the shortest feed period value specified below.
# (This works by controlling whether the default template includes
# __refresh__; if you use a custom template, __refresh__ is always
# available.)
userefresh 0
# Whether to show the list of active feeds in the generated HTML.
# (This works by controlling whether the default template includes
# __feeds__; if you use a custom template, __feeds__ is always
# available.)
showfeeds 1
# The time in seconds that rawdog will wait before considering a feed
# unreachable when trying to connect. If you're getting lots of timeout
# errors and are on a slow connection, increase this.
timeout 30
plugindirs /home/reagle/bin/rawdog-2.6/plugins/
# The feeds you want to watch, in the format "feed period url [args]".
# The period is the minimum time in minutes between updates; if less
# than period minutes have passed, "rawdog update" will skip that feed.
# Specifying a period less than 30 minutes is considered to be bad manners; it
# is suggested that you make the period as long as possible.
# Arguments are optional, and are of the form "key=value", seperated by spaces;
# possible arguments are:
# user User for HTTP basic authentication
# password Password for HTTP basic authentication
# You can specify as many feeds as you like.
#feed 180
http://secretfeed.example.com/secret.rss user=bob password=secret
# Tech/Gadgets
feed 60
http://www.advogato.org/rss/articles.xml
#feed 60
http://diveintomark.org/xml/rss.xml
feed 60
http://www.zeldman.com/feed/zeldman.xml
feed 60
http://rss.slashdot.org/Slashdot/slashdot
feed 60
http://www.pheedo.com/f/palminfocentercom
feed 60
http://www.newsforge.com/newsforge.rss
feed 60
http://www.linuxtoday.com/backend/my-netscape.rdf
feed 60
http://www.kde.org/dotkdeorg.rdf
feed 60
http://www.gizmodo.com/index.rdf
#feed 60
http://www.debianplanet.org/module.php?mod=node&op=feed # they keep updating their feed when there's no new content
feed 60
http://www.dpreview.com/news/dpr.rdf
feed 60
http://www.arstechnica.com/etc/rdf/ars.rdf
feed 60
http://www.pythonware.com/daily/rss.xml
feed 60
http://rdfweb.org/mt/foaflog/index.rdf
feed 60
http://rdfweb.org/mt/foaflog/index.rdf
#feed 60
http://microdoc-news.info/rss
feed 60
http://weblogs.mozillazine.org/hyatt/blogger_rss.xml
feed 60
http://www.kk.org/cooltools/index.xml
feed 120
http://www.periapsis.org/archives/categories/tellico.xml
feed 120
http://debian-administration.org/articles.rdf
feed 60
http://distrowatch.com/news/dw.xml
#feed 120
http://palmaddict.typepad.com/palmaddicts/index.rdf
feed 60
http://planet.ubuntulinux.org/rss10.xml
feed 60
http://digg.com/rss/index.xml
#feed 60
http://stateless.geek.nz/feed/
#feed 240
http://feeds.feedburner.com/770fan
feed 120
http://tamspalm.tamoggemon.com/feed/
#stuff
#feed 30
http://www.techbargains.com/rss.xml
feed 30
http://www.slickdeals.net/rss.php
#feed 30
http://newyork.craigslist.org/ele/index.rss
#feed 30
http://newyork.craigslist.org/sys/index.rss
#feed 30
http://newyork.craigslist.org/zip/index.rss
feed 240
http://blog.openmobl.com/feeds/posts/default?alt=rss
# Communities/Research
feed 60
http://www.livejournal.com/users/presnick/data/rss
feed 60
http://www.professional-lurker.com/index.rdf
feed 60
http://feeds.feedburner.com/TechnologyTheSocial
feed 60
http://jeays.net/wikipedia/announcements.xml
feed 120
http://successfulacademic.typepad.com/successful_academic_tips/index.rdf
feed 120
http://blog.epistemographer.com/?feed=rss2
feed 240
http://wikizine.blogspot.com/atom.xml
feed 240
http://ddc.typepad.com/foaf.rdf
feed 60
http://groups.google.com/group/social-media-research/feed/rss_v2_0_topics.xml
# Andreas?
feed 60
http://www.aaronsw.com/weblog/index.xml
feed 60
http://dannyayers.com/index.rdf
feed 60
http://joi.ito.com/index.xml
feed 60
http://ross.typepad.com/blog/index.rdf
feed 60
http://www.kuro5hin.org/backend.rdf
feed 60
http://www.corante.com/many/index.rdf
feed 60
http://terranova.blogs.com/terra_nova/index.rdf
feed 60
http://www.w3.org/2000/08/w3c-synd/home.rss
feed 60
http://www.oreillynet.com/pub/feed/20?format=rss1
#feed 60
http://www.gothamist.com/archives/news_nyc/index.php
feed 60
http://www.woostercollective.com/rss/index.xml
feed 60
http://healthhacker.org/satoroams/?feed=rss2
#Research: Digital studies and history
feed 240
http://digitalhistoryhacks.blogspot.com/rss.xml
feed 240
http://ragesossscholar.blogspot.com/atom.xml
feed 120
http://journalism.nyu.edu/pubzone/weblogs/pressthink/index.rdf
feed 120
http://michaelzimmer.blogspot.com/atom.xml
#Wikipedia
feed 240
http://www.dufoundation.org/blog/?feed=rss2
feed 240
http://www.andrewlih.com/blog/feed/
#feed 240
http://wikipediasignpost.com/feed.rss
feed 240
http://cormaggio.org/?feed=rss
feed 120
http://blog.jimmywales.com/index.php/feed/rss/
feed 120
http://nonbovine-ruminations.blogspot.com/rss.xml
feed 120
http://blog.citizendium.org/feed/rss/
feed 60
http://planet.wikimedia.org/atom.xml
# Interesting People
feed 60
http://goatee.net/,rss
feed 60
http://blogs.law.harvard.edu/philg/xml/rss.xml
feed 60
http://www.bestkungfu.com/index.rss
feed 60
http://blogs.law.harvard.edu/thinking/xml/rss.xml
#feed 60
http://blogs.law.harvard.edu/red/xml/rss.xml
#feed 60
http://www.areta.net/areta.xml
feed 60
http://www.nyu.edu/classes/siva/index.rdf
feed 60
http://saladwithsteve.com/index.rdf
feed 60
http://www.lessig.org/blog/index.rdf
feed 60
http://www.livejournal.com/users/cris/data/rss
feed 60
http://boingboing.net/rss.xml
feed 60
http://www.tbray.org/ongoing/ongoing.atom
#feed 60
http://www.livejournal.com/users/sandhawke/data/rss
feed 120
http://asweknowit.ca/channels/webnews/rss/webnews.rss
feed 60
http://anamnesiac.com/index.rdf
feed 60
http://tinywords.com/feed/
feed 60
http://impressive.net/weblogs/fogo/index.rss
#feed 60
http://www.markcrispinmiller.blogspot.com/
feed 120
http://stayfree.typepad.com/stayfree/index.rdf
feed 120
http://wm.sieheauch.de/wp-rss2.php
feed 60
http://mika.yukidoke.org/nikki/index.rdf
feed 120
http://dullicious.net/?feed=rss2
feed 120
http://headrush.typepad.com/creating_passionate_users/index.rdf
_______________________________________________
rawdog-users mailing list
rawdog-users@???
http://lists.us-lot.org/mailman/listinfo/rawdog-users