irc bot for announcing rss/atom feeds (http://hackage.haskell.org/package/rss2irc)
#2doesn't handle unicode correctly ?
Reported by nanq in #haskell just now.
The feed item for a new upload of nanq
contains:
<description><i>Added by fosskers, Wed Jan 6 17:30:05 UTC 2016.</i><p>Performs 漢字検定 (National Kanji Exam) level analysis on given Kanji.</description>
In a web browser this renders as:
<i>Added by fosskers, Wed Jan 6 17:30:05 UTC 2016.</i><p>Performs 漢字検定 (National Kanji Exam) level analysis on given Kanji.
but in IRC (erc) I saw
* hackagebot nanq 1.1.1 - Performs "W^\\232 (National Kanji Exam) level analysis on given Kanji. https://hackage.haskell.org/package/nanq-1.1.1 (fosskers)
Something like that.
there's some usage of the nefarious ByteString.Char8 pack/unpack that would better be UTF-8 Text encoding/decoding - will try to fix it
I think I fixed it, though I haven't yet tested it on a wild unicode RSS feed...
part 1 in the IRC layer (also includes connection fix) http://hub.darcs.net/claude/rss2irc/patch/b84582c105ed28dfd9fdf5def0228d5e90a1a2a7 part 2 in the Feed layer http://hub.darcs.net/claude/rss2irc/patch/a295ec681d4a9a1767281f44714528d3240e3a4c
Screenshot from glguy on #haskell yesterday, possibly this issue: http://imgur.com/a/GIZZa
Better link to claude's fixes as of today: http://hub.darcs.net/simon/rss2irc/patches