Ticket #405 (closed enhancement: fixed)

Opened 2 years ago

Last modified 5 weeks ago

Request for new news feeds

Reported by: kovidgoyal Owned by: kovidgoyal
Priority: minor Milestone:
Component: News downloading Version: trunk
Keywords: New York Cc: flower_os@…, sam@…

Description (last modified by kovidgoyal) (diff)

This is a meta bug o collect requests for new news feeds, so that someone can look at them:

Requests so far:

* The Jerusalem Post ?http://www.jpost.com/

Attachments

Volkskrant.recipe Download (3.3 KB) - added by mellian 7 weeks ago.
Volkskrant paid version

Change History

  Changed 2 years ago by choard

  • priority changed from minor to trivial
  • type changed from defect to enhancement

the last 2 are reversed; it's washington post and chicago tribune. is web2lrf available on the mac version?

  Changed 2 years ago by kovidgoyal

  • description modified (diff)

Yes it is

  Changed 2 years ago by kovidgoyal

  • description modified (diff)

  Changed 2 years ago by kovidgoyal

  • description modified (diff)

follow-up: ↓ 6   Changed 2 years ago by ralfw

in reply to: ↑ 5   Changed 2 years ago by jaguaracer

Replying to ralfw:

The Japan Times (?http://feeds.feedburner.com/japantimes_news)

Financial Times would be great. ?http://www.ft.com/servicestools/newstracking/rss

Personally I'm looking at markets. Its in RSS but for the life of me, I can't get it into libprs500 in basic mode. Any help would be great.

  Changed 22 months ago by ChuckEggDotCom

Recipe for FORTUNE magazine

class Advanced User Recipe?1211822316(Basic News Recipe?):

title = u'Fortune' oldest_article = 31 max_articles_per_feed = 100

feeds = [(u'Fortune', u'?http://labs.echoditto.com/projects/fulltextrss/?url=http%3A%2F%2Frss.cnn.com%2Frss%2Fmoney_latest.rss')]

  Changed 22 months ago by ChuckEggDotCom

Recipe for BUSINESS WEEK magazine:

class Advanced User Recipe?1211822401(Basic News Recipe?):

title = u'Business Week' oldest_article = 7 max_articles_per_feed = 100

feeds = [(u'Top Stories', u'?http://www.businessweek.com/topStories/rss/topStories.rss'), (u'Top News', u'?http://www.businessweek.com/rss/bwdaily.rss'), (u'Asia', u'?http://www.businessweek.com/rss/asia.rss'), (u'Autos', u'?http://www.businessweek.com/rss/autos/index.rss'), (u'Classic Cars', u'?http://www.businessweek.com/rss/autos/classic_cars/index.rss'), (u'Europe', u'?http://www.businessweek.com/rss/europe.rss'), (u'Hybrids', u'?http://www.businessweek.com/rss/hybrids/index.rss'), (u'Auto Reviews', u'?http://www.businessweek.com/rss/autos/reviews/index.rss'), (u'Innovation & Design', u'?http://www.businessweek.com/rss/innovate.rss'), (u'Architecture', u'?http://www.businessweek.com/rss/architecture.rss'), (u'Brand Equity', u'?http://www.businessweek.com/rss/brandequity.rss'), (u'Auto Design', u'?http://www.businessweek.com/rss/carbuff.rss'), (u'Game Room', u'?http://www.businessweek.com/rss/gameroom.rss'), (u'Technology', u'?http://www.businessweek.com/rss/technology.rss'), (u'Investing', u'?http://www.businessweek.m/rss/investor.rss'), (u'Small Business', u'?http://www.businessweek.com/rss/smallbiz.rss'), (u'Careers', u'?http://www.businessweek.com/rss/careers.rss'), (u'B-Schools', u'?http://www.businessweek.com/rss/bschools.rss'), (u'Magazine Selections', u'?http://www.businessweek.com/rss/magazine.rss'), (u'CEO Guide to Tech', u'?http://www.businessweek.com/rss/ceo_guide_tech.rss')]

  Changed 22 months ago by kovidgoyal

  • description modified (diff)

Thanks, I'll add Business Week as a built-in recipe in the next release.

  Changed 18 months ago by mikediaz

Tried Scientific American in vain. Can you help me grab the feeds that appear here?

?http://www.sciam.com/page.cfm?section=rss

Here is the advanced mode text of what I entered

class Advanced User Recipe?1221785185(Basic News Recipe?):

title = u'Scientific American' oldest_article = 30 max_articles_per_feed = 100

feeds = [(u'Latest News', u'?http://rss.sciam.com/ScientificAmerican-News'), (u'Global', u'?http://rss.sciam.com/ScientificAmerican-Global'), (u'Health', u'?http://rss.sciam.com/sciam/health'), (u'Space', u'?http://rss.sciam.com/sciam/space'), (u'Technology', u'?http://rss.sciam.com/sciam/technology'), (u'Biology', u'?http://rss.sciam.com/sciam/biology'), (u'Mind & Brain', u'?http://rss.sciam.com/sciam/mind-and-brain'), (u"What's Next", u'?http://rss.sciam.com/sciam/whats-next'), (u'Archeology & Paleontology', u'?http://www.sciam.com/page.cfm?section=rsscategory&alias=archaeology-and-paleontology'), (u'Physics', u'?http://www.sciam.com/page.cfm?section=rsscategory&alias=physics'), (u'Math', u'?http://rss.sciam.com/sciam/math'), (u'History of Science', u'?http://www.sciam.com/page.cfm?section=rsscategory&alias=history-of-science'), (u'Chemistry', u'?http://rss.sciam.com/sciam/chemistry'), (u'Mind Matters', u'?http://rss.sciam.com/ScientificAmerican-MindBlog')]

  Changed 18 months ago by mikediaz

This one works for Discover Magazine in case you want to use publish it. could not update the wiki. hope it helps

class Advanced User Recipe?1221785375(Basic News Recipe?):

title = u'Discover Magazine' oldest_article = 33 max_articles_per_feed = 20

feeds = [(u'Technology', u'?http://discovermagazine.com/topics/technology/rss.xml'), (u'Health - Medicine', u'?http://discovermagazine.com/topics/health-medicine/rss.xml'), (u'Mind Brain', u'?http://discovermagazine.com/topics/mind-brain/rss.xml'), (u'Space', u'?http://discovermagazine.com/topics/space/rss.xml'), (u'Human Origins', u'?http://discovermagazine.com/topics/human-origins/rss.xml'), (u'Living World', u'?http://discovermagazine.com/topics/living-world/rss.xml'), (u'Environment', u'?http://discovermagazine.com/topics/environment/rss.xml'), (u'Physics & Math', u'?http://discovermagazine.com/topics/physics-math/rss.xml'), (u'Vital Signs', u'?http://discovermagazine.com/columns/vital-signs/rss.xml'), (u"20 Things you didn't know about...", u'?http://discovermagazine.com/columns/20-things-you-didnt-know/rss.xml'), (u'Fuzzy Math', u'?http://discovermagazine.com/columns/fuzzy-math/rss.xml'), (u'The Brain', u'?http://discovermagazine.com/columns/the-brain/rss.xml'), (u'Stupid Science Word of the Month', u'?http://discovermagazine.com/columns/stupid-science-word-of-the-month/rss.xml'), (u'Science Not Fiction', u'?http://blogs.discovermagazine.com/sciencenotfiction/wp-rss.php')]

  Changed 18 months ago by kovidgoyal

Added recipes for discover and sciam, thanks.

  Changed 17 months ago by mccande

Is anyone planning a recipe for the London Times based on URL's like this rather than the rss feeds? This gives an almost complete copy of the newspaper.

Thanks in advance

?http://www.timesonline.co.uk/tol/newspapers/the_times/tuesday/?days=Tuesday

  Changed 12 months ago by Ezmegaz

hy! there is one hungaryan news site WWW.INDEX.HU, and if i download news from there " ?http://index.hu/x?t=/24ora/rss/ " the fonts are incredible big..., can you slove the problem of that rss? THX

  Changed 12 months ago by kovidgoyal

a good place to ask for help with recipes is the custom recipes thread in the calibre forum

in reply to: ↑ description   Changed 11 months ago by GPThomson

I'd like to to have the news from the following website added to calibre. It is all Dutch speaking (Belgium):

Knack (?http://www.knack.be/)

Humo (?http://www.humo.be/)

De Gentenaar (?http://www.nieuwsblad.be/)

Thanks!

  Changed 10 months ago by pixnaps

Request news feed for 'Inside Higher Ed': ?http://www.insidehighered.com/news/

follow-up: ↓ 19   Changed 10 months ago by Ezmegaz

hungaryan news site "index.hu" fss feeds

class Advanced User Recipe?1242148664(Basic News Recipe?):

title = u'INDEX.HU' oldest_article = 3 max_articles_per_feed = 50

feeds = [(u'ALL', u'?http://index.hu/24ora/rss/'), (u'BELF\xd6LD', u'?http://index.hu/belfold/rss/default/'), (u'K\xdcLF\xd6LD', u'?http://index.hu/kulfold/rss/default/'), (u'BULV\xc1R', u'?http://index.hu/bulvar/rss/default/'), (u'GAZDAS\xc1G', u'?http://index.hu/gazdasag/rss/default/'), (u'TECH', u'?http://index.hu/tech/rss/main/'), (u'KULT\xdaRA', u'?http://index.hu/kultur/rss/main/'), (u'TUDOM\xc1NY', u'?http://index.hu/tudomany/rss/main/'), (u'SPORT', u'?http://sportgeza.hu/24ora/'), (u'V\xc9LEM\xc9NY', u'?http://index.hu/velemeny/rss/default/')]

in reply to: ↑ 18   Changed 10 months ago by Ezmegaz

(u'SPORT', u'?http://sportgeza.hu/24ora/')

please remove this feed from sources

  Changed 10 months ago by Ezmegaz

pcworld.hu feeds -hungaryan news site

class Advanced User Recipe?1242149390(Basic News Recipe?):

title = u'PCWORLD.HU' oldest_article = 3 max_articles_per_feed = 50

feeds = [(u'H\xedrek', u'?http://pcworld.hu/rss/rss.xml'), (u'Hardver h\xedrek', u'?http://www.pcworld.hu/rss/rss_hardverhirek.xml'), (u'Szoftver h\xedrek', u'?http://www.pcworld.hu/rss/rss_szoftverhirek.xml'), (u'Hardver cikkek', u'?http://www.pcworld.hu/rss/rss_hardvercikkek.xml'), (u'Szoftver cikkek', u'?http://www.pcworld.hu/rss/rss_szoftvercikkek.xml'), (u'Mobil h\xedrek', u'?http://www.pcworld.hu/rss/rss_mobil.xml'), (u'\xdczleti h\xedrek', u'?http://www.pcworld.hu/rss/rss_uzlet.xml'), (u'Let\xf6lt\xe9sek', u'?http://www.pcworld.hu/rss/rss_letoltes.xml'), (u'PC World TV', u'?http://tv.pcworld.hu/rss/rss_hun_pcw.xml'), (u'Tudta-e...?', u'?http://pcworld.hu/rss/rss_tudtae.xml')]

  Changed 10 months ago by floweros

  • cc flower_os@… added

?Slashdot.org - News for nerds. Stuff that matters.

English

Slashdot.py Download

class Advanced User Recipe?1238825681(Basic News Recipe?):

title = u'Slashdot.org' oldest_article = 7 max_articles_per_feed = 100 no_stylesheets = True keep_only_tags = [dict(name='div',attrs={'id':'article'})] remove_tags = [

dict(name='div',attrs={'id':'userlogin-title'}), dict(name='div',attrs={'id':'userlogin-content'}), dict(name='div',attrs={'id':'commentwrap'}), dict(name='span',attrs={'id':'more_comments_num_a'}), ]

feeds = [

(u'Slashdot', u'?http://rss.slashdot.org/Slashdot/slashdot?m=5072'), (u'/. IT', u'?http://rss.slashdot.org/Slashdot/slashdotIT'), (u'/. Hardware', u'?http://rss.slashdot.org/Slashdot/slashdotHardware'), (u'/. Linux', u'?http://rss.slashdot.org/Slashdot/slashdotLinux'), (u'/. Your Rights Online', u'?http://rss.slashdot.org/Slashdot/slashdotYourRightsOnline') ]

  Changed 10 months ago by Ezmegaz

hungarian rss feeds

class Advanced User Recipe?1242561863(Basic News Recipe?):

title = u'168\xf3ra' oldest_article = 4 max_articles_per_feed = 50

feeds = [(u'Itthon', u'?http://www.168ora.hu/static/rss/cikkek_itthon.xml'), (u'Gl\xf3busz', u'?http://www.168ora.hu/static/rss/cikkek_globusz.xml'), (u'Punch', u'?http://www.168ora.hu/static/rss/cikkek_punch.xml'), (u'Arte', u'?http://www.168ora.hu/static/rss/cikkek_arte.xml'), (u'Buxa', u'?http://www.168ora.hu/static/rss/cikkek_buxa.xml'), (u'Sebess\xe9g', u'?http://www.168ora.hu/static/rss/cikkek_sebesseg.xml'), (u'Tud\xe1s', u'?http://www.168ora.hu/static/rss/cikkek_tudas.xml'), (u'Sport', u'?http://www.168ora.hu/static/rss/cikkek_sport.xml'), (u'V\xe9lem\xe9ny', u'?http://www.168ora.hu/static/rss/cikkek_velemeny.xml'), (u'Dolce Vita', u'?http://www.168ora.hu/static/rss/cikkek_dolcevita.xml'), (u'R\xe1di\xf3', u'?http://www.168ora.hu/static/rss/radio.xml')]

class Advanced User Recipe?1242561928(Basic News Recipe?):

title = u'ATV' oldest_article = 5 max_articles_per_feed = 50

feeds = [(u'H\xedrek', u'?http://atv.hu/rss/1'), (u'Cikkek', u'?http://atv.hu/rss/2')]

class Advanced User Recipe?1242561945(Basic News Recipe?):

title = u'H\xedrszerz\u0151' oldest_article = 5 max_articles_per_feed = 50

feeds = [(u'Belf\xf6ld', u'?http://www.hirszerzo.hu/rss.belfold.xml'), (u'K\xfclf\xf6ld', u'?http://www.hirszerzo.hu/rss.kulfold.xml'), (u'Profit', u'?http://www.hirszerzo.hu/rss.profit.xml'), (u'Shake', u'?http://www.hirszerzo.hu/rss.shake.xml'), (u'Publicisztika', u'?http://www.hirszerzo.hu/rss.publicisztika.xml'), (u'Elemz\xe9s', u'?http://www.hirszerzo.hu/rss.elemzes.xml'), (u'Sorok k\xf6z\xf6tt', u'?http://www.hirszerzo.hu/rss.sorok_kozott.xml'), (u'Gal\xe9ria', u'?http://www.hirszerzo.hu/rss.galeria.xml'), (u'Patro', u'?http://www.hirszerzo.hu/rss.patro.xml')]

  Changed 10 months ago by TMF

This is a news feed for Le Temps (see ?Wikipedia article), Switzerland's leading French-language newspaper.

class Advanced User Recipe?1243078936(Basic News Recipe?):

title = u'Le Temps' oldest_article = 7 max_articles_per_feed = 100 no_stylesheets = True remove_tags = [dict(name='div', attrs={'id':'footer'})] remove_tags = [dict(name='div', attrs={'class':'box links'})] remove_tags = [dict(name='script')] extra_css = '.heading {font-size: 13px; line-height: 15px; margin: 20px 0;} \n h2 {font-size: 24px; line-height: 25px; margin-bottom: 14px;} \n .author {font-size: 11px; margin: 0 0 5px 0;} \n .lead {font-weight: 700; margin: 10px 0;} \n p {margin: 0 0 10px 0;}'

feeds = [

('Actualit?', '?http://www.letemps.ch/rss/site/'), ('Monde', '?http://www.letemps.ch/rss/site/actualite/monde'), ('Suisse & R?gions', '?http://www.letemps.ch/rss/site/actualite/suisse_regions'), ('Sciences & Environnement', '?http://www.letemps.ch/rss/site/actualite/sciences_environnement'), ('Soci?t?', '?http://www.letemps.ch/rss/site/actualite/societe'), ('Economie & Finance', '?http://www.letemps.ch/rss/site/economie_finance'), ('Economie & Finance - Finance', '?http://www.letemps.ch/rss/site/economie_finance/finance'), ('Economie & Finance - Fonds de placement', '?http://www.letemps.ch/rss/site/economie_finance/fonds_placement'), ('Economie & Finance - Carri?res', '?http://www.letemps.ch/rss/site/economie_finance/carrieres'), ('Culture', '?http://www.letemps.ch/rss/site/culture'), ('Culture - Cin?ma', '?http://www.letemps.ch/rss/site/culture/cinema'), ('Culture - Musiques', '?http://www.letemps.ch/rss/site/culture/musiques'), ('Culture - Sc?nes', '?http://www.letemps.ch/rss/site/culture/scenes'), ('Culture - Arts plastiques', '?http://www.letemps.ch/rss/site/culture/arts_plastiques'), ('Livres', '?http://www.letemps.ch/rss/site/culture/livres'), ('Opinions', '?http://www.letemps.ch/rss/site/opinions'), ('Opinions - Editoriaux', '?http://www.letemps.ch/rss/site/opinions/editoriaux'), ('Opinions - Invit?s', '?http://www.letemps.ch/rss/site/opinions/invites'), ('Opinions - Chroniques', '?http://www.letemps.ch/rss/site/opinions/chroniques'), ('Life Style?', '?http://www.letemps.ch/rss/site/lifestyle'), ('Life Style? - Luxe', '?http://www.letemps.ch/rss/site/lifestyle/luxe'), ('Life Style? - Horlogerie & Joaillerie', '?http://www.letemps.ch/rss/site/lifestyle/horlogerie_joaillerie'), ('Life Style? - Design', '?http://www.letemps.ch/rss/site/lifestyle/design'), ('Life Style? - Voyages', '?http://www.letemps.ch/rss/site/lifestyle/voyages'), ('Life Style? - Gastronomie', '?http://www.letemps.ch/rss/site/lifestyle/gastronomie'), ('Life Style? - Architecture & Immobilier', '?http://www.letemps.ch/rss/site/lifestyle/architecture_immobilier'), ('Life Style? - Automobile', '?http://www.letemps.ch/rss/site/lifestyle/automobile'), ('Sports', '?http://www.letemps.ch/rss/site/actualite/sports'),

]

def print_version(self, url):

return url.replace('Page', 'Facet/print')

  Changed 10 months ago by TMF

This is a news feed for WOZ Die Wochenzeitung (?German Wikipedia article, a German language Swiss left-wing weekly.

class Advanced User Recipe?1243078936(Basic News Recipe?):

title = u'WOZ Die Wochenzeitung' oldest_article = 7 max_articles_per_feed = 100 no_stylesheets = True remove_tags = [dict(name='p', attrs={'class':'arrow_top'})] remove_tags = [dict(name='p', attrs={'class':'bottom_right'})] remove_tags = [dict(name='script')] extra_css = '#print_titel{vertical-align: bottom; text-align: left; color: #666666; background-color: white; padding-top: 30px; padding-bottom: 10px; border-bottom: 1px solid #999999;} \n #title{text-align: left; font-size: large; font-weight: 600; padding-top: 0px; padding-bottom: 6px;} \n h3 {text-align: left; font-size: large; font-weight: 600; padding-top: 0px; padding-bottom: 6px;} \n #lead{font-weight: 600; padding-bottom: 6px;} \n h2{font-weight: 600; padding-bottom: 6px;} #author{color: #666666; padding-top: 0px; padding-bottom: 0px;} \n h4{color: #666666; padding-top: 0px; padding-bottom: 0px;} \n #author2 {color: #666666; padding-top: 0px; padding-bottom: 0px;} \n .dotted_line{padding-top: 0px; margin-bottom: 18px; border-bottom: 1px dotted #666666;} \n .intro{margin: 0 auto; font-weight: 600; padding-bottom: 18px;} \n h5{margin: 0 auto; font-weight: 600; padding-bottom: 18px;} \n .intro2{margin: 0 auto; font-weight: 600;} \n .text{padding-bottom: 18px;} \n .subtitle{margin: 0 auto; font-weight: 600; padding-bottom: 10px;} \n .articletitle{margin: 0 auto; font-weight: 600; padding-bottom: 10px;} \n #content_infobox{margin-top: 20px; margin-left: 0px; margin-right: 0px; margin-bottom: 10px; text-align: left; border-bottom: 1px solid #999999;} \n .content_infobox_titel{padding-top: 6px; padding-bottom: 8px; padding-left: 8px; padding-right: 8px; font-weight: 600; border-top: 1px solid #999999; border-bottom: 1px dotted #999999;} \n .content_infobox_text{padding-top: 6px; padding-bottom: 12px; padding-left: 8px; padding-right: 8px;} \n .box_gray{padding-top: 4px; padding-left: 7px; padding-right: 7px; padding-bottom: 4px;} \n .box_white { padding-top: 4px; padding-left: 7px; padding-right: 7px; padding-bottom: 4px;} \n .content_infobox_mehr{margin-top: 20px; margin-left: 0px; margin-right: 0px; margin-bottom: 10px; text-align: left; width: 600px; border-bottom: 1px solid #999999;}'

feeds = [('WOZ Die Wochenzeitung - Headlines', '?http://www.woz.ch/inhalt/headlinesRSS.php'),]

def print_version(self, url):

return url.replace('rss/', 'print_')

  Changed 8 months ago by kiodane

  • keywords craigslist added
  • component changed from web2lrf to News downloading

Best of for Craigslist (broken down by city)

class Advanced User Recipe?1247284744(Basic News Recipe?):

title = u'craigslist - Best Of' oldest_article = 365 max_articles_per_feed = 100

feeds = [(u'Best of craigslist', u'?http://www.craigslist.org/about/best/all/index.rss'), (u'Ann Arbor', u'?http://www.craigslist.org/about/best/aaa/index.rss'), (u'Asheville', u'?http://www.craigslist.org/about/best/ash/index.rss'), (u'Austin', u'?http://www.craigslist.org/about/best/aus/index.rss'), (u'Baltimore', u'?http://www.craigslist.org/about/best/bal/index.rss'), (u'Birmingham', u'?http://www.craigslist.org/about/best/bhm/index.rss'), (u'Boston', u'?http://www.craigslist.org/about/best/bos/index.rss'), (u'Vermont', u'?http://www.craigslist.org/about/best/brl/index.rss'), (u'Columbia', u'?http://www.craigslist.org/about/best/cae/index.rss'), (u'Charlotte', u'?http://www.craigslist.org/about/best/cha/index.rss'), (u'Chico', u'?http://www.craigslist.org/about/best/chc/index.rss'), (u'Chicago', u'?http://www.craigslist.org/about/best/chi/index.rss'), (u'Charleston', u'?http://www.craigslist.org/about/best/chs/index.rss'), (u'Cleveland', u'?http://www.craigslist.org/about/best/cle/index.rss'), (u'Calgary', u'?http://www.craigslist.org/about/best/clg/index.rss'), (u'Colorado Springs', u'?http://www.craigslist.org/about/best/cos/index.rss'), (u'Dallas', u'?http://www.craigslist.org/about/best/dal/index.rss'), (u'Denver', u'?http://www.craigslist.org/about/best/den/index.rss'), (u'Detroit Metro', u'?http://www.craigslist.org/about/best/det/index.rss'), (u'Des Moines', u'?http://www.craigslist.org/about/best/dsm/index.rss'), (u'Eau Claire', u'?http://www.craigslist.org/about/best/eau/index.rss'), (u'Grand Rapids', u'?http://www.craigslist.org/about/best/grr/index.rss'), (u'Hawaii', u'?http://www.craigslist.org/about/best/hnl/index.rss'), (u'Jacksonville', u'?http://www.craigslist.org/about/best/jax/index.rss'), (u'Knoxville', u'?http://www.craigslist.org/about/best/knx/index.rss'), (u'Kansas City', u'?http://www.craigslist.org/about/best/ksc/index.rss'), (u'South Florida', u'?http://www.craigslist.org/about/best/mia/index.rss'), (u'Minneapolis', u'?http://www.craigslist.org/about/best/min/index.rss'), (u'Maine', u'?http://www.craigslist.org/about/best/mne/index.rss'), (u'Montreal', u'?http://www.craigslist.org/about/best/mon/index.rss'), (u'Nashville', u'?http://www.craigslist.org/about/best/nsh/index.rss'), (u'New York', u'?http://www.craigslist.org/about/best/nyc/index.rss'), (u'Orange County', u'?http://www.craigslist.org/about/best/orc/index.rss'), (u'Portland', u'?http://www.craigslist.org/about/best/pdx/index.rss'), (u'Phoenix', u'?http://www.craigslist.org/about/best/phx/index.rss'), (u'Pittsburgh', u'?http://www.craigslist.org/about/best/pit/index.rss'), (u'Rhode Island', u'?http://www.craigslist.org/about/best/prv/index.rss'), (u'Raleigh', u'?http://www.craigslist.org/about/best/ral/index.rss'), (u'Rochester', u'?http://www.craigslist.org/about/best/rcs/index.rss'), (u'San Antonio', u'?http://www.craigslist.org/about/best/sat/index.rss'), (u'Santa Barbara', u'?http://www.craigslist.org/about/best/sba/index.rss'), (u'San Diego', u'?http://www.craigslist.org/about/best/sdo/index.rss'), (u'Seattle-Tacoma', u'?http://www.craigslist.org/about/best/sea/index.rss'), (u'Sf Bay Area', u'?http://www.craigslist.org/about/best/sfo/index.rss'), (u'Salt Lake City', u'?http://www.craigslist.org/about/best/slc/index.rss'), (u'Spokane', u'?http://www.craigslist.org/about/best/spk/index.rss'), (u'St Louis', u'?http://www.craigslist.org/about/best/stl/index.rss'), (u'Sydney', u'?http://www.craigslist.org/about/best/syd/index.rss'), (u'Toronto', u'?http://www.craigslist.org/about/best/tor/index.rss'), (u'Vancouver BC', u'?http://www.craigslist.org/about/best/van/index.rss'), (u'Washington DC', u'?http://www.craigslist.org/about/best/wdc/index.rss')]

follow-up: ↓ 27   Changed 8 months ago by kovidgoyal

Thanks, added.

in reply to: ↑ 26   Changed 8 months ago by kiodane

Replying to kovidgoyal:

Thanks, added.

On second thought, it should just be the first link, as the others are only the cities that were on the first page of results in the RSS on the day I pulled the data. The complete city list is cumbersome, so just the "best of/All" page should be covered.

title = u'craigslist - Best Of' oldest_article = 365 max_articles_per_feed = 100

feeds = [(u'Best of craigslist', u'?http://www.craigslist.org/about/best/all/index.rss')]

  Changed 8 months ago by kovidgoyal

OK changed.

  Changed 7 months ago by devilinside

  • keywords craigslist removed

Hungarian Tech Net? recipe

class Hun Tech Net?(Basic News Recipe?):

title = u'Tech Net?' oldest_article = 3 description = u'Az ut\xf3bbi 3 nap Tech Net? h\xedrei' language = _('Hungarian') lang = 'hu' encoding = 'utf-8' author = 'Devilinside' max_articles_per_feed = 30 timefmt = ' [%Y, %b %d, %a]' extra_css =

body{font-family:Arial,Helvetica,sans-serif; font-size:small;} h1{font-size:large;}

remove_tags_after = dict(name='ul', attrs={'class':'cikk_bottom box'}) remove_tags_before = dict(name='div', attrs={'id':'c-main'}) remove_tags = [dict(name='div', attrs={'class':'wrp clr'})]

feeds = [(u'C\xedmlap', u'?http://www.technet.hu/rss/cimoldal/'), (u'Tech Tud?', u'?http://www.technet.hu/rss/techtud/'), (u'PDA M\xe1nia', u'?http://www.technet.hu/rss/pdamania/'), (u'Telefon', u'?http://www.technet.hu/rss/telefon/'), (u'Sz\xe1m\xedt\xf3g\xe9p', u'?http://www.technet.hu/rss/notebook/'), (u'GPS', u'?http://www.technet.hu/rss/gps/')]

  Changed 7 months ago by kovidgoyal

thanks, added

  Changed 7 months ago by tarun

'The Times of India' is the most read english newspaper(daily) in India. I tried my best but could only get the headlines, content went missing and now I'm struggling. After customising the Sony reader itself resets.

Useful links: Website URL: www.timesofindia.com

Main RSS feed: ?http://timesofindia.indiatimes.com/rssfeeds/-2128936835.cms

Replacement URL links: a). News Link ?http://timesofindia.indiatimes.com/news/india/Karnataka-HC-judge-declares-his-assets-/articleshow/4933280.cms b). Print version of above ?http://timesofindia.indiatimes.com/articleshow/msid-4933280,prtpage-1.cms

Kindly send me the code and incorporate in your new release, TOI is important in India.

Thanks, Tarun.

in reply to: ↑ description   Changed 7 months ago by bhandarisaurabh1987

  • type changed from enhancement to task

Replying to kovidgoyal:

This is a meta bug o collect requests for new news feeds, so that someone can look at them: Requests so far: * ?http://www.izvestia.ru/ * ?http://www.denverpost.com * ?http://www.usatoday.com * The Economist (with access to subscriber content) * New Yorker * Atlantic Monthly * Washington Post * Chicago Tribune * Spiegel Online English version (?http://www.spiegel.de/international/) * The Nation (?http://www.thenation.com/) * Harper's Magazine (?http://www.harpers.org/) * Mother Jones Magazine (?http://motherjones.com/) * Tom Dispatch?.org (?http://tomdispatch.org/) * Alter Net?.org (?http://alternet.org/) * The Jerusalem Post ?http://www.jpost.com/ *The Business World ?http://www.businessworld.in/ *business standard?http://www.business-standard.com/india/

  Changed 6 months ago by devilinside

  • priority changed from trivial to minor
  • type changed from task to enhancement

Hungarian military news portal recipe:

#!/usr/bin/env python
# vim:fileencoding=UTF-8:ts=4:sw=4:sta:et:sts=4:ai
from __future__ import with_statement

__license__   = 'GPL v3'
__copyright__ = '2009, Kovid Goyal <kovid@kovidgoyal.net>'
__docformat__ = 'restructuredtext en'

from calibre.web.feeds.news import BasicNewsRecipe

class HunMilNews(BasicNewsRecipe):
    title          = u'Honvedelem.hu'
    oldest_article = 3
    description = u'Katonah\xedrek'
    language = _('Hungarian')
    lang = 'hu'
    encoding = 'windows-1250'
    category = 'news, military'

    no_stylesheets         = True


    __author__ = 'Devilinside'
    max_articles_per_feed = 16
    no_stylesheets = True



    keep_only_tags = [dict(name='div', attrs={'class':'cikkoldal_cikk_cim'}),
 dict(name='div', attrs={'class':'cikkoldal_cikk_alcim'}),
 dict(name='div', attrs={'class':'cikkoldal_datum'}),
 dict(name='div', attrs={'class':'cikkoldal_lead'}),
 dict(name='div', attrs={'class':'cikkoldal_szoveg'}),
 dict(name='img', attrs={'class':'ajanlo_kep_keretes'}),
        ]



    feeds          = [(u'Misszi\xf3k', u'http://www.honvedelem.hu/rss_b?c=22'), 
 (u'Aktu\xe1lis hazai h\xedrek', u'http://www.honvedelem.hu/rss_b?c=3'), 
 (u'K\xfclf\xf6ldi h\xedrek', u'http://www.honvedelem.hu/rss_b?c=4'), 
 (u'A h\xf3nap t\xe9m\xe1ja', u'http://www.honvedelem.hu/rss_b?c=6'), 
 (u'Riport', u'http://www.honvedelem.hu/rss_b?c=5'), 
 (u'Portr\xe9k', u'http://www.honvedelem.hu/rss_b?c=7'), 
 (u'Haditechnika', u'http://www.honvedelem.hu/rss_b?c=8'), 
 (u'Programok, esem\xe9nyek', u'http://www.honvedelem.hu/rss_b?c=12')
        ]

Regards

  Changed 6 months ago by nicholasdp

Would it be possible to make a recipe for "Harvard Business Review" with access to subscriber content?

?http://hbr.harvardbusiness.org/

Thanks!

  Changed 6 months ago by kovidgoyal

I will need your account information for that: send it via email to kovid at kovidgoyal dot net

  Changed 6 months ago by bhandarisaurabh1987

can anyone make recipe for business standard ,it's a business news daily ?http://www.business-standard.com/india/

  Changed 5 months ago by irishdaze

  • type changed from enhancement to task

Please add to the next release the Ars Technica Recipe (by Michael Warner) if it works after testing? (props to Michael Warner!)

follow-up: ↓ 39   Changed 5 months ago by kovidgoyal

  • type changed from task to enhancement

calibre already has an Ars Technica recipe

in reply to: ↑ 38   Changed 5 months ago by irishdaze

  • status changed from new to closed
  • resolution set to worksforme

Replying to kovidgoyal:

calibre already has an Ars Technica recipe

I just looked again and found it! My apologies, I have no idea how I missed it the first time! Thanks for setting me straight. :-)

  Changed 5 months ago by dueyfinster

Irish Independent:

__license__   = 'GPL v3'
__copyright__ = '2009 Neil Grogan'
# 
# Irish Independent Recipe
#
import re 
from calibre.web.feeds.news import BasicNewsRecipe

class IrishIndependent(BasicNewsRecipe):
    title          = u'Irish Independent'
    description = 'Irish and World news from Irelands Bestselling Daily Broadsheet'
    __author__	= 'Neil Grogan'
    oldest_article = 7
    max_articles_per_feed = 100
    remove_tags_before = dict(id='article')
    remove_tags_after  = [dict(name='div', attrs={'class':'toolsBottom'})]
    remove_tags    = [
	        dict(name='div',attrs={'class':'toolsBottom'}),
	        dict(name='div',attrs={'class':'toolsTop'}),
	        dict(name='div',attrs={'class':'boxRelated'}),
	        dict(name='div',attrs={'class':'section first'}),
	        dict(name='div',attrs={'class':'tabIt'}),
	        dict(name='div',attrs={'class':'inner'})
	        ]

    feeds	= [
    (u'Frontpage News', u'http://www.independent.ie/rss'),
    (u'World News', u'http://www.independent.ie/world-news/rss'),
    (u'Technology', u'http://www.independent.ie/business/technology/rss'),
    (u'Sport', u'http://www.independent.ie/sport/rss'),
    (u'Entertainment', u'http://www.independent.ie/entertainment/rss'),
    (u'Independent Woman', u'http://www.independent.ie/lifestyle/independent-woman/rss'),
    (u'Education',u'http://www.independent.ie/education/rss'),
    (u'Lifestyle', u'http://www.independent.ie/lifestyle/rss'),
    (u'Travel', u'http://www.independent.ie/travel/rss'), 
    (u'Letters', u'http://www.independent.ie/opinion/letters/rss'),
    (u'Weather', u'http://www.independent.ie/weather/rss')
    ]
    
# If text only articles are desired
#def print_version(self, url):
#    return '%s?service=Print' % url

Just a small thanks for an awesome program ;)

  Changed 3 months ago by davotibarna

The first Norwegian news source - the most popular newspaper in Norway:

class Aftenposten(Basic News Recipe?):

title = u'Aftenposten' author = 'davotibarna' description = 'Norske nyheter' language = 'no' oldest_article = 5 max_articles_per_feed = 100 no_stylesheets = True encoding = 'ISO-8859-1'

feeds = [(u'Aftenposten', u'?http://www.aftenposten.no/eksport/rss-1_0/')]

def print_version(self, url):

return url.replace('#xtor=RSS-3', '?service=print')

  Changed 3 months ago by davotibarna

Better using a block for the code:

class Aftenposten(BasicNewsRecipe):
    title          = u'Aftenposten'
    __author__     = 'davotibarna'
    description    = 'Norske nyheter'
    language = 'no'
    oldest_article = 5
    max_articles_per_feed = 100
    no_stylesheets = True
    encoding = 'ISO-8859-1'

    feeds          = [(u'Aftenposten', u'http://www.aftenposten.no/eksport/rss-1_0/')]

    def print_version(self, url):
        return url.replace('#xtor=RSS-3', '?service=print')

  Changed 3 months ago by BarryTX

  • keywords Houston, Chronicle added
  • status changed from closed to reopened
  • resolution worksforme deleted

Calibre = great program. I hope I can get some help for a custom recipe for The Houston Chronicle. Houston is one of the biggest cities in the US and only has one major newspaper, the Chronicle.

I have a basic recipe that will pull feeds but no pictures. I tried to modify the LA Times recipe but have not had any success. Would it be possible to get a robust Houston Chronicle recipe added to Calibre?

The list of Houston Chronicle news feeds are at www.chron.com/news/rss. If I look at the raw feeds in a section, the links follow the format of ?http://feeds.chron.com/houstonchronicle/<section>.

An example of an individual article link looks like this:

?http://www.chron.com/disp/story.mpl/...n+%26+Texas%29

Thanks for all your support!

  Changed 3 months ago by kovidgoyal

Houston Chronicle will be in the next release.

  Changed 2 months ago by very-simple

  • cc sam@… added
  • keywords New York added; Houston, Chronicle removed

I would love to see a recipe for New York Magazine - here is the page of RSS feeds: ?http://nymag.com/newyork/rss/

Note that, for some reason, some feeds (sections) are headline-only and some are full feed (like vulture and daily intelligencer).

I started trying to put this together myself, but I'm not very good at it.

Thanks in advance!

  Changed 2 months ago by stanfieldm

I would love a feed for the Minneapolis Star Tribune:

?http://www.startribune.com/help/12374506.html

Thanks!

  Changed 2 months ago by kovidgoyal

  • status changed from reopened to closed
  • resolution set to fixed

Fixesed in branch trunk. The fix will be in the next release.

  Changed 2 months ago by rogeliodh

Hi, I've created a recipe for "La Jornada", a Mexican newspaper.

__license__   = 'GPL v3'
__copyright__ = '2009, Rogelio Dominguez <rogelio.dominguez at gmail.com>'
'''
www.jornada.unam.mx
'''

from calibre.web.feeds.news import BasicNewsRecipe

import re

class LaJornada(BasicNewsRecipe):
    title          = u'La Jornada'
    oldest_article = 1
    max_articles_per_feed = 100
    keep_only_tags = [dict(id=['article-cont'])]
    remove_tags    = [dict(name='div', attrs={'class':['go gui','go gui top']})]
    no_stylesheets = True
    # Remove capitalized initial letter
    preprocess_regexps = [
                         (re.compile(r'<div class="inicial">(.*)</div>.*<p class="s-s">', re.DOTALL|re.IGNORECASE),
                          lambda match: match.group(1)),
                         ]
    feeds          = [(u'Edicion completa', u'http://www.jornada.unam.mx/rss/edicion.xml')]

  Changed 2 months ago by pablofunes

Hi Rogelio, funny I was also working on a recipe for La Jornada! Mine uses separate feeds, so it creates a table of contents with each section in the newspaper which is nice.

I don't know how to make the pictures go to the top, not the bottom of each article.

?http://bugs.calibre-ebook.com/attachment/wiki/UserRecipes/LaJornada.recipe

Cheers, Pablo Funes.

  Changed 7 weeks ago by qwe

Changed 7 weeks ago by mellian

Volkskrant paid version

  Changed 5 weeks ago by spqr

Hi, I'd really love to see a recipe for the german ?Scinexx.de. They offer a quite simple Print-Version, so it should be not too complicated, to compose a recipe (for someone more familiar w/ python than me...

E.g. from Article "?http://scinexx.de/wissen-aktuell-11161-2010-02-01.html" go to print version "?http://scinexx.de/inc/artikel_drucken.php?id=11161&a_flag=1"

Thx in advance, Josef

  Changed 5 weeks ago by kovidgoyal

best place to ask for custom recipes is the custom recipes thread in the calibre forum, where lots of recipe writers hang out.

Closing this ticket, in the future requests should go to that thread.

?http://www.mobileread.com/forums/showthread.php?t=32543

Note: See TracTickets for help on using tickets.