Ticket #405 (closed enhancement: fixed)
New news feed
| Reported by: | kovidgoyal | Owned by: | kovidgoyal |
|---|---|---|---|
| Priority: | minor | Milestone: | |
| Component: | News downloading | Version: | trunk |
| Keywords: | New York | Cc: | flower_os@…, sam@… |
Description (last modified by kovidgoyal) (diff)
This is a meta bug o collect requests for new news feeds, so that someone can look at them:
Requests so far:
- http://www.izvestia.ru/
- http://www.denverpost.com
- http://www.usatoday.com
- The Economist (with access to subscriber content)
- New Yorker
- Atlantic Monthly
- Washington Post
- Chicago Tribune
- Spiegel Online English version ( http://www.spiegel.de/international/)
- The Nation ( http://www.thenation.com/)
- Harper's Magazine ( http://www.harpers.org/)
- Mother Jones Magazine ( http://motherjones.com/)
- Tom Dispatch?.org ( http://tomdispatch.org/)
- Alter Net?.org ( http://alternet.org/)
- The Jerusalem Post http://www.jpost.com/
Attachments
Change History
comment:1 Changed 3 years ago by choard
- Priority changed from minor to trivial
- Type changed from defect to enhancement
comment:5 follow-up: ↓ 6 Changed 3 years ago by ralfw
The Japan Times ( http://feeds.feedburner.com/japantimes_news)
comment:6 in reply to: ↑ 5 Changed 3 years ago by jaguaracer
Replying to ralfw:
The Japan Times ( http://feeds.feedburner.com/japantimes_news)
Financial Times would be great. http://www.ft.com/servicestools/newstracking/rss
Personally I'm looking at markets. Its in RSS but for the life of me, I can't get it into libprs500 in basic mode. Any help would be great.
comment:7 Changed 2 years ago by ChuckEggDotCom
Recipe for FORTUNE magazine
class Advanced User Recipe?1211822316(Basic News Recipe?):
title = u'Fortune' oldest_article = 31 max_articles_per_feed = 100
feeds = [(u'Fortune', u' http://labs.echoditto.com/projects/fulltextrss/?url=http%3A%2F%2Frss.cnn.com%2Frss%2Fmoney_latest.rss')]
comment:8 Changed 2 years ago by ChuckEggDotCom
Recipe for BUSINESS WEEK magazine:
class Advanced User Recipe?1211822401(Basic News Recipe?):
title = u'Business Week' oldest_article = 7 max_articles_per_feed = 100
feeds = [(u'Top Stories', u' http://www.businessweek.com/topStories/rss/topStories.rss'), (u'Top News', u' http://www.businessweek.com/rss/bwdaily.rss'), (u'Asia', u' http://www.businessweek.com/rss/asia.rss'), (u'Autos', u' http://www.businessweek.com/rss/autos/index.rss'), (u'Classic Cars', u' http://www.businessweek.com/rss/autos/classic_cars/index.rss'), (u'Europe', u' http://www.businessweek.com/rss/europe.rss'), (u'Hybrids', u' http://www.businessweek.com/rss/hybrids/index.rss'), (u'Auto Reviews', u' http://www.businessweek.com/rss/autos/reviews/index.rss'), (u'Innovation & Design', u' http://www.businessweek.com/rss/innovate.rss'), (u'Architecture', u' http://www.businessweek.com/rss/architecture.rss'), (u'Brand Equity', u' http://www.businessweek.com/rss/brandequity.rss'), (u'Auto Design', u' http://www.businessweek.com/rss/carbuff.rss'), (u'Game Room', u' http://www.businessweek.com/rss/gameroom.rss'), (u'Technology', u' http://www.businessweek.com/rss/technology.rss'), (u'Investing', u' http://www.businessweek.m/rss/investor.rss'), (u'Small Business', u' http://www.businessweek.com/rss/smallbiz.rss'), (u'Careers', u' http://www.businessweek.com/rss/careers.rss'), (u'B-Schools', u' http://www.businessweek.com/rss/bschools.rss'), (u'Magazine Selections', u' http://www.businessweek.com/rss/magazine.rss'), (u'CEO Guide to Tech', u' http://www.businessweek.com/rss/ceo_guide_tech.rss')]
comment:9 Changed 2 years ago by kovidgoyal
- Description modified (diff)
Thanks, I'll add Business Week as a built-in recipe in the next release.
comment:10 Changed 2 years ago by mikediaz
Tried Scientific American in vain. Can you help me grab the feeds that appear here?
http://www.sciam.com/page.cfm?section=rss
Here is the advanced mode text of what I entered
class Advanced User Recipe?1221785185(Basic News Recipe?):
title = u'Scientific American' oldest_article = 30 max_articles_per_feed = 100
feeds = [(u'Latest News', u' http://rss.sciam.com/ScientificAmerican-News'), (u'Global', u' http://rss.sciam.com/ScientificAmerican-Global'), (u'Health', u' http://rss.sciam.com/sciam/health'), (u'Space', u' http://rss.sciam.com/sciam/space'), (u'Technology', u' http://rss.sciam.com/sciam/technology'), (u'Biology', u' http://rss.sciam.com/sciam/biology'), (u'Mind & Brain', u' http://rss.sciam.com/sciam/mind-and-brain'), (u"What's Next", u' http://rss.sciam.com/sciam/whats-next'), (u'Archeology & Paleontology', u' http://www.sciam.com/page.cfm?section=rsscategory&alias=archaeology-and-paleontology'), (u'Physics', u' http://www.sciam.com/page.cfm?section=rsscategory&alias=physics'), (u'Math', u' http://rss.sciam.com/sciam/math'), (u'History of Science', u' http://www.sciam.com/page.cfm?section=rsscategory&alias=history-of-science'), (u'Chemistry', u' http://rss.sciam.com/sciam/chemistry'), (u'Mind Matters', u' http://rss.sciam.com/ScientificAmerican-MindBlog')]
comment:11 Changed 2 years ago by mikediaz
This one works for Discover Magazine in case you want to use publish it. could not update the wiki. hope it helps
class Advanced User Recipe?1221785375(Basic News Recipe?):
title = u'Discover Magazine' oldest_article = 33 max_articles_per_feed = 20
feeds = [(u'Technology', u' http://discovermagazine.com/topics/technology/rss.xml'), (u'Health - Medicine', u' http://discovermagazine.com/topics/health-medicine/rss.xml'), (u'Mind Brain', u' http://discovermagazine.com/topics/mind-brain/rss.xml'), (u'Space', u' http://discovermagazine.com/topics/space/rss.xml'), (u'Human Origins', u' http://discovermagazine.com/topics/human-origins/rss.xml'), (u'Living World', u' http://discovermagazine.com/topics/living-world/rss.xml'), (u'Environment', u' http://discovermagazine.com/topics/environment/rss.xml'), (u'Physics & Math', u' http://discovermagazine.com/topics/physics-math/rss.xml'), (u'Vital Signs', u' http://discovermagazine.com/columns/vital-signs/rss.xml'), (u"20 Things you didn't know about...", u' http://discovermagazine.com/columns/20-things-you-didnt-know/rss.xml'), (u'Fuzzy Math', u' http://discovermagazine.com/columns/fuzzy-math/rss.xml'), (u'The Brain', u' http://discovermagazine.com/columns/the-brain/rss.xml'), (u'Stupid Science Word of the Month', u' http://discovermagazine.com/columns/stupid-science-word-of-the-month/rss.xml'), (u'Science Not Fiction', u' http://blogs.discovermagazine.com/sciencenotfiction/wp-rss.php')]
comment:12 Changed 2 years ago by kovidgoyal
Added recipes for discover and sciam, thanks.
comment:13 Changed 23 months ago by mccande
Is anyone planning a recipe for the London Times based on URL's like this rather than the rss feeds? This gives an almost complete copy of the newspaper.
Thanks in advance
http://www.timesonline.co.uk/tol/newspapers/the_times/tuesday/?days=Tuesday
comment:14 Changed 18 months ago by Ezmegaz
hy! there is one hungaryan news site WWW.INDEX.HU, and if i download news from there " http://index.hu/x?t=/24ora/rss/ " the fonts are incredible big..., can you slove the problem of that rss? THX
comment:15 Changed 18 months ago by kovidgoyal
a good place to ask for help with recipes is the custom recipes thread in the calibre forum
comment:16 in reply to: ↑ description Changed 17 months ago by GPThomson
I'd like to to have the news from the following website added to calibre. It is all Dutch speaking (Belgium):
Knack ( http://www.knack.be/)
Humo ( http://www.humo.be/)
De Gentenaar ( http://www.nieuwsblad.be/)
Thanks!
comment:17 Changed 16 months ago by pixnaps
Request news feed for 'Inside Higher Ed': http://www.insidehighered.com/news/
comment:18 follow-up: ↓ 19 Changed 16 months ago by Ezmegaz
hungaryan news site "index.hu" fss feeds
class Advanced User Recipe?1242148664(Basic News Recipe?):
title = u'INDEX.HU' oldest_article = 3 max_articles_per_feed = 50
feeds = [(u'ALL', u' http://index.hu/24ora/rss/'), (u'BELF\xd6LD', u' http://index.hu/belfold/rss/default/'), (u'K\xdcLF\xd6LD', u' http://index.hu/kulfold/rss/default/'), (u'BULV\xc1R', u' http://index.hu/bulvar/rss/default/'), (u'GAZDAS\xc1G', u' http://index.hu/gazdasag/rss/default/'), (u'TECH', u' http://index.hu/tech/rss/main/'), (u'KULT\xdaRA', u' http://index.hu/kultur/rss/main/'), (u'TUDOM\xc1NY', u' http://index.hu/tudomany/rss/main/'), (u'SPORT', u' http://sportgeza.hu/24ora/'), (u'V\xc9LEM\xc9NY', u' http://index.hu/velemeny/rss/default/')]
comment:19 in reply to: ↑ 18 Changed 16 months ago by Ezmegaz
(u'SPORT', u' http://sportgeza.hu/24ora/')
please remove this feed from sources
comment:20 Changed 16 months ago by Ezmegaz
pcworld.hu feeds -hungaryan news site
class Advanced User Recipe?1242149390(Basic News Recipe?):
title = u'PCWORLD.HU' oldest_article = 3 max_articles_per_feed = 50
feeds = [(u'H\xedrek', u' http://pcworld.hu/rss/rss.xml'), (u'Hardver h\xedrek', u' http://www.pcworld.hu/rss/rss_hardverhirek.xml'), (u'Szoftver h\xedrek', u' http://www.pcworld.hu/rss/rss_szoftverhirek.xml'), (u'Hardver cikkek', u' http://www.pcworld.hu/rss/rss_hardvercikkek.xml'), (u'Szoftver cikkek', u' http://www.pcworld.hu/rss/rss_szoftvercikkek.xml'), (u'Mobil h\xedrek', u' http://www.pcworld.hu/rss/rss_mobil.xml'), (u'\xdczleti h\xedrek', u' http://www.pcworld.hu/rss/rss_uzlet.xml'), (u'Let\xf6lt\xe9sek', u' http://www.pcworld.hu/rss/rss_letoltes.xml'), (u'PC World TV', u' http://tv.pcworld.hu/rss/rss_hun_pcw.xml'), (u'Tudta-e...?', u' http://pcworld.hu/rss/rss_tudtae.xml')]
comment:21 Changed 16 months ago by floweros
- Cc flower_os@… added
Slashdot.org - News for nerds. Stuff that matters.
English
class Advanced User Recipe?1238825681(Basic News Recipe?):
title = u'Slashdot.org' oldest_article = 7 max_articles_per_feed = 100 no_stylesheets = True keep_only_tags = [dict(name='div',attrs={'id':'article'})] remove_tags = [
dict(name='div',attrs={'id':'userlogin-title'}), dict(name='div',attrs={'id':'userlogin-content'}), dict(name='div',attrs={'id':'commentwrap'}), dict(name='span',attrs={'id':'more_comments_num_a'}), ]
feeds = [
(u'Slashdot', u' http://rss.slashdot.org/Slashdot/slashdot?m=5072'), (u'/. IT', u' http://rss.slashdot.org/Slashdot/slashdotIT'), (u'/. Hardware', u' http://rss.slashdot.org/Slashdot/slashdotHardware'), (u'/. Linux', u' http://rss.slashdot.org/Slashdot/slashdotLinux'), (u'/. Your Rights Online', u' http://rss.slashdot.org/Slashdot/slashdotYourRightsOnline') ]
comment:22 Changed 16 months ago by Ezmegaz
hungarian rss feeds
class Advanced User Recipe?1242561863(Basic News Recipe?):
title = u'168\xf3ra' oldest_article = 4 max_articles_per_feed = 50
feeds = [(u'Itthon', u' http://www.168ora.hu/static/rss/cikkek_itthon.xml'), (u'Gl\xf3busz', u' http://www.168ora.hu/static/rss/cikkek_globusz.xml'), (u'Punch', u' http://www.168ora.hu/static/rss/cikkek_punch.xml'), (u'Arte', u' http://www.168ora.hu/static/rss/cikkek_arte.xml'), (u'Buxa', u' http://www.168ora.hu/static/rss/cikkek_buxa.xml'), (u'Sebess\xe9g', u' http://www.168ora.hu/static/rss/cikkek_sebesseg.xml'), (u'Tud\xe1s', u' http://www.168ora.hu/static/rss/cikkek_tudas.xml'), (u'Sport', u' http://www.168ora.hu/static/rss/cikkek_sport.xml'), (u'V\xe9lem\xe9ny', u' http://www.168ora.hu/static/rss/cikkek_velemeny.xml'), (u'Dolce Vita', u' http://www.168ora.hu/static/rss/cikkek_dolcevita.xml'), (u'R\xe1di\xf3', u' http://www.168ora.hu/static/rss/radio.xml')]
class Advanced User Recipe?1242561928(Basic News Recipe?):
title = u'ATV' oldest_article = 5 max_articles_per_feed = 50
feeds = [(u'H\xedrek', u' http://atv.hu/rss/1'), (u'Cikkek', u' http://atv.hu/rss/2')]
class Advanced User Recipe?1242561945(Basic News Recipe?):
title = u'H\xedrszerz\u0151' oldest_article = 5 max_articles_per_feed = 50
feeds = [(u'Belf\xf6ld', u' http://www.hirszerzo.hu/rss.belfold.xml'), (u'K\xfclf\xf6ld', u' http://www.hirszerzo.hu/rss.kulfold.xml'), (u'Profit', u' http://www.hirszerzo.hu/rss.profit.xml'), (u'Shake', u' http://www.hirszerzo.hu/rss.shake.xml'), (u'Publicisztika', u' http://www.hirszerzo.hu/rss.publicisztika.xml'), (u'Elemz\xe9s', u' http://www.hirszerzo.hu/rss.elemzes.xml'), (u'Sorok k\xf6z\xf6tt', u' http://www.hirszerzo.hu/rss.sorok_kozott.xml'), (u'Gal\xe9ria', u' http://www.hirszerzo.hu/rss.galeria.xml'), (u'Patro', u' http://www.hirszerzo.hu/rss.patro.xml')]
comment:23 Changed 16 months ago by TMF
This is a news feed for Le Temps (see Wikipedia article), Switzerland's leading French-language newspaper.
class Advanced User Recipe?1243078936(Basic News Recipe?):
title = u'Le Temps' oldest_article = 7 max_articles_per_feed = 100 no_stylesheets = True remove_tags = [dict(name='div', attrs={'id':'footer'})] remove_tags = [dict(name='div', attrs={'class':'box links'})] remove_tags = [dict(name='script')] extra_css = '.heading {font-size: 13px; line-height: 15px; margin: 20px 0;} \n h2 {font-size: 24px; line-height: 25px; margin-bottom: 14px;} \n .author {font-size: 11px; margin: 0 0 5px 0;} \n .lead {font-weight: 700; margin: 10px 0;} \n p {margin: 0 0 10px 0;}'
feeds = [
('Actualité', ' http://www.letemps.ch/rss/site/'), ('Monde', ' http://www.letemps.ch/rss/site/actualite/monde'), ('Suisse & Régions', ' http://www.letemps.ch/rss/site/actualite/suisse_regions'), ('Sciences & Environnement', ' http://www.letemps.ch/rss/site/actualite/sciences_environnement'), ('Société', ' http://www.letemps.ch/rss/site/actualite/societe'), ('Economie & Finance', ' http://www.letemps.ch/rss/site/economie_finance'), ('Economie & Finance - Finance', ' http://www.letemps.ch/rss/site/economie_finance/finance'), ('Economie & Finance - Fonds de placement', ' http://www.letemps.ch/rss/site/economie_finance/fonds_placement'), ('Economie & Finance - Carrières', ' http://www.letemps.ch/rss/site/economie_finance/carrieres'), ('Culture', ' http://www.letemps.ch/rss/site/culture'), ('Culture - Cinéma', ' http://www.letemps.ch/rss/site/culture/cinema'), ('Culture - Musiques', ' http://www.letemps.ch/rss/site/culture/musiques'), ('Culture - Scènes', ' http://www.letemps.ch/rss/site/culture/scenes'), ('Culture - Arts plastiques', ' http://www.letemps.ch/rss/site/culture/arts_plastiques'), ('Livres', ' http://www.letemps.ch/rss/site/culture/livres'), ('Opinions', ' http://www.letemps.ch/rss/site/opinions'), ('Opinions - Editoriaux', ' http://www.letemps.ch/rss/site/opinions/editoriaux'), ('Opinions - Invités', ' http://www.letemps.ch/rss/site/opinions/invites'), ('Opinions - Chroniques', ' http://www.letemps.ch/rss/site/opinions/chroniques'), ('Life Style?', ' http://www.letemps.ch/rss/site/lifestyle'), ('Life Style? - Luxe', ' http://www.letemps.ch/rss/site/lifestyle/luxe'), ('Life Style? - Horlogerie & Joaillerie', ' http://www.letemps.ch/rss/site/lifestyle/horlogerie_joaillerie'), ('Life Style? - Design', ' http://www.letemps.ch/rss/site/lifestyle/design'), ('Life Style? - Voyages', ' http://www.letemps.ch/rss/site/lifestyle/voyages'), ('Life Style? - Gastronomie', ' http://www.letemps.ch/rss/site/lifestyle/gastronomie'), ('Life Style? - Architecture & Immobilier', ' http://www.letemps.ch/rss/site/lifestyle/architecture_immobilier'), ('Life Style? - Automobile', ' http://www.letemps.ch/rss/site/lifestyle/automobile'), ('Sports', ' http://www.letemps.ch/rss/site/actualite/sports'),
]
def print_version(self, url):
return url.replace('Page', 'Facet/print')
comment:24 Changed 16 months ago by TMF
This is a news feed for WOZ Die Wochenzeitung ( German Wikipedia article, a German language Swiss left-wing weekly.
class Advanced User Recipe?1243078936(Basic News Recipe?):
title = u'WOZ Die Wochenzeitung' oldest_article = 7 max_articles_per_feed = 100 no_stylesheets = True remove_tags = [dict(name='p', attrs={'class':'arrow_top'})] remove_tags = [dict(name='p', attrs={'class':'bottom_right'})] remove_tags = [dict(name='script')] extra_css = '#print_titel{vertical-align: bottom; text-align: left; color: #666666; background-color: white; padding-top: 30px; padding-bottom: 10px; border-bottom: 1px solid #999999;} \n #title{text-align: left; font-size: large; font-weight: 600; padding-top: 0px; padding-bottom: 6px;} \n h3 {text-align: left; font-size: large; font-weight: 600; padding-top: 0px; padding-bottom: 6px;} \n #lead{font-weight: 600; padding-bottom: 6px;} \n h2{font-weight: 600; padding-bottom: 6px;} #author{color: #666666; padding-top: 0px; padding-bottom: 0px;} \n h4{color: #666666; padding-top: 0px; padding-bottom: 0px;} \n #author2 {color: #666666; padding-top: 0px; padding-bottom: 0px;} \n .dotted_line{padding-top: 0px; margin-bottom: 18px; border-bottom: 1px dotted #666666;} \n .intro{margin: 0 auto; font-weight: 600; padding-bottom: 18px;} \n h5{margin: 0 auto; font-weight: 600; padding-bottom: 18px;} \n .intro2{margin: 0 auto; font-weight: 600;} \n .text{padding-bottom: 18px;} \n .subtitle{margin: 0 auto; font-weight: 600; padding-bottom: 10px;} \n .articletitle{margin: 0 auto; font-weight: 600; padding-bottom: 10px;} \n #content_infobox{margin-top: 20px; margin-left: 0px; margin-right: 0px; margin-bottom: 10px; text-align: left; border-bottom: 1px solid #999999;} \n .content_infobox_titel{padding-top: 6px; padding-bottom: 8px; padding-left: 8px; padding-right: 8px; font-weight: 600; border-top: 1px solid #999999; border-bottom: 1px dotted #999999;} \n .content_infobox_text{padding-top: 6px; padding-bottom: 12px; padding-left: 8px; padding-right: 8px;} \n .box_gray{padding-top: 4px; padding-left: 7px; padding-right: 7px; padding-bottom: 4px;} \n .box_white { padding-top: 4px; padding-left: 7px; padding-right: 7px; padding-bottom: 4px;} \n .content_infobox_mehr{margin-top: 20px; margin-left: 0px; margin-right: 0px; margin-bottom: 10px; text-align: left; width: 600px; border-bottom: 1px solid #999999;}'
feeds = [('WOZ Die Wochenzeitung - Headlines', ' http://www.woz.ch/inhalt/headlinesRSS.php'),]
def print_version(self, url):
return url.replace('rss/', 'print_')
comment:25 Changed 14 months ago by kiodane
- Keywords craigslist added
- Component changed from web2lrf to News downloading
Best of for Craigslist (broken down by city)
class Advanced User Recipe?1247284744(Basic News Recipe?):
title = u'craigslist - Best Of' oldest_article = 365 max_articles_per_feed = 100
feeds = [(u'Best of craigslist', u' http://www.craigslist.org/about/best/all/index.rss'), (u'Ann Arbor', u' http://www.craigslist.org/about/best/aaa/index.rss'), (u'Asheville', u' http://www.craigslist.org/about/best/ash/index.rss'), (u'Austin', u' http://www.craigslist.org/about/best/aus/index.rss'), (u'Baltimore', u' http://www.craigslist.org/about/best/bal/index.rss'), (u'Birmingham', u' http://www.craigslist.org/about/best/bhm/index.rss'), (u'Boston', u' http://www.craigslist.org/about/best/bos/index.rss'), (u'Vermont', u' http://www.craigslist.org/about/best/brl/index.rss'), (u'Columbia', u' http://www.craigslist.org/about/best/cae/index.rss'), (u'Charlotte', u' http://www.craigslist.org/about/best/cha/index.rss'), (u'Chico', u' http://www.craigslist.org/about/best/chc/index.rss'), (u'Chicago', u' http://www.craigslist.org/about/best/chi/index.rss'), (u'Charleston', u' http://www.craigslist.org/about/best/chs/index.rss'), (u'Cleveland', u' http://www.craigslist.org/about/best/cle/index.rss'), (u'Calgary', u' http://www.craigslist.org/about/best/clg/index.rss'), (u'Colorado Springs', u' http://www.craigslist.org/about/best/cos/index.rss'), (u'Dallas', u' http://www.craigslist.org/about/best/dal/index.rss'), (u'Denver', u' http://www.craigslist.org/about/best/den/index.rss'), (u'Detroit Metro', u' http://www.craigslist.org/about/best/det/index.rss'), (u'Des Moines', u' http://www.craigslist.org/about/best/dsm/index.rss'), (u'Eau Claire', u' http://www.craigslist.org/about/best/eau/index.rss'), (u'Grand Rapids', u' http://www.craigslist.org/about/best/grr/index.rss'), (u'Hawaii', u' http://www.craigslist.org/about/best/hnl/index.rss'), (u'Jacksonville', u' http://www.craigslist.org/about/best/jax/index.rss'), (u'Knoxville', u' http://www.craigslist.org/about/best/knx/index.rss'), (u'Kansas City', u' http://www.craigslist.org/about/best/ksc/index.rss'), (u'South Florida', u' http://www.craigslist.org/about/best/mia/index.rss'), (u'Minneapolis', u' http://www.craigslist.org/about/best/min/index.rss'), (u'Maine', u' http://www.craigslist.org/about/best/mne/index.rss'), (u'Montreal', u' http://www.craigslist.org/about/best/mon/index.rss'), (u'Nashville', u' http://www.craigslist.org/about/best/nsh/index.rss'), (u'New York', u' http://www.craigslist.org/about/best/nyc/index.rss'), (u'Orange County', u' http://www.craigslist.org/about/best/orc/index.rss'), (u'Portland', u' http://www.craigslist.org/about/best/pdx/index.rss'), (u'Phoenix', u' http://www.craigslist.org/about/best/phx/index.rss'), (u'Pittsburgh', u' http://www.craigslist.org/about/best/pit/index.rss'), (u'Rhode Island', u' http://www.craigslist.org/about/best/prv/index.rss'), (u'Raleigh', u' http://www.craigslist.org/about/best/ral/index.rss'), (u'Rochester', u' http://www.craigslist.org/about/best/rcs/index.rss'), (u'San Antonio', u' http://www.craigslist.org/about/best/sat/index.rss'), (u'Santa Barbara', u' http://www.craigslist.org/about/best/sba/index.rss'), (u'San Diego', u' http://www.craigslist.org/about/best/sdo/index.rss'), (u'Seattle-Tacoma', u' http://www.craigslist.org/about/best/sea/index.rss'), (u'Sf Bay Area', u' http://www.craigslist.org/about/best/sfo/index.rss'), (u'Salt Lake City', u' http://www.craigslist.org/about/best/slc/index.rss'), (u'Spokane', u' http://www.craigslist.org/about/best/spk/index.rss'), (u'St Louis', u' http://www.craigslist.org/about/best/stl/index.rss'), (u'Sydney', u' http://www.craigslist.org/about/best/syd/index.rss'), (u'Toronto', u' http://www.craigslist.org/about/best/tor/index.rss'), (u'Vancouver BC', u' http://www.craigslist.org/about/best/van/index.rss'), (u'Washington DC', u' http://www.craigslist.org/about/best/wdc/index.rss')]
comment:26 follow-up: ↓ 27 Changed 14 months ago by kovidgoyal
Thanks, added.
comment:27 in reply to: ↑ 26 Changed 14 months ago by kiodane
Replying to kovidgoyal:
Thanks, added.
On second thought, it should just be the first link, as the others are only the cities that were on the first page of results in the RSS on the day I pulled the data. The complete city list is cumbersome, so just the "best of/All" page should be covered.
title = u'craigslist - Best Of' oldest_article = 365 max_articles_per_feed = 100
feeds = [(u'Best of craigslist', u' http://www.craigslist.org/about/best/all/index.rss')]
comment:28 Changed 14 months ago by kovidgoyal
OK changed.
comment:29 Changed 13 months ago by devilinside
- Keywords craigslist removed
Hungarian Tech Net? recipe
class Hun Tech Net?(Basic News Recipe?):
title = u'Tech Net?' oldest_article = 3 description = u'Az ut\xf3bbi 3 nap Tech Net? h\xedrei' language = _('Hungarian') lang = 'hu' encoding = 'utf-8' author = 'Devilinside' max_articles_per_feed = 30 timefmt = ' [%Y, %b %d, %a]' extra_css =
body{font-family:Arial,Helvetica,sans-serif; font-size:small;} h1{font-size:large;}
remove_tags_after = dict(name='ul', attrs={'class':'cikk_bottom box'}) remove_tags_before = dict(name='div', attrs={'id':'c-main'}) remove_tags = [dict(name='div', attrs={'class':'wrp clr'})]
feeds = [(u'C\xedmlap', u' http://www.technet.hu/rss/cimoldal/'), (u'Tech Tud?', u' http://www.technet.hu/rss/techtud/'), (u'PDA M\xe1nia', u' http://www.technet.hu/rss/pdamania/'), (u'Telefon', u' http://www.technet.hu/rss/telefon/'), (u'Sz\xe1m\xedt\xf3g\xe9p', u' http://www.technet.hu/rss/notebook/'), (u'GPS', u' http://www.technet.hu/rss/gps/')]
comment:30 Changed 13 months ago by kovidgoyal
thanks, added
comment:31 Changed 12 months ago by tarun
'The Times of India' is the most read english newspaper(daily) in India. I tried my best but could only get the headlines, content went missing and now I'm struggling. After customising the Sony reader itself resets.
Useful links: Website URL: www.timesofindia.com
Main RSS feed: http://timesofindia.indiatimes.com/rssfeeds/-2128936835.cms
Replacement URL links: a). News Link http://timesofindia.indiatimes.com/news/india/Karnataka-HC-judge-declares-his-assets-/articleshow/4933280.cms b). Print version of above http://timesofindia.indiatimes.com/articleshow/msid-4933280,prtpage-1.cms
Kindly send me the code and incorporate in your new release, TOI is important in India.
Thanks, Tarun.
comment:32 in reply to: ↑ description Changed 12 months ago by bhandarisaurabh1987
- Type changed from enhancement to task
Replying to kovidgoyal:
This is a meta bug o collect requests for new news feeds, so that someone can look at them:
Requests so far:
- http://www.izvestia.ru/
- http://www.denverpost.com
- http://www.usatoday.com
- The Economist (with access to subscriber content)
- New Yorker
- Atlantic Monthly
- Washington Post
- Chicago Tribune
- Spiegel Online English version ( http://www.spiegel.de/international/)
- The Nation ( http://www.thenation.com/)
- Harper's Magazine ( http://www.harpers.org/)
- Mother Jones Magazine ( http://motherjones.com/)
- Tom Dispatch?.org ( http://tomdispatch.org/)
- Alter Net?.org ( http://alternet.org/)
- The Jerusalem Post http://www.jpost.com/
*The Business World http://www.businessworld.in/ *business standard http://www.business-standard.com/india/
comment:33 Changed 12 months ago by devilinside
- Priority changed from trivial to minor
- Type changed from task to enhancement
Hungarian military news portal recipe:
#!/usr/bin/env python
# vim:fileencoding=UTF-8:ts=4:sw=4:sta:et:sts=4:ai
from __future__ import with_statement
__license__ = 'GPL v3'
__copyright__ = '2009, Kovid Goyal <kovid@kovidgoyal.net>'
__docformat__ = 'restructuredtext en'
from calibre.web.feeds.news import BasicNewsRecipe
class HunMilNews(BasicNewsRecipe):
title = u'Honvedelem.hu'
oldest_article = 3
description = u'Katonah\xedrek'
language = _('Hungarian')
lang = 'hu'
encoding = 'windows-1250'
category = 'news, military'
no_stylesheets = True
__author__ = 'Devilinside'
max_articles_per_feed = 16
no_stylesheets = True
keep_only_tags = [dict(name='div', attrs={'class':'cikkoldal_cikk_cim'}),
dict(name='div', attrs={'class':'cikkoldal_cikk_alcim'}),
dict(name='div', attrs={'class':'cikkoldal_datum'}),
dict(name='div', attrs={'class':'cikkoldal_lead'}),
dict(name='div', attrs={'class':'cikkoldal_szoveg'}),
dict(name='img', attrs={'class':'ajanlo_kep_keretes'}),
]
feeds = [(u'Misszi\xf3k', u'http://www.honvedelem.hu/rss_b?c=22'),
(u'Aktu\xe1lis hazai h\xedrek', u'http://www.honvedelem.hu/rss_b?c=3'),
(u'K\xfclf\xf6ldi h\xedrek', u'http://www.honvedelem.hu/rss_b?c=4'),
(u'A h\xf3nap t\xe9m\xe1ja', u'http://www.honvedelem.hu/rss_b?c=6'),
(u'Riport', u'http://www.honvedelem.hu/rss_b?c=5'),
(u'Portr\xe9k', u'http://www.honvedelem.hu/rss_b?c=7'),
(u'Haditechnika', u'http://www.honvedelem.hu/rss_b?c=8'),
(u'Programok, esem\xe9nyek', u'http://www.honvedelem.hu/rss_b?c=12')
]
Regards
comment:34 Changed 12 months ago by nicholasdp
Would it be possible to make a recipe for "Harvard Business Review" with access to subscriber content?
http://hbr.harvardbusiness.org/
Thanks!
comment:35 Changed 12 months ago by kovidgoyal
I will need your account information for that: send it via email to kovid at kovidgoyal dot net
comment:36 Changed 12 months ago by bhandarisaurabh1987
can anyone make recipe for business standard ,it's a business news daily http://www.business-standard.com/india/
comment:37 Changed 11 months ago by irishdaze
- Type changed from enhancement to task
Please add to the next release the Ars Technica Recipe (by Michael Warner) if it works after testing? (props to Michael Warner!)
comment:38 follow-up: ↓ 39 Changed 11 months ago by kovidgoyal
- Type changed from task to enhancement
calibre already has an Ars Technica recipe
comment:39 in reply to: ↑ 38 Changed 11 months ago by irishdaze
- Status changed from new to closed
- Resolution set to worksforme
Replying to kovidgoyal:
calibre already has an Ars Technica recipe
I just looked again and found it! My apologies, I have no idea how I missed it the first time! Thanks for setting me straight. :-)
comment:40 Changed 10 months ago by dueyfinster
Irish Independent:
__license__ = 'GPL v3'
__copyright__ = '2009 Neil Grogan'
#
# Irish Independent Recipe
#
import re
from calibre.web.feeds.news import BasicNewsRecipe
class IrishIndependent(BasicNewsRecipe):
title = u'Irish Independent'
description = 'Irish and World news from Irelands Bestselling Daily Broadsheet'
__author__ = 'Neil Grogan'
oldest_article = 7
max_articles_per_feed = 100
remove_tags_before = dict(id='article')
remove_tags_after = [dict(name='div', attrs={'class':'toolsBottom'})]
remove_tags = [
dict(name='div',attrs={'class':'toolsBottom'}),
dict(name='div',attrs={'class':'toolsTop'}),
dict(name='div',attrs={'class':'boxRelated'}),
dict(name='div',attrs={'class':'section first'}),
dict(name='div',attrs={'class':'tabIt'}),
dict(name='div',attrs={'class':'inner'})
]
feeds = [
(u'Frontpage News', u'http://www.independent.ie/rss'),
(u'World News', u'http://www.independent.ie/world-news/rss'),
(u'Technology', u'http://www.independent.ie/business/technology/rss'),
(u'Sport', u'http://www.independent.ie/sport/rss'),
(u'Entertainment', u'http://www.independent.ie/entertainment/rss'),
(u'Independent Woman', u'http://www.independent.ie/lifestyle/independent-woman/rss'),
(u'Education',u'http://www.independent.ie/education/rss'),
(u'Lifestyle', u'http://www.independent.ie/lifestyle/rss'),
(u'Travel', u'http://www.independent.ie/travel/rss'),
(u'Letters', u'http://www.independent.ie/opinion/letters/rss'),
(u'Weather', u'http://www.independent.ie/weather/rss')
]
# If text only articles are desired
#def print_version(self, url):
# return '%s?service=Print' % url
Just a small thanks for an awesome program ;)
comment:41 Changed 9 months ago by davotibarna
The first Norwegian news source - the most popular newspaper in Norway:
class Aftenposten(Basic News Recipe?):
title = u'Aftenposten' author = 'davotibarna' description = 'Norske nyheter' language = 'no' oldest_article = 5 max_articles_per_feed = 100 no_stylesheets = True encoding = 'ISO-8859-1'
feeds = [(u'Aftenposten', u' http://www.aftenposten.no/eksport/rss-1_0/')]
def print_version(self, url):
return url.replace('#xtor=RSS-3', '?service=print')
comment:42 Changed 9 months ago by davotibarna
Better using a block for the code:
class Aftenposten(BasicNewsRecipe):
title = u'Aftenposten'
__author__ = 'davotibarna'
description = 'Norske nyheter'
language = 'no'
oldest_article = 5
max_articles_per_feed = 100
no_stylesheets = True
encoding = 'ISO-8859-1'
feeds = [(u'Aftenposten', u'http://www.aftenposten.no/eksport/rss-1_0/')]
def print_version(self, url):
return url.replace('#xtor=RSS-3', '?service=print')
comment:43 Changed 9 months ago by BarryTX
- Keywords Houston, Chronicle added
- Status changed from closed to reopened
- Resolution worksforme deleted
Calibre = great program. I hope I can get some help for a custom recipe for The Houston Chronicle. Houston is one of the biggest cities in the US and only has one major newspaper, the Chronicle.
I have a basic recipe that will pull feeds but no pictures. I tried to modify the LA Times recipe but have not had any success. Would it be possible to get a robust Houston Chronicle recipe added to Calibre?
The list of Houston Chronicle news feeds are at www.chron.com/news/rss. If I look at the raw feeds in a section, the links follow the format of http://feeds.chron.com/houstonchronicle/<section>.
An example of an individual article link looks like this:
http://www.chron.com/disp/story.mpl/...n+%26+Texas%29
Thanks for all your support!
comment:44 Changed 8 months ago by kovidgoyal
Houston Chronicle will be in the next release.
comment:45 Changed 8 months ago by very-simple
- Cc sam@… added
- Keywords New York added; Houston, Chronicle removed
I would love to see a recipe for New York Magazine - here is the page of RSS feeds: http://nymag.com/newyork/rss/
Note that, for some reason, some feeds (sections) are headline-only and some are full feed (like vulture and daily intelligencer).
I started trying to put this together myself, but I'm not very good at it.
Thanks in advance!
comment:46 Changed 8 months ago by stanfieldm
I would love a feed for the Minneapolis Star Tribune:
http://www.startribune.com/help/12374506.html
Thanks!
comment:47 Changed 8 months ago by kovidgoyal
- Status changed from reopened to closed
- Resolution set to fixed
Fixesed in branch trunk. The fix will be in the next release.
comment:48 Changed 8 months ago by rogeliodh
Hi, I've created a recipe for "La Jornada", a Mexican newspaper.
__license__ = 'GPL v3'
__copyright__ = '2009, Rogelio Dominguez <rogelio.dominguez at gmail.com>'
'''
www.jornada.unam.mx
'''
from calibre.web.feeds.news import BasicNewsRecipe
import re
class LaJornada(BasicNewsRecipe):
title = u'La Jornada'
oldest_article = 1
max_articles_per_feed = 100
keep_only_tags = [dict(id=['article-cont'])]
remove_tags = [dict(name='div', attrs={'class':['go gui','go gui top']})]
no_stylesheets = True
# Remove capitalized initial letter
preprocess_regexps = [
(re.compile(r'<div class="inicial">(.*)</div>.*<p class="s-s">', re.DOTALL|re.IGNORECASE),
lambda match: match.group(1)),
]
feeds = [(u'Edicion completa', u'http://www.jornada.unam.mx/rss/edicion.xml')]
comment:49 Changed 8 months ago by pablofunes
Hi Rogelio, funny I was also working on a recipe for La Jornada! Mine uses separate feeds, so it creates a table of contents with each section in the newspaper which is nice.
I don't know how to make the pictures go to the top, not the bottom of each article.
http://bugs.calibre-ebook.com/attachment/wiki/UserRecipes/LaJornada.recipe
Cheers, Pablo Funes.
comment:50 Changed 7 months ago by qwe
comment:51 Changed 7 months ago by spqr
Hi, I'd really love to see a recipe for the german Scinexx.de. They offer a quite simple Print-Version, so it should be not too complicated, to compose a recipe (for someone more familiar w/ python than me...
E.g. from Article " http://scinexx.de/wissen-aktuell-11161-2010-02-01.html" go to print version " http://scinexx.de/inc/artikel_drucken.php?id=11161&a_flag=1"
Thx in advance, Josef
comment:52 Changed 7 months ago by kovidgoyal
best place to ask for custom recipes is the custom recipes thread in the calibre forum, where lots of recipe writers hang out.
Closing this ticket, in the future requests should go to that thread.
comment:53 Changed 4 months ago by RedTie45
comment:54 Changed 4 months ago by kovidgoyal
best place to request custom recipes is http://www.mobileread.com/forums/showthread.php?t=32543
comment:55 Changed 4 months ago by admins
Really interesting wiki! Now i have new source for reading in Russian.
comment:56 Changed 3 months ago by Limon460
Looking for the perfect job? If you are, then you first need to consider the quality of your resume that you plan on submitting to your dream company. Whatever you submit as basis for your credentials will create the companys impression of who you are. Thus, there is a need to learn the proper way of writing resumes, curriculum vitae and cover letters. These three main documents reveal basically reveal your personal information and your desire to become a part of the company.
comment:57 Changed 3 months ago by admins
Best place to ask for custom recipes is the custom recipes thread in the calibre forum, where lots of recipe writers hang out.
Regards from: darba piedāvājumi
comment:58 Changed 3 months ago by Satarsa
- Summary changed from Request for new news feeds to New news feed
The recipe for the Russian close-to-science newspaper 'Troitskiy variant' (Троицкий вариант).
class Trv(BasicNewsRecipe):
__license__ = 'GPL v3'
__copyright__ = '2010, Vadim Dyadkin dyadkin@lns.pnpi.spb.ru'
title = u'\u0422\u0440\u043e\u0438\u0446\u043a\u0438\u0439 \u0432\u0430\u0440\u0438\u0430\u043d\u0442'
oldest_article = 30
max_articles_per_feed = 100
recursion = 4
no_stylesheets = True
simultaneous_downloads = 1
keep_only_tags = [dict(name='h1'),
dict(name='div', attrs={'id' : 'content'})
]
remove_tags = [dict(name='div', attrs={'class' : ['dateright', 'postmeta', 'adsense-post', 'comments', 'nocomments', 'widgetarea', 'breadcrumb']}), {'id' : ['sidebar', 'l_sidebar', 'r_sidebar', 'footer', 'homepageright0']}, {'style' : 'clear:both;'},
dict(name='ul'),
dict(name='h2')
]
feeds = [(u'\u0422\u0440\u043e\u0438\u0446\u043a\u0438\u0439 \u0432\u0430\u0440\u0438\u0430\u043d\u0442', u'http://trv-science.ru/feed/')]


the last 2 are reversed; it's washington post and chicago tribune. is web2lrf available on the mac version?