Ticket #405 (closed enhancement: fixed)

Opened 3 years ago

Last modified 3 months ago

New news feed

Reported by: kovidgoyal Owned by: kovidgoyal
Priority: minor Milestone:
Component: News downloading Version: trunk
Keywords: New York Cc: flower_os@…, sam@…

Description (last modified by kovidgoyal) (diff)

This is a meta bug o collect requests for new news feeds, so that someone can look at them:

Requests so far:

Attachments

Volkskrant.recipe Download (3.3 KB) - added by mellian 7 months ago.
Volkskrant paid version
tagesspiegel.recipe Download (3.8 KB) - added by ipaschke 4 months ago.
Der Tagesspiegel, German

Change History

comment:1 Changed 3 years ago by choard

  • Priority changed from minor to trivial
  • Type changed from defect to enhancement

the last 2 are reversed; it's washington post and chicago tribune. is web2lrf available on the mac version?

comment:2 Changed 3 years ago by kovidgoyal

  • Description modified (diff)

Yes it is

comment:3 Changed 3 years ago by kovidgoyal

  • Description modified (diff)

comment:4 Changed 3 years ago by kovidgoyal

  • Description modified (diff)

comment:5 follow-up: ↓ 6 Changed 3 years ago by ralfw

comment:6 in reply to: ↑ 5 Changed 3 years ago by jaguaracer

Replying to ralfw:

The Japan Times ( http://feeds.feedburner.com/japantimes_news)

Financial Times would be great.  http://www.ft.com/servicestools/newstracking/rss

Personally I'm looking at markets. Its in RSS but for the life of me, I can't get it into libprs500 in basic mode. Any help would be great.

comment:7 Changed 2 years ago by ChuckEggDotCom

Recipe for FORTUNE magazine

class Advanced User Recipe?1211822316(Basic News Recipe?):

title = u'Fortune' oldest_article = 31 max_articles_per_feed = 100

feeds = [(u'Fortune', u' http://labs.echoditto.com/projects/fulltextrss/?url=http%3A%2F%2Frss.cnn.com%2Frss%2Fmoney_latest.rss')]

comment:8 Changed 2 years ago by ChuckEggDotCom

Recipe for BUSINESS WEEK magazine:

class Advanced User Recipe?1211822401(Basic News Recipe?):

title = u'Business Week' oldest_article = 7 max_articles_per_feed = 100

feeds = [(u'Top Stories', u' http://www.businessweek.com/topStories/rss/topStories.rss'), (u'Top News', u' http://www.businessweek.com/rss/bwdaily.rss'), (u'Asia', u' http://www.businessweek.com/rss/asia.rss'), (u'Autos', u' http://www.businessweek.com/rss/autos/index.rss'), (u'Classic Cars', u' http://www.businessweek.com/rss/autos/classic_cars/index.rss'), (u'Europe', u' http://www.businessweek.com/rss/europe.rss'), (u'Hybrids', u' http://www.businessweek.com/rss/hybrids/index.rss'), (u'Auto Reviews', u' http://www.businessweek.com/rss/autos/reviews/index.rss'), (u'Innovation & Design', u' http://www.businessweek.com/rss/innovate.rss'), (u'Architecture', u' http://www.businessweek.com/rss/architecture.rss'), (u'Brand Equity', u' http://www.businessweek.com/rss/brandequity.rss'), (u'Auto Design', u' http://www.businessweek.com/rss/carbuff.rss'), (u'Game Room', u' http://www.businessweek.com/rss/gameroom.rss'), (u'Technology', u' http://www.businessweek.com/rss/technology.rss'), (u'Investing', u' http://www.businessweek.m/rss/investor.rss'), (u'Small Business', u' http://www.businessweek.com/rss/smallbiz.rss'), (u'Careers', u' http://www.businessweek.com/rss/careers.rss'), (u'B-Schools', u' http://www.businessweek.com/rss/bschools.rss'), (u'Magazine Selections', u' http://www.businessweek.com/rss/magazine.rss'), (u'CEO Guide to Tech', u' http://www.businessweek.com/rss/ceo_guide_tech.rss')]

comment:9 Changed 2 years ago by kovidgoyal

  • Description modified (diff)

Thanks, I'll add Business Week as a built-in recipe in the next release.

comment:10 Changed 2 years ago by mikediaz

Tried Scientific American in vain. Can you help me grab the feeds that appear here?

 http://www.sciam.com/page.cfm?section=rss

Here is the advanced mode text of what I entered

class Advanced User Recipe?1221785185(Basic News Recipe?):

title = u'Scientific American' oldest_article = 30 max_articles_per_feed = 100

feeds = [(u'Latest News', u' http://rss.sciam.com/ScientificAmerican-News'), (u'Global', u' http://rss.sciam.com/ScientificAmerican-Global'), (u'Health', u' http://rss.sciam.com/sciam/health'), (u'Space', u' http://rss.sciam.com/sciam/space'), (u'Technology', u' http://rss.sciam.com/sciam/technology'), (u'Biology', u' http://rss.sciam.com/sciam/biology'), (u'Mind & Brain', u' http://rss.sciam.com/sciam/mind-and-brain'), (u"What's Next", u' http://rss.sciam.com/sciam/whats-next'), (u'Archeology & Paleontology', u' http://www.sciam.com/page.cfm?section=rsscategory&alias=archaeology-and-paleontology'), (u'Physics', u' http://www.sciam.com/page.cfm?section=rsscategory&alias=physics'), (u'Math', u' http://rss.sciam.com/sciam/math'), (u'History of Science', u' http://www.sciam.com/page.cfm?section=rsscategory&alias=history-of-science'), (u'Chemistry', u' http://rss.sciam.com/sciam/chemistry'), (u'Mind Matters', u' http://rss.sciam.com/ScientificAmerican-MindBlog')]

comment:11 Changed 2 years ago by mikediaz

This one works for Discover Magazine in case you want to use publish it. could not update the wiki. hope it helps

class Advanced User Recipe?1221785375(Basic News Recipe?):

title = u'Discover Magazine' oldest_article = 33 max_articles_per_feed = 20

feeds = [(u'Technology', u' http://discovermagazine.com/topics/technology/rss.xml'), (u'Health - Medicine', u' http://discovermagazine.com/topics/health-medicine/rss.xml'), (u'Mind Brain', u' http://discovermagazine.com/topics/mind-brain/rss.xml'), (u'Space', u' http://discovermagazine.com/topics/space/rss.xml'), (u'Human Origins', u' http://discovermagazine.com/topics/human-origins/rss.xml'), (u'Living World', u' http://discovermagazine.com/topics/living-world/rss.xml'), (u'Environment', u' http://discovermagazine.com/topics/environment/rss.xml'), (u'Physics & Math', u' http://discovermagazine.com/topics/physics-math/rss.xml'), (u'Vital Signs', u' http://discovermagazine.com/columns/vital-signs/rss.xml'), (u"20 Things you didn't know about...", u' http://discovermagazine.com/columns/20-things-you-didnt-know/rss.xml'), (u'Fuzzy Math', u' http://discovermagazine.com/columns/fuzzy-math/rss.xml'), (u'The Brain', u' http://discovermagazine.com/columns/the-brain/rss.xml'), (u'Stupid Science Word of the Month', u' http://discovermagazine.com/columns/stupid-science-word-of-the-month/rss.xml'), (u'Science Not Fiction', u' http://blogs.discovermagazine.com/sciencenotfiction/wp-rss.php')]

comment:12 Changed 2 years ago by kovidgoyal

Added recipes for discover and sciam, thanks.

comment:13 Changed 23 months ago by mccande

Is anyone planning a recipe for the London Times based on URL's like this rather than the rss feeds? This gives an almost complete copy of the newspaper.

Thanks in advance

 http://www.timesonline.co.uk/tol/newspapers/the_times/tuesday/?days=Tuesday

comment:14 Changed 18 months ago by Ezmegaz

hy! there is one hungaryan news site WWW.INDEX.HU, and if i download news from there "  http://index.hu/x?t=/24ora/rss/ " the fonts are incredible big..., can you slove the problem of that rss? THX

comment:15 Changed 18 months ago by kovidgoyal

a good place to ask for help with recipes is the custom recipes thread in the calibre forum

comment:16 in reply to: ↑ description Changed 17 months ago by GPThomson

I'd like to to have the news from the following website added to calibre. It is all Dutch speaking (Belgium):

Knack ( http://www.knack.be/)

Humo ( http://www.humo.be/)

De Gentenaar ( http://www.nieuwsblad.be/)

Thanks!

comment:17 Changed 16 months ago by pixnaps

Request news feed for 'Inside Higher Ed':  http://www.insidehighered.com/news/

comment:18 follow-up: ↓ 19 Changed 16 months ago by Ezmegaz

hungaryan news site "index.hu" fss feeds

class Advanced User Recipe?1242148664(Basic News Recipe?):

title = u'INDEX.HU' oldest_article = 3 max_articles_per_feed = 50

feeds = [(u'ALL', u' http://index.hu/24ora/rss/'), (u'BELF\xd6LD', u' http://index.hu/belfold/rss/default/'), (u'K\xdcLF\xd6LD', u' http://index.hu/kulfold/rss/default/'), (u'BULV\xc1R', u' http://index.hu/bulvar/rss/default/'), (u'GAZDAS\xc1G', u' http://index.hu/gazdasag/rss/default/'), (u'TECH', u' http://index.hu/tech/rss/main/'), (u'KULT\xdaRA', u' http://index.hu/kultur/rss/main/'), (u'TUDOM\xc1NY', u' http://index.hu/tudomany/rss/main/'), (u'SPORT', u' http://sportgeza.hu/24ora/'), (u'V\xc9LEM\xc9NY', u' http://index.hu/velemeny/rss/default/')]

comment:19 in reply to: ↑ 18 Changed 16 months ago by Ezmegaz

(u'SPORT', u' http://sportgeza.hu/24ora/')

please remove this feed from sources

comment:20 Changed 16 months ago by Ezmegaz

pcworld.hu feeds -hungaryan news site

class Advanced User Recipe?1242149390(Basic News Recipe?):

title = u'PCWORLD.HU' oldest_article = 3 max_articles_per_feed = 50

feeds = [(u'H\xedrek', u' http://pcworld.hu/rss/rss.xml'), (u'Hardver h\xedrek', u' http://www.pcworld.hu/rss/rss_hardverhirek.xml'), (u'Szoftver h\xedrek', u' http://www.pcworld.hu/rss/rss_szoftverhirek.xml'), (u'Hardver cikkek', u' http://www.pcworld.hu/rss/rss_hardvercikkek.xml'), (u'Szoftver cikkek', u' http://www.pcworld.hu/rss/rss_szoftvercikkek.xml'), (u'Mobil h\xedrek', u' http://www.pcworld.hu/rss/rss_mobil.xml'), (u'\xdczleti h\xedrek', u' http://www.pcworld.hu/rss/rss_uzlet.xml'), (u'Let\xf6lt\xe9sek', u' http://www.pcworld.hu/rss/rss_letoltes.xml'), (u'PC World TV', u' http://tv.pcworld.hu/rss/rss_hun_pcw.xml'), (u'Tudta-e...?', u' http://pcworld.hu/rss/rss_tudtae.xml')]

comment:21 Changed 16 months ago by floweros

  • Cc flower_os@… added

 Slashdot.org - News for nerds. Stuff that matters.

English

Slashdot.py Download

class Advanced User Recipe?1238825681(Basic News Recipe?):

title = u'Slashdot.org' oldest_article = 7 max_articles_per_feed = 100 no_stylesheets = True keep_only_tags = [dict(name='div',attrs={'id':'article'})] remove_tags = [

dict(name='div',attrs={'id':'userlogin-title'}), dict(name='div',attrs={'id':'userlogin-content'}), dict(name='div',attrs={'id':'commentwrap'}), dict(name='span',attrs={'id':'more_comments_num_a'}), ]

feeds = [

(u'Slashdot', u' http://rss.slashdot.org/Slashdot/slashdot?m=5072'), (u'/. IT', u' http://rss.slashdot.org/Slashdot/slashdotIT'), (u'/. Hardware', u' http://rss.slashdot.org/Slashdot/slashdotHardware'), (u'/. Linux', u' http://rss.slashdot.org/Slashdot/slashdotLinux'), (u'/. Your Rights Online', u' http://rss.slashdot.org/Slashdot/slashdotYourRightsOnline') ]

comment:22 Changed 16 months ago by Ezmegaz

hungarian rss feeds

class Advanced User Recipe?1242561863(Basic News Recipe?):

title = u'168\xf3ra' oldest_article = 4 max_articles_per_feed = 50

feeds = [(u'Itthon', u' http://www.168ora.hu/static/rss/cikkek_itthon.xml'), (u'Gl\xf3busz', u' http://www.168ora.hu/static/rss/cikkek_globusz.xml'), (u'Punch', u' http://www.168ora.hu/static/rss/cikkek_punch.xml'), (u'Arte', u' http://www.168ora.hu/static/rss/cikkek_arte.xml'), (u'Buxa', u' http://www.168ora.hu/static/rss/cikkek_buxa.xml'), (u'Sebess\xe9g', u' http://www.168ora.hu/static/rss/cikkek_sebesseg.xml'), (u'Tud\xe1s', u' http://www.168ora.hu/static/rss/cikkek_tudas.xml'), (u'Sport', u' http://www.168ora.hu/static/rss/cikkek_sport.xml'), (u'V\xe9lem\xe9ny', u' http://www.168ora.hu/static/rss/cikkek_velemeny.xml'), (u'Dolce Vita', u' http://www.168ora.hu/static/rss/cikkek_dolcevita.xml'), (u'R\xe1di\xf3', u' http://www.168ora.hu/static/rss/radio.xml')]

class Advanced User Recipe?1242561928(Basic News Recipe?):

title = u'ATV' oldest_article = 5 max_articles_per_feed = 50

feeds = [(u'H\xedrek', u' http://atv.hu/rss/1'), (u'Cikkek', u' http://atv.hu/rss/2')]

class Advanced User Recipe?1242561945(Basic News Recipe?):

title = u'H\xedrszerz\u0151' oldest_article = 5 max_articles_per_feed = 50

feeds = [(u'Belf\xf6ld', u' http://www.hirszerzo.hu/rss.belfold.xml'), (u'K\xfclf\xf6ld', u' http://www.hirszerzo.hu/rss.kulfold.xml'), (u'Profit', u' http://www.hirszerzo.hu/rss.profit.xml'), (u'Shake', u' http://www.hirszerzo.hu/rss.shake.xml'), (u'Publicisztika', u' http://www.hirszerzo.hu/rss.publicisztika.xml'), (u'Elemz\xe9s', u' http://www.hirszerzo.hu/rss.elemzes.xml'), (u'Sorok k\xf6z\xf6tt', u' http://www.hirszerzo.hu/rss.sorok_kozott.xml'), (u'Gal\xe9ria', u' http://www.hirszerzo.hu/rss.galeria.xml'), (u'Patro', u' http://www.hirszerzo.hu/rss.patro.xml')]

comment:23 Changed 16 months ago by TMF

This is a news feed for Le Temps (see  Wikipedia article), Switzerland's leading French-language newspaper.

class Advanced User Recipe?1243078936(Basic News Recipe?):

title = u'Le Temps' oldest_article = 7 max_articles_per_feed = 100 no_stylesheets = True remove_tags = [dict(name='div', attrs={'id':'footer'})] remove_tags = [dict(name='div', attrs={'class':'box links'})] remove_tags = [dict(name='script')] extra_css = '.heading {font-size: 13px; line-height: 15px; margin: 20px 0;} \n h2 {font-size: 24px; line-height: 25px; margin-bottom: 14px;} \n .author {font-size: 11px; margin: 0 0 5px 0;} \n .lead {font-weight: 700; margin: 10px 0;} \n p {margin: 0 0 10px 0;}'

feeds = [

('Actualité', ' http://www.letemps.ch/rss/site/'), ('Monde', ' http://www.letemps.ch/rss/site/actualite/monde'), ('Suisse & Régions', ' http://www.letemps.ch/rss/site/actualite/suisse_regions'), ('Sciences & Environnement', ' http://www.letemps.ch/rss/site/actualite/sciences_environnement'), ('Société', ' http://www.letemps.ch/rss/site/actualite/societe'), ('Economie & Finance', ' http://www.letemps.ch/rss/site/economie_finance'), ('Economie & Finance - Finance', ' http://www.letemps.ch/rss/site/economie_finance/finance'), ('Economie & Finance - Fonds de placement', ' http://www.letemps.ch/rss/site/economie_finance/fonds_placement'), ('Economie & Finance - Carrières', ' http://www.letemps.ch/rss/site/economie_finance/carrieres'), ('Culture', ' http://www.letemps.ch/rss/site/culture'), ('Culture - Cinéma', ' http://www.letemps.ch/rss/site/culture/cinema'), ('Culture - Musiques', ' http://www.letemps.ch/rss/site/culture/musiques'), ('Culture - Scènes', ' http://www.letemps.ch/rss/site/culture/scenes'), ('Culture - Arts plastiques', ' http://www.letemps.ch/rss/site/culture/arts_plastiques'), ('Livres', ' http://www.letemps.ch/rss/site/culture/livres'), ('Opinions', ' http://www.letemps.ch/rss/site/opinions'), ('Opinions - Editoriaux', ' http://www.letemps.ch/rss/site/opinions/editoriaux'), ('Opinions - Invités', ' http://www.letemps.ch/rss/site/opinions/invites'), ('Opinions - Chroniques', ' http://www.letemps.ch/rss/site/opinions/chroniques'), ('Life Style?', ' http://www.letemps.ch/rss/site/lifestyle'), ('Life Style? - Luxe', ' http://www.letemps.ch/rss/site/lifestyle/luxe'), ('Life Style? - Horlogerie & Joaillerie', ' http://www.letemps.ch/rss/site/lifestyle/horlogerie_joaillerie'), ('Life Style? - Design', ' http://www.letemps.ch/rss/site/lifestyle/design'), ('Life Style? - Voyages', ' http://www.letemps.ch/rss/site/lifestyle/voyages'), ('Life Style? - Gastronomie', ' http://www.letemps.ch/rss/site/lifestyle/gastronomie'), ('Life Style? - Architecture & Immobilier', ' http://www.letemps.ch/rss/site/lifestyle/architecture_immobilier'), ('Life Style? - Automobile', ' http://www.letemps.ch/rss/site/lifestyle/automobile'), ('Sports', ' http://www.letemps.ch/rss/site/actualite/sports'),

]

def print_version(self, url):

return url.replace('Page', 'Facet/print')

comment:24 Changed 16 months ago by TMF

This is a news feed for WOZ Die Wochenzeitung ( German Wikipedia article, a German language Swiss left-wing weekly.

class Advanced User Recipe?1243078936(Basic News Recipe?):

title = u'WOZ Die Wochenzeitung' oldest_article = 7 max_articles_per_feed = 100 no_stylesheets = True remove_tags = [dict(name='p', attrs={'class':'arrow_top'})] remove_tags = [dict(name='p', attrs={'class':'bottom_right'})] remove_tags = [dict(name='script')] extra_css = '#print_titel{vertical-align: bottom; text-align: left; color: #666666; background-color: white; padding-top: 30px; padding-bottom: 10px; border-bottom: 1px solid #999999;} \n #title{text-align: left; font-size: large; font-weight: 600; padding-top: 0px; padding-bottom: 6px;} \n h3 {text-align: left; font-size: large; font-weight: 600; padding-top: 0px; padding-bottom: 6px;} \n #lead{font-weight: 600; padding-bottom: 6px;} \n h2{font-weight: 600; padding-bottom: 6px;} #author{color: #666666; padding-top: 0px; padding-bottom: 0px;} \n h4{color: #666666; padding-top: 0px; padding-bottom: 0px;} \n #author2 {color: #666666; padding-top: 0px; padding-bottom: 0px;} \n .dotted_line{padding-top: 0px; margin-bottom: 18px; border-bottom: 1px dotted #666666;} \n .intro{margin: 0 auto; font-weight: 600; padding-bottom: 18px;} \n h5{margin: 0 auto; font-weight: 600; padding-bottom: 18px;} \n .intro2{margin: 0 auto; font-weight: 600;} \n .text{padding-bottom: 18px;} \n .subtitle{margin: 0 auto; font-weight: 600; padding-bottom: 10px;} \n .articletitle{margin: 0 auto; font-weight: 600; padding-bottom: 10px;} \n #content_infobox{margin-top: 20px; margin-left: 0px; margin-right: 0px; margin-bottom: 10px; text-align: left; border-bottom: 1px solid #999999;} \n .content_infobox_titel{padding-top: 6px; padding-bottom: 8px; padding-left: 8px; padding-right: 8px; font-weight: 600; border-top: 1px solid #999999; border-bottom: 1px dotted #999999;} \n .content_infobox_text{padding-top: 6px; padding-bottom: 12px; padding-left: 8px; padding-right: 8px;} \n .box_gray{padding-top: 4px; padding-left: 7px; padding-right: 7px; padding-bottom: 4px;} \n .box_white { padding-top: 4px; padding-left: 7px; padding-right: 7px; padding-bottom: 4px;} \n .content_infobox_mehr{margin-top: 20px; margin-left: 0px; margin-right: 0px; margin-bottom: 10px; text-align: left; width: 600px; border-bottom: 1px solid #999999;}'

feeds = [('WOZ Die Wochenzeitung - Headlines', ' http://www.woz.ch/inhalt/headlinesRSS.php'),]

def print_version(self, url):

return url.replace('rss/', 'print_')

comment:25 Changed 14 months ago by kiodane

  • Keywords craigslist added
  • Component changed from web2lrf to News downloading

Best of for Craigslist (broken down by city)

class Advanced User Recipe?1247284744(Basic News Recipe?):

title = u'craigslist - Best Of' oldest_article = 365 max_articles_per_feed = 100

feeds = [(u'Best of craigslist', u' http://www.craigslist.org/about/best/all/index.rss'), (u'Ann Arbor', u' http://www.craigslist.org/about/best/aaa/index.rss'), (u'Asheville', u' http://www.craigslist.org/about/best/ash/index.rss'), (u'Austin', u' http://www.craigslist.org/about/best/aus/index.rss'), (u'Baltimore', u' http://www.craigslist.org/about/best/bal/index.rss'), (u'Birmingham', u' http://www.craigslist.org/about/best/bhm/index.rss'), (u'Boston', u' http://www.craigslist.org/about/best/bos/index.rss'), (u'Vermont', u' http://www.craigslist.org/about/best/brl/index.rss'), (u'Columbia', u' http://www.craigslist.org/about/best/cae/index.rss'), (u'Charlotte', u' http://www.craigslist.org/about/best/cha/index.rss'), (u'Chico', u' http://www.craigslist.org/about/best/chc/index.rss'), (u'Chicago', u' http://www.craigslist.org/about/best/chi/index.rss'), (u'Charleston', u' http://www.craigslist.org/about/best/chs/index.rss'), (u'Cleveland', u' http://www.craigslist.org/about/best/cle/index.rss'), (u'Calgary', u' http://www.craigslist.org/about/best/clg/index.rss'), (u'Colorado Springs', u' http://www.craigslist.org/about/best/cos/index.rss'), (u'Dallas', u' http://www.craigslist.org/about/best/dal/index.rss'), (u'Denver', u' http://www.craigslist.org/about/best/den/index.rss'), (u'Detroit Metro', u' http://www.craigslist.org/about/best/det/index.rss'), (u'Des Moines', u' http://www.craigslist.org/about/best/dsm/index.rss'), (u'Eau Claire', u' http://www.craigslist.org/about/best/eau/index.rss'), (u'Grand Rapids', u' http://www.craigslist.org/about/best/grr/index.rss'), (u'Hawaii', u' http://www.craigslist.org/about/best/hnl/index.rss'), (u'Jacksonville', u' http://www.craigslist.org/about/best/jax/index.rss'), (u'Knoxville', u' http://www.craigslist.org/about/best/knx/index.rss'), (u'Kansas City', u' http://www.craigslist.org/about/best/ksc/index.rss'), (u'South Florida', u' http://www.craigslist.org/about/best/mia/index.rss'), (u'Minneapolis', u' http://www.craigslist.org/about/best/min/index.rss'), (u'Maine', u' http://www.craigslist.org/about/best/mne/index.rss'), (u'Montreal', u' http://www.craigslist.org/about/best/mon/index.rss'), (u'Nashville', u' http://www.craigslist.org/about/best/nsh/index.rss'), (u'New York', u' http://www.craigslist.org/about/best/nyc/index.rss'), (u'Orange County', u' http://www.craigslist.org/about/best/orc/index.rss'), (u'Portland', u' http://www.craigslist.org/about/best/pdx/index.rss'), (u'Phoenix', u' http://www.craigslist.org/about/best/phx/index.rss'), (u'Pittsburgh', u' http://www.craigslist.org/about/best/pit/index.rss'), (u'Rhode Island', u' http://www.craigslist.org/about/best/prv/index.rss'), (u'Raleigh', u' http://www.craigslist.org/about/best/ral/index.rss'), (u'Rochester', u' http://www.craigslist.org/about/best/rcs/index.rss'), (u'San Antonio', u' http://www.craigslist.org/about/best/sat/index.rss'), (u'Santa Barbara', u' http://www.craigslist.org/about/best/sba/index.rss'), (u'San Diego', u' http://www.craigslist.org/about/best/sdo/index.rss'), (u'Seattle-Tacoma', u' http://www.craigslist.org/about/best/sea/index.rss'), (u'Sf Bay Area', u' http://www.craigslist.org/about/best/sfo/index.rss'), (u'Salt Lake City', u' http://www.craigslist.org/about/best/slc/index.rss'), (u'Spokane', u' http://www.craigslist.org/about/best/spk/index.rss'), (u'St Louis', u' http://www.craigslist.org/about/best/stl/index.rss'), (u'Sydney', u' http://www.craigslist.org/about/best/syd/index.rss'), (u'Toronto', u' http://www.craigslist.org/about/best/tor/index.rss'), (u'Vancouver BC', u' http://www.craigslist.org/about/best/van/index.rss'), (u'Washington DC', u' http://www.craigslist.org/about/best/wdc/index.rss')]

comment:26 follow-up: ↓ 27 Changed 14 months ago by kovidgoyal

Thanks, added.

comment:27 in reply to: ↑ 26 Changed 14 months ago by kiodane

Replying to kovidgoyal:

Thanks, added.

On second thought, it should just be the first link, as the others are only the cities that were on the first page of results in the RSS on the day I pulled the data. The complete city list is cumbersome, so just the "best of/All" page should be covered.

title = u'craigslist - Best Of' oldest_article = 365 max_articles_per_feed = 100

feeds = [(u'Best of craigslist', u' http://www.craigslist.org/about/best/all/index.rss')]

comment:28 Changed 14 months ago by kovidgoyal

OK changed.

comment:29 Changed 13 months ago by devilinside

  • Keywords craigslist removed

Hungarian Tech Net? recipe

class Hun Tech Net?(Basic News Recipe?):

title = u'Tech Net?' oldest_article = 3 description = u'Az ut\xf3bbi 3 nap Tech Net? h\xedrei' language = _('Hungarian') lang = 'hu' encoding = 'utf-8' author = 'Devilinside' max_articles_per_feed = 30 timefmt = ' [%Y, %b %d, %a]' extra_css =

body{font-family:Arial,Helvetica,sans-serif; font-size:small;} h1{font-size:large;}

remove_tags_after = dict(name='ul', attrs={'class':'cikk_bottom box'}) remove_tags_before = dict(name='div', attrs={'id':'c-main'}) remove_tags = [dict(name='div', attrs={'class':'wrp clr'})]

feeds = [(u'C\xedmlap', u' http://www.technet.hu/rss/cimoldal/'), (u'Tech Tud?', u' http://www.technet.hu/rss/techtud/'), (u'PDA M\xe1nia', u' http://www.technet.hu/rss/pdamania/'), (u'Telefon', u' http://www.technet.hu/rss/telefon/'), (u'Sz\xe1m\xedt\xf3g\xe9p', u' http://www.technet.hu/rss/notebook/'), (u'GPS', u' http://www.technet.hu/rss/gps/')]

comment:30 Changed 13 months ago by kovidgoyal

thanks, added

comment:31 Changed 12 months ago by tarun

'The Times of India' is the most read english newspaper(daily) in India. I tried my best but could only get the headlines, content went missing and now I'm struggling. After customising the Sony reader itself resets.

Useful links: Website URL: www.timesofindia.com

Main RSS feed:  http://timesofindia.indiatimes.com/rssfeeds/-2128936835.cms

Replacement URL links: a). News Link  http://timesofindia.indiatimes.com/news/india/Karnataka-HC-judge-declares-his-assets-/articleshow/4933280.cms b). Print version of above  http://timesofindia.indiatimes.com/articleshow/msid-4933280,prtpage-1.cms

Kindly send me the code and incorporate in your new release, TOI is important in India.

Thanks, Tarun.

comment:32 in reply to: ↑ description Changed 12 months ago by bhandarisaurabh1987

  • Type changed from enhancement to task

Replying to kovidgoyal:

This is a meta bug o collect requests for new news feeds, so that someone can look at them:

Requests so far:

*The Business World  http://www.businessworld.in/ *business standard http://www.business-standard.com/india/

comment:33 Changed 12 months ago by devilinside

  • Priority changed from trivial to minor
  • Type changed from task to enhancement

Hungarian military news portal recipe:

#!/usr/bin/env python
# vim:fileencoding=UTF-8:ts=4:sw=4:sta:et:sts=4:ai
from __future__ import with_statement

__license__   = 'GPL v3'
__copyright__ = '2009, Kovid Goyal <kovid@kovidgoyal.net>'
__docformat__ = 'restructuredtext en'

from calibre.web.feeds.news import BasicNewsRecipe

class HunMilNews(BasicNewsRecipe):
    title          = u'Honvedelem.hu'
    oldest_article = 3
    description = u'Katonah\xedrek'
    language = _('Hungarian')
    lang = 'hu'
    encoding = 'windows-1250'
    category = 'news, military'

    no_stylesheets         = True


    __author__ = 'Devilinside'
    max_articles_per_feed = 16
    no_stylesheets = True



    keep_only_tags = [dict(name='div', attrs={'class':'cikkoldal_cikk_cim'}),
 dict(name='div', attrs={'class':'cikkoldal_cikk_alcim'}),
 dict(name='div', attrs={'class':'cikkoldal_datum'}),
 dict(name='div', attrs={'class':'cikkoldal_lead'}),
 dict(name='div', attrs={'class':'cikkoldal_szoveg'}),
 dict(name='img', attrs={'class':'ajanlo_kep_keretes'}),
        ]



    feeds          = [(u'Misszi\xf3k', u'http://www.honvedelem.hu/rss_b?c=22'), 
 (u'Aktu\xe1lis hazai h\xedrek', u'http://www.honvedelem.hu/rss_b?c=3'), 
 (u'K\xfclf\xf6ldi h\xedrek', u'http://www.honvedelem.hu/rss_b?c=4'), 
 (u'A h\xf3nap t\xe9m\xe1ja', u'http://www.honvedelem.hu/rss_b?c=6'), 
 (u'Riport', u'http://www.honvedelem.hu/rss_b?c=5'), 
 (u'Portr\xe9k', u'http://www.honvedelem.hu/rss_b?c=7'), 
 (u'Haditechnika', u'http://www.honvedelem.hu/rss_b?c=8'), 
 (u'Programok, esem\xe9nyek', u'http://www.honvedelem.hu/rss_b?c=12')
        ]

Regards

comment:34 Changed 12 months ago by nicholasdp

Would it be possible to make a recipe for "Harvard Business Review" with access to subscriber content?

 http://hbr.harvardbusiness.org/

Thanks!

comment:35 Changed 12 months ago by kovidgoyal

I will need your account information for that: send it via email to kovid at kovidgoyal dot net

comment:36 Changed 12 months ago by bhandarisaurabh1987

can anyone make recipe for business standard ,it's a business news daily  http://www.business-standard.com/india/

comment:37 Changed 11 months ago by irishdaze

  • Type changed from enhancement to task

Please add to the next release the Ars Technica Recipe (by Michael Warner) if it works after testing? (props to Michael Warner!)

comment:38 follow-up: ↓ 39 Changed 11 months ago by kovidgoyal

  • Type changed from task to enhancement

calibre already has an Ars Technica recipe

comment:39 in reply to: ↑ 38 Changed 11 months ago by irishdaze

  • Status changed from new to closed
  • Resolution set to worksforme

Replying to kovidgoyal:

calibre already has an Ars Technica recipe

I just looked again and found it! My apologies, I have no idea how I missed it the first time! Thanks for setting me straight. :-)

comment:40 Changed 10 months ago by dueyfinster

Irish Independent:

__license__   = 'GPL v3'
__copyright__ = '2009 Neil Grogan'
# 
# Irish Independent Recipe
#
import re 
from calibre.web.feeds.news import BasicNewsRecipe

class IrishIndependent(BasicNewsRecipe):
    title          = u'Irish Independent'
    description = 'Irish and World news from Irelands Bestselling Daily Broadsheet'
    __author__	= 'Neil Grogan'
    oldest_article = 7
    max_articles_per_feed = 100
    remove_tags_before = dict(id='article')
    remove_tags_after  = [dict(name='div', attrs={'class':'toolsBottom'})]
    remove_tags    = [
	        dict(name='div',attrs={'class':'toolsBottom'}),
	        dict(name='div',attrs={'class':'toolsTop'}),
	        dict(name='div',attrs={'class':'boxRelated'}),
	        dict(name='div',attrs={'class':'section first'}),
	        dict(name='div',attrs={'class':'tabIt'}),
	        dict(name='div',attrs={'class':'inner'})
	        ]

    feeds	= [
    (u'Frontpage News', u'http://www.independent.ie/rss'),
    (u'World News', u'http://www.independent.ie/world-news/rss'),
    (u'Technology', u'http://www.independent.ie/business/technology/rss'),
    (u'Sport', u'http://www.independent.ie/sport/rss'),
    (u'Entertainment', u'http://www.independent.ie/entertainment/rss'),
    (u'Independent Woman', u'http://www.independent.ie/lifestyle/independent-woman/rss'),
    (u'Education',u'http://www.independent.ie/education/rss'),
    (u'Lifestyle', u'http://www.independent.ie/lifestyle/rss'),
    (u'Travel', u'http://www.independent.ie/travel/rss'), 
    (u'Letters', u'http://www.independent.ie/opinion/letters/rss'),
    (u'Weather', u'http://www.independent.ie/weather/rss')
    ]
    
# If text only articles are desired
#def print_version(self, url):
#    return '%s?service=Print' % url

Just a small thanks for an awesome program ;)

comment:41 Changed 9 months ago by davotibarna

The first Norwegian news source - the most popular newspaper in Norway:

class Aftenposten(Basic News Recipe?):

title = u'Aftenposten' author = 'davotibarna' description = 'Norske nyheter' language = 'no' oldest_article = 5 max_articles_per_feed = 100 no_stylesheets = True encoding = 'ISO-8859-1'

feeds = [(u'Aftenposten', u' http://www.aftenposten.no/eksport/rss-1_0/')]

def print_version(self, url):

return url.replace('#xtor=RSS-3', '?service=print')

comment:42 Changed 9 months ago by davotibarna

Better using a block for the code:

class Aftenposten(BasicNewsRecipe):
    title          = u'Aftenposten'
    __author__     = 'davotibarna'
    description    = 'Norske nyheter'
    language = 'no'
    oldest_article = 5
    max_articles_per_feed = 100
    no_stylesheets = True
    encoding = 'ISO-8859-1'

    feeds          = [(u'Aftenposten', u'http://www.aftenposten.no/eksport/rss-1_0/')]

    def print_version(self, url):
        return url.replace('#xtor=RSS-3', '?service=print')

comment:43 Changed 9 months ago by BarryTX

  • Keywords Houston, Chronicle added
  • Status changed from closed to reopened
  • Resolution worksforme deleted

Calibre = great program. I hope I can get some help for a custom recipe for The Houston Chronicle. Houston is one of the biggest cities in the US and only has one major newspaper, the Chronicle.

I have a basic recipe that will pull feeds but no pictures. I tried to modify the LA Times recipe but have not had any success. Would it be possible to get a robust Houston Chronicle recipe added to Calibre?

The list of Houston Chronicle news feeds are at www.chron.com/news/rss. If I look at the raw feeds in a section, the links follow the format of  http://feeds.chron.com/houstonchronicle/<section>.

An example of an individual article link looks like this:

 http://www.chron.com/disp/story.mpl/...n+%26+Texas%29

Thanks for all your support!

comment:44 Changed 8 months ago by kovidgoyal

Houston Chronicle will be in the next release.

comment:45 Changed 8 months ago by very-simple

  • Cc sam@… added
  • Keywords New York added; Houston, Chronicle removed

I would love to see a recipe for New York Magazine - here is the page of RSS feeds:  http://nymag.com/newyork/rss/

Note that, for some reason, some feeds (sections) are headline-only and some are full feed (like vulture and daily intelligencer).

I started trying to put this together myself, but I'm not very good at it.

Thanks in advance!

comment:46 Changed 8 months ago by stanfieldm

I would love a feed for the Minneapolis Star Tribune:

 http://www.startribune.com/help/12374506.html

Thanks!

comment:47 Changed 8 months ago by kovidgoyal

  • Status changed from reopened to closed
  • Resolution set to fixed

Fixesed in branch trunk. The fix will be in the next release.

comment:48 Changed 8 months ago by rogeliodh

Hi, I've created a recipe for "La Jornada", a Mexican newspaper.

__license__   = 'GPL v3'
__copyright__ = '2009, Rogelio Dominguez <rogelio.dominguez at gmail.com>'
'''
www.jornada.unam.mx
'''

from calibre.web.feeds.news import BasicNewsRecipe

import re

class LaJornada(BasicNewsRecipe):
    title          = u'La Jornada'
    oldest_article = 1
    max_articles_per_feed = 100
    keep_only_tags = [dict(id=['article-cont'])]
    remove_tags    = [dict(name='div', attrs={'class':['go gui','go gui top']})]
    no_stylesheets = True
    # Remove capitalized initial letter
    preprocess_regexps = [
                         (re.compile(r'<div class="inicial">(.*)</div>.*<p class="s-s">', re.DOTALL|re.IGNORECASE),
                          lambda match: match.group(1)),
                         ]
    feeds          = [(u'Edicion completa', u'http://www.jornada.unam.mx/rss/edicion.xml')]

comment:49 Changed 8 months ago by pablofunes

Hi Rogelio, funny I was also working on a recipe for La Jornada! Mine uses separate feeds, so it creates a table of contents with each section in the newspaper which is nice.

I don't know how to make the pictures go to the top, not the bottom of each article.

 http://bugs.calibre-ebook.com/attachment/wiki/UserRecipes/LaJornada.recipe

Cheers, Pablo Funes.

comment:50 Changed 7 months ago by qwe

Changed 7 months ago by mellian

Volkskrant paid version

comment:51 Changed 7 months ago by spqr

Hi, I'd really love to see a recipe for the german  Scinexx.de. They offer a quite simple Print-Version, so it should be not too complicated, to compose a recipe (for someone more familiar w/ python than me...

E.g. from Article " http://scinexx.de/wissen-aktuell-11161-2010-02-01.html" go to print version " http://scinexx.de/inc/artikel_drucken.php?id=11161&a_flag=1"

Thx in advance, Josef

comment:52 Changed 7 months ago by kovidgoyal

best place to ask for custom recipes is the custom recipes thread in the calibre forum, where lots of recipe writers hang out.

Closing this ticket, in the future requests should go to that thread.

 http://www.mobileread.com/forums/showthread.php?t=32543

comment:53 Changed 4 months ago by RedTie45

comment:54 Changed 4 months ago by kovidgoyal

best place to request custom recipes is  http://www.mobileread.com/forums/showthread.php?t=32543

Changed 4 months ago by ipaschke

Der Tagesspiegel, German

comment:55 Changed 4 months ago by admins

Really interesting wiki! Now i have new  source for reading in Russian.

comment:56 Changed 3 months ago by Limon460

Looking for the perfect job? If you are, then you first need to consider the quality of your resume that you plan on submitting to your dream company. Whatever you submit as basis for your credentials will create the companys impression of who you are. Thus, there is a need to learn the proper way of writing resumes, curriculum vitae and cover letters. These three main documents reveal basically reveal your personal information and your desire to become a part of the company.

  Resume Writing

comment:57 Changed 3 months ago by admins

Best place to ask for custom recipes is the custom recipes thread in the calibre forum, where lots of recipe writers hang out.

Regards from:  darba piedāvājumi

comment:58 Changed 3 months ago by Satarsa

  • Summary changed from Request for new news feeds to New news feed

The recipe for the Russian close-to-science newspaper 'Troitskiy variant' (Троицкий вариант).

class Trv(BasicNewsRecipe):

__license__   = 'GPL v3'
__copyright__ = '2010, Vadim Dyadkin dyadkin@lns.pnpi.spb.ru'


    title          = u'\u0422\u0440\u043e\u0438\u0446\u043a\u0438\u0439 \u0432\u0430\u0440\u0438\u0430\u043d\u0442'
    oldest_article = 30
    max_articles_per_feed = 100
    recursion = 4
    no_stylesheets = True
    simultaneous_downloads = 1

    keep_only_tags = [dict(name='h1'),
                      dict(name='div', attrs={'id' : 'content'})
                     ]

    remove_tags = [dict(name='div', attrs={'class' : ['dateright', 'postmeta', 'adsense-post', 'comments', 'nocomments', 'widgetarea', 'breadcrumb']}), {'id' : ['sidebar', 'l_sidebar', 'r_sidebar', 'footer', 'homepageright0']}, {'style' : 'clear:both;'},
                   dict(name='ul'),
                   dict(name='h2')
                ]

    feeds          = [(u'\u0422\u0440\u043e\u0438\u0446\u043a\u0438\u0439 \u0432\u0430\u0440\u0438\u0430\u043d\u0442', u'http://trv-science.ru/feed/')]
Note: See TracTickets for help on using tickets.