feeds2disk

Langue: en

Autres versions - même langue

Version: April 2009 (ubuntu - 07/07/09)

Section: 1 (Commandes utilisateur)

NAME

feeds2disk - part of calibre

SYNOPSIS

feeds2disk [options] ARG

DESCRIPTION

feeds2disk parses an online source of articles, like an RSS or ATOM feed and fetches the article contents organized in a nice hierarchy.

ARG can be one of:

file name - feeds2disk will try to load a recipe from the file

builtin recipe title - feeds2disk will load the builtin recipe and use it to fetch the feed. For e.g. Newsweek or "The BBC" or "The New York Times"

recipe as a string - feeds2disk will load the recipe directly from the string arg.

Available builtin recipes are: 'The Age', 'Al Jazeera in English', 'Ambito.com', 'The American Spectator', u'Ars Technica', u'Associated Press', 'The Atlantic', u'The Australian', 'B92', u'The BBC', "Barron's", u'Blic', 'Borba Online', 'Business Week', 'CNN', 'Chicago Breaking News', 'Chicago Tribune', 'Christian Science Monitor', u'Cincinnati Enquirer', 'Clarin', u'Common Dreams', 'Courrier International', 'Critica de la Argentina', u'Cyberpresse', 'DNA India', u'Daily Telegraph', u'Danas', u'De Morgen', u'De Standaard', 'Diario Granma', 'Die Zeit Nachrichten', u'Discover Magazine', 'E-Novine', u'EL PAIS', 'ESPN', 'The Economist', 'El Cronista', 'El Mercurio online', 'El Mundo', 'El Universal', 'ElArgentino.com', u'Engadget', 'Exiled Online', 'FAZ NET', 'FTD', u'Financial Times', u'Forbes', 'Freakonomics Blog', u'Fudzilla', u'Glasgow Herald', 'Globe and Mail', 'Google Reader', u'The Guardian', u"Harper's Magazine", u"Harper's Magazine - articles from printed edition", u'The Hindu', 'Honolulu Advertiser', 'Honolulu Star-Bulletin', u'Indianapolis Star', 'Infobae.com', u'The International Herald Tribune', u'The Irish Times', u'The Japan Times', 'Jerusalem Post', 'Joel on Software', 'Jornal Brasileiro Online', u'Jutarnji', 'Juventud Rebelde', 'Juventud Rebelde in english', 'La Cuarta', 'La Mujer de mi Vida', 'La Nacion', 'La Nacion Chile', 'La Prensa', 'La Segunda', 'La Tercera', 'LeMonde.fr', u'Liberation', u'Linux Magazine', u'Linuxdevices', u'London Review of Books', u'The Los Angeles Times', 'The Market Ticker', u'The Moscow Times', 'NASA', 'NIN online', 'NSPM in English', u'The Nation', u'New Scientist - Online News', u'New York Review of Books', 'The New York Times', 'The New York Times (subscription)', u'The New Yorker', 'Newstimes', 'Newsweek', u'Nova srpska politicka misao', 'O Estado de S. Paulo', 'O Globo', 'The Onion', 'Outlook India', 'Pagina/12', 'Pescanik', u'Physicstoday', u'Physicsworld', 'Pobjeda Online', u'Politika Online', 'Portfolio', 'Press Online', 'Reuters', u'San Francisco Chronicle', u'Science AAAS', u'ScienceDaily', u'ScienceNews', u'Scientific American', u'The Scotsman', u'Shacknews', 'Soldiers', 'Spiegel Online', u'Spiegel Online International', u'The St. Petersburg Times', u'Sueddeutsche', u'Supersite for Windows', 'Sydney Morning Herald', u'Telegraph.co.uk', 'Telepolis', 'Teleread Blog', u'Time', u'The Times Online', "Tom's Hardware German", "Tom's Hardware US", 'USA Today', 'United Press International', u'Utne reader', u'Vecernje Novosti', 'Vijesti', 'Vreme', 'The Wall Street Journal', 'Washington Post', 'Wired.com', 'heise', u'la Repubblica', u'securitywatch', 'xkcd', 'zdnet'

Whenever you pass arguments to feeds2disk that have spaces in them, enclose the arguments in quotation marks.

OPTIONS

--version
show program's version number and exit
-h, --help
show this help message and exit
--feeds=FEEDS
Specify a list of feeds to download. For example: "['http://feeds.newsweek.com/newsweek/TopNews', 'http://feeds.newsweek.com/headlines/politics']" If you specify this option, any argument to %prog is ignored and a default recipe is used to download the feeds.
--verbose
Be more verbose while processing.
--title=TITLE
The title for this recipe. Used as the title for any ebooks created from the downloaded feeds.
--username=USERNAME
Username for sites that require a login to access content.
--password=PASSWORD
Password for sites that require a login to access content.
--lrf
Optimize fetching for subsequent conversion to LRF.
--recursions=RECURSIONS
Number of levels of links to follow on webpages that are linked to from feeds. Defaul 0
--output-dir=OUTPUT_DIR
The directory in which to store the downloaded feeds. Defaults to the current directory.
--no-progress-bar
Dont show the progress bar
--debug
Very verbose output, useful for debugging.
--test
Useful for recipe development. Forces max_articles_per_feed to 2 and downloads at most 2 feeds.
WEB2DISK OPTIONS:
Options to control web2disk (used to fetch websites linked from feeds)
-t TIMEOUT, --timeout=TIMEOUT
Timeout in seconds to wait for a response from the server. Default: 10.0 s
--delay=DELAY
Minimum interval in seconds between consecutive fetches. Default is 0 s
--encoding=ENCODING
The character encoding for the websites you are trying to download. The default is to try and guess the encoding.
--match-regexp=MATCH_REGEXPS
Only links that match this regular expression will be followed. This option can be specified multiple times, in which case as long as a link matches any one regexp, it will be followed. By default all links are followed.
--filter-regexp=FILTER_REGEXPS
Any link that matches this regular expression will be ignored. This option can be specified multiple times, in which case as long as any regexp matches a link, it will be ignored.By default, no links are ignored. If both --filter-regexp and --match-regexp are specified, then --filter-regexp is applied first.
--dont-download-stylesheets
Do not download CSS stylesheets.

Created by Kovid Goyal <kovid@kovidgoyal.net>

SEE ALSO

http://calibre.kovidgoyal.net