68d4dd463a
This brings us back to only honouring robots.txt on page downloads, not on image downloads. Rationale: Dosage is not a "robot" in the classical sense. It's not designed to spider huge amounts of web sites in search for some content to index, it's only intended to help users keep a personal archive of comics he is interested in. We try very hard to never download any image twice. This fixes #24. (Precedent for this rationale: Google Feedfetcher: https://support.google.com/webmasters/answer/178852?hl=en#robots) |
||
---|---|---|
.. | ||
plugins | ||
__init__.py | ||
ansicolor.py | ||
colorama.py | ||
comic.py | ||
configuration.py | ||
decorators.py | ||
director.py | ||
events.py | ||
fileutil.py | ||
helpers.py | ||
languages.py | ||
loader.py | ||
output.py | ||
rss.py | ||
scraper.py | ||
singleton.py | ||
updater.py | ||
util.py |