Commit graph

462 commits

Author SHA1 Message Date
Bastian Kleineidam
6ca4eaa492 Code cleanup. 2013-04-11 18:27:43 +02:00
Bastian Kleineidam
7e593cf7e8 Add firstStripUrls. 2013-04-10 23:57:09 +02:00
Bastian Kleineidam
a0c7f54871 Fix zwarwald 2013-04-10 20:14:43 +02:00
Bastian Kleineidam
fb05c10808 Sort entries. 2013-04-10 18:36:33 +02:00
Bastian Kleineidam
54283775a8 Add ForLackOfABetterComic 2013-04-10 18:20:39 +02:00
Bastian Kleineidam
d00310f017 Add EverydayBlues 2013-04-10 18:20:08 +02:00
Bastian Kleineidam
8b99b59056 Added DamnLol 2013-04-10 18:19:38 +02:00
Bastian Kleineidam
5127d4c895 Use re.escape and add some firstStripUrl. 2013-04-10 18:19:11 +02:00
Bastian Kleineidam
3213eebd75 Added ZenPencils. 2013-04-09 19:38:47 +02:00
Bastian Kleineidam
e040dd0d50 Added Science. 2013-04-09 19:38:16 +02:00
Bastian Kleineidam
68f14971e8 Added RealmOfAtland. 2013-04-09 19:37:47 +02:00
Bastian Kleineidam
f9179e9de5 Added GoblinsComic 2013-04-09 19:37:24 +02:00
Bastian Kleineidam
f71961acbc Added ExtraOrdinary. 2013-04-09 19:36:51 +02:00
Bastian Kleineidam
190ffcd390 Use str() for robotparser. 2013-04-09 19:36:00 +02:00
Bastian Kleineidam
b9dc385ff2 Implemented voting 2013-04-09 19:33:50 +02:00
Bastian Kleineidam
4528281ddd Voting part 2 2013-04-08 21:20:01 +02:00
Bastian Kleineidam
e762f269b7 First part of voting stuff. 2013-04-08 20:19:10 +02:00
Bastian Kleineidam
7584f0b647 Add version update check. 2013-04-08 20:17:02 +02:00
Bastian Kleineidam
781bac0ca2 Feed text content instead of binary to robots.txt parser. 2013-04-07 18:11:29 +02:00
Bastian Kleineidam
bd1d41b83c Write encoded data in binary format. 2013-04-05 19:27:30 +02:00
Bastian Kleineidam
0fbc005377 A Python3 fix. 2013-04-05 18:57:44 +02:00
Bastian Kleineidam
97522bc5ae Use tuples rather than lists. 2013-04-05 18:55:19 +02:00
Bastian Kleineidam
adb31d84af Use HTMLParser.unescape instead of rolling our own function. 2013-04-05 18:53:19 +02:00
Bastian Kleineidam
1c9f64bc27 Better name for Sketchesnatched. 2013-04-05 18:47:51 +02:00
Bastian Kleineidam
9e26640407 Augment SketcheSnatched 2013-04-05 07:31:22 +02:00
Bastian Kleineidam
50b742721b SketcheSnatched 2013-04-05 07:20:50 +02:00
Bastian Kleineidam
3936cfa9ce Another fix. 2013-04-05 06:56:33 +02:00
Bastian Kleineidam
6aa588860d Code cleanup 2013-04-05 06:36:05 +02:00
Bastian Kleineidam
fabe872d1d Fix SnowFlame 2013-04-04 18:32:37 +02:00
Bastian Kleineidam
8150dabfa6 Remove SarahZero 2013-04-04 18:32:29 +02:00
Bastian Kleineidam
5d6e210c98 Fix Curvy 2013-04-04 18:30:27 +02:00
Bastian Kleineidam
b3cbad37bc Remove CaribbeanBlue 2013-04-04 18:30:16 +02:00
Bastian Kleineidam
80c24a10c0 Fix WebDesignerCOTW 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
62af4b875e Fix Precocious 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
421e31c961 Fix Oglaf 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
d794919e73 Fix LasLindas 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
08a3587df6 Fix KatzenfutterGeleespritzer 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
c57226f1c0 Remove GreystoneInn 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
3078c5ec73 Fix ExtraLife and EyeOfRamalach 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
3fd4cfea0d Fix DasLebenIstKeinPonyhof 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
460c5be689 Add POST support to urlopen(). 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
44c3fb9f16 Remove broken scripted plugins. 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
0054ebfe0b Some Python3 fixes. 2013-04-03 20:32:43 +02:00
Bastian Kleineidam
2c0ca04882 Fix warning for scrapers with multiple image patterns. 2013-04-03 20:32:19 +02:00
Bastian Kleineidam
f53a516219 Use output logging instead of print statement. 2013-04-03 20:31:10 +02:00
Bastian Kleineidam
a972729c0d Add WebDesignerCOTW 2013-04-03 20:30:51 +02:00
Bastian Kleineidam
fdab3b7b35 Add StuffNoOneToldMe 2013-04-03 20:30:29 +02:00
Bastian Kleineidam
43255872c3 Added SnowFlakes. 2013-04-03 20:30:16 +02:00
Bastian Kleineidam
6303a1cb20 Updated scripted plugins. 2013-04-03 20:27:12 +02:00
Bastian Kleineidam
f737486754 Fix hagar. 2013-03-26 20:12:26 +01:00
Bastian Kleineidam
a2f343226f Remove duplicate dilbert. 2013-03-26 20:02:13 +01:00
Bastian Kleineidam
1d7f7a8517 Fix genre list 2013-03-26 19:58:22 +01:00
Bastian Kleineidam
141d5113de Fix hagarthehorrible 2013-03-26 19:54:00 +01:00
Bastian Kleineidam
b62f1ba69d Code cleanup. 2013-03-26 17:36:06 +01:00
Bastian Kleineidam
31d95d1d03 Remove DerFlix 2013-03-26 17:35:58 +01:00
Bastian Kleineidam
3dd2daf223 Updated scripted plugins. 2013-03-26 17:35:47 +01:00
Bastian Kleineidam
92150ddbda Add HagarTheHorrible 2013-03-26 17:35:10 +01:00
Bastian Kleineidam
75f3d59e85 Fix Eriadan 2013-03-26 17:34:56 +01:00
Bastian Kleineidam
de3ce2ec95 Fix WormWorldSaga* 2013-03-26 17:34:27 +01:00
Bastian Kleineidam
a3d74c5a0e Fix BratHalla and BrentalFloss 2013-03-26 17:33:51 +01:00
Bastian Kleineidam
10985ae614 Add genre tags. 2013-03-26 17:33:27 +01:00
Bastian Kleineidam
fcdc67ef92 Fix documentation. 2013-03-26 17:29:20 +01:00
Bastian Kleineidam
110a67cda4 Retry failed page content downloads (eg. timeouts). 2013-03-25 19:49:09 +01:00
Bastian Kleineidam
ec33276fd7 Print stacktrace on image errors. 2013-03-25 19:48:47 +01:00
Bastian Kleineidam
1a7dfc02d2 Add Schuelert 2013-03-25 19:48:32 +01:00
Bastian Kleineidam
af10385da1 Add firstStripUrl for KevinAndKell. 2013-03-25 19:48:19 +01:00
Bastian Kleineidam
940a04b499 Fix comic searching. 2013-03-25 19:48:01 +01:00
Bastian Kleineidam
bafe981917 Add DrMcNinja. 2013-03-25 19:47:44 +01:00
Bastian Kleineidam
9d1f286424 Improved documentation. 2013-03-25 19:47:29 +01:00
Bastian Kleineidam
c99827935b Updated plugins with scripts. 2013-03-25 19:40:38 +01:00
Bastian Kleineidam
01c2afc264 Print exception tracebacks. 2013-03-25 19:39:37 +01:00
Tobias Gruetzmacher
a1b5bfb68f Add a simple event consumer to write JSON metadata.
This drops a file named dosage.json in each comic directory. This is
still not perfect, but something to build upon.
2013-03-24 16:55:30 +01:00
Tobias Gruetzmacher
0a218c0283 Add event comicPageLink for every previous link.
This event allows a listener to build connections between pages.
2013-03-24 16:36:02 +01:00
Bastian Kleineidam
9f08b21a7e Get correct images of gocomic strips. 2013-03-24 14:13:33 +01:00
Bastian Kleineidam
179ba7f49f Add release info. 2013-03-21 19:04:59 +01:00
Bastian Kleineidam
2b98cf0079 CucumberQuest fixes. 2013-03-21 18:38:40 +01:00
Bastian Kleineidam
3f6df92fef Added some comics, fixed some. 2013-03-21 18:33:16 +01:00
Bastian Kleineidam
448e80eaed Added MyCartoons 2013-03-20 21:42:04 +01:00
Bastian Kleineidam
2e3907d942 Add Katzenfuttergeleespritzer and ParallelUniversum 2013-03-20 17:39:49 +01:00
Bastian Kleineidam
3937cfba4b Added SandraAndWooGerman 2013-03-19 20:54:16 +01:00
Bastian Kleineidam
78fb63859c Add DemolitionSquad. 2013-03-19 20:45:59 +01:00
Bastian Kleineidam
17fe58b864 Fix some comics. 2013-03-19 20:45:18 +01:00
Bastian Kleineidam
178d8f80b2 Fix Dilbert image naming. 2013-03-18 18:15:19 +01:00
Bastian Kleineidam
88224fe21a Add DogHouseDiaries, update changelog. 2013-03-15 07:04:19 +01:00
Bastian Kleineidam
6a2f55ddef Dont stop on image regex errors. 2013-03-15 07:03:54 +01:00
Bastian Kleineidam
e88cf514a7 Fixes for FonFlatter 2013-03-13 18:31:58 +01:00
Bastian Kleineidam
e739eb7992 Added CucumberQuest 2013-03-12 21:36:13 +01:00
Bastian Kleineidam
79f0f4b36c Added OrnerBoy 2013-03-12 21:23:26 +01:00
Bastian Kleineidam
502a35166f Added KickInTheHead 2013-03-12 21:16:17 +01:00
Bastian Kleineidam
7c4ac0df7b Add and fix some comics. 2013-03-12 20:49:46 +01:00
Bastian Kleineidam
43f20270d0 Allow a list of regular expressions for image and previous link search. 2013-03-12 20:48:26 +01:00
Bastian Kleineidam
6de26aeeaa Updated keenspot scraper and its comic list. 2013-03-12 20:47:52 +01:00
Bastian Kleineidam
2bf7d16090 Updated comicgenesis comic list. 2013-03-12 20:47:38 +01:00
Bastian Kleineidam
58abcb282d Added GeeksNextDoor. 2013-03-11 22:51:45 +01:00
Bastian Kleineidam
737f1e189d Added FullFrontalNerdity. 2013-03-11 22:45:30 +01:00
Bastian Kleineidam
538523f86c Add keenspot. 2013-03-11 22:03:17 +01:00
Bastian Kleineidam
a16bf6c16b Rename keenspot to comicgenesis and enable it. 2013-03-11 21:50:49 +01:00
Bastian Kleineidam
f0eaba0f69 Reenable comicgenesis comics. 2013-03-11 20:33:56 +01:00
Bastian Kleineidam
950a958e30 Updated for release. [ci skip] 2013-03-11 20:14:27 +01:00
Bastian Kleineidam
7eaf12caf6 Fix LookingForGroup 2013-03-11 19:56:37 +01:00
Bastian Kleineidam
7ee73caf3c Allow multiple event output and improve HTML output. 2013-03-11 17:33:59 +01:00
Bastian Kleineidam
75e576f2de Embed images in html output. 2013-03-09 21:39:43 +01:00
Bastian Kleineidam
8b0a523f77 Page comic listings. 2013-03-09 09:00:50 +01:00
Bastian Kleineidam
5ccf44c36a Embed images in html output. 2013-03-08 22:38:11 +01:00
Bastian Kleineidam
88e28f3923 Fix some comics and add language tag. 2013-03-08 22:33:05 +01:00
Bastian Kleineidam
b368f125bc Fix some comics. 2013-03-08 06:47:00 +01:00
Bastian Kleineidam
4c344765ff Add option to wait before downloading. 2013-03-08 06:46:50 +01:00
Bastian Kleineidam
0ee0822e00 Fix some comics. 2013-03-08 00:06:55 +01:00
Bastian Kleineidam
2bdf0d588d Simplify exception handling. 2013-03-08 00:06:50 +01:00
Bastian Kleineidam
1d7410c038 Added Zwarwald and AhoiPolloi 2013-03-07 23:51:55 +01:00
Bastian Kleineidam
8259a01d64 Fix URLs with no content type header. 2013-03-07 23:08:37 +01:00
Bastian Kleineidam
1cc7d39047 Fix some comics. 2013-03-07 23:08:17 +01:00
Bastian Kleineidam
0215ae82af Fix some comics. 2013-03-07 19:54:18 +01:00
Bastian Kleineidam
e96c68c378 Fix dorktower. 2013-03-07 18:24:12 +01:00
Bastian Kleineidam
7d8786c1d2 Code cleanup. 2013-03-07 18:22:49 +01:00
Bastian Kleineidam
736d9aa8cf Code cleanup. 2013-03-07 18:22:39 +01:00
Bastian Kleineidam
23c20bfe32 Fix some comics. 2013-03-07 18:22:24 +01:00
Bastian Kleineidam
d1e5ad2696 Set proper HTML5 doctype and encoding for HTML output. 2013-03-07 18:21:05 +01:00
Bastian Kleineidam
9f13af7750 Retry empty downloads and don't set a manual modification time. 2013-03-07 18:20:38 +01:00
Bastian Kleineidam
6f2aebe8c0 Updated copyright. 2013-03-07 18:19:50 +01:00
Bastian Kleineidam
106a15b6c3 Add missing attribute. 2013-03-06 20:23:43 +01:00
Bastian Kleineidam
10eb1ff5ec Fix dilbert filenames. 2013-03-06 20:21:20 +01:00
Bastian Kleineidam
d7925ba4a2 Sort comics. 2013-03-06 20:21:10 +01:00
Bastian Kleineidam
bae2a96d8b Added some comic strips and cleanup the scraper code. 2013-03-06 20:00:30 +01:00
Bastian Kleineidam
3a22c05050 Catch WindowsError when initializing colorama. 2013-03-05 21:15:25 +01:00
Bastian Kleineidam
01177e25f0 Updated generated comic lists. 2013-03-05 19:06:00 +01:00
Bastian Kleineidam
c13aa323d8 Code cleanup [ci skip] 2013-03-04 21:44:26 +01:00
Bastian Kleineidam
61a02630b8 Remove duplicate comic entries. 2013-03-04 19:40:10 +01:00
Bastian Kleineidam
4047859c5b Fix BrentalFlossFit 2013-03-04 19:37:26 +01:00
wummel
c0440266cf Merge pull request #12 from TobiX/some-new-comics
Some new comics
2013-03-04 10:13:51 -08:00
Bastian Kleineidam
3712799ee0 Add imageUrlModifier() for scrapers. 2013-03-04 19:10:27 +01:00
Bastian Kleineidam
44d696c4af Flush file contents to disk and check for empty files. 2013-03-04 19:10:26 +01:00
Bastian Kleineidam
60b160bcdf Prevent double slash in support url 2013-03-04 19:10:26 +01:00
Tobias Gruetzmacher
fc3fab8500 Add Namesake. 2013-03-03 22:41:11 +01:00
Tobias Gruetzmacher
bf13b13ab6 Add StickyDillyBuns.
One of the Pixie Trix Comix.
2013-03-03 22:03:27 +01:00
Tobias Gruetzmacher
1af8a99594 Added MenageA3 (ma3comics.com).
One of the Pixie Trix Comix.
2013-03-03 21:52:08 +01:00
Tobias Gruetzmacher
d668f5fc1e Add DangerouslyChloe.
One of the Pixie Trix Comix.
2013-03-03 21:31:44 +01:00
Tobias Gruetzmacher
4036ce06ef Add MagickChicks.
One of the Pixie Trix Comix.
2013-03-03 20:50:21 +01:00
Tobias Gruetzmacher
af57e018a1 Add ShadowGirls. 2013-03-03 18:59:16 +01:00
Tobias Gruetzmacher
5c85e9a2f2 Add BrentalFloss.
"Flossed in Time" does not work ATM since there are errors in the image
URL.
2013-03-03 18:58:21 +01:00
Tobias Gruetzmacher
89f1170ff4 Add AlphaLuna and AlphaLuna/Spanish. 2013-03-03 15:58:40 +01:00
Bastian Kleineidam
fba7f6e527 Updated comic plugins. 2013-03-01 20:55:55 +01:00
Bastian Kleineidam
d7daf67e08 Fix some comics. 2013-02-27 19:40:54 +01:00
Bastian Kleineidam
09df20cd1f Fix some comics and increase travis test number. 2013-02-26 06:12:46 +01:00
Bastian Kleineidam
41c954b309 Another try on URL quoting. 2013-02-23 09:08:08 +01:00
Bastian Kleineidam
953dc62ffd Fix some comics. 2013-02-23 09:07:44 +01:00
Bastian Kleineidam
ec6e59e53c Fix Chucklebrain 2013-02-22 20:29:05 +01:00
Bastian Kleineidam
6793aecbd3 Fix OneQuestion. 2013-02-22 20:23:47 +01:00
Bastian Kleineidam
889056b8e3 Fix PicPakDog 2013-02-22 19:43:33 +01:00
Bastian Kleineidam
2eb7b43dd2 Remove drunkduck awards. 2013-02-21 19:51:10 +01:00
Bastian Kleineidam
f36ed46d6a Fix tests which hit the first URL. 2013-02-21 19:48:21 +01:00
Bastian Kleineidam
d0c3492cc7 Catch robots.txt errors. 2013-02-21 19:48:04 +01:00
Bastian Kleineidam
b453c442c2 Fix some comics. 2013-02-21 19:47:37 +01:00
Bastian Kleineidam
1a84431456 Add Caggage 2013-02-21 19:47:21 +01:00
Bastian Kleineidam
292c58633c Fix AstronomyPOTD 2013-02-20 20:52:37 +01:00
Bastian Kleineidam
725001155a Updated generated comics. 2013-02-20 20:52:23 +01:00
Bastian Kleineidam
ae0e9feea1 Remember skipped URLs. 2013-02-20 20:51:39 +01:00
Bastian Kleineidam
91c32515d5 Fix some comics. 2013-02-19 20:58:04 +01:00
Bastian Kleineidam
8e2a01f19f Fix some comics. 2013-02-18 20:55:54 +01:00
Bastian Kleineidam
79795115f0 Do not sort module lists. 2013-02-18 20:40:35 +01:00
Bastian Kleineidam
be1694592e Do not stream page content URLs. 2013-02-18 20:38:59 +01:00
Bastian Kleineidam
96edb60e01 Fix some comics. 2013-02-18 20:38:44 +01:00
Bastian Kleineidam
17f1988197 Fix Catalyst 2013-02-18 20:03:54 +01:00
Bastian Kleineidam
270510bdc5 Fix AstronomyPOTD 2013-02-18 20:03:42 +01:00
Bastian Kleineidam
6155b022a6 Allow selected strips without images. 2013-02-18 20:03:27 +01:00
Bastian Kleineidam
4f03963b9e Code cleanup. 2013-02-18 20:02:16 +01:00
Bastian Kleineidam
c4191158ec Sort scrapers only when listing them. 2013-02-18 20:01:50 +01:00
Bastian Kleineidam
dc9334cca9 Fix scraperclass function. Closes issue #7. 2013-02-18 19:59:16 +01:00
Bastian Kleineidam
495b6d006d Fix some comics. 2013-02-16 14:54:08 +01:00
Bastian Kleineidam
a99fbbcf45 Fix ASofterWorld 2013-02-16 14:18:43 +01:00
Bastian Kleineidam
da9eee3bc0 Updated copyright. 2013-02-15 18:32:36 +01:00
Bastian Kleineidam
deae84d8fa Updated comicfury. 2013-02-14 21:28:34 +01:00
Bastian Kleineidam
40de445d8c Allow multiple comic name matches. 2013-02-13 22:18:05 +01:00
Bastian Kleineidam
8a33871df8 Fix some comicfury stuff. 2013-02-13 22:17:39 +01:00
Bastian Kleineidam
93c48fb7e2 Make _BasicScraper hashable. 2013-02-13 20:00:16 +01:00
Bastian Kleineidam
23a1acd398 Add firstStripUrl to scrapers. 2013-02-13 19:59:59 +01:00
Bastian Kleineidam
312d117ff3 Rename get_scrapers to get_scraperclasses 2013-02-13 19:59:13 +01:00
Bastian Kleineidam
96bf9ef523 Recognize internal server errors. 2013-02-13 17:54:10 +01:00
Bastian Kleineidam
752bf1c6ef Updated plugins. 2013-02-13 17:53:25 +01:00
Bastian Kleineidam
e3722c1220 Add SandraAndWoo, SupernormalStep 2013-02-13 17:53:11 +01:00
Bastian Kleineidam
c422a23e27 Add ManlyGuysDoingManlyThings 2013-02-13 17:52:49 +01:00
Bastian Kleineidam
7da45ffe11 Fix LasLindas 2013-02-13 17:52:32 +01:00
Bastian Kleineidam
f16e860f1e Only cache robots.txt URL on memoize. 2013-02-13 17:52:07 +01:00
Bastian Kleineidam
7a98cf7599 Updated copyright. 2013-02-13 06:28:35 +01:00
Bastian Kleineidam
67af7bd115 Fix GUComics 2013-02-13 06:27:46 +01:00
Bastian Kleineidam
e38a766db3 Updated generated plugins. 2013-02-12 21:54:56 +01:00
Bastian Kleineidam
49ddcecb72 Fix Petitesymphony. 2013-02-12 21:14:57 +01:00
Bastian Kleineidam
093c2dcddc Fix EyeOfRamalach 2013-02-12 21:14:44 +01:00
Bastian Kleineidam
7375fa042f Fix AlienShores 2013-02-12 21:14:32 +01:00
Bastian Kleineidam
9ec4a44953 Remove universal strips since they are almost all duplicated and the rest is useless. 2013-02-12 20:56:02 +01:00
Bastian Kleineidam
10f6a1caa1 Correct path quoting. 2013-02-12 17:55:33 +01:00
Bastian Kleineidam
ebfc6cba70 Fix LookingForGroup. 2013-02-12 17:55:13 +01:00
Bastian Kleineidam
6d0fffd825 Always use connection pooling. 2013-02-12 17:55:13 +01:00
Bastian Kleineidam
82ada5fba0 Updated copyright. 2013-02-11 19:54:50 +01:00
Bastian Kleineidam
a35c54525d Work around a bug in python requests. 2013-02-11 19:52:59 +01:00
Bastian Kleineidam
14f0a6fe78 Do not prefetch content with requests >= 1.0 2013-02-11 19:45:21 +01:00
Bastian Kleineidam
67836942d8 Simplify the fetchUrl code. 2013-02-11 19:43:46 +01:00
Bastian Kleineidam
3f0816efe2 Updated copyright 2013-02-10 18:25:21 +01:00
Bastian Kleineidam
9fa9af639b Remove duplicate comic. 2013-02-10 18:24:21 +01:00
Bastian Kleineidam
1c24fca199 Updated comic from generated lists. 2013-02-10 15:07:21 +01:00