Commit graph

462 commits

Author SHA1 Message Date
Bastian Kleineidam
f36ed46d6a Fix tests which hit the first URL. 2013-02-21 19:48:21 +01:00
Bastian Kleineidam
d0c3492cc7 Catch robots.txt errors. 2013-02-21 19:48:04 +01:00
Bastian Kleineidam
b453c442c2 Fix some comics. 2013-02-21 19:47:37 +01:00
Bastian Kleineidam
1a84431456 Add Caggage 2013-02-21 19:47:21 +01:00
Bastian Kleineidam
292c58633c Fix AstronomyPOTD 2013-02-20 20:52:37 +01:00
Bastian Kleineidam
725001155a Updated generated comics. 2013-02-20 20:52:23 +01:00
Bastian Kleineidam
ae0e9feea1 Remember skipped URLs. 2013-02-20 20:51:39 +01:00
Bastian Kleineidam
91c32515d5 Fix some comics. 2013-02-19 20:58:04 +01:00
Bastian Kleineidam
8e2a01f19f Fix some comics. 2013-02-18 20:55:54 +01:00
Bastian Kleineidam
79795115f0 Do not sort module lists. 2013-02-18 20:40:35 +01:00
Bastian Kleineidam
be1694592e Do not stream page content URLs. 2013-02-18 20:38:59 +01:00
Bastian Kleineidam
96edb60e01 Fix some comics. 2013-02-18 20:38:44 +01:00
Bastian Kleineidam
17f1988197 Fix Catalyst 2013-02-18 20:03:54 +01:00
Bastian Kleineidam
270510bdc5 Fix AstronomyPOTD 2013-02-18 20:03:42 +01:00
Bastian Kleineidam
6155b022a6 Allow selected strips without images. 2013-02-18 20:03:27 +01:00
Bastian Kleineidam
4f03963b9e Code cleanup. 2013-02-18 20:02:16 +01:00
Bastian Kleineidam
c4191158ec Sort scrapers only when listing them. 2013-02-18 20:01:50 +01:00
Bastian Kleineidam
dc9334cca9 Fix scraperclass function. Closes issue #7. 2013-02-18 19:59:16 +01:00
Bastian Kleineidam
495b6d006d Fix some comics. 2013-02-16 14:54:08 +01:00
Bastian Kleineidam
a99fbbcf45 Fix ASofterWorld 2013-02-16 14:18:43 +01:00
Bastian Kleineidam
da9eee3bc0 Updated copyright. 2013-02-15 18:32:36 +01:00
Bastian Kleineidam
deae84d8fa Updated comicfury. 2013-02-14 21:28:34 +01:00
Bastian Kleineidam
40de445d8c Allow multiple comic name matches. 2013-02-13 22:18:05 +01:00
Bastian Kleineidam
8a33871df8 Fix some comicfury stuff. 2013-02-13 22:17:39 +01:00
Bastian Kleineidam
93c48fb7e2 Make _BasicScraper hashable. 2013-02-13 20:00:16 +01:00
Bastian Kleineidam
23a1acd398 Add firstStripUrl to scrapers. 2013-02-13 19:59:59 +01:00
Bastian Kleineidam
312d117ff3 Rename get_scrapers to get_scraperclasses 2013-02-13 19:59:13 +01:00
Bastian Kleineidam
96bf9ef523 Recognize internal server errors. 2013-02-13 17:54:10 +01:00
Bastian Kleineidam
752bf1c6ef Updated plugins. 2013-02-13 17:53:25 +01:00
Bastian Kleineidam
e3722c1220 Add SandraAndWoo, SupernormalStep 2013-02-13 17:53:11 +01:00
Bastian Kleineidam
c422a23e27 Add ManlyGuysDoingManlyThings 2013-02-13 17:52:49 +01:00
Bastian Kleineidam
7da45ffe11 Fix LasLindas 2013-02-13 17:52:32 +01:00
Bastian Kleineidam
f16e860f1e Only cache robots.txt URL on memoize. 2013-02-13 17:52:07 +01:00
Bastian Kleineidam
7a98cf7599 Updated copyright. 2013-02-13 06:28:35 +01:00
Bastian Kleineidam
67af7bd115 Fix GUComics 2013-02-13 06:27:46 +01:00
Bastian Kleineidam
e38a766db3 Updated generated plugins. 2013-02-12 21:54:56 +01:00
Bastian Kleineidam
49ddcecb72 Fix Petitesymphony. 2013-02-12 21:14:57 +01:00
Bastian Kleineidam
093c2dcddc Fix EyeOfRamalach 2013-02-12 21:14:44 +01:00
Bastian Kleineidam
7375fa042f Fix AlienShores 2013-02-12 21:14:32 +01:00
Bastian Kleineidam
9ec4a44953 Remove universal strips since they are almost all duplicated and the rest is useless. 2013-02-12 20:56:02 +01:00
Bastian Kleineidam
10f6a1caa1 Correct path quoting. 2013-02-12 17:55:33 +01:00
Bastian Kleineidam
ebfc6cba70 Fix LookingForGroup. 2013-02-12 17:55:13 +01:00
Bastian Kleineidam
6d0fffd825 Always use connection pooling. 2013-02-12 17:55:13 +01:00
Bastian Kleineidam
82ada5fba0 Updated copyright. 2013-02-11 19:54:50 +01:00
Bastian Kleineidam
a35c54525d Work around a bug in python requests. 2013-02-11 19:52:59 +01:00
Bastian Kleineidam
14f0a6fe78 Do not prefetch content with requests >= 1.0 2013-02-11 19:45:21 +01:00
Bastian Kleineidam
67836942d8 Simplify the fetchUrl code. 2013-02-11 19:43:46 +01:00
Bastian Kleineidam
3f0816efe2 Updated copyright 2013-02-10 18:25:21 +01:00
Bastian Kleineidam
9fa9af639b Remove duplicate comic. 2013-02-10 18:24:21 +01:00
Bastian Kleineidam
1c24fca199 Updated comic from generated lists. 2013-02-10 15:07:21 +01:00
wummel
a61c4b4096 Merge pull request #6 from TobiX/for-upstream-2013-02-08
Fix for Spinnerette, 2 new comics
2013-02-09 23:03:45 -08:00
Bastian Kleineidam
e9b63210f9 Add encoding, inline images and guid tags to RSS output. 2013-02-10 08:00:32 +01:00
Bastian Kleineidam
77f3d152c0 Fix imageSearch pattern. 2013-02-08 21:03:23 +01:00
Tobias Gruetzmacher
e67b86c32f Add ParadigmShift.
The file names for this are a bit inconsistent...
2013-02-07 23:57:34 +01:00
Tobias Gruetzmacher
4b6d7c54af Add SkinDeep.
Filenames for this are all over the place :(
2013-02-07 23:57:34 +01:00
Tobias Gruetzmacher
b32dc6fd40 Fix Spinnerette.
The old expression was matching "Previous issue" first and skipping all
comics.
2013-02-07 23:57:34 +01:00
Bastian Kleineidam
419ae5fbcf Raise ValueError when HTML file already exists. 2013-02-07 20:48:03 +01:00
Bastian Kleineidam
1a0cd1ee6b Print HTTP client headers. 2013-02-07 18:28:56 +01:00
Bastian Kleineidam
e16b86d768 Allow debug level to be set. 2013-02-07 18:28:40 +01:00
Bastian Kleineidam
68d58640e8 Added some comics. 2013-02-06 22:27:40 +01:00
Bastian Kleineidam
c19cb93a14 Added some comics. 2013-02-06 22:08:36 +01:00
Bastian Kleineidam
137e30b3ac Added Nedroid comic strip. 2013-02-06 07:03:29 +01:00
Bastian Kleineidam
052e510085 Added HijinksEnsue comic strip. 2013-02-06 06:58:06 +01:00
Bastian Kleineidam
af9d8e90f0 Add missing url variable. 2013-02-06 06:36:50 +01:00
Bastian Kleineidam
a90875f018 Updated copyright. 2013-02-05 19:52:10 +01:00
Bastian Kleineidam
f18b5d5542 Fix Arcamax comics. 2013-02-05 19:51:55 +01:00
Bastian Kleineidam
1451047877 Rename latestUrl in url 2013-02-05 19:51:46 +01:00
Bastian Kleineidam
7f78bea1af Always have an url attribute in comic scrapers. 2013-02-04 21:00:26 +01:00
Bastian Kleineidam
77b8daf2f9 Add Spinnerette comic. 2013-01-29 21:52:26 +01:00
Bastian Kleineidam
4a63f5d561 Add GrrlPower comic. 2013-01-29 21:42:10 +01:00
Bastian Kleineidam
f727a0132f Add VampireCheerleader comic. 2013-01-29 21:23:59 +01:00
Bastian Kleineidam
a96b527f98 Add SequentialArt comic. 2013-01-29 21:23:32 +01:00
Bastian Kleineidam
8b04602c7b Fix GunnerkrigCourt 2013-01-29 19:00:29 +01:00
Bastian Kleineidam
4ab5b67f2e Improve comic strip message. 2013-01-29 18:51:35 +01:00
Bastian Kleineidam
0095f17b4e Updated copyright. 2013-01-28 18:52:26 +01:00
Bastian Kleineidam
e6d35c6494 Updated comic lists. 2013-01-28 06:53:12 +01:00
Bastian Kleineidam
73700e66f0 Cleanup 2013-01-24 21:42:27 +01:00
Bastian Kleineidam
4b35d332dc Fix DrFun image regex. 2013-01-24 07:53:47 +01:00
Bastian Kleineidam
51d0176f53 Fix CyanideAndHappiness image regex - really. 2013-01-23 21:53:34 +01:00
Bastian Kleineidam
399cda21e5 Fix CyanideAndHappiness image regex. 2013-01-23 21:35:36 +01:00
Bastian Kleineidam
f1356a9ff8 Fix URL norming, See issue #2. 2013-01-23 21:16:22 +01:00
Bastian Kleineidam
9ad4477d1f Retrieve more than one strip in index mode. 2013-01-23 20:21:52 +01:00
Bastian Kleineidam
95e1fbbe55 Fix LeastICouldDo. See issue #1. 2013-01-23 20:01:18 +01:00
Bastian Kleineidam
6477e570e1 Fix UnboundLocalError on indexed retrieval. See bug #4 2013-01-23 19:51:08 +01:00
Bastian Kleineidam
0e438b864e Add comic strips from Arcamax. 2013-01-23 19:34:11 +01:00
Bastian Kleineidam
d54d787af1 Better image filename for CyanideAndHappiness 2013-01-23 19:33:10 +01:00
Bastian Kleineidam
5479627d86 Updated copyright. 2013-01-09 22:21:19 +01:00
Bastian Kleineidam
39c83b8968 Truncate generated comic names. 2013-01-09 22:20:03 +01:00
Bastian Kleineidam
0da595963c Added AmazingSuperPowers comic. 2012-12-28 05:41:26 +01:00
Bastian Kleineidam
ad5c7667f1 Add PandyLand comic. 2012-12-28 05:37:08 +01:00
Bastian Kleineidam
a59b984414 Updated generated comic modules. 2012-12-19 20:43:32 +01:00
Bastian Kleineidam
6a2f57b132 Support requests module >= 1.0 2012-12-19 20:43:18 +01:00
Bastian Kleineidam
5f9e5ae3ca Various comics are fixed. 2012-12-13 21:05:27 +01:00
Bastian Kleineidam
de1b80fa4d Fix .zip file module loading. 2012-12-12 23:27:03 +01:00
Bastian Kleineidam
fcbace28b4 Load modules from .zip file. 2012-12-12 23:22:36 +01:00
Bastian Kleineidam
8d92a8f334 Updated colorama and info level of file messages. 2012-12-12 22:00:17 +01:00
Bastian Kleineidam
e5a04931d3 Various fixes and additions. 2012-12-12 17:41:29 +01:00
Bastian Kleineidam
6c0ad90c15 Cleanup 2012-12-09 20:15:22 +01:00
Bastian Kleineidam
3a3a800798 Make output thread-safe. 2012-12-09 18:12:41 +01:00
Bastian Kleineidam
75de0bc662 Added two more comics. 2012-12-09 18:12:21 +01:00
Bastian Kleineidam
5c34baa663 Capitalize comic names. 2012-12-09 07:51:49 +01:00
Bastian Kleineidam
0d9e1b4ef6 Added comics. 2012-12-08 21:30:51 +01:00
Bastian Kleineidam
4def4b81bd Add cookie feature. 2012-12-08 21:30:23 +01:00
Bastian Kleineidam
faba7b0bca Fix more comics. 2012-12-08 00:45:18 +01:00
Bastian Kleineidam
1b74e304c0 Updated comics. 2012-12-05 22:36:16 +01:00
Bastian Kleineidam
e5d9002f09 Fix more comics. 2012-12-05 21:52:52 +01:00
Bastian Kleineidam
387dff79a9 Fix comics. 2012-12-04 07:02:40 +01:00
Bastian Kleineidam
45df462a47 Fix some comics. 2012-12-02 18:35:06 +01:00
Bastian Kleineidam
bcae1b018c Add comic excludes in scripts. 2012-11-29 06:46:58 +01:00
Bastian Kleineidam
451fd982d9 Add comic scripts, add fixes and other stuff. 2012-11-28 18:15:12 +01:00
Bastian Kleineidam
a52e5ae575 Add more comics. 2012-11-26 19:41:25 +01:00
Bastian Kleineidam
0556ffd30a Fix comics, improve tests, use python-requests. 2012-11-26 18:44:31 +01:00
Bastian Kleineidam
d4eee7719d Dynamic type generation helpers. 2012-11-26 07:14:02 +01:00
Bastian Kleineidam
4528894c05 Fix some comics. 2012-11-26 07:13:32 +01:00
Bastian Kleineidam
7e91c83753 Improved comic test. 2012-11-25 07:56:46 +01:00
Bastian Kleineidam
958a788550 Fix some comics. 2012-11-21 21:57:26 +01:00
Bastian Kleineidam
54eaadf4fc Updated documentation and fix some comics. 2012-11-20 18:53:53 +01:00
Bastian Kleineidam
64d9fd6ac2 Require python 2.7, use importlib. 2012-11-19 21:20:50 +01:00
Bastian Kleineidam
7dbd14f934 Fix syntax errors. 2012-11-19 21:20:34 +01:00
Bastian Kleineidam
7e39b291dc Fix some comics 2012-11-14 20:23:30 +01:00
Bastian Kleineidam
31e7ddbd7c Fix some comics. 2012-11-14 06:22:08 +01:00
Bastian Kleineidam
eba2f0089d Fix some comics. 2012-11-13 19:12:28 +01:00
Bastian Kleineidam
5006ed7f40 Rename imageUrl to stripUrl. 2012-11-13 19:10:19 +01:00
Bastian Kleineidam
e4600df1bd Fix some comics. 2012-11-13 06:51:54 +01:00
Bastian Kleineidam
30a28cdb3e Fix some comics. 2012-11-12 18:59:19 +01:00
Bastian Kleineidam
a5108dfb45 Fix copyright. 2012-10-12 22:15:40 +02:00
Bastian Kleineidam
1f1a5f2b3c Improve error message. 2012-10-12 22:10:26 +02:00
Bastian Kleineidam
7a70cd79ca Fix event handling. 2012-10-12 22:07:50 +02:00
Bastian Kleineidam
7bf54255f0 Fix some comics 2012-10-12 21:47:57 +02:00
Bastian Kleineidam
b3e51ddc93 Simplify tagre regex. 2012-10-12 21:47:41 +02:00
Bastian Kleineidam
9c032c9006 Match before and after a tag. 2012-10-12 21:11:44 +02:00
Bastian Kleineidam
67939b3d71 Fix comics. 2012-10-11 21:32:15 +02:00
Bastian Kleineidam
da2b13822d Remove stray print statement. 2012-10-11 19:58:10 +02:00
Bastian Kleineidam
06008d4266 Fix indexed retrieval. 2012-10-11 19:53:37 +02:00
Bastian Kleineidam
78f44e9d9c Improve URL retrieval. 2012-10-11 19:53:10 +02:00
Bastian Kleineidam
65d881eee5 Code cleanup. 2012-10-11 19:52:52 +02:00
Bastian Kleineidam
c0ad053647 Prevent empty URL matching. 2012-10-11 18:16:29 +02:00
Bastian Kleineidam
21de295168 Remove progress stuff. 2012-10-11 18:08:59 +02:00
Bastian Kleineidam
3d96adc3ff Remove progress stuff. 2012-10-11 18:08:18 +02:00
Bastian Kleineidam
194d1e28b1 Add more documentation. 2012-10-11 18:02:29 +02:00
Bastian Kleineidam
4470e13b1a Fix some comics. 2012-10-11 17:39:38 +02:00
Bastian Kleineidam
979c97901b Fix tagre tests. 2012-10-11 17:02:40 +02:00
Bastian Kleineidam
ecfc88faf1 Fix comics. 2012-10-11 16:06:45 +02:00
Bastian Kleineidam
db1df21b58 Fix comic filename. 2012-10-11 15:58:54 +02:00
Bastian Kleineidam
17a40d4fda Make tagre quote configurable. 2012-10-11 15:43:29 +02:00
Bastian Kleineidam
9d30a7004e Only warn about missing images. 2012-10-11 15:17:08 +02:00
Bastian Kleineidam
a7036beef7 Sort loaded plugins. 2012-10-11 14:45:06 +02:00
Bastian Kleineidam
30a64ab1e5 Remove unused imports. 2012-10-11 14:17:37 +02:00
Bastian Kleineidam
c707aa893d A lot of refactoring. 2012-10-11 12:03:12 +02:00
Bastian Kleineidam
c1dc5892c8 Only import colorama on windows systems. 2012-10-01 18:01:56 +02:00
Bastian Kleineidam
a53e1f63bc Improve console size guessing. 2012-09-27 21:59:11 +02:00
Bastian Kleineidam
86e3aa6c1a Add Fredo And Pidjin 2012-09-27 21:55:16 +02:00
Bastian Kleineidam
3ab526f7d8 Fix 2012-09-27 21:54:56 +02:00
Bastian Kleineidam
f3365f6a5e Code cleanup. 2012-09-27 21:24:28 +02:00
Bastian Kleineidam
1333be7225 HTTP improvements. 2012-09-26 16:52:45 +02:00
Bastian Kleineidam
cc2a8df98f Document some functions. 2012-09-26 16:47:39 +02:00
Bastian Kleineidam
4a53639e79 Use tagre matching function. 2012-09-26 14:42:28 +02:00
Bastian Kleineidam
58c4cffcc8 Match end bracket in tagre function. 2012-09-26 14:42:05 +02:00
Bastian Kleineidam
e79649a9ad Fix AGirlAndHerFed. 2012-09-24 20:52:08 +02:00
Bastian Kleineidam
a17782428b Updated copyright for all source files. 2012-06-20 22:41:04 +02:00
Bastian Kleineidam
c9082aee42 Improved terminal functions. 2012-06-20 22:33:26 +02:00
Bastian Kleineidam
f91fb80a39 Initial commit to Github. 2012-06-20 21:58:13 +02:00