Commit graph

162 commits

Author SHA1 Message Date
Tobias Gruetzmacher
3f9feec041 Allow modules to ignore some HTTP error codes.
This is neccessary since it seems some webservers out there are
misconfigured to deliver actual content with an HTTP error code...
2016-11-01 18:25:02 +01:00
Tobias Gruetzmacher
69e6318f87 Remove ScurryAndCover, too much JavaScript. 2016-10-31 07:04:00 +01:00
Tobias Gruetzmacher
47e2502ec7 Fix a bunch of comic modules. 2016-10-31 06:57:47 +01:00
Tobias Gruetzmacher
9a6a310b76 Fixup copyright years. 2016-10-29 00:21:41 +02:00
Peter Brunner
19445a83ae Fix smbc 2016-10-18 21:28:42 -04:00
Tobias Gruetzmacher
06be2a026b Move some ex-KeenSpot comics to shorter names. 2016-10-14 14:23:33 +02:00
Hubert Figuière
81821dc450 Added Space Junk Arlia comic 2016-08-08 09:18:33 -04:00
André-Patrick Bubel
2b8e948868 Add String Theory comic 2016-06-01 11:19:17 +00:00
Tobias Gruetzmacher
77ed0218e0 Fix some comic modules. 2016-05-05 20:55:14 +02:00
Tobias Gruetzmacher
1d94439715 Fix some more comic modules. 2016-04-27 00:31:27 +02:00
Tobias Gruetzmacher
6574997e01 Refactor: All the other class methods.
Turns out, it would have been better if all methods had been instance
methods and not class methods. This finished a big chunk of the rework
needed for #42.
2016-04-21 23:52:31 +02:00
Tobias Gruetzmacher
c3f32dfef7 Refactor: Make namer a method.
When #42 is realized, the naming of files might differ between comic
modules, so the namer's logical location is the instance, not the class.
2016-04-21 08:20:49 +02:00
Tobias Gruetzmacher
0468f2f31a Refactor: Convert starter to simple method. 2016-04-13 20:01:51 +02:00
Tobias Gruetzmacher
42e43fa4e6 Read starter parameters from class.
This allows to specify starters in a more declarative and dynamic way.
2016-04-12 23:11:39 +02:00
Damjan Košir
af2e57d850 Added comic ScurryAndCover...
- Yay, funky JavaScript parsing!
- Start page isn't latest comic...

Updated-by: Tobias Gruetzmacher <tobias-git@23.gs>
2016-04-11 00:09:53 +02:00
Tobias Gruetzmacher
fa98f6ddbf Move more comics to common WordPressScraper. 2016-04-10 23:04:34 +02:00
Tobias Gruetzmacher
9feaf245f2 Fixed & removed some comics in s.py. 2016-04-06 00:40:13 +02:00
Tobias Gruetzmacher
0bcfb8a82e Move ComicControl into common module.
- Move all comics using ComicControl into alphabetical files.
- Add BalderDash & Picklewhistle
2016-04-04 00:12:53 +02:00
Peter Janes
759bd0c360 Fix Sherman's Lagoon 2016-04-03 14:54:41 -04:00
Tobias Gruetzmacher
bb1f20d867 Remove make_scraper for most WordPress comics.
- Dropped KatzenfutterGeleespritzer, because robots.txt.
- Move all WordPress/ComicPress scrapers into alphabetical files.
- Move _WordPressScraper & _ComicPress scraper into common.py.
- Some smaller PEP8 fixes.
2016-04-02 00:19:53 +02:00
Tobias Gruetzmacher
7f1e136d8b Sort comics alphabetically & PEP8 style fixes. 2016-03-31 23:13:54 +02:00
Damjan Košir
fd9c480d9c adding bonus panel to SWBC and multiple images flag to ParserScraper 2015-08-03 22:58:44 +12:00
Tobias Gruetzmacher
6a70bf4671 Enable some comics based on current policy. 2015-07-18 01:21:29 +02:00
Tobias Gruetzmacher
88e387ad15 Add Sleepless Domain. 2015-07-12 18:31:21 +02:00
Damjan Košir
119a3cd13a added text to ScandinaviaAndTheWorld 2015-07-07 19:48:25 +12:00
Damjan Košir
6412e6e542 fixed Spinnerette 2015-06-08 20:31:13 +12:00
Damjan Košir
e2d01e4924 fixed ScandinaviaAndTheWorld 2015-06-04 18:58:59 +12:00
Damjan Košir
5569439c43 fixed 16 comics 2015-05-25 21:57:06 +12:00
Marc Winkelmann
69e5b8ad93 Shermans Lagoon and On The Fastrack working again. Also corrected name. 2015-05-02 22:27:08 +02:00
DirkReiners
1438330a94 Fixes and Additions...
Fixed SabrinaOnline
Fixed SMBC
Added StandStillStaySilent (partial, prevsearch not working yet)
2015-04-29 10:37:14 -05:00
Tobias Gruetzmacher
ff21df596b Remove descriptions and genres (closes #9).
Maintaining the descriptions creates quite a bit of overhead (finding
them, copying them, checking if they are still correct) for a minimal
user benefit.

PS: Viewing this diff should be easier in a difftool that shows changes
in a line, for example kdiff3.
2015-04-20 20:29:09 +02:00
Tobias Gruetzmacher
5934f03453 Merge branch 'htmlparser' - I think it's ready.
This closes pull request #70.
2015-04-01 22:13:55 +02:00
Tobias Gruetzmacher
b5368b366a Merge Gaia(German), SandraAndWoo(German) into common base.
This also fixes #97 by correcting the imageSearch regex.
2015-02-04 19:41:52 +01:00
Tobias Gruetzmacher
f8531eca57 Move SinFest back to KeenSpot namespace. 2015-01-16 00:16:28 +01:00
Alex Rozenshteyn
51996e45ed Update SinFest to work with new website. 2014-12-16 12:01:54 -05:00
Tobias Gruetzmacher
32265c99d7 Merge branch 'zac9/patch-1' 2014-12-10 00:00:51 +01:00
mbrandis
25cf4888ae - Adapted ShermansLagoon
- Better version of OnTheFastTrack
2014-11-14 20:37:06 +01:00
Tobias Gruetzmacher
6769e1eb36 Add StrongFemaleProtagonist.
This uses the _ParserScraper and CSS selectors.
2014-10-13 23:39:50 +02:00
zac9
6ca200419a Update s.py 2014-09-28 19:48:26 -07:00
zac9
491b5457b2 Added comic ShotgunShuffle 2014-09-28 06:29:02 -07:00
Bastian Kleineidam
950dd2932c Remove stray print statement. 2014-07-21 20:20:15 +02:00
Tobias Gruetzmacher
ea5d533e30 Fix index lookups for SnowFlame and SnowFlakes. 2014-07-19 13:23:42 +02:00
Bastian Kleineidam
4c2a339e25 Fix some comics. 2014-07-02 19:51:53 +02:00
Peter B
8f1c864ec3 Added Safely Endangered 2014-06-17 01:05:11 -04:00
Peter B
78954da9d7 fix StandStillStaySilent, strip urls when downloading 2014-06-04 01:58:16 -04:00
Bastian Kleineidam
875e431edc Provide page data in shouldSkipUrl() function 2014-02-10 21:58:09 +01:00
Tobias Gruetzmacher
44ef1831bf Sluggy Freelance has some pages with multiple comics.
See for example SluggyFreelance:010422
2014-01-28 19:08:39 +01:00
Peter B
d0031b65c8 Added "Stand Still. Stay Silent." comic. 2014-01-08 11:08:19 -05:00
Bastian Kleineidam
4d63920434 Updated copyright. 2014-01-05 16:50:57 +01:00
Bastian Kleineidam
c3078ed855 Added EdmundFinney, Gaia, GaiaGerman, InternetWebcomic,
NotInventedHere, RedsPlanet, RomanticallyApocalyptic,
  ScandinaviaAndTheWorld, TheGamerCat, Weregeek
2013-12-10 19:50:21 +01:00
Bastian Kleineidam
7760985601 Fix broken comics 2013-11-12 18:33:14 +01:00
Bastian Kleineidam
f74b18c2e5 Remove unused import. 2013-11-07 21:22:49 +01:00
Bastian Kleineidam
86257c8364 Remove duplicate variable 2013-08-28 20:50:07 +02:00
Bastian Kleineidam
ef4ae435a5 Fix several comics. 2013-07-18 20:39:53 +02:00
Bastian Kleineidam
d5172074d5 Fix some comics. 2013-07-09 22:21:17 +02:00
Bastian Kleineidam
02132893b2 Fix shermanslagoon namer. 2013-07-04 20:20:26 +02:00
mbrandis
ccf50cad89 Corrected description. 2013-06-24 22:46:39 +02:00
mbrandis
3b0393ccf6 Update s.py
Added Sherman's Lagoon, namer is not perfect.
2013-06-23 23:30:57 +03:00
Bastian Kleineidam
1c1b0aaf18 Comic fixes. 2013-05-25 23:24:33 +02:00
Bastian Kleineidam
d6ca5aa7fd SnowFlakes is end-of-life. 2013-04-29 20:31:07 +02:00
Bastian Kleineidam
dcacbf0b9a Fix some comics. 2013-04-28 19:58:38 +02:00
Bastian Kleineidam
cafa37fcb1 All scrapers must have an URL. 2013-04-26 06:53:05 +02:00
Bastian Kleineidam
8418ea471d Added SabrinaOnline. 2013-04-25 21:14:32 +02:00
Bastian Kleineidam
dbdbdd09de Fix SMBC 2013-04-25 20:32:21 +02:00
wummel
1c29f22270 Merge pull request #27 from dromaludaire/master
Fix some SMBC download links
2013-04-22 10:40:24 -07:00
Benjamin Sigonneau
f1da47edef [SMBC] Fix regexp, mainly concerns older strips 2013-04-21 22:39:06 +02:00
Sven Hartge
8e34239b27 Fix typo in regex for SandraOnTheRocks. 2013-04-20 19:59:01 +02:00
Sven Hartge
40f2aed8f0 Add Sandra on the Rocks. 2013-04-20 18:51:06 +02:00
Bastian Kleineidam
e37a80fdc1 Add some descriptions. 2013-04-14 09:02:14 +02:00
Bastian Kleineidam
f15f993851 s/baseurl/baseUrl/g 2013-04-13 20:58:00 +02:00
Bastian Kleineidam
522af89af5 Add some descriptions. 2013-04-13 08:00:03 +02:00
Bastian Kleineidam
35c031ca81 Fixed some comics. 2013-04-11 18:27:43 +02:00
Bastian Kleineidam
7e593cf7e8 Add firstStripUrls. 2013-04-10 23:57:09 +02:00
Bastian Kleineidam
fb05c10808 Sort entries. 2013-04-10 18:36:33 +02:00
Bastian Kleineidam
5127d4c895 Use re.escape and add some firstStripUrl. 2013-04-10 18:19:11 +02:00
Bastian Kleineidam
e040dd0d50 Added Science. 2013-04-09 19:38:16 +02:00
Bastian Kleineidam
1c9f64bc27 Better name for Sketchesnatched. 2013-04-05 18:47:51 +02:00
Bastian Kleineidam
9e26640407 Augment SketcheSnatched 2013-04-05 07:31:22 +02:00
Bastian Kleineidam
50b742721b SketcheSnatched 2013-04-05 07:20:50 +02:00
Bastian Kleineidam
fabe872d1d Fix SnowFlame 2013-04-04 18:32:37 +02:00
Bastian Kleineidam
8150dabfa6 Remove SarahZero 2013-04-04 18:32:29 +02:00
Bastian Kleineidam
fdab3b7b35 Add StuffNoOneToldMe 2013-04-03 20:30:29 +02:00
Bastian Kleineidam
43255872c3 Added SnowFlakes. 2013-04-03 20:30:16 +02:00
Bastian Kleineidam
1a7dfc02d2 Add Schuelert 2013-03-25 19:48:32 +01:00
Bastian Kleineidam
3937cfba4b Added SandraAndWooGerman 2013-03-19 20:54:16 +01:00
Bastian Kleineidam
d7925ba4a2 Sort comics. 2013-03-06 20:21:10 +01:00
Bastian Kleineidam
bae2a96d8b Added some comic strips and cleanup the scraper code. 2013-03-06 20:00:30 +01:00
Tobias Gruetzmacher
bf13b13ab6 Add StickyDillyBuns.
One of the Pixie Trix Comix.
2013-03-03 22:03:27 +01:00
Tobias Gruetzmacher
af57e018a1 Add ShadowGirls. 2013-03-03 18:59:16 +01:00
Bastian Kleineidam
d7daf67e08 Fix some comics. 2013-02-27 19:40:54 +01:00
Bastian Kleineidam
e3722c1220 Add SandraAndWoo, SupernormalStep 2013-02-13 17:53:11 +01:00
Tobias Gruetzmacher
4b6d7c54af Add SkinDeep.
Filenames for this are all over the place :(
2013-02-07 23:57:34 +01:00
Tobias Gruetzmacher
b32dc6fd40 Fix Spinnerette.
The old expression was matching "Previous issue" first and skipping all
comics.
2013-02-07 23:57:34 +01:00
Bastian Kleineidam
c19cb93a14 Added some comics. 2013-02-06 22:08:36 +01:00
Bastian Kleineidam
1451047877 Rename latestUrl in url 2013-02-05 19:51:46 +01:00
Bastian Kleineidam
7f78bea1af Always have an url attribute in comic scrapers. 2013-02-04 21:00:26 +01:00
Bastian Kleineidam
77b8daf2f9 Add Spinnerette comic. 2013-01-29 21:52:26 +01:00
Bastian Kleineidam
a96b527f98 Add SequentialArt comic. 2013-01-29 21:23:32 +01:00
Bastian Kleineidam
5f9e5ae3ca Various comics are fixed. 2012-12-13 21:05:27 +01:00
Bastian Kleineidam
e5a04931d3 Various fixes and additions. 2012-12-12 17:41:29 +01:00