Commit graph

117 commits

Author SHA1 Message Date
Tobias Gruetzmacher 5508539ca8 Merge all Shivae Studio comics into one file
And they should all work again.
2021-02-02 00:41:08 +01:00
Tobias Gruetzmacher 3d05e59c36 Fix some modules (a-c) 2021-01-18 01:25:34 +01:00
Tobias Gruetzmacher a5bfbdb07b Merge Flowerlark Studios comics into one file 2021-01-18 01:25:34 +01:00
Tobias Gruetzmacher e64635e86b Stricter style checking & related style fixes 2020-10-11 20:15:27 +02:00
Tobias Gruetzmacher 383e759a0e Switch Achewood to ParserScraper 2020-09-26 09:42:48 +02:00
Tobias Gruetzmacher fd779475c1 Fix first comic for Achewood (fixes #175) 2020-09-26 09:31:31 +02:00
Techwolf cac269e0bc Fix AntiheroForHire 2020-04-29 01:40:27 -07:00
Tobias Gruetzmacher 27d28b8eef Update file headers
The default encoding for source files is UTF-8 since Python 3, so we can
drop all encoding headers. While we are at it, just replace them with
SPDX headers.
2020-04-18 13:45:44 +02:00
Tobias Gruetzmacher 44791439a5 Drop Python 2 support: Obsolete future statements 2020-02-04 01:06:19 +01:00
Techwolf eb69210e58 Add self to authors list, update copyright headers 2020-01-28 00:23:15 +01:00
Tobias Gruetzmacher fc4655157a Move some more modules to HTTPS and fix canonical URL 2020-01-12 22:41:06 +01:00
Techwolf 241732bbba Add common handler for mgsisk's Wordpress Webcomic plugin 2020-01-12 21:29:47 +01:00
Techwolf 163b252776 Add ArtificialIncident 2020-01-12 21:19:56 +01:00
Techwolf 0fed6c2c36 Add site engine for NamirDeiter comics and migrate existing comics 2020-01-12 21:19:56 +01:00
Tobias Gruetzmacher 3fe40326e0 Fix AbstruseGoose 2020-01-09 22:59:12 +01:00
Tobias Gruetzmacher ff3b7f2cbe Fix AltermetaOld 2020-01-09 17:41:07 +01:00
Tobias Gruetzmacher 752525c3e9 Fix some old modules using the Internet Archive 2020-01-09 17:38:13 +01:00
Tobias Gruetzmacher e5cb27ffa4 Minor URL fixes, switched some modules to https 2019-12-31 01:44:19 +01:00
Techwolf 7e9ca4daba Add AntiheroForHire 2019-12-27 22:40:27 +01:00
Techwolf c5f57288ec Refactor WP_LATEST_SEARCH into class variable of WordPressScraper 2019-12-27 22:20:02 +01:00
Techwolf 44d2198c34 Add AmbersNoBrainers 2019-12-27 22:20:02 +01:00
Techwolf b383a92f65 Add AdventuresOfFifne 2019-12-27 18:41:49 +01:00
Techwolf 24ee2e2594 Add AbbysAgency 2019-12-17 01:28:02 +01:00
Techwolf becc3b98f6 Add AlienDice and AlienDice/Legacy 2019-12-17 01:28:02 +01:00
Techwolf 56573c41ae Add Anaria 2019-12-17 01:28:02 +01:00
Techwolf 57046e23ca Add A&HClub 2019-12-17 01:28:02 +01:00
Techwolf c9e3923442 Fix AlphaLuna 2019-12-17 01:28:02 +01:00
Techwolf 6cfde34663 Fix Altermeta 2019-12-17 01:28:02 +01:00
Techwolf 2916b39c98 Add ApartmentForTwo 2019-12-17 01:28:02 +01:00
Techwolf 15a5953120 Add ATaleOfTails 2019-11-03 23:35:06 +01:00
Techwolf f5b7b067b7 Switch AGirlAndHerFed to parser scraper 2019-11-03 21:37:05 +01:00
Tobias Gruetzmacher fbb3a18c91 Enable warnings and fix some of them 2018-05-23 00:54:40 +02:00
Tobias Gruetzmacher 75aa7207ea Some minor fixes to make some modules work again. 2017-11-27 01:04:35 +01:00
Tobias Gruetzmacher 7e0adf1d96 Unify more WordPress-based modules. 2017-05-22 01:17:05 +02:00
Tobias Gruetzmacher b8484cde50 Fix some more modules. 2017-05-15 00:27:28 +02:00
Tobias Gruetzmacher ebbb27d05d Move xpath_class to helpers module. 2017-02-13 22:41:17 +01:00
Tobias Gruetzmacher 20ca5d7fc2 Fix some modules. 2017-02-06 00:05:05 +01:00
Tobias Gruetzmacher c4a184d173 Remove some vanished modules. 2017-01-12 02:01:10 +01:00
Tobias Gruetzmacher 3f9feec041 Allow modules to ignore some HTTP error codes.
This is neccessary since it seems some webservers out there are
misconfigured to deliver actual content with an HTTP error code...
2016-11-01 18:25:02 +01:00
Tobias Gruetzmacher 9a6a310b76 Fixup copyright years. 2016-10-29 00:21:41 +02:00
Tobias Gruetzmacher b1d2650615 Fix some modules (a&b). 2016-09-29 01:29:01 +02:00
Tobias Gruetzmacher 4006ced43d Move all HijinksEnsue comics into alphabetic files. 2016-05-02 01:25:34 +02:00
Tobias Gruetzmacher c3f32dfef7 Refactor: Make namer a method.
When #42 is realized, the naming of files might differ between comic
modules, so the namer's logical location is the instance, not the class.
2016-04-21 08:20:49 +02:00
Tobias Gruetzmacher 0468f2f31a Refactor: Convert starter to simple method. 2016-04-13 20:01:51 +02:00
Tobias Gruetzmacher 42e43fa4e6 Read starter parameters from class.
This allows to specify starters in a more declarative and dynamic way.
2016-04-12 23:11:39 +02:00
Tobias Gruetzmacher fa98f6ddbf Move more comics to common WordPressScraper. 2016-04-10 23:04:34 +02:00
Tobias Gruetzmacher bb5b6ffcec Fix comics in module a.py. 2016-04-07 23:21:31 +02:00
Tobias Gruetzmacher 8768ff07b6 Fix AhoiPolloi, be a bit smarter about encoding.
HTML character encoding in the context of HTTP is quite tricky to get
right and honestly, I'm not sure if I did get it right this time. But I
think, the current behaviour matches best what web browsers try to do:

1. Let Requests figure out the content from the HTTP header. This
   overrides everything else. We need to "trick" LXML to accept our
   decision if the document contains an XML declaration which might
   disagree with the HTTP header.
2. If the HTTP headers don't specify any encoding, let LXML guess the
   encoding and be done with it.
2016-04-06 22:22:22 +02:00
Tobias Gruetzmacher 0d453a6858 Move Flowerlark Studios into alphabetical files. 2016-04-03 22:58:01 +02:00
Tobias Gruetzmacher bb1f20d867 Remove make_scraper for most WordPress comics.
- Dropped KatzenfutterGeleespritzer, because robots.txt.
- Move all WordPress/ComicPress scrapers into alphabetical files.
- Move _WordPressScraper & _ComicPress scraper into common.py.
- Some smaller PEP8 fixes.
2016-04-02 00:19:53 +02:00