Commit graph

1351 commits

Author SHA1 Message Date
Tobias Gruetzmacher
ac0d2d2095 Skip video pages for MonkeyUser 2021-02-01 00:29:36 +01:00
Tobias Gruetzmacher
2ecfcaec17 Update GoComics 2021-02-01 00:19:22 +01:00
Tobias Gruetzmacher
87f4049347 Try to fix some more comics using the Internet Archive 2021-01-31 23:40:21 +01:00
Techwolf
9c87c214b6 Fix VGCats, VGCats/Adventure, VGCats/Super 2021-01-19 01:46:53 +01:00
Tobias Gruetzmacher
c43bc0cef4 Fix duplicate module detection 2021-01-19 01:19:07 +01:00
Tobias Gruetzmacher
6835d490d8 Remove 3 MangaDex modules which were taken down 2021-01-19 01:07:36 +01:00
Tobias Gruetzmacher
0067114110 Remove some ComicFury modules which are gone 2021-01-19 00:55:52 +01:00
Tobias Gruetzmacher
3d05e59c36 Fix some modules (a-c) 2021-01-18 01:25:34 +01:00
Tobias Gruetzmacher
a5bfbdb07b Merge Flowerlark Studios comics into one file 2021-01-18 01:25:34 +01:00
toonn
1040d4a976
Add Bird-boy (#183)
Co-authored-by: Tobias Gruetzmacher <tobias-git@23.gs>
2021-01-17 21:48:40 +01:00
Tobias Gruetzmacher
890f116179 Remove SmackJeeves
It was closed at the end of 2020...
2021-01-10 19:18:45 +01:00
Tobias Gruetzmacher
c3227c33f5 Fix WebToons cookies 2021-01-09 20:33:36 +01:00
Tobias Gruetzmacher
5966e8c523 Show Python version in "--version" output 2020-12-24 14:05:48 +01:00
Arwarld
651bd62267
Fix for Ruthe.de and NichtLustig.de (now joscha.com) (#181)
NichtLustig moved to joscha.com, new layout and image-names.

Co-authored-by: Marius Knabben <marius.knabben@rwth-aachen.de>
2020-12-02 22:37:18 +01:00
Techwolf
740abed621 Add MangaDex/JahySamaWaKujikenai 2020-11-09 00:54:20 +01:00
Techwolf
ed00c29ce8 Add TekMage 2020-11-09 00:54:20 +01:00
Techwolf
e6260b6cc5 Add ComicFury/Jack 2020-11-09 00:54:20 +01:00
Techwolf
5345a5a276 Add KemonoCafe/KnuckleUp 2020-11-09 00:54:20 +01:00
Techwolf
a3eb2ec9f3 Fix BetterDays and OriginalLife 2020-11-09 00:54:20 +01:00
Techwolf
e82c8f0f8d Mark CarryOn alt comics as endOfLife 2020-11-09 00:54:20 +01:00
Techwolf
ba214da8b0 Add MangaDex/Lv2KaraCheatDattaMotoYuushaKouhoNoMattariIsekaiLife 2020-11-09 00:54:20 +01:00
Techwolf
25efa82388 Add MangaDex/MaouNoOreGaDoreiElfWoYomeNiShitandaGaDouMederebaIi 2020-11-09 00:54:20 +01:00
Techwolf
5d62183806 Add WebToons/TokyoThreatDocumentationProject 2020-11-09 00:54:20 +01:00
Techwolf
348434e182 Add MangaDex/SousouNoFrieren 2020-11-09 00:54:20 +01:00
Techwolf
588597837a Add MangaDex/HowToOpenATriangularRiceball 2020-11-09 00:54:20 +01:00
Techwolf
6611b1e430 Fix WebToons completion detection 2020-11-09 00:54:20 +01:00
Techwolf
4caef255b4 Add WebToons/OVERPOWERED 2020-11-09 00:54:20 +01:00
Techwolf
f6eb2f088e Add WebToons/Defects 2020-11-09 00:54:20 +01:00
Techwolf
722df37080 Fix DeerMe 2020-11-09 00:54:20 +01:00
Techwolf
7d36a2e552 Add Unconventional 2020-11-09 00:54:20 +01:00
Tobias Gruetzmacher
2fcd8d92f2 Fix GeeksNextDoor (& switch to ParserScraper) 2020-10-18 20:44:47 +02:00
Tobias Gruetzmacher
fdf8164d22 Fix KuroShouri (switch to ParserScraper) 2020-10-18 20:28:00 +02:00
Tobias Gruetzmacher
e64635e86b Stricter style checking & related style fixes 2020-10-11 20:15:27 +02:00
Ruthalas
e84bbe2667
Add three new comics (#177)
* Add BloopBloopRelationshipComic to Webtoons
* Add Todd Allison & the Petunia Violet to Smack Jeeves
* Add Oversimplified SCP to MangaDex

Co-authored-by: Tobias Gruetzmacher <tobias-git@23.gs>
2020-10-11 11:30:02 +02:00
Tobias Gruetzmacher
d487485815 Read scraper modules from user data directory
This allows users to add scrapers without setting up a complete Python
development environment.
2020-10-04 23:46:57 +02:00
Tobias Gruetzmacher
0bdf3dd94b Allow adding external directories to the plugin package 2020-10-04 22:28:51 +02:00
Tobias Gruetzmacher
3256f9fdc2 Hardcode the "plugins" package name 2020-10-04 22:28:51 +02:00
Tobias Gruetzmacher
9237bd62b2 Convert scraper cache to a class
This should make it easier to extend with additional entries.
2020-10-04 22:28:51 +02:00
Tobias Gruetzmacher
26f63fd994 Remove unused property in Output class 2020-10-04 22:12:34 +02:00
Tobias Gruetzmacher
d93fed567c Distribute PyInstaller hook
This uses a feature introduced in PyInstaller 4.0 to register hooks as
entry points and automatically discover them whenever a module is used.
2020-10-01 15:51:09 +02:00
Tobias Gruetzmacher
7c2c9c3778 Fix VGCats & switch to HTTPS 2020-09-30 21:37:29 +02:00
Tobias Gruetzmacher
bb71adbe72 Remove VictimsOfTheSystem
Domain seems to be gone for some month and archive.org doesn't have
enough to switch.
2020-09-30 21:29:58 +02:00
Tobias Gruetzmacher
23a3e3b101 Switch WastedTalent to ParserScraper 2020-09-30 21:24:01 +02:00
Tobias Gruetzmacher
dfa412c03e Skip videos for WapsiSquare 2020-09-30 21:17:46 +02:00
Tobias Gruetzmacher
bbe3d9f9be Remove WebDesignerCOTW 2020-09-30 21:16:55 +02:00
Tobias Gruetzmacher
437afc685d Add GenshinImpact to Webtoons module 2020-09-30 18:24:33 +02:00
Tobias Gruetzmacher
e4f1f59432 Fix ButtercupFestival 2020-09-29 23:31:38 +02:00
Tobias Gruetzmacher
dc41c318e9 Fix ButterSafe 2020-09-29 23:04:56 +02:00
Tobias Gruetzmacher
f31c0c681d Fix BusinessCat and mark as endOfLife 2020-09-29 23:00:58 +02:00
Tobias Gruetzmacher
2364e8bab3 Fix Brink and mark as endOfLife 2020-09-29 23:00:53 +02:00
Tobias Gruetzmacher
5afab71e7f Fix BoredAndEvil 2020-09-29 22:35:09 +02:00
Tobias Gruetzmacher
5863ddb73b Fix BookOfBiff 2020-09-29 22:27:32 +02:00
Tobias Gruetzmacher
bbac2f99a7 Fix BetweenFailures 2020-09-29 22:22:55 +02:00
Tobias Gruetzmacher
3330162c6b Fix BetterDays with archive.org 2020-09-29 21:35:41 +02:00
Tobias Gruetzmacher
e98a1601ca Remove workaround for libxml2 older 2.9.3 (2015)
This workaround was written in 2016 while that version was still found
on many systems. Addionally, this workaround needs to be enabled by the
developer, who might not even be aware that they need to enable it for a
specific module. We still throw a warning to the user if running with
such an old libxml version.
2020-09-29 21:16:48 +02:00
Tobias Gruetzmacher
c065a820ed Fix BadMachinery and mark as endOfLife 2020-09-29 21:01:33 +02:00
Tobias Gruetzmacher
24efafdf96 Fix BMovieComic 2020-09-29 20:30:44 +02:00
Tobias Gruetzmacher
0fe32e7562 Remove unused f-string
Since we still want to support Python 3.5 for a bit, we should avoid
f-strings until we finally drop support for that.
2020-09-28 22:19:48 +02:00
Tobias Gruetzmacher
7e040086b6 Try to inform the user about geo-blocks
Instead of letting the crawler run into "random" error messages, throw a
specific "geoblocked" exception instead.
2020-09-28 13:11:34 +02:00
Tobias Gruetzmacher
e34a0b539c Don't rethrow RequestException as IOError
Since RequestException already is an IOError, nothing of value is lost.
2020-09-28 12:05:01 +02:00
Tobias Gruetzmacher
8d7fd8b884 Update GoComics modules
The usual: GoComics removed some comics, added some and renamed some...
2020-09-28 01:15:07 +02:00
Tobias Gruetzmacher
b090d86c9b Switch Drowtales to parser scraper (also fixes it) 2020-09-27 20:56:26 +02:00
Tobias Gruetzmacher
a7f9692ffe Add StalkerXStalker to WebToons module (fixes #173) 2020-09-27 19:21:28 +02:00
Tobias Gruetzmacher
ccb4c32c6b Fix DumbingOfAge 2020-09-27 19:16:34 +02:00
Tobias Gruetzmacher
346b3b4abd Fix ShipInABottle 2020-09-27 18:34:36 +02:00
Tobias Gruetzmacher
632433d896 Move CrapIDrewOnMyLunchBreak to WebToons
Unfortunatly, the original website is broken.
2020-09-27 18:25:33 +02:00
Tobias Gruetzmacher
85c40a942f Remove StuffNoOneToldMe 2020-09-27 15:50:48 +02:00
Tobias Gruetzmacher
8cc916b292 Fix CynWolf 2020-09-27 15:50:25 +02:00
Tobias Gruetzmacher
193b3da801 Repair EarthsongSaga (fixes #172)
Verified as fetching the complete history right now.
2020-09-26 11:29:45 +02:00
Tobias Gruetzmacher
383e759a0e Switch Achewood to ParserScraper 2020-09-26 09:42:48 +02:00
Tobias Gruetzmacher
fd779475c1 Fix first comic for Achewood (fixes #175) 2020-09-26 09:31:31 +02:00
garbled1
598cdd092c Add TheGlassScientists 2020-09-13 08:28:19 -07:00
garbled1
c870f71715 Add TheBoyWhoFell 2020-09-13 08:18:42 -07:00
garbled1
f223bc09fb Add TheForgottenOrder 2020-09-13 08:14:52 -07:00
garbled1
5fca596216 Add SisterClaire 2020-09-13 08:11:05 -07:00
garbled1
4dd52cd160 Add NeverSatisfied 2020-09-13 08:04:02 -07:00
garbled1
3b27a530d7 Add GuildedAge 2020-09-13 07:52:38 -07:00
garbled1
6fa835c928 Add ParaNatural 2020-09-13 07:48:32 -07:00
garbled1
10b419331a Add SaffronAndSage 2020-09-13 07:37:24 -07:00
garbled1
6eb0087c9f Add DaughterOfTheLilies 2020-09-13 07:25:05 -07:00
Techwolf
d4fb75b844 Add MangaDex/TheWolfAndRedRidingHood 2020-08-19 00:04:55 -07:00
Techwolf
073890c237 Add MangaDex/HangingOutWithAGamerGirl 2020-08-15 21:11:40 -07:00
Techwolf
191df7d5ed Mark SchlockMercenary as endOfLife 2020-08-14 20:51:34 -07:00
Techwolf
70c030ca32 Add MangaDex/OokamiShounenWaKyouMoUsoOKasaneru 2020-08-14 20:51:34 -07:00
Techwolf
49c02541f4 Add MangaDex/KawaiiJoushiWoKomarasetai 2020-08-14 20:51:34 -07:00
Techwolf
711d8d5e23 Fix BlackTapestries and TheRealmOfKaerwyn 2020-08-14 20:51:34 -07:00
Techwolf
103520e586 Add LazJonesAndTheMayfieldRegulators/SideStories 2020-08-14 20:51:34 -07:00
Techwolf
2c05fbb78f Fix Tamberlane 2020-08-14 20:51:34 -07:00
Techwolf
4a24afa652 Mark OriginalLife as endOfLife 2020-08-14 20:51:34 -07:00
Techwolf
88e54258d0 Fix StudioKhimera/UberQuest 2020-08-14 20:51:34 -07:00
Techwolf
1e4c291145 Add ComicFury/LustAndIre 2020-08-14 20:51:34 -07:00
Techwolf
5d27402346 Add TheNightBelongsToUs 2020-08-14 20:51:34 -07:00
Techwolf
a191f7e909 Add SoloLeveling 2020-08-14 20:51:34 -07:00
Techwolf
2ef6ebb6f8 Fix Nightshift and StarfireAgency 2020-08-14 20:51:33 -07:00
Techwolf
2688ffad91 Fix DesertFox 2020-08-14 20:51:33 -07:00
Techwolf
9764b82101 Fix Moonlace and Bethellium 2020-08-14 20:51:33 -07:00
Techwolf
c29bd2f069 Add ProjectFuture/Emily 2020-08-14 20:51:33 -07:00
Techwolf
de3b396d11 Add RayFox 2020-08-14 20:51:33 -07:00
Techwolf
4726497c06 Update WebToons comics 2020-08-14 20:50:54 -07:00
Techwolf
c362f39e3a Fix FurryExperience 2020-08-14 20:50:54 -07:00