Commit graph

715 commits

Author SHA1 Message Date
Tobias Gruetzmacher
d495d95ee0 Refactor: Move repeated check into its own function. 2014-10-13 21:29:54 +02:00
Tobias Gruetzmacher
3235b8b312 Pass unicode strings to lxml.
This reverts commit fcde86e9c0 & some
more. This lets python-requests do all the encoding stuff and leaves
LXML with (hopefully) clean unicode HTML to parse.
2014-10-13 19:39:48 +02:00
zac9
6ca200419a Update s.py 2014-09-28 19:48:26 -07:00
zac9
5b7ab5a711 Update o.py 2014-09-28 19:41:29 -07:00
zac9
491b5457b2 Added comic ShotgunShuffle 2014-09-28 06:29:02 -07:00
Bastian Kleineidam
731291979d Fixed RedMeat. 2014-09-22 22:14:31 +02:00
Bastian Kleineidam
e43694c156 Don't crash on multiple HTML output runs per day. 2014-09-22 22:00:16 +02:00
Bastian Kleineidam
e87f5993b8 Merge branch 'master' into htmlparser 2014-08-07 18:10:15 +02:00
Tobias Gruetzmacher
08175d28c9 Fix Ruthe (see #73). 2014-07-31 21:27:49 +02:00
Tobias Gruetzmacher
ca2d722d39 Fix DieFruehreifen (closes #73). 2014-07-31 21:18:15 +02:00
Tobias Gruetzmacher
6c7fb176b1 Add Blade Kitten as an example for the new parser. 2014-07-26 11:28:43 +02:00
Tobias Gruetzmacher
f9f0b75d7c Create new HTML parser based scraper class. 2014-07-26 11:28:43 +02:00
Tobias Gruetzmacher
fcde86e9c0 Change getPageContent to (optionally) return raw text.
This allows LXML to do its own "magic" encoding detection
2014-07-26 11:28:43 +02:00
Tobias Gruetzmacher
0e03eca8f0 Move all regular expression operation into the new class.
- Move fetchUrls, fetchUrl and fetchText.
- Move base URL handling.
2014-07-26 11:28:43 +02:00
Tobias Gruetzmacher
fde1fdced6 Fix some typos. 2014-07-26 11:28:43 +02:00
Tobias Gruetzmacher
2567bd4e57 Convert starters and other helpers to new interface.
This allows those starters to work with future scrapers.
2014-07-26 11:28:43 +02:00
Tobias Gruetzmacher
4265053846 Refactor: Move regualar expression scraping into a new class.
- This also makes "<base href>" handling an internal detail of the regular
  expression scraper, future scrapers might not need that or handle it in
  another way.
2014-07-26 11:28:43 +02:00
Bastian Kleineidam
3a929ceea6 Allow comic text to be optional. Patch from TobiX 2014-07-24 20:49:57 +02:00
Bastian Kleineidam
950dd2932c Remove stray print statement. 2014-07-21 20:20:15 +02:00
Tobias Gruetzmacher
ea5d533e30 Fix index lookups for SnowFlame and SnowFlakes. 2014-07-19 13:23:42 +02:00
Bastian Kleineidam
4d49d4394b Fix doc 2014-07-03 18:42:06 +02:00
Bastian Kleineidam
f194e430bc TheThinHLine: fetch bigger images and name image files from sequence number. 2014-07-03 18:41:25 +02:00
Bastian Kleineidam
4845a4ccc1 Merge branch 'master' of github.com:wummel/dosage 2014-07-03 17:12:42 +02:00
Bastian Kleineidam
641daa738b Updated list of comics 2014-07-03 17:12:25 +02:00
Bastian Kleineidam
93fe5d5987 Minor useragent refactoring 2014-07-03 17:12:25 +02:00
Bastian Kleineidam
4c2a339e25 Fix some comics. 2014-07-02 19:51:53 +02:00
Luc Fouin
cb76198da7 added the thin H line, fixes #67 2014-07-02 17:14:33 +02:00
Luc Fouin
763f9b02a2 added the thin H line 2014-07-02 17:11:33 +02:00
Bastian Kleineidam
b03ba158ef Fixed LookingForGroup 2014-07-01 23:44:01 +02:00
Bastian Kleineidam
3485e2ac54 Added Whomp. 2014-06-24 20:48:49 +02:00
wummel
a0086bfcd8 Merge pull request #63 from sehrgut/master
Updated GirlGenius to new markup
2014-06-24 20:40:15 +02:00
Peter B
8f1c864ec3 Added Safely Endangered 2014-06-17 01:05:11 -04:00
Keith Beckman
236b840363 Updated GirlGenius to new markup
GG markup has changed, so I fixed the prevSearch regex to find the
"previous" button on the redesigned page.

As well, I set multipleImagesPerStrip to true, since there are quite a
few comics with multiple images that were being discarded.
2014-06-13 16:43:40 -04:00
Bastian Kleineidam
68afeaf82d Make appname lowercase. 2014-06-09 13:24:58 +02:00
Bastian Kleineidam
00e424aed0 Fix zenpencils. 2014-06-08 13:40:42 +02:00
Bastian Kleineidam
687d27d534 Stripping should be done in normaliseUrl. 2014-06-08 10:12:33 +02:00
Bastian Kleineidam
c528fd1822 Merge branch 'master' of github.com:wummel/dosage 2014-06-08 10:07:36 +02:00
Bastian Kleineidam
0ee5c08771 Match zoom image for GoComics pages. 2014-06-08 10:06:34 +02:00
Peter B
78954da9d7 fix StandStillStaySilent, strip urls when downloading 2014-06-04 01:58:16 -04:00
Peter B
71ed9ad69d fixed foul language 2014-06-04 01:35:40 -04:00
Bastian Kleineidam
62a3a55b82 Fixed LoadingArtist 2014-03-26 19:59:42 +01:00
Bastian Kleineidam
813e6876fc Add missing @classmethod 2014-03-26 19:59:42 +01:00
Bastian Kleineidam
c2cf58560e Remove unused import. 2014-03-26 19:59:42 +01:00
Bastian Kleineidam
4bb31953ad Fix PennyArcade 2014-03-26 19:59:42 +01:00
Freestila
0faf4a722b Update o.py
Removed procedure for "I am over 18" button, sicne this button no longer exists
2014-03-05 09:28:34 +01:00
Bastian Kleineidam
348dd5e6c0 Add documentation 2014-03-04 20:53:19 +01:00
Bastian Kleineidam
3108c9124a Fix thread import for py3 2014-03-04 20:50:34 +01:00
Bastian Kleineidam
18972d3830 Remove old waitSeconds parameter. 2014-03-04 18:38:46 +01:00
Bastian Kleineidam
15ef59262a Make threads interruptable. 2014-03-04 18:38:46 +01:00
Tobias Gruetzmacher
33801376f9 Fix indentation. 2014-02-27 22:31:21 +01:00
Tobias Gruetzmacher
1bcac66c03 Mark MonsieurLeChien as french. 2014-02-27 22:30:02 +01:00
Tobias Gruetzmacher
8e2ba15410 Merge pull request #60 from Freestila/master
Added comics - looks good
2014-02-27 22:24:57 +01:00
Luc Fouin
da9f518a7a add french commit M. Le Chien 2014-02-27 17:45:29 +01:00
Freestila
53ebb51b10 Added comic DungeonsAndDenizens 2014-02-27 15:08:07 +01:00
Freestila
b8fefb37c0 Added comic Underling 2014-02-20 12:54:40 +01:00
Freestila
3d19d45e81 Added wait 1 sek because of permanent Timeout / connection pool exceed from server 2014-02-20 12:54:13 +01:00
Freestila
67c31284f1 Added comic GrimTales from Down Below 2014-02-18 21:12:29 +01:00
Freestila
de0bb1c9d5 Added comic "The Landscaper" 2014-02-18 21:00:43 +01:00
Freestila
96f61542ee Added comic "Die Fruehreifen" 2014-02-18 21:00:19 +01:00
Peter B
b44b751efa Fixed EvilInc comics. Closes #58 2014-02-14 19:33:13 -05:00
Bastian Kleineidam
f50ef910be Skip CyanideAndHappiness videos 2014-02-10 21:58:26 +01:00
Bastian Kleineidam
875e431edc Provide page data in shouldSkipUrl() function 2014-02-10 21:58:09 +01:00
Bastian Kleineidam
73e1af7aba Fixed FredoAndPidjin 2014-02-06 19:57:56 +01:00
Peter B
d86442efed Added Oh Joy Sex Toy. 2014-01-30 22:45:50 -05:00
Peter B
add63d6d6c Added The Gentleman's Armchair Comic. 2014-01-30 22:32:46 -05:00
Tobias Gruetzmacher
44ef1831bf Sluggy Freelance has some pages with multiple comics.
See for example SluggyFreelance:010422
2014-01-28 19:08:39 +01:00
wummel
6b8854e7b2 Merge pull request #55 from Lugoues/upstream
Added MrLovenstein Comic
2014-01-26 05:49:50 -08:00
Bastian Kleineidam
cc5ee572fb Fix some comics 2014-01-24 23:17:21 +01:00
Peter B
66f6b08163 Added MrLovenstein Comic 2014-01-23 20:23:24 -05:00
Bastian Kleineidam
1a56fbb3dd Fix DemolitionSquad 2014-01-20 19:01:47 +01:00
Bastian Kleineidam
8b0f149c2b Updated copyright 2014-01-19 13:16:22 +01:00
Peter B
740bcb72ce Added Eat That Toast 2014-01-12 19:08:02 -05:00
Peter B
124cf99665 Added Poorly Drawn lines replacing GoComic's version. 2014-01-12 19:08:02 -05:00
Bastian Kleineidam
e738454cb1 Correct drunkduck disablement comment. 2014-01-11 20:04:52 +01:00
Peter B
d0031b65c8 Added "Stand Still. Stay Silent." comic. 2014-01-08 11:08:19 -05:00
Bastian Kleineidam
69bffc9c92 Fix invalid description. 2014-01-06 16:25:42 +01:00
Bastian Kleineidam
264a20a4db Disable disallowed drunkduck comics. 2014-01-06 09:58:24 +01:00
Bastian Kleineidam
3f4be55332 Merge branch 'upstream' of https://github.com/Lugoues/dosage into Lugoues-upstream 2014-01-06 09:38:25 +01:00
Bastian Kleineidam
d98c2a52dd Skip phdcomic video URL. 2014-01-06 08:20:58 +01:00
Peter B
ceca4ba102 Added FoulLanguage Comic 2014-01-06 00:34:37 -05:00
Peter B
1de57ea1fe added Camp Comic 2014-01-05 23:09:19 -05:00
Bastian Kleineidam
ef17268ace Fix comic list output. 2014-01-05 17:37:13 +01:00
Bastian Kleineidam
5fe48d013a Increase wait interval. 2014-01-05 17:14:19 +01:00
Bastian Kleineidam
4d63920434 Updated copyright. 2014-01-05 16:50:57 +01:00
Bastian Kleineidam
b6c913e2d5 Wait some time between requests. 2014-01-05 16:23:45 +01:00
Bastian Kleineidam
1affe58370 Use thread name in log output. 2014-01-05 16:17:34 +01:00
Bastian Kleineidam
bb18295798 Use realpath to detect symlinked instances. 2014-01-05 11:16:57 +01:00
Bastian Kleineidam
d9edeb1343 Limit cyanideandhappiness filename length 2014-01-05 11:08:15 +01:00
Bastian Kleineidam
9172aba146 Remove stray print 2014-01-05 10:50:25 +01:00
Bastian Kleineidam
1f38895681 Ensure only on instance of dosage is running to prevent accedental DoS on sites with multiple comics. 2014-01-05 10:36:22 +01:00
Bastian Kleineidam
732b50811d Only ensure the maximum width. 2013-12-22 13:38:29 +01:00
Bastian Kleineidam
f488935072 Fix AbstruseGoose and QuestionabelContent. 2013-12-22 08:01:58 +01:00
Bastian Kleineidam
a1a773dd52 Fix loader in frozen executables. 2013-12-18 20:55:23 +01:00
Bastian Kleineidam
5c5aa166c7 Fix gocomic image matcher 2013-12-12 22:54:03 +01:00
Bastian Kleineidam
799d3040f0 Refactoring 2013-12-11 17:54:39 +01:00
Bastian Kleineidam
f23aa86a2c Get larger Gocomic images. 2013-12-11 17:53:52 +01:00
Bastian Kleineidam
b5d973e2d4 Only resize really big images. 2013-12-11 00:01:29 +01:00
Bastian Kleineidam
5ad423c15e Limit image size also in HTML. 2013-12-10 19:59:19 +01:00
Bastian Kleineidam
c3078ed855 Added EdmundFinney, Gaia, GaiaGerman, InternetWebcomic,
NotInventedHere, RedsPlanet, RomanticallyApocalyptic,
  ScandinaviaAndTheWorld, TheGamerCat, Weregeek
2013-12-10 19:50:21 +01:00
Damjan Košir
4e40f02642 added comic Gaia in German 2013-12-10 18:02:20 +13:00
Damjan Košir
4e5717be57 added comic Gaia 2013-12-10 17:08:15 +13:00
Damjan Košir
f48b22b512 added comic Not Invented Here 2013-12-10 16:40:44 +13:00
Damjan Košir
e181b287c9 added comic Romantically Apocalyptic 2013-12-10 16:39:30 +13:00
Damjan Košir
58b62dbad3 added comic Scandinavia and the World 2013-12-10 16:37:35 +13:00
Damjan Košir
5982e27c7b added comic Red's Planet 2013-12-10 16:34:47 +13:00
Damjan Košir
4f47792dee added comic The Gamer Cat 2013-12-10 16:33:07 +13:00
Damjan Košir
b53ca04ee7 added comic Internet Webcomic 2013-12-10 16:32:16 +13:00
Damjan Košir
f095f6309e added comic Edmund Finney's Quest to Find the Meaning of Life 2013-12-10 16:31:03 +13:00
Bastian Kleineidam
67c2203e7e Ensure maxium aspect ratio in RSS images. 2013-12-08 15:55:39 +01:00
Bastian Kleineidam
df9a381ae4 Document getfp() function. 2013-12-08 11:46:26 +01:00
Bastian Kleineidam
03fff069ee Apply same file checks files as for image files. 2013-12-05 18:29:15 +01:00
Bastian Kleineidam
599672acbf Fix xkcd text regex. Closes #46 2013-12-05 18:29:15 +01:00
Bastian Kleineidam
7343932a5a Strip whitespace from image text. 2013-12-04 18:07:13 +01:00
wummel
0378c9d855 Merge pull request #45 from Lugoues/master
Store alt text from AbstruseGoose
2013-12-04 09:01:50 -08:00
Bastian Kleineidam
c583e8717e Store large xkcd images. 2013-12-04 17:56:54 +01:00
Bastian Kleineidam
0e5c59133c Provide HTML page data for image URL modifier function. 2013-12-04 17:54:55 +01:00
Peter B
36dcadc7d4 Store alt text from AbstruseGoose 2013-12-03 21:56:54 -05:00
Bastian Kleineidam
3c5424c2ef Add text in RSS and HTML output. 2013-11-29 20:32:54 +01:00
Bastian Kleineidam
142c418dc0 Store alt text from xkcd comics. 2013-11-29 20:27:11 +01:00
Bastian Kleineidam
0eaf9a3139 Add text search in comic strips. 2013-11-29 20:26:49 +01:00
Bastian Kleineidam
468b34034b cyanideandhappiness skip URL 2013-11-29 18:31:34 +01:00
Bastian Kleineidam
9514a8eeae Fixed ForLackOfABetterComic 2013-11-27 20:49:35 +01:00
Bastian Kleineidam
7d05b666da Updated RSS link name 2013-11-25 21:20:48 +01:00
Bastian Kleineidam
01085d56c2 Regenerated. 2013-11-24 12:19:54 +01:00
Bastian Kleineidam
48e417c647 Fixed some comics. 2013-11-18 22:01:30 +01:00
Bastian Kleineidam
f6fc604745 Fix GoComics image URL. 2013-11-14 21:30:51 +01:00
Bastian Kleineidam
44f8c81111 Updated from edits. 2013-11-12 20:13:58 +01:00
Bastian Kleineidam
7760985601 Fix broken comics 2013-11-12 18:33:14 +01:00
Bastian Kleineidam
45a5ef9064 Removed AetheriaEpics 2013-11-07 21:23:15 +01:00
Bastian Kleineidam
f74b18c2e5 Remove unused import. 2013-11-07 21:22:49 +01:00
Bastian Kleineidam
ca17332942 Call self.starter() on indexed comics since it might set cookies. 2013-11-07 20:48:10 +01:00
Bastian Kleineidam
74cca6bac3 Fixed oglaf comic skipping. 2013-11-07 20:47:31 +01:00
Bastian Kleineidam
1f282147dc Fix drunkduck comics. 2013-11-07 17:12:38 +01:00
Bastian Kleineidam
3e6414e0e5 Updated plugins 2013-11-07 07:28:47 +01:00
Bastian Kleineidam
86257c8364 Remove duplicate variable 2013-08-28 20:50:07 +02:00
Faldrian
93318c1d0c Added DarthsAndDroids 2013-08-19 20:14:47 +02:00
Bastian Kleineidam
ef4ae435a5 Fix several comics. 2013-07-18 20:39:53 +02:00
Bastian Kleineidam
eb4ee1a251 Add EatLiver and JimBenton 2013-07-16 18:01:44 +02:00
Bastian Kleineidam
934546954b Added MarriedToTheSea, NatalieDee 2013-07-10 18:43:53 +02:00
Bastian Kleineidam
d5172074d5 Fix some comics. 2013-07-09 22:21:17 +02:00
Bastian Kleineidam
8d5ae7b1bb Updated plugins. 2013-07-09 22:21:12 +02:00
Bastian Kleineidam
38f2e9e625 Fix typo. 2013-07-04 21:00:06 +02:00
wummel
dc6c90dbf3 Merge pull request #36 from pataluc/master
Added Go Get A Roomie
2013-07-04 11:56:31 -07:00
Bastian Kleineidam
327cb35aee Add dosagelib.__version__ 2013-07-04 20:55:43 +02:00
Bastian Kleineidam
02132893b2 Fix shermanslagoon namer. 2013-07-04 20:20:26 +02:00
Bastian Kleineidam
f78d28fba8 Merge branch 'master' of https://github.com/mbrandis/dosage into mbrandis-master 2013-07-04 20:03:11 +02:00
Bastian Kleineidam
8559184d69 Updated plugins 2013-07-04 12:22:36 +02:00
Bastian Kleineidam
a27ab5460b Add ICanBarelyDraw 2013-07-04 12:22:20 +02:00
Luc Fouin
495b1149bd added GoGetARoomie 2013-07-04 11:08:16 +02:00
mbrandis
ccf50cad89 Corrected description. 2013-06-24 22:46:39 +02:00
Bastian Kleineidam
da957ce329 Updated linuxcom 2013-06-24 20:27:43 +02:00
Bastian Kleineidam
36b8dcea04 Merge branch 'patch-2' of https://github.com/mbrandis/dosage into mbrandis-patch-2 2013-06-24 20:23:10 +02:00
Bastian Kleineidam
6bd534eaed Fix OnTheFasttrack 2013-06-24 20:19:33 +02:00
mbrandis
3b0393ccf6 Update s.py
Added Sherman's Lagoon, namer is not perfect.
2013-06-23 23:30:57 +03:00
mbrandis
265c03fc82 Update l.py
Added Linux.com Friday Funnies.
2013-06-23 22:28:08 +02:00
mbrandis
ef5ac2128b Update o.py
Please consider adding this comic.
2013-06-23 22:26:32 +02:00
Bastian Kleineidam
1c1b0aaf18 Comic fixes. 2013-05-25 23:24:33 +02:00
Bastian Kleineidam
66dccef537 Fix keenspot description type. 2013-05-22 22:29:20 +02:00
Bastian Kleineidam
ef878eed7c Updated plugins. 2013-05-22 07:19:16 +02:00
Bastian Kleineidam
b41fdf99ae Fix LookingForGroup. 2013-05-21 18:48:03 +02:00
Bastian Kleineidam
1478f22099 Output fixes. 2013-04-30 20:26:36 +02:00
Bastian Kleineidam
5f6ed7e05d More output stuff. 2013-04-30 07:24:54 +02:00
Bastian Kleineidam
1a6416eb1b Updated wormworld chapter. 2013-04-30 06:42:49 +02:00
Bastian Kleineidam
ebdc1e6359 More unicode output fixes. 2013-04-30 06:41:19 +02:00
Bastian Kleineidam
d6ca5aa7fd SnowFlakes is end-of-life. 2013-04-29 20:31:07 +02:00
Bastian Kleineidam
cec08b86dd DrFun is end-of-life. 2013-04-29 20:29:56 +02:00
Bastian Kleineidam
79273deb23 Correct output encoding. 2013-04-29 20:25:05 +02:00
Bastian Kleineidam
80d7defcd2 Unicode descriptions. 2013-04-29 07:35:56 +02:00
Bastian Kleineidam
459156fc1a Description must be unicode. 2013-04-29 07:27:59 +02:00
Bastian Kleineidam
64bf618b87 xkcd 2013-04-29 07:19:58 +02:00
Bastian Kleineidam
dcacbf0b9a Fix some comics. 2013-04-28 19:58:38 +02:00
Bastian Kleineidam
f9a48e6cb9 Updated scripted comic plugins. 2013-04-27 07:47:17 +02:00
Bastian Kleineidam
8783b53012 Fix GirlGenious strip url. 2013-04-26 19:52:45 +02:00
Bastian Kleineidam
cafa37fcb1 All scrapers must have an URL. 2013-04-26 06:53:05 +02:00
Bastian Kleineidam
05dbc51d3e Detect completed end-of-life comics. 2013-04-25 22:40:06 +02:00
Bastian Kleineidam
871de6a8ce Prefer GoComics over Creators since they have a better naming scheme. 2013-04-25 21:50:45 +02:00
Bastian Kleineidam
4716ecd71d Carciphona description. 2013-04-25 21:50:27 +02:00
Bastian Kleineidam
ba6e0c09a4 Added Unsound. 2013-04-25 21:38:18 +02:00
Bastian Kleineidam
725824f067 Added TwoGuysAndGuy 2013-04-25 21:23:31 +02:00
Bastian Kleineidam
382c4c05ad Added TheDreamlandChronicles 2013-04-25 21:20:48 +02:00
Bastian Kleineidam
8418ea471d Added SabrinaOnline. 2013-04-25 21:14:32 +02:00
Bastian Kleineidam
3e74dc9956 Fix MadamAndEve. 2013-04-25 21:09:42 +02:00
Bastian Kleineidam
c95a447305 Updated docs. 2013-04-25 21:06:20 +02:00
Bastian Kleineidam
6c773e21c7 Added Lackadaisy. 2013-04-25 21:06:12 +02:00
Bastian Kleineidam
1ae674782d Added GirlGenius 2013-04-25 20:58:24 +02:00
Bastian Kleineidam
96fc129fea Add GeneralProtectionFault (disallowed by robots.txt) 2013-04-25 20:54:48 +02:00
Bastian Kleineidam
f20df8b692 Added Curtailed. 2013-04-25 20:46:05 +02:00
Bastian Kleineidam
c114a834dd Added Carciphona 2013-04-25 20:40:15 +02:00
Bastian Kleineidam
51d84131eb Added ARedTailsDream 2013-04-25 20:37:27 +02:00
Bastian Kleineidam
dbdbdd09de Fix SMBC 2013-04-25 20:32:21 +02:00
Bastian Kleineidam
52ee7228ef Fix DorkTower image regex. 2013-04-25 19:01:38 +02:00
Bastian Kleineidam
aca3c959af Improve CtrlAltDel image names. 2013-04-25 19:01:21 +02:00
wummel
1c29f22270 Merge pull request #27 from dromaludaire/master
Fix some SMBC download links
2013-04-22 10:40:24 -07:00
Benjamin Sigonneau
f1da47edef [SMBC] Fix regexp, mainly concerns older strips 2013-04-21 22:39:06 +02:00
Sven Hartge
8e34239b27 Fix typo in regex for SandraOnTheRocks. 2013-04-20 19:59:01 +02:00
Sven Hartge
40f2aed8f0 Add Sandra on the Rocks. 2013-04-20 18:51:06 +02:00
Bastian Kleineidam
4988e79e6e Added some descriptions. 2013-04-19 06:31:12 +02:00
Bastian Kleineidam
e37a80fdc1 Add some descriptions. 2013-04-14 09:02:14 +02:00
Bastian Kleineidam
f15f993851 s/baseurl/baseUrl/g 2013-04-13 20:58:00 +02:00
Bastian Kleineidam
c246b41d64 Code formatting. 2013-04-13 08:00:11 +02:00
Bastian Kleineidam
522af89af5 Add some descriptions. 2013-04-13 08:00:03 +02:00
Bastian Kleineidam
3a03554d26 Ensure unicode output to fix encoding errors. 2013-04-12 21:02:31 +02:00
Bastian Kleineidam
35c031ca81 Fixed some comics. 2013-04-11 18:27:43 +02:00
Bastian Kleineidam
6ca4eaa492 Code cleanup. 2013-04-11 18:27:43 +02:00
Bastian Kleineidam
7e593cf7e8 Add firstStripUrls. 2013-04-10 23:57:09 +02:00
Bastian Kleineidam
a0c7f54871 Fix zwarwald 2013-04-10 20:14:43 +02:00
Bastian Kleineidam
fb05c10808 Sort entries. 2013-04-10 18:36:33 +02:00
Bastian Kleineidam
54283775a8 Add ForLackOfABetterComic 2013-04-10 18:20:39 +02:00
Bastian Kleineidam
d00310f017 Add EverydayBlues 2013-04-10 18:20:08 +02:00
Bastian Kleineidam
8b99b59056 Added DamnLol 2013-04-10 18:19:38 +02:00
Bastian Kleineidam
5127d4c895 Use re.escape and add some firstStripUrl. 2013-04-10 18:19:11 +02:00
Bastian Kleineidam
3213eebd75 Added ZenPencils. 2013-04-09 19:38:47 +02:00
Bastian Kleineidam
e040dd0d50 Added Science. 2013-04-09 19:38:16 +02:00
Bastian Kleineidam
68f14971e8 Added RealmOfAtland. 2013-04-09 19:37:47 +02:00
Bastian Kleineidam
f9179e9de5 Added GoblinsComic 2013-04-09 19:37:24 +02:00
Bastian Kleineidam
f71961acbc Added ExtraOrdinary. 2013-04-09 19:36:51 +02:00
Bastian Kleineidam
190ffcd390 Use str() for robotparser. 2013-04-09 19:36:00 +02:00
Bastian Kleineidam
b9dc385ff2 Implemented voting 2013-04-09 19:33:50 +02:00
Bastian Kleineidam
4528281ddd Voting part 2 2013-04-08 21:20:01 +02:00
Bastian Kleineidam
e762f269b7 First part of voting stuff. 2013-04-08 20:19:10 +02:00
Bastian Kleineidam
7584f0b647 Add version update check. 2013-04-08 20:17:02 +02:00
Bastian Kleineidam
781bac0ca2 Feed text content instead of binary to robots.txt parser. 2013-04-07 18:11:29 +02:00
Bastian Kleineidam
bd1d41b83c Write encoded data in binary format. 2013-04-05 19:27:30 +02:00
Bastian Kleineidam
0fbc005377 A Python3 fix. 2013-04-05 18:57:44 +02:00
Bastian Kleineidam
97522bc5ae Use tuples rather than lists. 2013-04-05 18:55:19 +02:00
Bastian Kleineidam
adb31d84af Use HTMLParser.unescape instead of rolling our own function. 2013-04-05 18:53:19 +02:00
Bastian Kleineidam
1c9f64bc27 Better name for Sketchesnatched. 2013-04-05 18:47:51 +02:00
Bastian Kleineidam
9e26640407 Augment SketcheSnatched 2013-04-05 07:31:22 +02:00
Bastian Kleineidam
50b742721b SketcheSnatched 2013-04-05 07:20:50 +02:00
Bastian Kleineidam
3936cfa9ce Another fix. 2013-04-05 06:56:33 +02:00
Bastian Kleineidam
6aa588860d Code cleanup 2013-04-05 06:36:05 +02:00
Bastian Kleineidam
fabe872d1d Fix SnowFlame 2013-04-04 18:32:37 +02:00
Bastian Kleineidam
8150dabfa6 Remove SarahZero 2013-04-04 18:32:29 +02:00
Bastian Kleineidam
5d6e210c98 Fix Curvy 2013-04-04 18:30:27 +02:00
Bastian Kleineidam
b3cbad37bc Remove CaribbeanBlue 2013-04-04 18:30:16 +02:00
Bastian Kleineidam
80c24a10c0 Fix WebDesignerCOTW 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
62af4b875e Fix Precocious 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
421e31c961 Fix Oglaf 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
d794919e73 Fix LasLindas 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
08a3587df6 Fix KatzenfutterGeleespritzer 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
c57226f1c0 Remove GreystoneInn 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
3078c5ec73 Fix ExtraLife and EyeOfRamalach 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
3fd4cfea0d Fix DasLebenIstKeinPonyhof 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
460c5be689 Add POST support to urlopen(). 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
44c3fb9f16 Remove broken scripted plugins. 2013-04-04 18:30:02 +02:00
Bastian Kleineidam
0054ebfe0b Some Python3 fixes. 2013-04-03 20:32:43 +02:00
Bastian Kleineidam
2c0ca04882 Fix warning for scrapers with multiple image patterns. 2013-04-03 20:32:19 +02:00
Bastian Kleineidam
f53a516219 Use output logging instead of print statement. 2013-04-03 20:31:10 +02:00
Bastian Kleineidam
a972729c0d Add WebDesignerCOTW 2013-04-03 20:30:51 +02:00
Bastian Kleineidam
fdab3b7b35 Add StuffNoOneToldMe 2013-04-03 20:30:29 +02:00