8768ff07b6
HTML character encoding in the context of HTTP is quite tricky to get right and honestly, I'm not sure if I did get it right this time. But I think, the current behaviour matches best what web browsers try to do: 1. Let Requests figure out the content from the HTTP header. This overrides everything else. We need to "trick" LXML to accept our decision if the document contains an XML declaration which might disagree with the HTTP header. 2. If the HTTP headers don't specify any encoding, let LXML guess the encoding and be done with it. |
||
---|---|---|
.. | ||
__init__.py | ||
a.py | ||
arcamax.py | ||
b.py | ||
c.py | ||
clonemanga.py | ||
comicfury.py | ||
comicgenesis.py | ||
common.py | ||
creators.py | ||
d.py | ||
drunkduck.py | ||
e.py | ||
f.py | ||
footloosecomic.py | ||
g.py | ||
gocomics.py | ||
h.py | ||
i.py | ||
j.py | ||
k.py | ||
keenspot.py | ||
kindofnormal.py | ||
l.py | ||
m.py | ||
n.py | ||
nitrocosm.py | ||
nuklearpower.py | ||
num.py | ||
o.py | ||
p.py | ||
pensandtales.py | ||
petitesymphony.py | ||
q.py | ||
r.py | ||
s.py | ||
sandraandwoo.py | ||
smackjeeves.py | ||
snafu.py | ||
t.py | ||
u.py | ||
v.py | ||
w.py | ||
webcomiceu.py | ||
webcomicfactory.py | ||
webcomicnation.py | ||
wlpcomics.py | ||
wordpress.py | ||
x.py | ||
y.py | ||
z.py |