~alcinnz/hurl: ISSUES/charset-sniffing.md - Argonaut Constellation git

~alcinnz/hurl

hurl/ISSUES/charset-sniffing.md -rw-r--r-- 298 bytes

41ee21d2 — Adrian Cochrane Broaden base dependency bounds, fix readStrict regression. 9 months ago

View Rendered
View Source

#Optimize Charset Sniffing

Almost all charsets are supersets of ASCII, so when sniffing the charset for files which don't specify the encoding in their MIMEtype I can treat all the preceding text as ASCII. Though I suppose for this trick to work on UTF16 or UTF32 I'd need to remove any 0 bytes.