~alcinnz/hurl

ref: 4136c1ee4bd3fb92dfa1b21357d3ce7d6649e825 hurl/ISSUES/charset-sniffing.md -rw-r--r-- 298 bytes
4136c1ee — Adrian Cochrane Merge branch 'main' of adrian.geek.nz:/srv/git/hurl into main 2 years ago
                                                                                
7d31eda4 Adrian Cochrane
1
2
3
# Optimize Charset Sniffing

Almost all charsets are supersets of ASCII, so when sniffing the charset for files which don't specify the encoding in their MIMEtype I can treat all the preceding text as ASCII. Though I suppose for this trick to work on UTF16 or UTF32 I'd need to remove any 0 bytes.