Bug 153611 - Automatic encoding detection fails for UTF-8 HTML with byte order mark
Summary: Automatic encoding detection fails for UTF-8 HTML with byte order mark
Status: RESOLVED WORKSFORME
Alias: None
Product: konqueror
Classification: Applications
Component: general (show other bugs)
Version: 3.5
Platform: unspecified Linux
: NOR normal
Target Milestone: ---
Assignee: Konqueror Developers
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2007-12-07 14:52 UTC by Tap
Modified: 2018-06-11 16:43 UTC (History)
1 user (show)

See Also:
Latest Commit:
Version Fixed In:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Tap 2007-12-07 14:52:42 UTC
Version:           3.5.8 (using KDE 3.5.8, compiled sources)
Compiler:          Target: i486-slackware-linux
OS:                Linux (i686) release 2.6.21.1

An HTML file with a byte-order mark and the encoding set in a META tag is not correctly detected as UTF-8. The layout breaks very noticeably unless the encoding is manually set to "Unicode (utf8)".

Example URL: http://www.brandonsanderson.com/book/Alcatraz/

(This example may not be great because it doesn't validate and doesn't have an <?xml?> tag despite claiming to be XHTML.)

There are other bugs similar to this one in the database, but I'm reporting it because they all seem to have been closed as resolved years ago.
Comment 1 Jaime Torres 2008-05-30 18:06:55 UTC
It renders the page as firefox 3 rc1 with konqueror 4 trunk 813735
Comment 2 Rui G. 2008-06-04 02:19:52 UTC
I confirm this in 3.5.9 . In trunk r816330 there is no need to set the encoding, it renders fine.
Comment 3 Maksim Orlovich 2008-08-03 19:01:04 UTC
Works fine in 4.x indeed.