Version: (using KDE KDE 3.5.5) Installed from: Gentoo Packages OS: Linux UTF-8 text files labelled as containing "very long lines" (more than 300 bytes, in KDE -`file`, however, counts any Unicode character as one character, but KDE do not inform the user that the file contains "very long lines", so it does not really matter), in KDE (in Konqueror or KMail, for example), are typed as "Unknown". It means problems with file associations, no preview, possible problems with antivirus/antispam filters (for emails -these files are sent with the "application/octet-stream" MIME type (without a charset, by the way, though it does not matter much, as the files cannot be previewed from the email client)), and general confusion which results from these problems. There is no problem with ASCII files, even if they contain "very long lines" (they are properly typed as "Plain Text Document"). UTF-8 text files, without "very long lines", are properly typed too. If you want to test, simply create a file, with a single line, containing less than 300 bytes, and some UTF-8 non-ASCII characters (like "é"), and another file, with a line containing more than 300 bytes (and some UTF-8 non-ASCII characters). Then, just check the type in Konqueror or KMail. KDE should properly detect the MIME type of UTF-8 text files, containing "very long lines" (that is, "text/plain").
This occurs both with Konqueror 3.4.0 using kde 3.4.0, Suse 9.3 and konq 3.5.6 Kubuntu 7.0.4 Perry
Reproduced in kde 4.0.3 (f9). The file program reports UTF-8 Unicode text, with very long lines.
The problem is still there in Opensuse 11.0 kde 3.5.9 and 4.1 .
Please provide test files (and reopen the bug). My own tests work. $ kmimetypefinder testfile text/plain $ kmimetypefinder testfile_long text/plain $ file testfile* testfile: UTF-8 Unicode text testfile_long: UTF-8 Unicode text, with very long lines