SW-generated .DOC files are well opened in MS Office, text is also displayed in catdoc utility, but it display garbage when trying open it in calligrawords. I add reference to such one file in above URL. "file" utility print about this: Composite Document File V2 Document, Little Endian, Os: Windows, Version 5.1, Code page: 1250, Title: Likvida, Subject: Tiskova sestava BYZNYS Win/VR, Author: UNIT PLUS s.r.o., Template: Normal, Last Saved By: jsvabova, Revision Number: 2, Name of Creating Application: Microsoft Word 9.0, Create Time/Date: Mon Jun 18 22:37:00 2012, Last Saved Time/Date: Mon Jun 18 22:37:00 2012, Number of Pages: 1, Number of Words: 42, Number of Characters: 240, Security: 0 (really, number of words in document is about 150 and number of characters is ~ 1100) When trying open it in CW, then: 1) Opening this 78kB file takes about 13.5 minutes((!) at dual-core/4GB RAM PC. All this time progress bar stay at cca 40% and CPU is burdened by CW at 99,7 %. 2) text lines are displayed incorrectly, it seems as bottom part of chracters is cutted off. Reproducible: Always
That sounds like way too long time indeed. for me it even crashes in the filter. However files saved by non-msword may very well be invalid, and those other readers are maybe just better at coping with wrong files. Anyways it's something we need to investigate.
These .doc files are probably generated by utilizing some MS tools/libraries. This indeed not means that generated files are valid - but is there some SW for their validations? I know no one...
As newer version Calligra Words 2.5.1 now appear in my distro repo, I test this .doc opening in it (instead in previous 2.5.0) - but with same result, problem persist. For completeness, trying open this documents in recently released OpenOffice 3.4.1 and LibreOffice 3.6.1 (both on i686 Linux) did ever worst result: there was not any important delay in document import, but at screen were only graphical boxes, lines and tables; text has be missing entirely.
I reported this problem to (Apache) Openoffice bugzilla too. There was quick reaction and ascertainment, that my document, when opened in MS Word 2010 and without changes saved, then it is without problem correctly opened in Openoffice. See: https://issues.apache.org/ooo/show_bug.cgi?id=120790 But when I try open this re-saved document in Calligra Words 2.5.1, then import still take very long time (~ 14 minutes, progress bar stay at cca 40%, ... - as described above), but then document appear almost OK - with exception upper left corner of document, where are some garbage. But remainder of page look well.
I just tested this issue with Calligra Words version 2.6.1, still same bad result.
Dear Bug Submitter, This bug has been stagnant for a long time. Could you help us out and re-test if the bug is valid in the latest version? I am setting the status to NEEDSINFO pending your response, please change the Status back to REPORTED when you respond. Thank you for helping us make KDE software even better for everyone!
Dear Bug Submitter, This is a reminder that this bug has been stagnant for a long time. Could you help us out and re-test if the bug is valid in the latest version? This bug will be moved back to REPORTED Status for manual review later, which may take a while. If you are able to, please lend us a hand. Thank you for helping us make KDE software even better for everyone!
Thank you for reporting this issue in KDE software. As it has been a while since this issue was reported, can we please ask you to see if you can reproduce the issue with a recent software version? If you can reproduce the issue, please change the status to "REPORTED" when replying. Thank you!
Dear Bug Submitter, This bug has been in NEEDSINFO status with no change for at least 15 days. Please provide the requested information as soon as possible and set the bug status as REPORTED. Due to regular bug tracker maintenance, if the bug is still in NEEDSINFO status with no change in 30 days the bug will be closed as RESOLVED > WORKSFORME due to lack of needed information. For more information about our bug triaging procedures please read the wiki located here: https://community.kde.org/Guidelines_and_HOWTOs/Bug_triaging If you have already provided the requested information, please mark the bug as REPORTED so that the KDE team knows that the bug is ready to be confirmed. Thank you for helping us make KDE software even better for everyone!
This bug has been in NEEDSINFO status with no change for at least 30 days. The bug is now closed as RESOLVED > WORKSFORME due to lack of needed information. For more information about our bug triaging procedures please read the wiki located here: https://community.kde.org/Guidelines_and_HOWTOs/Bug_triaging Thank you for helping us make KDE software even better for everyone!