Bug 305883 - doc import takes several tens of seconds, text in tex boxes is corrupted/unreadable
Summary: doc import takes several tens of seconds, text in tex boxes is corrupted/unre...
Status: RESOLVED WORKSFORME
Alias: None
Product: calligrawords
Classification: Applications
Component: doc (other bugs)
Version First Reported In: 2.5.0
Platform: Fedora RPMs Linux
: NOR normal
Target Milestone: ---
Assignee: Calligra Words Bugs
URL: http://www.hanzlici.cz//bugrep/textbo...
Keywords:
Depends on:
Blocks:
 
Reported: 2012-08-27 14:25 UTC by Franta Hanzlik
Modified: 2023-01-30 05:07 UTC (History)
1 user (show)

See Also:
Latest Commit:
Version Fixed In:
Sentry Crash Report:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Franta Hanzlik 2012-08-27 14:25:00 UTC
SW-generated .DOC files are well opened in MS Office, text is also displayed in catdoc utility, but it display garbage when trying open it in calligrawords. I add reference to such one file in above URL.
"file" utility print about this:
Composite Document File V2 Document, Little Endian, Os: Windows, Version 5.1, Code page: 1250, Title: Likvida, Subject: Tiskova sestava BYZNYS Win/VR, Author: UNIT PLUS s.r.o., Template: Normal, Last Saved By: jsvabova, Revision Number: 2, Name of Creating Application: Microsoft Word 9.0, Create Time/Date: Mon Jun 18 22:37:00 2012, Last Saved Time/Date: Mon Jun 18 22:37:00 2012, Number of Pages: 1, Number of Words: 42, Number of Characters: 240, Security: 0
(really, number of words in document is about 150 and number of characters is ~ 1100)

When trying open it in CW, then:
1) Opening this 78kB file takes about 13.5 minutes((!) at dual-core/4GB RAM PC. All this time progress bar stay at cca 40% and CPU is burdened by CW at 99,7 %.
2) text lines are displayed incorrectly, it seems as bottom part of chracters is cutted off.


Reproducible: Always
Comment 1 Camilla Boemann 2012-08-27 14:49:47 UTC
That sounds like way too long time indeed. for me it even crashes in the filter.

However files saved by non-msword may very well be invalid, and those other readers are maybe just better at coping with wrong files. Anyways it's something we need to investigate.
Comment 2 Franta Hanzlik 2012-08-27 17:51:47 UTC
These .doc files are probably generated by utilizing some MS tools/libraries. This indeed not means that generated files are valid - but is there some SW for their validations? I know no one...
Comment 3 Franta Hanzlik 2012-09-01 12:30:25 UTC
As newer version Calligra Words 2.5.1 now appear in my distro repo, I test this .doc opening in it (instead in previous 2.5.0) - but with same result, problem persist.
For completeness, trying open this documents in recently released OpenOffice 3.4.1 and LibreOffice 3.6.1
(both on i686 Linux) did ever worst result: there was not any important delay in document import, but at screen were only graphical boxes, lines and tables; text has be missing entirely.
Comment 4 Franta Hanzlik 2012-09-01 20:34:48 UTC
I reported this problem to (Apache) Openoffice bugzilla too. There was quick reaction and ascertainment, that my document, when opened in MS Word 2010 and without changes saved, then it is without problem correctly opened in Openoffice. See:
https://issues.apache.org/ooo/show_bug.cgi?id=120790

But when I try open this re-saved document in Calligra Words 2.5.1, then import still take very long time (~ 14 minutes,  progress bar stay at cca 40%, ... - as described above), but then document appear almost OK - with exception upper left corner of document, where are some garbage. But remainder of page look well.
Comment 5 Franta Hanzlik 2013-03-30 18:21:22 UTC
I just tested this issue with Calligra Words version 2.6.1, still same bad result.
Comment 6 Andrew Crouthamel 2018-11-10 03:14:09 UTC
Dear Bug Submitter,

This bug has been stagnant for a long time. Could you help us out and re-test if the bug is valid in the latest version? I am setting the status to NEEDSINFO pending your response, please change the Status back to REPORTED when you respond.

Thank you for helping us make KDE software even better for everyone!
Comment 7 Andrew Crouthamel 2018-11-20 03:58:09 UTC
Dear Bug Submitter,

This is a reminder that this bug has been stagnant for a long time. Could you help us out and re-test if the bug is valid in the latest version? This bug will be moved back to REPORTED Status for manual review later, which may take a while. If you are able to, please lend us a hand.

Thank you for helping us make KDE software even better for everyone!
Comment 8 Justin Zobel 2022-12-31 00:24:34 UTC
Thank you for reporting this issue in KDE software. As it has been a while since this issue was reported, can we please ask you to see if you can reproduce the issue with a recent software version?

If you can reproduce the issue, please change the status to "REPORTED" when replying. Thank you!
Comment 9 Bug Janitor Service 2023-01-15 05:10:46 UTC
Dear Bug Submitter,

This bug has been in NEEDSINFO status with no change for at least
15 days. Please provide the requested information as soon as
possible and set the bug status as REPORTED. Due to regular bug
tracker maintenance, if the bug is still in NEEDSINFO status with
no change in 30 days the bug will be closed as RESOLVED > WORKSFORME
due to lack of needed information.

For more information about our bug triaging procedures please read the
wiki located here:
https://community.kde.org/Guidelines_and_HOWTOs/Bug_triaging

If you have already provided the requested information, please
mark the bug as REPORTED so that the KDE team knows that the bug is
ready to be confirmed.

Thank you for helping us make KDE software even better for everyone!
Comment 10 Bug Janitor Service 2023-01-30 05:07:27 UTC
This bug has been in NEEDSINFO status with no change for at least
30 days. The bug is now closed as RESOLVED > WORKSFORME
due to lack of needed information.

For more information about our bug triaging procedures please read the
wiki located here:
https://community.kde.org/Guidelines_and_HOWTOs/Bug_triaging

Thank you for helping us make KDE software even better for everyone!