Bug 165796 - Akregator parses plain text containing html entities as a clickable link.
Summary: Akregator parses plain text containing html entities as a clickable link.
Status: RESOLVED WORKSFORME
Alias: None
Product: akregator
Classification: Applications
Component: feed parser (show other bugs)
Version: unspecified
Platform: Fedora RPMs Linux
: NOR normal
Target Milestone: ---
Assignee: kdepim bugs
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2008-07-05 16:57 UTC by Chris Fritz
Modified: 2011-03-21 22:10 UTC (History)
2 users (show)

See Also:
Latest Commit:
Version Fixed In:


Attachments
A copy of the HTML of the mis-parsed post. (9.69 KB, text/html)
2008-07-05 16:59 UTC, Chris Fritz
Details
Akregator's rendering of the HTML. (95.74 KB, image/png)
2008-07-05 17:03 UTC, Chris Fritz
Details
Opera's rendering of the HTML. (68.68 KB, image/png)
2008-07-05 17:06 UTC, Chris Fritz
Details
View of page in Konqueror (within Akregator) (67.79 KB, image/png)
2008-07-05 20:48 UTC, Chris Fritz
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Chris Fritz 2008-07-05 16:57:52 UTC
Version:           1.2.9 (using KDE 3.5.9)
Installed from:    Fedora RPMs

The following texts are improperly parsed when viewed in Akregator (not using the Konqueror view).  Hopefully nothing in this text is parsed as HTML, or it won't make sense.

<a href="http://particletree.com">Particletree</a>

This becomes a link with the text: Particletree

<a href="http://particletree.com&qout;>Particletree</a>

This becomes the text: <a href="http://particletree.com&qout>Particletree</a>

&lt;a href="http://particletree.com"&gt;Particletree&lt;/a&gt;

This becomes a link with the text: Particletree

&lt;a href="http://particletree.com"&gt;Particletree&lt;/a&gt;

This becomes a link with the text: Particletree

&lt;a href="&lt;a href="http://particletree.com"&gt;http://particletree.com&lt;/a&gt;"&gt;Particletree&lt;/a&gt;

This becomes a link with the text: http://particletree.com">Particletree
(Note that this text is linked up to and excluding the " text.

The expected behavior is for every line to be plain text HTML.  The expected behavior is not for the HTML entities to be parsed as HTML.
Comment 1 Chris Fritz 2008-07-05 16:59:50 UTC
Created attachment 25856 [details]
A copy of the HTML of the mis-parsed post.

I forgot to add the URL of the mis-parsed post.  It is
http://particletree.com/notebook/smarter-auto-linking/	I've attached a copy of
the page in case it's deleted or edited in the future.
Comment 2 Chris Fritz 2008-07-05 17:03:37 UTC
Created attachment 25857 [details]
Akregator's rendering of the HTML.

This screenshots shows how the text is being parsed in Akregator.
Comment 3 Chris Fritz 2008-07-05 17:06:26 UTC
Created attachment 25858 [details]
Opera's rendering of the HTML.

Here is the proper rendering for the HTML.  Although Opera was used for this
screenshot, Konqueror shows the same.  This shows how the HTML entitied text
should appear (unparsed) in content.
Comment 4 Christophe Marin 2008-07-05 19:21:34 UTC
Works as expected in Akregator for KDE4.

Try the page you mentioned in konqueror, how is it rendered ? 
Comment 5 Chris Fritz 2008-07-05 20:48:20 UTC
Created attachment 25862 [details]
View of page in Konqueror (within Akregator)

The view in Konqueror matches Opera's view.  This screenshot shows it in
Konqueror within Akregator (maybe I have the naming wrong, is it more properly
said as a Konqueror KPart within Akregator?)  So, Konqueror is parsing the HTML
entities properly as HTML entities, whereas somewhere along the line
Akregator's own view of the content is decoding entities, replacing them with
the characters they represent (i.e. replacing &gt; with > and &quot; with ")

I'd check to see if this is already fixed in Akregator for KDE4, but I can't
run that version due to some crashing bugs (which have already been reported
here by others).
Comment 6 Frank Osterfeld 2011-03-21 22:10:25 UTC
That should be fixed in KDE4 (reopen if you have counter-example).