Bug 144540

Summary: aKregator unable to parse rss/xml on empty header
Product: [Applications] akregator Reporter: Ido Kanner <idokan>
Component: feed parserAssignee: kdepim bugs <kdepim-bugs>
Status: REPORTED ---    
Severity: wishlist CC: de.meyer.maarten, gianogli
Priority: NOR    
Version: unspecified   
Target Milestone: ---   
Platform: Ubuntu   
OS: Linux   
Latest Commit: Version Fixed In:
Attachments: XML file

Description Ido Kanner 2007-04-22 23:51:49 UTC
Version:           1.2.6 (using KDE KDE 3.5.6)
Installed from:    Ubuntu Packages
OS:                Linux

When a RSS/XML is read, and the header starts with empty line instead of "<?xml version="1.0" encoding="UTF-8"?>" (that exists on the second line, or beyond), akregator fails to read and parse the RSS.

According to the standards, the way that akregator works, is valid, however free hosting such as "go daddy" attempts to place banners on the first lines, and that's breaks the code.
Please note that if there is an XML request, then the "Go daddy" server, does not place the banner, but still places empty (\r\n or \n or other "new line" code) lines.
Comment 1 Eckhart Wörner 2007-04-23 02:27:08 UTC
You'd better ask Go Daddy to fix this at their servers. It's quite unlikely this will be fixed in Akregator at all (as Akregator uses standard XML libraries to access the XML data).
Comment 2 Ido Kanner 2007-04-23 10:31:37 UTC
I agree with you as that it's a bug on GoDaddy's side. That's why I'm reporting a wishlist.

However, I do believe that XML parsers should be able to cope with this type of bug.
Comment 3 Maarten De Meyer 2014-01-15 19:26:01 UTC
Hi,
Could you please give an example of a feed with an empty header.
A quick search for a feed from a GoDaddy site turned up nothing.
Comment 4 gianogli 2020-07-14 12:13:37 UTC
Created attachment 130112 [details]
XML file
Comment 5 gianogli 2020-07-14 12:14:31 UTC
I have the same issue with the rss/xml page of the "Computer Security Incident Response Team -Italy".
You can find it at https://csirt.gov.it/data/indexer/rss

It start without the headers but it works on other readers. I attached the xml file.