Bug 171368 - konqueror refuses to show pages claiming they are cyclic links
Summary: konqueror refuses to show pages claiming they are cyclic links
Status: RESOLVED WORKSFORME
Alias: None
Product: konqueror
Classification: Applications
Component: khtml (show other bugs)
Version: 4.1.1
Platform: unspecified Linux
: NOR normal
Target Milestone: ---
Assignee: Konqueror Developers
URL:
Keywords:
: 166941 (view as bug list)
Depends on:
Blocks:
 
Reported: 2008-09-20 11:06 UTC by Anders Lund
Modified: 2011-05-04 23:08 UTC (History)
8 users (show)

See Also:
Latest Commit:
Version Fixed In:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Anders Lund 2008-09-20 11:06:29 UTC
Version:           4.1.1 (KDE 4.1.1) (using 4.1.1 (KDE 4.1.1), Kubuntu packages)
Compiler:          gcc
OS:                Linux (x86_64) release 2.6.24-16-generic

Quite often, konqueror displays a message about having detected a cyclic link. That would be fine if it was the actual case, but other browsers such as firefox or konqueror from KDE 3 show those pages fine.

Example  URL: http://www.bikedreams.dk/shop/saet+diverse-222c1.html
Comment 1 Rui G. 2008-09-28 02:53:53 UTC
I confirm it in trunk rev.864904 as well.
Comment 2 Maksim Orlovich 2008-09-28 03:04:03 UTC
More URL encoding goodness(?):
GET /shop/saet+diverse-222c1.html HTTP/1.1 
Connection: Keep-Alive 
User-Agent: Mozilla/5.0 (compatible; Konqueror/4.1; Linux) KHTML/4.1.2 (like Gecko) 
Accept: text/html, image/jpeg, image/png, text/*, image/*, */* 
Accept-Encoding: x-gzip, x-deflate, gzip, deflate 
Accept-Charset: utf-8, utf-8;q=0.5, *;q=0.5 
Accept-Language: en-US, en 
Host: www.bikedreams.dk 
 
HTTP/1.1 301 Moved Permanently 
Cache-Control: no-cache 
Pragma: no-cache 
Content-Length: 0 
Content-Type: text/html 
Expires: Sun, 28 Sep 2008 01:01:42 GMT 
Location: http://www.bikedreams.dk/shop/saet%2bdiverse-222c1.html 
Server: Microsoft-IIS/6.0 
X-Powered-By: ASP.NET 
P3P: CP='OUR PSA PSD BUS UNI NID DSP NOI COR' 
Set-Cookie: ASPSESSIONIDSCBCRCRA=BLNNMLGDGHGPHPOOLCOEMALH; path=/ 
Date: Sun, 28 Sep 2008 01:02:42 GMT 
 
GET /shop/saet+diverse-222c1.html HTTP/1.1 
Connection: Keep-Alive 
User-Agent: Mozilla/5.0 (compatible; Konqueror/4.1; Linux) KHTML/4.1.2 (like Gecko) 
Accept: text/html, image/jpeg, image/png, text/*, image/*, */* 
Accept-Encoding: x-gzip, x-deflate, gzip, deflate 
Accept-Charset: utf-8, utf-8;q=0.5, *;q=0.5 
Accept-Language: en-US, en 
Host: www.bikedreams.dk 
Cookie: ASPSESSIONIDSCBCRCRA=BLNNMLGDGHGPHPOOLCOEMALH 
 
HTTP/1.1 301 Moved Permanently 
Cache-Control: no-cache 
Pragma: no-cache 
Content-Length: 0 
Content-Type: text/html 
Expires: Sun, 28 Sep 2008 01:01:42 GMT 
Location: http://www.bikedreams.dk/shop/saet%2bdiverse-222c1.html 
Server: Microsoft-IIS/6.0 
X-Powered-By: ASP.NET 
P3P: CP='OUR PSA PSD BUS UNI NID DSP NOI COR' 
Date: Sun, 28 Sep 2008 01:02:42 GMT 
 
GET /shop/saet+diverse-222c1.html HTTP/1.1 
Connection: Keep-Alive 
User-Agent: Mozilla/5.0 (compatible; Konqueror/4.1; Linux) KHTML/4.1.2 (like Gecko) 
Accept: text/html, image/jpeg, image/png, text/*, image/*, */* 
Accept-Encoding: x-gzip, x-deflate, gzip, deflate 
Accept-Charset: utf-8, utf-8;q=0.5, *;q=0.5 
Accept-Language: en-US, en 
Host: www.bikedreams.dk 
Cookie: ASPSESSIONIDSCBCRCRA=BLNNMLGDGHGPHPOOLCOEMALH 
 
HTTP/1.1 301 Moved Permanently 
Cache-Control: no-cache 
Pragma: no-cache 
Content-Length: 0 
Content-Type: text/html 
Expires: Sun, 28 Sep 2008 01:01:42 GMT 
Location: http://www.bikedreams.dk/shop/saet%2bdiverse-222c1.html 
Server: Microsoft-IIS/6.0 
X-Powered-By: ASP.NET 
P3P: CP='OUR PSA PSD BUS UNI NID DSP NOI COR' 
Date: Sun, 28 Sep 2008 01:02:42 GMT 
 
GET /shop/saet+diverse-222c1.html HTTP/1.1 
Connection: Keep-Alive 
User-Agent: Mozilla/5.0 (compatible; Konqueror/4.1; Linux) KHTML/4.1.2 (like Gecko) 
Accept: text/html, image/jpeg, image/png, text/*, image/*, */* 
Accept-Encoding: x-gzip, x-deflate, gzip, deflate 
Accept-Charset: utf-8, utf-8;q=0.5, *;q=0.5 
Accept-Language: en-US, en 
Host: www.bikedreams.dk 
Cookie: ASPSESSIONIDSCBCRCRA=BLNNMLGDGHGPHPOOLCOEMALH 
 
HTTP/1.1 301 Moved Permanently 
Cache-Control: no-cache 
Pragma: no-cache 
Content-Length: 0 
Content-Type: text/html 
Expires: Sun, 28 Sep 2008 01:01:43 GMT 
Location: http://www.bikedreams.dk/shop/saet%2bdiverse-222c1.html 
Server: Microsoft-IIS/6.0 
X-Powered-By: ASP.NET 
P3P: CP='OUR PSA PSD BUS UNI NID DSP NOI COR' 
Date: Sun, 28 Sep 2008 01:02:43 GMT 
 
GET /shop/saet+diverse-222c1.html HTTP/1.1 
Connection: Keep-Alive 
User-Agent: Mozilla/5.0 (compatible; Konqueror/4.1; Linux) KHTML/4.1.2 (like Gecko) 
Accept: text/html, image/jpeg, image/png, text/*, image/*, */* 
Accept-Encoding: x-gzip, x-deflate, gzip, deflate 
Accept-Charset: utf-8, utf-8;q=0.5, *;q=0.5 
Accept-Language: en-US, en 
Host: www.bikedreams.dk 
Cookie: ASPSESSIONIDSCBCRCRA=BLNNMLGDGHGPHPOOLCOEMALH 
 
HTTP/1.1 301 Moved Permanently 
Cache-Control: no-cache 
Pragma: no-cache 
Content-Length: 0 
Content-Type: text/html 
Expires: Sun, 28 Sep 2008 01:01:43 GMT 
Location: http://www.bikedreams.dk/shop/saet%2bdiverse-222c1.html 
Server: Microsoft-IIS/6.0 
X-Powered-By: ASP.NET 
P3P: CP='OUR PSA PSD BUS UNI NID DSP NOI COR' 
Date: Sun, 28 Sep 2008 01:02:43 GMT 
 
GET /shop/saet+diverse-222c1.html HTTP/1.1 
Connection: Keep-Alive 
User-Agent: Mozilla/5.0 (compatible; Konqueror/4.1; Linux) KHTML/4.1.2 (like Gecko) 
Accept: text/html, image/jpeg, image/png, text/*, image/*, */* 
Accept-Encoding: x-gzip, x-deflate, gzip, deflate 
Accept-Charset: utf-8, utf-8;q=0.5, *;q=0.5 
Accept-Language: en-US, en 
Host: www.bikedreams.dk 
Cookie: ASPSESSIONIDSCBCRCRA=BLNNMLGDGHGPHPOOLCOEMALH 
 
HTTP/1.1 301 Moved Permanently 
Cache-Control: no-cache 
Pragma: no-cache 
Content-Length: 0 
Content-Type: text/html 
Expires: Sun, 28 Sep 2008 01:01:43 GMT 
Location: http://www.bikedreams.dk/shop/saet%2bdiverse-222c1.html 
Server: Microsoft-IIS/6.0 
X-Powered-By: ASP.NET 
P3P: CP='OUR PSA PSD BUS UNI NID DSP NOI COR' 
Date: Sun, 28 Sep 2008 01:02:43 GMT 
 
GET /shop/saet+diverse-222c1.html HTTP/1.1 
Connection: Keep-Alive 
User-Agent: Mozilla/5.0 (compatible; Konqueror/4.1; Linux) KHTML/4.1.2 (like Gecko) 
Accept: text/html, image/jpeg, image/png, text/*, image/*, */* 
Accept-Encoding: x-gzip, x-deflate, gzip, deflate 
Accept-Charset: utf-8, utf-8;q=0.5, *;q=0.5 
Accept-Language: en-US, en 
Host: www.bikedreams.dk 
Cookie: ASPSESSIONIDSCBCRCRA=BLNNMLGDGHGPHPOOLCOEMALH 
 
HTTP/1.1 301 Moved Permanently 
Cache-Control: no-cache 
Pragma: no-cache 
Content-Length: 0 
Content-Type: text/html 
Expires: Sun, 28 Sep 2008 01:01:44 GMT 
Location: http://www.bikedreams.dk/shop/saet%2bdiverse-222c1.html 
Server: Microsoft-IIS/6.0 
X-Powered-By: ASP.NET 
P3P: CP='OUR PSA PSD BUS UNI NID DSP NOI COR' 
Date: Sun, 28 Sep 2008 01:02:43 GMT 
 
Comment 3 Rui G. 2008-09-28 03:24:22 UTC
I would guess it's a good idea to mark the original kurl encoding bug (?) a major one, because a lot of websites are breaking needlessly. 
Comment 4 Thiago Macieira 2008-09-29 18:11:39 UTC
Unfortunately, KUrl and QUrl are strictly compliant to RFC 3986. They are correct in their parsing. This is a bug in the server.

We have to think now whether we should add a workaround and, if so, how.
Comment 5 Anders Lund 2008-09-29 18:45:13 UTC
without a workaround, kde 4 is not functional in the real world, however rfc 3986 compliant it is. For me it would mean that I would have to dismiss konqueror as my web browser - something I would hate to have to do, I got to like it over the years ;)
 
Comment 6 Thiago Macieira 2008-09-29 23:12:59 UTC
What's unusable for one is perfectly functional for others. I haven't seen this problem in real life. And I have been using Konqueror for my main and only browser.

For me it's just "oh, that's interesting".

In any case, you should *also* contact the website.
Comment 7 Rui G. 2008-09-30 00:59:51 UTC
In bug 170745 there is another possible example of what is going on here, and it's similar, characters being replaced in the GET command because of encoding.

Konqueror requests  http://www.sejlsport.dk/graphics/ds/DSUngdom/PDF/Pr%F8v%20noget%20nyt%20dokumenter/Invitation_Kerteminde_11.07.08.pdf 

as

http://www.sejlsport.dk/graphics/ds/DSUngdom/PDF/Pr%EF%BF%BDv%20noget%20nyt%20dokumenter/Invitation_Kerteminde_11.07.08.pdf
Comment 8 Thiago Macieira 2008-09-30 08:06:27 UTC
Hi Rui. That's a different issue. And that one we already know how to solve.
Comment 9 Anders Lund 2008-09-30 20:34:20 UTC
Since it is not a bug, I will mark it a wish: I wish for (konqueror of) KDE 4 to be real world compliant
Comment 10 Thiago Macieira 2008-09-30 20:58:55 UTC
I didn't say it's not a bug. I said it's not a showstopper for me.

But, at this point, I really don't know how to solve this bug.
Comment 11 squan 2008-10-28 22:31:16 UTC
Same problem with most links on ftd.de.
konqueror3, firefox and w3m (and probably many less common browsers) seem not to care too much about RFC compliance and render these without complaints.
So mee too wishes that you could do like them and work around this annoying server behaviour for sake of a better user experience.
Comment 12 Thiago Macieira 2008-11-08 12:00:21 UTC
I'm working on some changes to QUrl that will make it keep the original encoding (as long as it's valid) and a transformation is not requested.
Comment 13 Thiago Macieira 2008-11-10 15:32:23 UTC
Fixed in Qt 4.5.0. See 
http://trolltech.com/developer/task-tracker/index_html?method=entry&id=234580

However, it KDE (KIO or Konqueror) is still triggering a conversion to the wrong URL.
Comment 14 Thiago Macieira 2008-11-10 17:40:43 UTC
This is a problem in Konqueror now.

There's code in Konqueror that sets the URL in the location bar to the "pretty" format. That means it converts anything that isn't specifically reserved to its actual characters. That's how %2b becomes + now.

I don't see how to solve this, aside from showing "ugly URLs". Even if we keep the actual, original URL somewhere besides what we show, the moment that the user clicks on the location bar and presses Enter, all bets are lost.
Comment 15 Maksim Orlovich 2008-11-10 18:51:14 UTC
re: comment #14: why would what konqueror does matter, considering the cycle detection is in KIO?
Comment 16 Thiago Macieira 2008-11-10 20:06:49 UTC
Ok, then I need to investigate some more.
Comment 17 Dawit Alemayehu 2011-05-04 21:52:40 UTC
Is this still an issue in KDE 4.6 ? I am unable to reproduce any of the reports above. None of the links I tried gave me any error message in both konqueror + khtml and konqueror + kwebkitpart.
Comment 18 squan 2011-05-04 22:34:57 UTC
> Same problem with most links on ftd.de.
No longer reprocucable with ftd.de since some month.
Comment 19 Dawit Alemayehu 2011-05-04 22:55:10 UTC
(In reply to comment #18)
> > Same problem with most links on ftd.de.
> No longer reprocucable with ftd.de since some month.

Great. Then I am going to close it. Anyone one of you can feel free to reopen the ticket if problem persists for you in KDE versions 4.6 or newer.
Comment 20 Dawit Alemayehu 2011-05-04 23:08:59 UTC
*** Bug 166941 has been marked as a duplicate of this bug. ***