Bug 35130 - web archiver: please include original URL in .war archives (PATCH)
Summary: web archiver: please include original URL in .war archives (PATCH)
Status: RESOLVED FIXED
Alias: None
Product: konqueror
Classification: Applications
Component: webarchiverplugin (show other bugs)
Version: unspecified
Platform: Debian testing Linux
: NOR wishlist
Target Milestone: ---
Assignee: Malte Starostik
URL:
Keywords:
: 46818 (view as bug list)
Depends on:
Blocks:
 
Reported: 2001-11-20 07:48 UTC by Ben Burton
Modified: 2004-02-22 21:45 UTC (History)
1 user (show)

See Also:
Latest Commit:
Version Fixed In:


Attachments
implement the feature and fix a bug (619 bytes, patch)
2004-02-22 21:40 UTC, Daniel Naber
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Ben Burton 2001-11-20 07:33:36 UTC
(*** This bug was imported into bugs.kde.org ***)

Package:           konqueror
Version:           KDE 2.2.1 
Severity:          wishlist
Installed from:    Debian Packages
Compiler:          Not Specified
OS:                Linux
OS/Compiler notes: Not Specified


Hi.  Received this through the debian BTS. - Ben.

Package: konq-plugins
Version: 2.2.1-4

Subject: konq-plugins: It would be useful if there would be an <base
href="http://www.source.com/directory/"> tag added within the web
archive (*.war) index.html file so that the relative links from the
archived document work properly.


(Submitted via bugs.kde.org)
Comment 1 Ben Burton 2002-10-30 23:25:35 UTC
Further discussion on this topic between myself and the original bug submitter:  
  
> Hmm, so on further thought I'm going to close this bug.  A web archive  
> includes all images on a web page, so as I understand it, adding a base tag  
> would break the image URLs when viewing the web archive offline.  It should  
> be more important to keep the archive working with itself than have it work  
> with external web sources.  
  
My idea was to have some means of absolute information about the original 
website URL included in the (.war) archive. This is valuable information 
for future reference, as most web pages do not have any absolute URL coded into 
them. 
 
 
There are two other solutions to this problem: 
 
1) As it it done in M$ IE: 
include an additional line after the <!DOCTYPE ...> declaration i.e. line with a 
comment about the original source address.: 
 
<!-- saved from 
url=(0049)http://osnews.com/story.php?news_id=2016&page=all --> 
 
 
2) Include a 
 
<LINK rel="START" 
title="Comment by Konqueror: The original WWW address of this page while 
downloaded" 
type="text/html" 
href="www.source.com/somewhwere/somepage.html"> 
 
reference that was not present in the original page. That would suit the 
purpose of an reference for future needs. But we modify the original page 
which might not be acceptable. 
 
Comment 2 undisclosed 2003-06-21 07:38:45 UTC
I suppose that including original URL in .war archive can be useful for another reason. 
Sometimes I can't find the origin URL of some of my .war files.  
Including original URL as standalone file (supposedly in .kdelink format) in .war give 
possibility quickly returns to those URL. 
 
Comment 3 Stephan Binner 2003-07-21 12:44:24 UTC
*** Bug 46818 has been marked as a duplicate of this bug. ***
Comment 4 Stephan Binner 2003-07-24 12:21:22 UTC
undo confirm 
Comment 5 Manuel Amador (Rudd-O) 2003-12-04 16:24:52 UTC
I second the LINK rel start idea.  When konq gets a navigation bar like mozilla (next, back, forward, up, and the like), it'll be extremely useful.  For now, it can be used directly from the war archive, or perhaps displayed in the URL bar or in a floating CSS box on top or on bottom of the page.
Comment 6 Daniel Naber 2004-02-22 17:42:20 UTC
I'm attaching a simple patch which adds the URL. It also removes
the doctype -- it doesn't make sense, as the page isn't guaranteed
to comply with that doctype.
Comment 7 Daniel Naber 2004-02-22 21:40:16 UTC
Created attachment 4845 [details]
implement the feature and fix a bug
Comment 8 Daniel Naber 2004-02-22 21:45:14 UTC
implementein HEAD (<!-- saved from: ... -->), will backport to 3.2 if it's okay.