It should be possible to remove duplicates. Some sites seem to edit entries and the next time the feed is updated a duplicate appears with only minor changes.
Created attachment 8502 [details] Patch to implement feature
Created attachment 8503 [details] updated patch - better heuristics for title, also does URL
Any reason why this is not yet committed and also not in the feature plan?
On Saturday 18 December 2004 10:01, Stephan Binner wrote: > Any reason why this is not yet committed and also not in the feature plan? I think the patch needs more work still, and we're not sure it's the best approach.
The dupes problem was reduced significantly by fixing it for RSS 1.0 feeds. Apart from that, I don't think we should delete dupes (semi-)automatically with a heuristic not working 100% safe.