Bug 134076 - invalid xml when loading playlist (utf 8 related)
Summary: invalid xml when loading playlist (utf 8 related)
Status: RESOLVED DUPLICATE of bug 133613
Alias: None
Product: amarok
Classification: Applications
Component: general (show other bugs)
Version: 1.4.3
Platform: Mandriva RPMs Linux
: NOR normal
Target Milestone: ---
Assignee: Amarok Developers
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-09-14 17:30 UTC by Nicolas Pomarede
Modified: 2007-02-11 01:51 UTC (History)
0 users

See Also:
Latest Commit:
Version Fixed In:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Nicolas Pomarede 2006-09-14 17:30:19 UTC
Version:           1.4.3 (using KDE KDE 3.5.4)
Installed from:    Mandriva RPMs
OS:                Linux

Hello,

it seems that 1.4.3 has a regression regarding restoring the current.xml
playlist file.

With 1.4.2, I had no problem with my playlist. But when 1.4.3 starts, I get the
message "invalid XML, please contact the authors" and the playlist is truncated after loading.

When looking at the last line loaded, the problem seems to be that the artist
tag doesn't accept non utf8 encoded strings in some case.

The problem is that these files are added through Amarok, so it should be
Amarok's jobs to convert the ID3 tags to proper UTF8 before saving them in
the XML file, else it seems it will break amarok's xml parser at 
the next start.

Here's the corresponding section of the xml file :

 <item url="file:///home/npomarede/Music/Chloe%20-%20I%20Hate%20Dancing/
09%20-%20Chlo%E9%20-%20Take%20Care.mp3" uniqueid="66371db2cee2d986b17b407ac99a
683c" >
  <Filename>09 - Chloé - Take Care.mp3</Filename>
  <Title>Take Care</Title>
  <Artist>Chloé</Artist>
  <Composer></Composer>
  <Year>2004</Year>
  <Album>Chloé - I Hate Dancing</Album>
  <DiscNumber>0</DiscNumber>
  <Track>9</Track>
...
  </item>


It seems that the parsing is aborted if an accent (é in that case) is used as
the last character of the value field of an xml tag (Artist in that case).

After loading the playlist, the last line will contain "Chloé</Artist>" in the
Artist column, instead of "Chloé".

I don't think it's really easy to ask users to reencode all their mp3's ID3
fields with utf8, so is there a way to workaround this (as it worked with 1.4.2) ?

In case ID3 tag are encoded using latin1, shouldn't amarok use UTF8 to save this tag
in the current.xml file which is marked as UTF8 instead of mixing latin1 which
causes problems ?


Regards
Comment 1 Mark Kretschmann 2006-09-14 17:45:58 UTC

*** This bug has been marked as a duplicate of 133613 ***
Comment 2 ahmed 2007-02-11 01:51:07 UTC
hello every body 
im using suse linux 10.1  im a newbie and i dont know much english
for my mp3 files the properties of the file such as artist name when i try to enter it or display it , it does not appear in the right way ( just some strange symbols) i guess you should know that it is not written in english (those properties i mean) please fix that by supporting utf8 or what ever you see good 
to display my local language in proper way
thanks for being patient