Bug 455089

Summary: Certain characters (like ü,ö,ä,ß but also ' ) in CD Metadata replaced with ## in ripped .flac files
Product: [Applications] k3b Reporter: Firlaev-Hans <firlaevhans.fiete>
Component: CopyingAssignee: k3b developers <k3b>
Status: REPORTED ---    
Severity: normal CC: aacid, michalm, trueg
Priority: NOR    
Version: 22.04.1   
Target Milestone: ---   
Platform: Arch Linux   
OS: Linux   
Latest Commit: Version Fixed In:
Sentry Crash Report:

Description Firlaev-Hans 2022-06-09 16:16:28 UTC
SUMMARY
I ripped a bunch of audio CDs to flac and noticed that while the filenames of the resulting files where normal, their Metadata had all sorts of characters replaced, e. g. "Don't Stop Me Now" became "Don##t stop me now".
I was able to correct these mistakes by editing the metadata in other software, so clearly this is not a limitation of Flac's Metadata format.

STEPS TO REPRODUCE
1. Find an audio CD that has characters such as ' or üöäß in its album title, artist name or track names
2. Rip it with K3b
3. Take a look at the metadata of the resulting files using e. g. VLC

OBSERVED RESULT
The characters are replaced with ## in the metadata but the filename is usually fine.

EXPECTED RESULT
The metadata should look normal

SOFTWARE/OS VERSIONS
Operating System: Arch Linux
KDE Plasma Version: 5.24.5
KDE Frameworks Version: 5.94.0
Qt Version: 5.15.4
Kernel Version: 5.18.2-zen1-1-zen (64-bit)
Graphics Platform: Wayland
Comment 1 Albert Astals Cid 2022-06-14 09:57:20 UTC
Can't seem to reproduce :/

tsdgeos@xps:~/musica/AC_DC - Iron Man 2:$ ls
02 - Rock 'n' Roll Damnation.flac

tsdgeos@xps:~/musica/AC_DC - Iron Man 2:$ metaflac --list 02\ -\ Rock\ \'n\'\ Roll\ Damnation.flac 
METADATA block #0
  type: 0 (STREAMINFO)
  is last: false
  length: 34
  minimum blocksize: 4096 samples
  maximum blocksize: 4096 samples
  minimum framesize: 16 bytes
  maximum framesize: 13711 bytes
  sample_rate: 44100 Hz
  channels: 2
  bits-per-sample: 16
  total samples: 9630852
  MD5 signature: 24d7c6bcd32b693b32f544ec7d71d931
METADATA block #1
  type: 4 (VORBIS_COMMENT)
  is last: false
  length: 140
  vendor string: reference libFLAC 1.3.4 20220220
  comments: 5
    comment[0]: ARTIST=AC/DC
    comment[1]: TITLE=Rock 'n' Roll Damnation
    comment[2]: TRACKNUMBER=02
    comment[3]: DATE=2010
    comment[4]: ALBUM=Iron Man 2
METADATA block #2
  type: 1 (PADDING)
  is last: true
  length: 8192
Comment 2 Firlaev-Hans 2022-06-14 11:37:42 UTC
(In reply to Albert Astals Cid from comment #1)
> Can't seem to reproduce :/

Weird. I wonder what's going wrong on my system then, because it happens literally every time there is a special character in the metadata.

I do seem to recall that this hasn't always been the case on my system either though, as I'm sure I have ripped CDs with '-characters before without problems. Maybe it has something to do with the tools / libraries used in the background?

Or maybe it is in some way related to the locale? My system is set to german.