386105 – K3b FFmpeg decode plugin seems to be broken, produces noise

Bug 386105 - K3b FFmpeg decode plugin seems to be broken, produces noise

Summary: K3b FFmpeg decode plugin seems to be broken, produces noise

Status:	RESOLVED FIXED

Alias:	None

Product:	k3b
Classification:	Applications
Component:	Plugins (show other bugs)
Version:	17.08.0
Platform:	Kubuntu Linux

Importance:	NOR normal
Target Milestone:	---
Assignee:	k3b developers

URL:
Keywords:

Depends on:
Blocks:

Reported:	2017-10-23 12:40 UTC by blaze
Modified:	2018-04-03 14:50 UTC (History)
CC List:	5 users (show)

See Also:
Latest Commit:	https://commits.kde.org/k3b/059d0da2e2c6865c93b7d5726d8f19830074be57
Version Fixed In:
Sentry Crash Report:

Attachments
Add an attachment

Note You need to log in before you can comment on or make changes to this bug.

Description blaze 2017-10-23 12:40:30 UTC

The output file contains noise. See the screenshot https://i.imgur.com/SByZogg.png
The amount of noise increases if input file has lower bitrate.
The audio itself has twice(?) higher pitch than original and also seems to be faster.
Timing is ok.

Also ffmpeg plugin currently uses deprecated API. This has to be fixed too at some point.



Distro: Kubuntu 17.10 (Artful)

$ ffmpeg -version
ffmpeg version 3.3.4-2 Copyright (c) 2000-2017 the FFmpeg developers
built with gcc 7 (Ubuntu 7.2.0-8ubuntu2)
configuration: --prefix=/usr --extra-version=2 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --enable-gpl --disable-stripping --enable-avresample --enable-avisynth --enable-gnutls --enable-ladspa --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libmp3lame --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-omx --enable-openal --enable-opengl --enable-sdl2 --enable-libdc1394 --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libopencv --enable-libx264 --enable-shared
libavutil      55. 58.100 / 55. 58.100
libavcodec     57. 89.100 / 57. 89.100
libavformat    57. 71.100 / 57. 71.100
libavdevice    57.  6.100 / 57.  6.100
libavfilter     6. 82.100 /  6. 82.100
libavresample   3.  5.  0 /  3.  5.  0
libswscale      4.  6.100 /  4.  6.100
libswresample   2.  7.100 /  2.  7.100
libpostproc    54.  5.100 / 54.  5.100

Comment 1 blaze 2017-10-23 13:02:27 UTC

This is example FFmpeg decoding code
https://www.ffmpeg.org/doxygen/trunk/demuxing_decoding_8c-example.html

It has some differences from what is present in k3b ffmpeg plugin.

Comment 2 blaze 2017-10-27 13:35:54 UTC

Another very characteristic screenshot:
https://i.imgur.com/l9wpxYG.png
comparison between corrupted file produced by k3b ffmpeg plugin (above) and normal (below)

Comment 3 blaze 2017-10-27 13:41:27 UTC

plugins/decoder/ffmpeg/k3bffmpegwrapper.cpp:

int K3bFFMpegFile::read( char* buf, int bufLen )
{

// ...

    // TODO: only swap if needed
    for( int i = 0; i < len-1; i+=2 ) {
        char a = buf[i];
        buf[i] = buf[i+1];
        buf[i+1] = a;
    }

// ...

}

What this code actually does? It looks the most suspicious.
And what "if needed" does mean?

Comment 4 Leslie Zhai 2017-10-30 02:13:01 UTC

Hi blaze,

Thanks for your report!

> Also ffmpeg plugin currently uses deprecated API

I migrated libAV before https://github.com/isoft-linux/libKeyFrame/blob/master/libKeyFrame.c#L295  and welcome the patch about migration :)

> https://i.imgur.com/l9wpxYG.png

That is very good DIFF! it is clear that K3B ffmpeg plugin bring in the noise. but it is not easy to use FFT and Band-pass filter to filter it ...

> What this code actually does? It looks the most suspicious. And what "if needed" does mean?

I have no idea why need to SWAP, perhaps it is better to ask the original author :) https://github.com/KDE/k3b/commit/97f72a28ef87ef62f89967e237f441e03d3b5b3b#diff-edc47aa8fac740c80253cd4bec4ba15dR207


You can test to comment the SWAP code, then rebuild K3B to verify whether or not produce the noise.

Regards,
Leslie Zhai

Comment 5 blaze 2017-10-30 07:09:18 UTC

> You can test to comment the SWAP code, then rebuild K3B to verify whether or not produce the noise

Did it but with that it's even worse. So it's not the case. I'm worriyng however about the "if needed" part.

The general idea is that decoding algorythm is broken (see the example usages of ffmpeg libavcodec). It decodes packets but does nothing to frames.

Also why K3bFFMpegFile::fillOutputBuffer() method is being called recursively? It's better to use while loop here.

Also it would be reasonable to get rid of all the legacy ffmpeg support code. The most part of it exists since KDE3 times. No point in having it now.

Comment 6 blaze 2018-03-10 06:36:19 UTC

I found that every frame produced by fillOutputBuffer() contains (some patch of) corrupted data just right after the middle.

Also citing the FFMPEG docs:

>Some decoders (those marked with AV_CODEC_CAP_DELAY) have a delay between input and output. This means that for some packets they will not immediately produce decoded output and need to be flushed at the end of decoding to get all the decoded data. Flushing is done by calling this function (avcodec_decode_audio4()) with packets with avpkt->data set to NULL and avpkt->size set to 0 until it stops returning samples. It is safe to flush even those decoders that are not marked with AV_CODEC_CAP_DELAY, then no samples will be returned.

Comment 7 blaze 2018-03-10 08:34:51 UTC

Experimenting further I found that if the calculated buffer size returned by av_samples_get_buffer_size() is divided by two then the resulting waveform looks normal (no noise), but the sound is twice faster than it should be. So now the question is how to slow down or stretch it. :D

Comment 8 blaze 2018-03-10 08:57:32 UTC

One more important addition: I think that the second half of every frame is just some memory garbage as I found in previous step. But you not supposed to divide the frame size by two. The actual string where things go wrong is:

> d->outputBufferPos = reinterpret_cast<char*>( d->frame->data[0] );

This is not the way how you should read the stereo data. Maybe the second channel can be accessed with d->frame->data[1]? Not really sure, just guessing.

Comment 9 blaze 2018-03-10 10:41:48 UTC

And one more: this plugin does not decode wma1, wma2 and aac at all, produces total garbage. And these formats supposed to be "tested", lol, according to the source comments.

Comment 10 blaze 2018-03-24 09:53:46 UTC

Created a phabricator diff here https://phabricator.kde.org/D11634
It fixes decoding for some lossless formats.
Compressed formats like WMA and AAC still have to be attended (probably need flushing).

Comment 11 blaze 2018-03-31 06:59:34 UTC

So here's my code which converts float audio to 16bit (for WMA and AAC):

>        for(int sample=0; sample<nb_s; sample++) {
>            for(int ch=0; ch<nb_ch; ch++) {
>                float smpl = *(reinterpret_cast<float*>(
>                    d->frame->extended_data[ch] + sample*4)) * 32767.0;
>                smpl = smpl > 32767.0 ? 32767.0 : smpl;
>                smpl = smpl < -32767.0 ? -32767.0 : smpl;
>                int16_t smpl2 = (((int16_t)(smpl + 32768.5)) - 32768);
>                ::memcpy(d->outputBufferPos + (sample * nb_ch + ch) * 2,
>                         &smpl2,
>                         2);
>            }
>        }

It works nicely: no noise or clipping. Made it just out of curiosity. But it's better to use libswresample. When it's done right it will cover all possible cases for the input audio. I'll get to that later. Stay tuned for patches.

Comment 12 blaze 2018-04-02 15:49:03 UTC

After accepting the latest revision of https://phabricator.kde.org/D11634 this bug can be considered as fixed in the upstream for future releases.

Comment 13 Leslie Zhai 2018-04-03 14:50:00 UTC

Git commit 059d0da2e2c6865c93b7d5726d8f19830074be57 by Leslie Zhai.
Committed on 03/04/2018 at 14:45.
Pushed by lesliezhai into branch 'master'.

[plugins] K3b FFMpeg Decoder Plugin Fix

A patch by Mikołaj Płomieński!

Reviewers: lesliezhai, anthonyfieroni

Differential Revision: https://phabricator.kde.org/D11634

M  +56   -23   plugins/decoder/ffmpeg/k3bffmpegwrapper.cpp

https://commits.kde.org/k3b/059d0da2e2c6865c93b7d5726d8f19830074be57