Bug 317078 - sse _mm_madd_epi16, undefined * 0 should not yield uninitialised result
Summary: sse _mm_madd_epi16, undefined * 0 should not yield uninitialised result
Status: REPORTED
Alias: None
Product: valgrind
Classification: Developer tools
Component: memcheck (show other bugs)
Version: 3.8.0
Platform: Other Linux
: NOR normal
Target Milestone: ---
Assignee: Julian Seward
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2013-03-20 14:23 UTC by ttyridal
Modified: 2016-03-17 10:33 UTC (History)
1 user (show)

See Also:
Latest Commit:
Version Fixed In:
Sentry Crash Report:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description ttyridal 2013-03-20 14:23:48 UTC
memcheck seems to not understand that 0 multiplied with whatever is still 0 when using simd instructions. 

valgrind version: 3.8.1

example program to provoke the error:

#include "emmintrin.h"
#include <stdio.h>
#include <valgrind/memcheck.h>
int main(void)
{
    __m128i a;
    __m128i r = _mm_madd_epi16(a,_mm_setzero_si128());
    
    if (VALGRIND_CHECK_MEM_IS_DEFINED(&r,16))
        printf("\n\nVALGRIND BUG! Memory is defined and 0 (undefined*0 == 0)\n\n");
    return _mm_cvtsi128_si32(r);
}   

gcc -msse -o test test.c && valgrind ./test
Observe that VALGRIND_CHECK_MEM_IS_DEFINED will report an error,  as well as the return value is deemed uninitialised.

expected result: no printf and no valgrind/memcheck errors.

A typical use-case is a filter function with padded data and 0 coefficients for edge handling.

Reproducible: Always
Comment 1 Julian Seward 2013-03-26 09:30:37 UTC
True, but probably difficult (impossible?) to fix without some performance loss,
since we'd need to insert a run time check for zero-ness of the arguments, and
special-case the output if they are found to be zero.
Comment 2 Dirk Farin 2016-03-17 10:33:15 UTC
Note that this bug also shows for vectors with only some elements being zero. I am affected by this bug too, multiplying two vectors of length 15 (last coefficient in one vector is 0). Valgrind reports this as error even though it is not.