Bug 238745

Summary: 3.5.0 Make fails on PPC Altivec opcodes, though configure says "Altivec off"
Product: [Developer tools] valgrind Reporter: Steve M <sm>
Component: generalAssignee: Julian Seward <jseward>
Status: RESOLVED FIXED    
Severity: normal CC: bart.vanassche+kde
Priority: NOR    
Version: 3.5.0   
Target Milestone: ---   
Platform: Compiled Sources   
OS: Linux   
Latest Commit: Version Fixed In:
Sentry Crash Report:

Description Steve M 2010-05-25 00:35:13 UTC
Version:           3.5.0 (using KDE 1.2) 
OS:                Linux

There seems to be no way to turn off generation of Altivec opcodes for non-Altivec PPC chips in 3.5.0 (downloaded from site).  Whether cross-compiling on x86 or native-compiling on the target MPC5200 PPC system (with gcc 4.2.2, kernel 2.6.29.3), configure runs, but make fails with:

===========================================================
gcc -I.. -I../include -I../VEX/pub -DVGA_ppc32=1 -DVGO_linux=1 -DVGP_ppc32_linux=1 -I../coregrind -DVG_LIBDIR="\"/usr/local/lib/valgrind"\" -DVG_PLATFORM="\"ppc32-linux\"" -m32 -g -Wno-long-long -c -o libcoregrind_ppc32_linux_a-dispatch-ppc32-linux.o `test -f 'm_dispatch/dispatch-ppc32-linux.S' || echo './'`m_dispatch/dispatch-ppc32-linux.S
m_dispatch/dispatch-ppc32-linux.S: Assembler messages:
m_dispatch/dispatch-ppc32-linux.S:142: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:144: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:146: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:148: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:150: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:152: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:154: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:156: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:158: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:160: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:162: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:164: Error: Unrecognized opcode: `stvx'
m_dispatch/dispatch-ppc32-linux.S:221: Error: Unrecognized opcode: `vspltisw'
m_dispatch/dispatch-ppc32-linux.S:222: Error: Unrecognized opcode: `mtvscr'
m_dispatch/dispatch-ppc32-linux.S:420: Error: Unrecognized opcode: `vspltisw'
m_dispatch/dispatch-ppc32-linux.S:421: Error: Unrecognized opcode: `vspltisw'
m_dispatch/dispatch-ppc32-linux.S:422: Error: Unrecognized opcode: `vsldoi'
m_dispatch/dispatch-ppc32-linux.S:424: Error: Unrecognized opcode: `mfvscr'
m_dispatch/dispatch-ppc32-linux.S:425: Error: Unrecognized opcode: `vand'
m_dispatch/dispatch-ppc32-linux.S:426: Error: Unrecognized opcode: `vspltw'
m_dispatch/dispatch-ppc32-linux.S:427: Error: Unrecognized opcode: `vcmpequw.'
m_dispatch/dispatch-ppc32-linux.S:511: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:513: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:515: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:517: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:519: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:521: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:523: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:525: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:527: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:529: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:531: Error: Unrecognized opcode: `lvx'
m_dispatch/dispatch-ppc32-linux.S:533: Error: Unrecognized opcode: `lvx'
===========================================================

I have persuaded configure to report "checking for Altivec... no" (it originally succeeded, even though the MPC5200 CPU doesn't support it.  Honestly, the only way I've been able to achieve this is to rename "altivec.h" to another name.  It's probably blindingly obvious, but I found no combination of command line options or environmental variables that achieved a "no Altivec" or avoided the assembler error.

Reproducible: Always

Steps to Reproduce:
* Decompress valgrind-3.5.0.tar.bz2.
* On a PPC (MPC5200) system, gcc 4.2.2, and kernel 2.6.29.3...
* Rename the file "altivec.h" to something else, so the Altivec test fails.
* Invoke ./configure --disable-tls
* Invoke make.
* See the error in the Details section.

Actual Results:  
The assembler error listed in the Details section.

Expected Results:  
A successful build.

Thanks.
Comment 1 Steve M 2010-05-25 01:38:48 UTC
It turns out the offending opcodes are in what appears to be hand-rolled code in 'm_dispatch/dispatch-ppc32-linux.S', for instance:

        /* VRSAVE save word : 32 bytes */
        mfspr   5,256         /* vrsave reg is spr number 256 */
        stw     5,244(1)

        /* Alignment padding : 4 bytes */

        /* Vector reg save area (quadword aligned) : 192 bytes */
        li      5,224
        stvx    31,5,1
        li      5,208
        stvx    30,5,1
        li      5,192
        stvx    29,5,1
        li      5,176
        stvx    28,5,1
        li      5,160
        stvx    27,5,1
        li      5,144
        stvx    26,5,1
        li      5,128
        stvx    25,5,1
        li      5,112
        stvx    25,5,1
        li      5,96
        stvx    23,5,1
        li      5,80
        stvx    22,5,1
        li      5,64
        stvx    21,5,1
        li      5,48
        stvx    20,5,1

These and the other SIMD instructions in this file make the gcc 4.2.2 PPC assembler choke.
Comment 2 Bart Van Assche 2010-05-25 08:40:09 UTC
Valgrind works fine on the PPC440GX, which doesn't support Altivec either. Which assembler are you using (as -v </dev/null) ?
Comment 3 Steve M 2010-05-25 16:10:20 UTC
as -v:
GNU assembler version 2.17.90-1 (ppc-linux) using BFD version (GNU Binutils) 2.17.90-1.20070806

This PPC Linux version was cross-compiled with the ELDK (Embedded Linux Dev Kit - http://ftp.denx.de/pub/eldk/4.2/ppc-linux-x86/).

Freescale's MPC5200 uses an e300 core, while their e600 core is required for Altivec support.  Your Applied Micro PPC440GX seems to be a much nicer chip, and Googling "PPC440gx stvx" turns up enough hits to suggest that the opcode is supported.

I just found another report of this issue on the ELDK site.  I'm off to read (it's a huge page).

Thanks.
Comment 4 Steve M 2010-05-25 16:42:04 UTC
I only found that others are currently having the same problem, with no solution.

I found a Freescale publication listing the instruction set of the e300 core (found in the MPC5200) at http://www.freescale.com/files/32bit/doc/ref_manual/e300coreRM.pdf.  "stvx" is not a supported instruction.  I tried a few others like "vand" and "vspltw"... also not supported on the MPC5200.

It seems that 'm_dispatch/dispatch-ppc32-linux.S' is not friendly to non-Altivec cores.
Comment 5 Bart Van Assche 2010-05-25 17:15:51 UTC
(In reply to comment #3)
> Freescale's MPC5200 uses an e300 core, while their e600 core is required for
> Altivec support.  Your Applied Micro PPC440GX seems to be a much nicer chip,
> and Googling "PPC440gx stvx" turns up enough hits to suggest that the opcode is
> supported.

If you are familiar enough with PPC assembly you can replace the stvx instructions by something that is understood by the e300. Valgrind is an open source project - contributions are welcome.
Comment 6 Steve M 2010-05-25 18:25:32 UTC
I diffed "dispatch-ppc32-linux.S" with "dispatch-ppc32-aix5.S".  The latter has four blocks of code commented out with the very helpful notation "// Sigh.  AIX 5.2 has no idea that Altivec exists."  Moving those four changes to the former file allowed a successful build on my non-Altivec target.  (It would appear that the Altivec code is merely saving and restoring the state of the Altivec unit.)

I haven't been able to test the built code yet.  I tried "valgrind ls" and got:

==21128== Memcheck, a memory error detector
==21128== Copyright (C) 2002-2009, and GNU GPL'd, by Julian Seward et al.
==21128== Using Valgrind-3.5.0 and LibVEX; rerun with -h for copyright info
==21128== Command: ls
==21128==

valgrind:  Fatal error at startup: a function redirection
valgrind:  which is mandatory for this platform-tool combination
valgrind:  cannot be set up.  Details of the redirection are:
valgrind:
valgrind:  A must-be-redirected function
valgrind:  whose name matches the pattern:      strlen
valgrind:  in an object with soname matching:   ld.so.1
valgrind:  was not found whilst processing
valgrind:  symbols from the object with soname: ld.so.1
valgrind:
valgrind:  Possible fixes: (1, short term): install glibc's debuginfo
valgrind:  package on this machine.  (2, longer term): ask the packagers
valgrind:  for your Linux distribution to please in future ship a non-
valgrind:  stripped ld.so (or whatever the dynamic linker .so is called)
valgrind:  that exports the above-named function using the standard
valgrind:  calling conventions for this platform.
valgrind:
valgrind:  Cannot continue -- exiting now.  Sorry.


Option (1) isn't available with this distro, so I'm working on (2).
Comment 7 Bart Van Assche 2010-05-25 18:33:01 UTC
(In reply to comment #6)
> [ ... ]
> valgrind:  Fatal error at startup: a function redirection
> valgrind:  which is mandatory for this platform-tool combination
> valgrind:  cannot be set up.  Details of the redirection are:
> valgrind:
> valgrind:  A must-be-redirected function
> valgrind:  whose name matches the pattern:      strlen
> valgrind:  in an object with soname matching:   ld.so.1
> valgrind:  was not found whilst processing
> valgrind:  symbols from the object with soname: ld.so.1
> valgrind:
> valgrind:  Possible fixes: (1, short term): install glibc's debuginfo
> valgrind:  package on this machine.  (2, longer term): ask the packagers
> valgrind:  for your Linux distribution to please in future ship a non-
> valgrind:  stripped ld.so (or whatever the dynamic linker .so is called)
> valgrind:  that exports the above-named function using the standard
> valgrind:  calling conventions for this platform.
> valgrind:
> valgrind:  Cannot continue -- exiting now.  Sorry.
> 
> 
> Option (1) isn't available with this distro, so I'm working on (2).

Regarding option (1): you don't have to copy the debuginfo packages on a local storage medium of the embedded device. Mounting these packages over NFS is also fine.
Comment 8 Steve M 2010-05-25 19:20:33 UTC
Thanks, that helped.  I ran the "example" test program from valgrind.org:

============================
#include <stdlib.h>

  void f(void)
  {
     int* x = malloc(10 * sizeof(int));
     x[10] = 0;        // problem 1: heap block overrun
  }                    // problem 2: memory leak -- x not freed

  int main(void)
  {
     f();
     return 0;
  }
============================

... and got the expected output:

============================
==21326== Memcheck, a memory error detector
==21326== Copyright (C) 2002-2009, and GNU GPL'd, by Julian Seward et al.
==21326== Using Valgrind-3.5.0 and LibVEX; rerun with -h for copyright info
==21326== Command: example
==21326==
==21326== Invalid write of size 4
==21326==    at 0x1000045C: f (example.c:6)
==21326==    by 0x1000048F: main (example.c:11)
==21326==  Address 0x4060050 is 0 bytes after a block of size 40 alloc'd
==21326==    at 0xFF5A290: malloc (vg_replace_malloc.c:195)
==21326==    by 0x10000447: f (example.c:5)
==21326==    by 0x1000048F: main (example.c:11)
==21326==
Example program exiting now...
==21326==
==21326== HEAP SUMMARY:
==21326==     in use at exit: 40 bytes in 1 blocks
==21326==   total heap usage: 1 allocs, 0 frees, 40 bytes allocated
==21326==
==21326== LEAK SUMMARY:
==21326==    definitely lost: 40 bytes in 1 blocks
==21326==    indirectly lost: 0 bytes in 0 blocks
==21326==      possibly lost: 0 bytes in 0 blocks
==21326==    still reachable: 0 bytes in 0 blocks
==21326==         suppressed: 0 bytes in 0 blocks
==21326== Rerun with --leak-check=full to see details of leaked memory
==21326==
==21326== For counts of detected and suppressed errors, rerun with: -v
==21326== ERROR SUMMARY: 1 errors from 1 contexts (suppressed: 5 from 3)
============================

Now to run it on some production code...
Comment 9 Bart Van Assche 2010-06-13 16:50:32 UTC
Should have been fixed on the trunk in r11173. Please verify.
Comment 10 Bart Van Assche 2010-06-18 12:34:25 UTC
Closed this issue since no feedback has been received in time.
Comment 11 Steve M 2010-06-18 14:29:32 UTC
I have been trying to test the code in the repository.  My target doesn't have autoconf, so I've been getting a cross-compiler configured and figuring out how to do an RPM.  Sorry it's taking too long.
Comment 12 Bart Van Assche 2010-06-18 19:27:06 UTC
(In reply to comment #11)
> I have been trying to test the code in the repository.  My target doesn't have
> autoconf, so I've been getting a cross-compiler configured and figuring out how
> to do an RPM.  Sorry it's taking too long.

The configure script generated by autogen.sh depends on the autoconf and automake versions but neither on the CPU type nor on the configuration details of the system it has been generated on. So an alternative is to generate the configure script on a system where autotools are available and then copy the configure script to the target system.
Comment 13 Steve M 2010-06-21 17:49:21 UTC
Thanks for the suggestion.  Unfortunately, it didn't work for me.  Whether I copied just the "configure" script into a clean copy of the latest repository on the target, or if I ran "autogen.sh" and brought the whole valgrind directory over, I got the error:

> configure: error: cannot find install-sh or install.sh in "." "./.." "./../.."

I am very interested in trying this latest version, because I'm getting a reliable segfault and (seemingly unjustified) buffer warnings in the syscall sendto() in my program.  I'm curious if they would remain with the latest build.

So, I'm back to the cross-compiler and figuring out how to make the package.  It's time I learned.
Comment 14 Steve M 2010-06-22 22:23:23 UTC
I can confirm two things:  First, valgrind r11182 cross-compiles without a problem on my Red Hat i386 system; and second, it is better behaved on the target, no longer segfaulting on a particular sendto() call at app launch.  Since I'm cross-compiling this time, I overrode the ac_have_altivec flag, so I don't know if the built-in Altivec test is still failing.  Still, the flag is honored, and the code works fine.

One minor thing, and this could be me: the RPM I built has a bunch of seemingly spurious dependencies (like libc.so.6, which is present in /lib).  One I couldn't explain or satisfy is "rltd(GNU_HASH)".  I installed the RPM on the target with --nodeps, and it seems to be running fine.