The vmsummbm instruction needs to chop the results to 32-bits. Currently implementation is not doing this correctly. The result is errors in the adjacent 32-bit fields.
Created attachment 118972 [details] fix for vmsummbm instruction The attached patch fixes the issue.
No regressions found by automated regression testing. Closing.