Bug 415136 - ARMv8.1 Compare-and-Swap instructions are not supported
Summary: ARMv8.1 Compare-and-Swap instructions are not supported
Status: RESOLVED FIXED
Alias: None
Product: valgrind
Classification: Developer tools
Component: vex (show other bugs)
Version: 3.14.0
Platform: Other Linux
: NOR normal
Target Milestone: ---
Assignee: Julian Seward
URL:
Keywords:
Depends on:
Blocks: 414270
  Show dependency treegraph
 
Reported: 2019-12-13 13:55 UTC by ahashmi
Modified: 2020-03-09 08:24 UTC (History)
0 users

See Also:
Latest Commit:
Version Fixed In:
Sentry Crash Report:


Attachments
Patch implements and tests ARM v8.1 CAS instructions (842.43 KB, text/plain)
2019-12-19 19:02 UTC, ahashmi
Details
Patch implements ARM v8.1 CAS instructions (4.31 KB, text/plain)
2020-02-19 16:17 UTC, ahashmi
Details
Patch tests ARM v8.1 CAS instructions (836.25 KB, text/plain)
2020-02-19 16:17 UTC, ahashmi
Details
Patch implements ARM v8.1 CAS instructions (6.18 KB, text/plain)
2020-02-19 16:31 UTC, ahashmi
Details

Note You need to log in before you can comment on or make changes to this bug.
Description ahashmi 2019-12-13 13:55:51 UTC
SUMMARY
The following Compare-and-Swap instructions are not currently supported:
CAS, CASA, CASAL, CASL: Compare and Swap word or doubleword in memory.
CASB, CASAB, CASALB, CASLB: Compare and Swap byte in memory.
CASH, CASAH, CASALH, CASLH: Compare and Swap halfword in memory.
CASP, CASPA, CASPAL, CASPL: Compare and Swap Pair of words or doublewords in memory.

STEPS TO REPRODUCE
Compile with -march=armv8.1-a and if the compiler generates any of the above, valgrind will exit with unknown instruction error.


EXPECTED RESULT


SOFTWARE/OS VERSIONS
Windows: 
macOS: 
Linux/KDE Plasma: 
(available in About System)
KDE Plasma Version: 
KDE Frameworks Version: 
Qt Version: 

ADDITIONAL INFORMATION
Comment 1 ahashmi 2019-12-19 19:02:25 UTC
Created attachment 124590 [details]
Patch implements and tests ARM v8.1 CAS instructions

This patch adds support for the following ARM v8.1 instructions:
CAS, CASA, CASAL, CASL
CASB, CASAB, CASALB, CASLB
CASH, CASAH, CASALH, CASLH

(Note this patch does not add support for the pairwise CAS instructions:
 CASP, CASPA, CASPAL, CASPL)
Comment 2 Julian Seward 2019-12-27 09:42:42 UTC
Thanks for the patch.  I looked at it in some detail.

An administrative request: please could you attach the test changes as a
separate patch?  With it all in one patch, I have to scroll through the entire
giant patch to check there aren't some extra implementation bits somewhere in
it.

I am going to need some context to make sense of this.  In particular, why is
ARM64in_CAS emitted with a loop, where it did not have a loop before?  It is
true that a correct usage of CAS in general requires a loop to retry in case
of failure, but that loop must exist in the guest code surrounding the
incoming CAS instruction.  The role of the IR representation and its host-side
translation back into arm64 code is merely to attempt the CAS and return a
success/failure indication to the guest state.

Also, what is the reason for changing the hardwired register numbers?  They
now don't match the ARM64in_CAS case in getRegUsage_ARM64Instr.
Comment 3 ahashmi 2020-02-19 16:17:07 UTC
Created attachment 126174 [details]
Patch implements ARM v8.1 CAS instructions
Comment 4 ahashmi 2020-02-19 16:17:53 UTC
Created attachment 126175 [details]
Patch tests ARM v8.1 CAS instructions
Comment 5 ahashmi 2020-02-19 16:22:28 UTC
Thanks for reviewing in detail.

Good point about splitting up the large patch. I've
uploaded the implementation and tests as 2 separate patches:
arm64_v81_cas_impl.patch
arm64_v81_cas_test.patch

> I am going to need some context to make sense of this...

The loop in this context is not looping for the same reason guest code
using a CAS instruction would loop. In this case the cbne w8, loop
instruction is checking to see if the previous store exclusive worked.
The spec says that store exclusives return 0 in the w register if the
store updated memory, 1 if it failed to update. This way we always make
sure that memory has been updated before returning from the CAS.
Previously it would have returned even if the store failed.

> Also, what is the reason for changing the hardwired register numbers?

Adding the store check loop means we need to compare the store's status
value in a w register and since w8 is the scratch register we use that,
leaving w1/x1 untouched after the load.
Comment 6 ahashmi 2020-02-19 16:31:04 UTC
Created attachment 126176 [details]
Patch implements ARM v8.1 CAS instructions
Comment 7 Julian Seward 2020-03-09 08:24:43 UTC
Committed:
implementation 2281c8c8675969537dc734237b60157e8a0fe0d6
test cases     3b067535056d7da12d6ca44685e696a99269585e

Thank you for the patches, and apologies for the slowness in 
processing them.