ARM DSP: Halving parallel add/sub and multiply add/sub #535
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
First of this PR restrict everything to Cortex-A/R devices till We found a better way to manage
thumb*
targets.Then add a set of halving parallel add/sub and multiply add/sub widely used in DSP algorithms:
shsub8
: Signed halving parallel byte-wise subtraction.shsub16
: Signed halving parallel halfword-wise subtraction.shadd8
: Signed halving parallel byte-wise add.shadd16
: Signed halving parallel halfword-wise add.smuad
: Signed dual multiply add.smuadx
: Signed dual multiply add reversed.smusd
: Signed dual multiply subtract.smusdx
: Signed dual multiply subtract reversed.