51a1abb918
NEON fixes/tweaks This merge request fixes some issues and adds some tweaks to NEON code: * SPLIT(16,4) ALTMAP implementation was broken as it only processed half the amount of data. As such, this fixed implementation is significantly slower than the old code (which is to be expected). Fixes #2 * SPLIT(16,4) implementations now merge the ARMv8 and older code path, similar to SPLIT(32,4). This fixes the ALTMAP variant, and also enables the non-ALTMAP version to have consistent sizing * Unnecessary VTRN removed in non-ALTMAP SPLIT(16,4) as NEON allows (de)interleaving during load/store; because of this, ALTMAP isn't so useful in NEON * This can also be done for SPLIT(32,4), but I have not implemented it * I also pulled the `if(xor)` conditional from non-ALTMAP SPLIT(16,4) to outside the loop. It seems to improve performance a bit on my Cortex A7 * It probably should be implemented everywhere else, but I have not done this * CARRY_FREE was incorrectly enabled on all sizes of w, when it's only available for w=4 and w=8 See merge request !16 |
||
---|---|---|
examples | ||
include | ||
m4 | ||
manual | ||
src | ||
test | ||
tools | ||
.gitignore | ||
AUTHORS | ||
COPYING | ||
ChangeLog | ||
License.txt | ||
Makefile.am | ||
NEWS | ||
README | ||
README.txt | ||
autogen.sh | ||
compile | ||
configure.ac | ||
depcomp |
README.txt
This is GF-Complete, Revision 1.03. January 1, 2015. Authors: James S. Plank (University of Tennessee) Ethan L. Miller (UC Santa Cruz) Kevin M. Greenan (Box) Benjamin A. Arnold (University of Tennessee) John A. Burnum (University of Tennessee) Adam W. Disney (University of Tennessee, Allen C. McBride (University of Tennessee) The user's manual is in the file Manual.pdf. The online home for GF-Complete is: - http://jerasure.org/jerasure/gf-complete To compile, do: ./configure make sudo make install