bec15359de
Optimisations for the 4,4 split table region multiplication and carry less multiplication using NEON's polynomial long multiplication. arm: w8: NEON carry less multiplication Selected time_tool.sh results for a 1.7GHz cortex-a9: Region Best (MB/s): 375.86 W-Method: 8 -m CARRY_FREE - Region Best (MB/s): 142.94 W-Method: 8 -m TABLE - Region Best (MB/s): 225.01 W-Method: 8 -m TABLE -r DOUBLE - Region Best (MB/s): 211.23 W-Method: 8 -m TABLE -r DOUBLE -r LAZY - Region Best (MB/s): 160.09 W-Method: 8 -m LOG - Region Best (MB/s): 123.61 W-Method: 8 -m LOG_ZERO - Region Best (MB/s): 123.85 W-Method: 8 -m LOG_ZERO_EXT - Region Best (MB/s): 1183.79 W-Method: 8 -m SPLIT 8 4 -r SIMD - Region Best (MB/s): 177.68 W-Method: 8 -m SPLIT 8 4 -r NOSIMD - Region Best (MB/s): 87.85 W-Method: 8 -m COMPOSITE 2 - - Region Best (MB/s): 428.59 W-Method: 8 -m COMPOSITE 2 - -r ALTMAP - |
||
---|---|---|
examples | ||
include | ||
m4 | ||
src | ||
test | ||
tools | ||
.gitignore | ||
AUTHORS | ||
COPYING | ||
ChangeLog | ||
License.txt | ||
Makefile.am | ||
NEWS | ||
README | ||
README.txt | ||
autogen.sh | ||
compile | ||
configure.ac | ||
depcomp | ||
test-driver |
README.txt
This is GF-Complete, Revision 1.02. January 1, 2014. Authors: James S. Plank (University of Tennessee) Ethan L. Miller (UC Santa Cruz) Kevin M. Greenan (Box) Benjamin A. Arnold (University of Tennessee) John A. Burnum (University of Tennessee) Adam W. Disney (University of Tennessee, Allen C. McBride (University of Tennessee) The user's manual is in the file Manual.pdf. You may also get a copy of that manual at http://www.cs.utk.edu/~plank/plank/papers/GF-Complete-Manual-1.02.pdf. The online home for GF-Complete is: - https://bitbucket.org/jimplank/gf-complete If you want to cite GF-Complete in a paper, I suggest citing the technical report version. The precise citation information for that is in http://www.cs.utk.edu/~plank/plank/papers/CS-13-716.html. To compile, do: ./configure make sudo make install