Commit Graph

3 Commits

Author SHA1 Message Date
Chad Kunde
96035f78ec asm/f64: add asm implementation of l2distance function
L2 Distance kernel is a slight modification of the L2 Norm kernel,
taking two vectors and calculating the difference before summing squares.

Same overflow protections as the netlib form of the norm calculation.
2019-11-23 16:39:34 +08:00
Chad Kunde
f237e064fc asm/f64: translate l2norm with increment to asm
Netlib algorithm reduces overflow while calculating the l2norm of a
vector.

Translated incremented vector calculation to asm while reducing
branches in NaN and Inf checks. Overflow protection is equivalent to
the Netlib standard implementation.
2019-11-23 16:39:34 +08:00
Chad Kunde
4e1ef9c972 translate netlib l2norm algorithm to asm and remove branches
Netlib algorithm reduces overflow while calculating the l2norm of a
vector.

Translated to asm while reducing branches in NaN and Inf checks.
Overflow protection is equivalent to the Netlib standard implementation.
2019-11-15 18:20:45 +08:00