Name Description Size
IntegerGemmIntrinsic.cpp 20123
IntegerGemmIntrinsic.h Interface for integer matrix multiplication followed by addition of bias. C = A * B + Bias Input matrix A: - A 2-D matrix that typically represents activations as floating point values - no. of rows should be a positive integer - no. of columns should be a positive integeral multiple of 64 - is represented as array (contiguous memory locations) in row-major format Input matrix B: - A 2-D matrix that typically represents fixed model parameters as floating point values - no. of rows should be: -- equal to no. of columns of Input matrix A -- a positive integeral multiple of 64 - no. of columns should be a positive integeral multiple of 8 - is represented as array (contiguous memory locations) in row-major format Please note that it is also possible to pass Input matrix B in 2 more forms: - One that is already a quantized and transposed version of Input matrix B - Other that is already a transposed version of Input matrix B Input Bias: - is an array (contiguous memory locations) that represents bias - size of the array should be equal to the no. of columns of Input matrix B Output matrix C: - is a 2-D matrix that represents the result (= A * B + Bias) - no. of rows = no. of rows of Input matrix A - no. of columns = no. of columns of Input matrix B (in untransposed form) - is represented as array (contiguous memory locations) in row-major format Please note that most of the functions in this interface might have architecture specific implementations. Conventions followed for the interface: - Unless explicitly mentioned, Input matrix B refers to an unquantized (i.e. float values) and non-transposed version - no. of rows of Input matrix A = `rowsA` - no. of columns of Input matrix A (`colsA`) = no. of rows of Input matrix B (`rowsB`) = `width` - no. of columns of Input matrix B = `colsB` 19336
moz.build 2878
README_MOZILLA 665