third_party/aom/av1/common/x86

firefox-main/third_party/aom/av1/common/x86

Name	Description	Size	Coverage
av1_convolve_horiz_rs_sse4.c		9922	0 %
av1_convolve_scale_sse4.c		21556	0 %
av1_inv_txfm_avx2.c		89957	54 %
av1_inv_txfm_avx2.h		2616	100 %
av1_inv_txfm_ssse3.c		113960	27 %
av1_inv_txfm_ssse3.h		8965	89 %
av1_txfm_sse2.h		13143	54 %
av1_txfm_sse4.c		892	0 %
av1_txfm_sse4.h		2393	0 %
cdef_block_avx2.c	partial A is a 16-bit vector of the form: [x8 - - x1 \| x16 - - x9] and partial B has the form: [0 y1 - y7 \| 0 y9 - y15]. This function computes (x1^2+y1^2)C1 + (x2^2+y2^2)C2 + ... (x7^2+y2^7)C7 + (x8^2+0^2)C8 on each 128-bit lane. Here the C1..C8 constants are in const1 and const2.	15640	92 %
cdef_block_sse4.c		1673	0 %
cdef_block_ssse3.c		2095	-
cfl_avx2.c	4x4	21372	0 %
cfl_simd.h		14837	-
cfl_sse2.c		3573	0 %
cfl_ssse3.c	Adds 4 pixels (in a 2x2 grid) and multiplies them by 2. Resulting in a more precise version of a box filter 4:2:0 pixel subsampling in Q3. The CfL prediction buffer is always of size CFL_BUF_SQUARE. However, the active area is specified using width and height. Note: We don't need to worry about going over the active area, as long as we stay inside the CfL prediction buffer.	16986	7 %
convolve_2d_avx2.c		6517	9 %
convolve_2d_sse2.c	Horizontal filter	23807	0 %
convolve_avx2.c	rounding code	39711	2 %
convolve_sse2.c	[4]	19767	0 %
filterintra_sse4.c	arbitrary pack arg	15287	0 %
highbd_convolve_2d_avx2.c	Horizontal filter	8312	0 %
highbd_convolve_2d_sse4.c		18237	0 %
highbd_convolve_2d_ssse3.c	Horizontal filter	17061	0 %
highbd_inv_txfm_avx2.c		169251	0 %
highbd_inv_txfm_sse4.c	SSE4.1	223876	0 %
highbd_jnt_convolve_avx2.c		36207	0 %
highbd_jnt_convolve_sse4.c	Vertical filter	16618	0 %
highbd_txfm_utility_sse4.h	SSE4.1	5597	0 %
highbd_warp_affine_avx2.c		29103	0 %
highbd_warp_plane_sse4.c		27683	0 %
highbd_wiener_convolve_avx2.c	Horizontal filter	11602	0 %
highbd_wiener_convolve_ssse3.c	Horizontal filter	8616	0 %
intra_edge_sse4.c		11606	0 %
jnt_convolve_avx2.c		53957	9 %
jnt_convolve_sse2.c		15331	0 %
jnt_convolve_ssse3.c	Horizontal filter	9983	0 %
reconinter_avx2.c		27617	0 %
reconinter_sse4.c	SSE4.1	6233	0 %
reconinter_ssse3.c		4755	0 %
resize_avx2.c	g0... g15 \| i0... i15	36682	0 %
resize_sse2.c	ah0 ah1 ... ah7	15713	0 %
resize_ssse3.c		39070	0 %
selfguided_avx2.c		28680	0 %
selfguided_hwy_avx512.cc		676	-
selfguided_sse4.c		26475	0 %
warp_plane_avx2.c		53875	0 %
warp_plane_hwy_avx2.cc		660	-
warp_plane_hwy_avx512.cc		665	-
warp_plane_hwy_sse4.cc		662	-
warp_plane_sse4.c	This is a modified version of 'av1_warped_filter' from warped_motion.c: Each coefficient is stored in 8 bits instead of 16 bits The coefficients are rearranged in the column order 0, 2, 4, 6, 1, 3, 5, 7 This is done in order to avoid overflow: Since the tap with the largest coefficient could be any of taps 2, 3, 4 or 5, we can't use the summation order ((0 + 1) + (4 + 5)) + ((2 + 3) + (6 + 7)) used in the regular convolve functions. Instead, we use the summation order ((0 + 2) + (4 + 6)) + ((1 + 3) + (5 + 7)). The rearrangement of coefficients in this table is so that we can get the coefficients into the correct order more quickly.	42048	0 %
wiener_convolve_avx2.c		9956	0 %
wiener_convolve_sse2.c	Horizontal filter	8848	0 %