minus-squarecollapse_already@lemmy.mltoOpen Source@lemmy.ml•FFmpeg devs boast of up to 94x performance boost after implementing handwritten AVX-512 assembly codelinkfedilinkEnglisharrow-up10·12 days agoAs someone who has done some hand coding of AVX-512, I appreciate their willingness to take this on. Getting the input vectors setup correctly for the instructions can be a hassle, especially when the input dataset is not an even multiple of 64. linkfedilink
As someone who has done some hand coding of AVX-512, I appreciate their willingness to take this on. Getting the input vectors setup correctly for the instructions can be a hassle, especially when the input dataset is not an even multiple of 64.