Float Product combine1: Maximum use of data abstraction:
28.91 cycles/element
Float Product combine2: Take vec_length() out of loop:
27.98 cycles/element
Float Product combine3: Array reference to vector data:
7.09 cycles/element
Float Product combine4: Array reference, accumulate in temporary:
7.00 cycles/element
Float Product combine4p: Pointer reference, accumulate in temporary:
7.67 cycles/element
Float Product Array code, unrolled by 2:
7.00 cycles/element
Float Product combine5p: Pointer code, unrolled by 3, for loop:
7.00 cycles/element
Float Product Array code, unrolled by 3, while loop:
7.00 cycles/element
Float Product Array code, unrolled by 4:
7.01 cycles/element
Float Product Array code, unrolled by 8:
7.00 cycles/element
Float Product Array code, unrolled by 16:
7.01 cycles/element
Float Product Pointer code, unrolled by 2:
7.00 cycles/element
Float Product Pointer code, unrolled by 3:
7.01 cycles/element
Float Product Pointer code, unrolled by 4:
7.00 cycles/element
Float Product Pointer code, unrolled by 8:
7.00 cycles/element
Float Product Pointer code, unrolled by 16:
6.99 cycles/element
Float Product combine6: Array code, unrolled by 2, Superscalar x2:
3.50 cycles/element
Float Product Array code, unrolled by 4, Superscalar x2:
3.50 cycles/element
Float Product Array code, unrolled by 8, Superscalar x2:
3.50 cycles/element
Float Product Array code, unrolled by 3, Superscalar x3:
2.34 cycles/element
Float Product Array code, unrolled by 4, Superscalar x4:
2.00 cycles/element
Float Product Array code, unrolled by 8, Superscalar x4:
2.00 cycles/element
Float Product Array code, unrolled by 6, Superscalar x6:
2.00 cycles/element
Float Product Array code, unrolled by 8, Superscalar x8:
2.00 cycles/element
Float Product Array code, unrolled by 10, Superscalar x10:
2.00 cycles/element
Float Product Array code, unrolled by 12, Superscalar x6:
2.00 cycles/element
Float Product Array code, unrolled by 12, Superscalar x12:
2.00 cycles/element
Float Product Pointer code, unrolled by 8, Superscalar x2:
3.50 cycles/element
Float Product Pointer code, unrolled by 8, Superscalar x4:
2.00 cycles/element
Float Product Pointer code, unrolled by 8, Superscalar x8:
2.01 cycles/element
Float Product Pointer code, unrolled by 9, Superscalar x3:
2.36 cycles/element
Float Product Array code, Unroll x2, Superscalar x2, noninterleaved:
3.50 cycles/element
Float Product Array code, unrolled by 2, different associativity:
3.63 cycles/element
Float Product Array code, unrolled by 3, Different Associativity:
2.42 cycles/element
Float Product Array code, unrolled by 4, Different Associativity:
2.01 cycles/element
Float Product Array code, unrolled by 6, Different Associativity:
2.01 cycles/element
Float Product Array code, unrolled by 8, Different Associativity:
2.00 cycles/element
Float Product SSE code, 1*VSIZE-way parallelism:
1.76 cycles/element
Float Product SSE code, 2*VSIZE-way parallelism:
1.18 cycles/element
Float Product SSE code, 4*VSIZE-way parallelism:
0.88 cycles/element
Float Product SSE code, 8*VSIZE-way parallelism:
0.63 cycles/element
Float Product SSE code, 12*VSIZE-way parallelism:
0.49 cycles/element
Float Product SSE code, 2*VSIZE-way parallelism, reassociate:
1.20 cycles/element
Float Product SSE code, 4*VSIZE-way parallelism, reassociate:
0.85 cycles/element
Float Product SSE code, 8*VSIZE-way parallelism, reassociate:
0.56 cycles/element