Fix undefined behaviour in dot product kernels #655

argilo · 2023-10-14T01:39:08Z

Fixes #651.
Fixes #659.

In tests, the dot product is a summation of 131071 terms, ~~so a fair amount of error can accumulate. From the past CI failures it can be seen that the error can reach a magnitude of 2, so I've set a tolerance of 10 to leave a safe margin.~~

Since the summation can be close to zero, it makes sense to check the absolute error instead of the relative error. The ccompare function did not have an absolute mode, so I added one.

Update: The test failures and large errors identified in #651 were due to undefined behaviour in the dot product kernels, which sometimes manifested in trailing samples/taps not being included in the calculation. I've updated the pull request to include fixes for that problem.

argilo · 2023-10-14T02:08:16Z

The test failure is unrelated; it's due to #646.

argilo · 2023-10-14T02:12:59Z

On my machine I can get away with an absolute tolerance of 1e-2 even if I change the order of floating point operations in the generic kernel by reversing the loop. So I'm confused why the errors have been so much larger in CI. There might be something more to the failures. Let's hold off on merging this until we know what's going on.

jdemel · 2023-10-14T08:21:44Z

@argilo you could convert this PR into a draft. This would clearly indicate that it is still being worked on.

argilo · 2023-10-15T01:01:17Z

Aha! The volk_32fc_32f_dot_prod_32fc kernels misbehave because they cast a float* to lv_32fc_t* and then dereference it, which is undefined behaviour.

volk/kernels/volk/volk_32fc_32f_dot_prod_32fc.h

Lines 64 to 65 in a26a1b8

    
           float res[2]; 
        
           float *realpt = &res[0], *imagpt = &res[1];

volk/kernels/volk/volk_32fc_32f_dot_prod_32fc.h

Line 78 in a26a1b8

*result = *(lv_32fc_t*)(&res[0]);

I got rid of the float array and replaced it with lv_32fc_t, following the model that the accumulator kernels use.

Tests now pass on all platforms, even with the tighter 1e-2 absolute tolerance.

argilo · 2023-10-15T01:02:24Z

The test failures are unrelated; both are due to #646.

argilo · 2023-10-15T01:04:13Z

volk_16i_32fc_dot_prod_32fc has the same bug, but I'll leave that one for later.

argilo · 2023-10-15T01:23:49Z

GNU Radio uses both volk_32fc_32f_dot_prod_32fc and volk_16i_32fc_dot_prod_32fc for FIR filtering, so it will be good to get them cleaned up and tested with tighter tolerances.

argilo · 2023-10-15T01:54:38Z

volk_16i_32fc_dot_prod_32fc has the same bug, but I'll leave that one for later.

Actually, I'll throw it in here because it will also need the new absolute tolerance code.

lib/kernel_tests.h

argilo · 2023-10-15T02:28:27Z

The test failure in the latest run is due to #650. At least there's some variety. 😄

argilo · 2023-10-15T14:32:09Z

Some information about why casting from float[] to std::complex is Undefined Behaviour:

https://stackoverflow.com/questions/23198943/is-it-legal-to-cast-float-to-stdcomplexfloat

jdemel

Almost LGTM. Only a very few comments. Thanks!

jdemel · 2023-10-22T12:29:30Z

kernels/volk/volk_16i_32fc_dot_prod_32fc.h

@@ -147,8 +147,7 @@ static inline void volk_16i_32fc_dot_prod_32fc_u_sse(lv_32fc_t* result,
    unsigned int number = 0;
    const unsigned int sixteenthPoints = num_points / 8;

-    float res[2];
-    float *realpt = &res[0], *imagpt = &res[1];
+    lv_32fc_t returnValue;


Compilers would potentially complain about uninitialized variables here.
I suggest: lv_32fc_t returnValue = lv_cmake(0.0f, 0.0f);
I'd update the other occurrences as well.

I'm not sure compilers would be confused by this particular case, but it would make the code easier to understand for humans. I'll add initializations and change all the updates to +=.

lib/kernel_tests.h

Signed-off-by: Clayton Smith <argilo@gmail.com>

argilo · 2023-10-22T13:09:52Z

Updated, and rebased on main. It might be a while until CI catches up. 😄

argilo · 2023-10-22T15:14:47Z

CI has now caught up. ✅

jdemel

LGTM

Fix undefined behaviour in dot product kernels

argilo force-pushed the fix-flaky-dot-prod branch from 7c7b097 to 946259f Compare October 14, 2023 01:41

argilo marked this pull request as draft October 14, 2023 14:08

argilo force-pushed the fix-flaky-dot-prod branch 2 times, most recently from d2e8603 to 43d8980 Compare October 15, 2023 00:45

argilo marked this pull request as ready for review October 15, 2023 01:01

argilo mentioned this pull request Oct 15, 2023

Undefined behaviour in volk_16i_32fc_dot_prod_32fc #659

Closed

argilo changed the title ~~Check absolute error of dot product calculation~~ Fix undefined behaviour in dot product kernel Oct 15, 2023

argilo force-pushed the fix-flaky-dot-prod branch from 43d8980 to d4c8688 Compare October 15, 2023 02:11

argilo changed the title ~~Fix undefined behaviour in dot product kernel~~ Fix undefined behaviour in dot product kernels Oct 15, 2023

argilo commented Oct 15, 2023

View reviewed changes

lib/kernel_tests.h Show resolved Hide resolved

jdemel requested changes Oct 22, 2023

View reviewed changes

argilo force-pushed the fix-flaky-dot-prod branch 2 times, most recently from 794ca91 to 56fba31 Compare October 22, 2023 13:03

argilo added 3 commits October 22, 2023 09:08

Fix undefined behaviour in dot product kernels

b9c4020

Signed-off-by: Clayton Smith <argilo@gmail.com>

Use an absolute tolerance to test the dot product kernels

db9b37c

Signed-off-by: Clayton Smith <argilo@gmail.com>

Always initialize returnValue

221138a

Signed-off-by: Clayton Smith <argilo@gmail.com>

argilo force-pushed the fix-flaky-dot-prod branch from 56fba31 to 221138a Compare October 22, 2023 13:08

jdemel approved these changes Oct 22, 2023

View reviewed changes

jdemel merged commit 42f57cd into gnuradio:main Oct 22, 2023
32 checks passed

argilo deleted the fix-flaky-dot-prod branch October 22, 2023 16:17

argilo mentioned this pull request Oct 23, 2023

Generate random integers with uniform_int_distribution #677

Merged

Alesha72003 pushed a commit to Alesha72003/volk that referenced this pull request May 15, 2024

Merge pull request gnuradio#655 from argilo/fix-flaky-dot-prod

0150e18

Fix undefined behaviour in dot product kernels

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix undefined behaviour in dot product kernels #655

Fix undefined behaviour in dot product kernels #655

argilo commented Oct 14, 2023 •

edited

Loading

argilo commented Oct 14, 2023

argilo commented Oct 14, 2023 •

edited

Loading

jdemel commented Oct 14, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

jdemel left a comment

jdemel Oct 22, 2023

argilo Oct 22, 2023

argilo commented Oct 22, 2023

argilo commented Oct 22, 2023

jdemel left a comment

Fix undefined behaviour in dot product kernels #655

Fix undefined behaviour in dot product kernels #655

Conversation

argilo commented Oct 14, 2023 • edited Loading

argilo commented Oct 14, 2023

argilo commented Oct 14, 2023 • edited Loading

jdemel commented Oct 14, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

argilo commented Oct 15, 2023

jdemel left a comment

Choose a reason for hiding this comment

jdemel Oct 22, 2023

Choose a reason for hiding this comment

argilo Oct 22, 2023

Choose a reason for hiding this comment

argilo commented Oct 22, 2023

argilo commented Oct 22, 2023

jdemel left a comment

Choose a reason for hiding this comment

argilo commented Oct 14, 2023 •

edited

Loading

argilo commented Oct 14, 2023 •

edited

Loading