Add a testcase for pthreads race conditions #12258

kripken · 2020-09-17T15:45:04Z

This is a manual test for race conditions (disabled by default as it is
very long, and checks for a race condition so it is inherently flakey)
that are fixed in #12243 #12244 #12245

It passes after the fixes in those PRs. Without them, it tends to fail
after 100 iterations out of 1000, so at least for me locally it fails
pretty consistently before the fixes.

Note that that was with chrome. I saw the test fail on firefox too
but far more rarely. On node I never saw it fail. So it definitely is
sensitive to timing somehow.

sbc100 · 2020-09-17T15:50:44Z

tests/pthread/test_pthread_proxy_hammer.cpp

+    printf("%d %d\n", i, total);
+    for (int j = 0; j < 1024; j++) {
+      // allocation uses a mutex
+      auto* rd = new random_device();


Can't this be doing directly with the pthread_mutex APIs rather than indirectly depending on the implementation of "/dev/urandom"?

Of you could write the test directly against pthread.h we could also see if it occurs in musl's native configuration.

No, the use of /dev/random is not just for a mutex - it's also for proxying (all file I/O is proxied to the main thread). That involves more than just a mutex.

Exactly, I was hoping for something a little more precise .... there is so much going on here its hard to know what this is testing.

Oh, definitely... yeah, this is not a great testcase. But it's the smallest I've managed so far that shows the issue, which is really hard to reproduce (as shown by it existing since forever, apparently).

If I have time I can try to reduce this more. But it may be better to focus on figuring out the actual cause of the problem, as that may suggest a testcase. We don't need to merge this urgently and may never merge it I guess.

tlively · 2020-09-18T00:30:06Z

[Commenting just to bump any notifications on this higher in my inbox]

kripken · 2020-09-22T23:27:07Z

I have found the actual cause here, and will open a refactoring PR and then a fix PR shortly. The fix PR will contain a variant of this test, turned off by default.

kripken added 4 commits September 16, 2020 21:02

Add a testcase for #12243 #12244 #12245 [ci skip]

5617421

better

2f18199

fix

68874d2

more

36be7cd

kripken changed the title ~~Add a testcase for pthreads race conditions fixed in #12243 #12244 #12245~~ Add a testcase for pthreads race conditions Sep 17, 2020

sbc100 reviewed Sep 17, 2020

View reviewed changes

This was referenced Sep 17, 2020

Fix a race condition in pthread call targets not waking up. #12244

Closed

Fix a race condition in pthread_mutex_timedlock.c #12245

Closed

kripken marked this pull request as draft September 17, 2020 22:45

kripken closed this Sep 22, 2020

kripken deleted the pthread4 branch September 22, 2020 23:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a testcase for pthreads race conditions #12258

Add a testcase for pthreads race conditions #12258

Uh oh!

kripken commented Sep 17, 2020 •

edited

Loading

Uh oh!

sbc100 Sep 17, 2020

Uh oh!

kripken Sep 17, 2020

Uh oh!

sbc100 Sep 17, 2020

Uh oh!

kripken Sep 17, 2020

Uh oh!

tlively commented Sep 18, 2020

Uh oh!

kripken commented Sep 22, 2020

Uh oh!

Uh oh!

Add a testcase for pthreads race conditions #12258

Add a testcase for pthreads race conditions #12258

Uh oh!

Conversation

kripken commented Sep 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbc100 Sep 17, 2020

Choose a reason for hiding this comment

Uh oh!

kripken Sep 17, 2020

Choose a reason for hiding this comment

Uh oh!

sbc100 Sep 17, 2020

Choose a reason for hiding this comment

Uh oh!

kripken Sep 17, 2020

Choose a reason for hiding this comment

Uh oh!

tlively commented Sep 18, 2020

Uh oh!

kripken commented Sep 22, 2020

Uh oh!

Uh oh!

kripken commented Sep 17, 2020 •

edited

Loading