Feature timeouts and stats #251

seanmcevoy · 2015-11-20T16:30:32Z

This pull request allows us to configure separate queue and service timeouts instead of just the total timeout.

The problem it's aimed at is that each request is blocking, so if the client is used in a near-capacity application and request queues start to build up it can become very inefficient.

The inefficiency in the current timeout mechanism is caused by the fact that the timeout covers both the queue and service time. If the request queue builds up then the requests that we put on the wire are the oldest ones which are most likely to expire while being serviced. If one expires while being serviced the connection is broken and re-formed and the next oldest request is taken. So if we're in a constant overloaded state the client will spend all its time disconnecting & reconnecting to the server and serve no requests at all.

To avoid this the new mechanism allows us to configure seperate queue and servicing timeouts.

This is demonstrated in overload_demo_test where we setup a dummy server with a delay and then overload it with 200 requests. The old mechanism can only service the first 2 or 3 requests and will timeout for the rest of the test, while the new mechanism will service a constant ~60% of the requests.

The old mechanism is retained and is working exactly as before, backward compatibility is tested in timeout_conn_test & timeout_no_conn_test.

lukebakken · 2015-12-04T16:16:53Z

If you have time, could you do some research into the CI failures? Let me know if you can / can't reproduce them locally when you run the test suite.

seanmcevoy · 2015-12-06T22:21:59Z

The changes affect the timeout mechanism so the tests make a lot of
concurrent calls and check the time taken to return. So I have a dilemma of
how tight to set the tolerances of the execution times.
On my machine these tests pass >95% of the time, but on you CI box the
failure seems consistent.
So it's just a test issue, I'll see if I can figure out a better way to
test it. Just loosning the tolerances until it passes feels like cheating!

On Fri, Dec 4, 2015 at 4:17 PM, Luke Bakken notifications@github.com
wrote:

If you have time, could you do some research into the CI failures? Let me
know if you can / can't reproduce them locally when you run the test suite.

—
Reply to this email directly or view it on GitHub
#251 (comment)
.

seanmcevoy · 2015-12-08T12:31:36Z

CI tests passing now after tweaking the tolerances. They're still probabalistic so unfortunately there will be a low failure rate if run repeatedly. Possibly they're a little too high-level to be eunit tests, don't know of any better solutions though.

lukebakken · 2016-11-25T16:50:37Z

@seanmcevoy - just so you know, we haven't had time to review this yet. I have added it to a future milestone and it won't be lost.

seanmcevoy added 3 commits November 13, 2015 16:44

added new timeout handling, better for overloaded connections

6d890f0

tweaked timeout tests.

e55e01d

added overload demo

c51affb

seanmcevoy mentioned this pull request Nov 20, 2015

2.0 #217

Closed

tweaked timeout test tolerances

7850853

lukebakken added this to the riak-erlang-client-2.4.0 milestone May 13, 2016

lukebakken self-assigned this May 13, 2016

lukebakken added the Enhancement label May 13, 2016

lukebakken mentioned this pull request May 13, 2016

Added new timeout mechanism #213

Closed

lukebakken modified the milestones: riak-erlang-client-2.4.0, riak-erlang-client-2.5.0 Jun 17, 2016

lukebakken modified the milestones: riak-erlang-client-3.0.0, riak-erlang-client-2.5.0 Oct 25, 2016

antn unassigned lukebakken Jun 15, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature timeouts and stats #251

Feature timeouts and stats #251

seanmcevoy commented Nov 20, 2015

lukebakken commented Dec 4, 2015

seanmcevoy commented Dec 6, 2015

seanmcevoy commented Dec 8, 2015

lukebakken commented Nov 25, 2016

Feature timeouts and stats #251

Are you sure you want to change the base?

Feature timeouts and stats #251

Conversation

seanmcevoy commented Nov 20, 2015

lukebakken commented Dec 4, 2015

seanmcevoy commented Dec 6, 2015

seanmcevoy commented Dec 8, 2015

lukebakken commented Nov 25, 2016