Busy hang #104

jwr · 2015-10-20T07:36:17Z

I hit another problem while inserting large amounts of data: the main process eventually hangs at close to 400% CPU usage, while RethinkDB stops receiving new data.

This happened after a loop inserted 14432000 records. The total number to be inserted was 24795802.

Records were inserted in batches (vectors) of 1024 at a time.

Unfortunately, I have little to go on, as the main process just continued spinning, there were no exceptions to be seen. I captured a CPU sample using YourKit, which might point someone to the problem.

This is using [com.apa512/rethinkdb "0.11.0"], so it's not the memory leak I reported before

The text was updated successfully, but these errors were encountered:

danielcompton · 2015-10-20T08:03:38Z

Just to check, are you waiting for each result to return before running the next insert?

jwr · 2015-10-20T08:07:54Z

Hmm. The real answer is "I don't know" :-)

I just run a loop that builds the insert queries and calls r/run on them. The loop is a doseq, so I'm not holding onto the results of the inserts.

danielcompton · 2015-10-20T08:12:33Z

r/run is blocking, so that should be waiting. I'll take a look tomorrow.

jwr · 2015-10-20T08:18:42Z

Also, this is a fairly large table. Before that I was able to insert 1878859 records and 3699819 records into two other tables, so this is not something that happened immediately.

danielcompton · 2015-10-23T22:51:40Z

Hey sorry I haven't had a chance to look into this just yet. As a temporary workaround, have you considered using multiple connections? e.g. closing the old connection and creating a new one every 100,000 inserts? Obviously not a great long term solution, but it might help in the short term.

jwr · 2015-10-28T10:55:06Z

Thanks, I don't really need a workaround, because this isn't anything urgent — I'm just trying to use clj-rethinkdb for larger things. It works for my smaller application (https://partsbox.io/), but so far seems to break for larger amounts of data.

I'm increasingly worried about the complexity in the driver and the (inevitable) resulting problems.

danielcompton · 2015-10-28T21:20:36Z

I'm looking to move all of the connection and results handling to the official Java RethinkDB driver as soon as there is a release of it, which offloads the hard part to them, and lets us just focus on the query language.

jwr · 2015-12-20T09:53:11Z

FWIW, I think this is a very good idea. We are a small community for now, so we should keep our code as simple as possible.

danielcompton · 2016-05-15T06:28:49Z

Can you try this again? We ran into a similar sounding issue (nothing was running, but no exceptions being thrown). The root cause turned out to be that all of the go threads were blocked waiting on reads from the db. The connection code has been rewritten with manifold which shouldn't suffer from this.

jwr · 2016-05-16T19:42:45Z

Of course. I will have to dig up that project and reproduce the problem again, which might take a while, though.

jwr · 2016-05-22T21:33:49Z

I retried this with 0.15.23. I was able to insert 24795801 records successfully with no hangs or any other problems. I'd say the bug can be closed!

I find it slightly alarming that performance drops with time when inserting larger numbers of records — from about 8-10k inserts/s at the beginning down to around 2k inserts/s after 23M records have been inserted (it's for a table with no indexes, and the data is fairly homogeneous). But I strongly suspect it's a RethinkDB issue. It seems to be reading about twice as much data as is being written to disk.

As for clj-rethinkdb, I noticed that its performance improved since 0.11.0 — I happened to have notes on how long a certain task took and compared them to the current results:

0.11.0: 275s
0.15.23: 193s

This is with a smaller task that doesn't hit RethinkDB's limitations. That's a 30% improvement! Nice!

danielcompton · 2016-05-22T23:25:19Z

Great to hear! The degraded performance after lots of inserts is interesting, although there's not really enough info to say whether that's RethinkDB, the driver, or something else entirely. If you felt like it, taking a VisualVM profile over time would be helpful to check if there's a memory leak which is causing excessive GCing.

Thanks for following up!

jwr · 2016-05-31T19:38:41Z

I confirmed that the degraded performance is due to RethinkDB and is to be expected: rethinkdb/rethinkdb#5805

danielcompton · 2016-05-31T20:27:03Z

Thanks for that, I'll watch that issue with interest.

danielcompton added the bug label Oct 23, 2015

danielcompton mentioned this issue Nov 15, 2015

Speed up rethinkdb driver issue #107 #108

Closed

danielcompton closed this as completed May 31, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Busy hang #104

Busy hang #104

jwr commented Oct 20, 2015

danielcompton commented Oct 20, 2015

jwr commented Oct 20, 2015

danielcompton commented Oct 20, 2015

jwr commented Oct 20, 2015

danielcompton commented Oct 23, 2015

jwr commented Oct 28, 2015

danielcompton commented Oct 28, 2015

jwr commented Dec 20, 2015

danielcompton commented May 15, 2016

jwr commented May 16, 2016

jwr commented May 22, 2016

danielcompton commented May 22, 2016

jwr commented May 31, 2016

danielcompton commented May 31, 2016

Busy hang #104

Busy hang #104

Comments

jwr commented Oct 20, 2015

danielcompton commented Oct 20, 2015

jwr commented Oct 20, 2015

danielcompton commented Oct 20, 2015

jwr commented Oct 20, 2015

danielcompton commented Oct 23, 2015

jwr commented Oct 28, 2015

danielcompton commented Oct 28, 2015

jwr commented Dec 20, 2015

danielcompton commented May 15, 2016

jwr commented May 16, 2016

jwr commented May 22, 2016

danielcompton commented May 22, 2016

jwr commented May 31, 2016

danielcompton commented May 31, 2016