sql: DROP DATABASE takes a looooong time #14279

danhhz · 2017-03-20T19:31:59Z

We have a teamcity build that backs up a production cluster and restores it into a new database _full. Then does an incremental backup from the first one and uses both to restore into a second new database _inc. Finally, it drops the two databases_full and _inc. In a recent run the backups each took about 1m, the restores took 10m, and the first drop database took 62 minutes. (As of the time of writing, the second one hadn't finished.) The amount of data being dropped was about 2.8GB.

https://teamcity.cockroachdb.com/viewLog.html?buildId=188347

The text was updated successfully, but these errors were encountered:

vivekmenezes · 2017-03-20T19:44:36Z

we should look at using rocksDBs DB::DeleteRange for faster deletion of both tables and indexes.

tamird · 2017-03-20T21:11:38Z

I don't think that's right. Deleting tables and indexes needs to go through our MVCC layer, we can't just obliterate the data with RocksDB's DeleteRange.

bdarnell · 2017-03-20T21:21:03Z

Well, we currently go through the MVCC layer, and this allows us to support ongoing queries (or new time-travel queries) across DROP TABLE boundaries, but in many cases this is not a requirement, and it might be nice to at least have the ability to opt in to a faster DROP TABLE that did not go through the MVCC layer (except to fix up the MVCC stats after the DeleteRange). This would both improve the performance of the drop itself and reduce the time before the disk space is freed (instead of waiting for a 24h GC cycle).

tamird · 2017-03-20T21:24:54Z

That's fine, but I'm not sure that's what this issue is about. It seems that the DROP TABLE operation is much slower than the restore, suggesting that there are lower-hanging fruit to be picked here.

bdarnell · 2017-03-20T21:29:54Z

I'm sure there's a lot of room for improvement while still using MVCC, but restore is as fast as it is because it bypasses the MVCC layer.

vivekmenezes · 2017-03-22T14:20:29Z

While we can work on the actual performance of drop table, a user preferred solution will be to just declare the drop as done once the name is available for reuse and run the actually drop in the background. This can be implemented by the schema changer being made aware if it is associated with a session, and running only the "release name" part of the code when run from the session, and the rest of the drop table code from the async schema changer.

spencerkimball · 2017-03-27T15:34:50Z

Perhaps it was a mistake to make DROP TABLE work like TRUNCATE TABLE. I agree with @vivekmenezes that we should start by removing the table name -> table ID mapping in the schema and let clients continue. But instead of doing the MVCC deletion in the background, it seems considerably more efficient to have a different path for the actual deletion of the table, which would schedule DeleteRange calls to delete the underlying data according to the zone config TTL.

The DROP TABLE is deemed complete as soon as the table name is no longer in use. The table data GC cleanup is executed asynchronously through the asynchronous schema change path, and can be made more performant later. fixes cockroachdb#14279 related to cockroachdb#2003

danhhz assigned vivekmenezes Mar 20, 2017

knz added this to the 1.0 milestone Mar 20, 2017

knz added C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. C-performance Perf of queries or internals. Solution not expected to change functional behavior. labels Mar 20, 2017

knz mentioned this issue Mar 20, 2017

sql: schema updates have problematic shortcomings #13804

Closed

22 tasks

spencerkimball added the high priority label Mar 27, 2017

vivekmenezes modified the milestones: 1.1, 1.0 Apr 19, 2017

vivekmenezes closed this as completed in 4628ab9 Jul 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: DROP DATABASE takes a looooong time #14279

sql: DROP DATABASE takes a looooong time #14279

danhhz commented Mar 20, 2017

vivekmenezes commented Mar 20, 2017

tamird commented Mar 20, 2017

bdarnell commented Mar 20, 2017

tamird commented Mar 20, 2017

bdarnell commented Mar 20, 2017

vivekmenezes commented Mar 22, 2017

spencerkimball commented Mar 27, 2017

sql: DROP DATABASE takes a looooong time #14279

sql: DROP DATABASE takes a looooong time #14279

Comments

danhhz commented Mar 20, 2017

vivekmenezes commented Mar 20, 2017

tamird commented Mar 20, 2017

bdarnell commented Mar 20, 2017

tamird commented Mar 20, 2017

bdarnell commented Mar 20, 2017

vivekmenezes commented Mar 22, 2017

spencerkimball commented Mar 27, 2017