-
Notifications
You must be signed in to change notification settings - Fork 146
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
scylla-advanced: Add a panel for scylla_io_queue_flow_ratio #2312
Conversation
} | ||
], | ||
"description": "This graph shows the ratio of dispatch rate to completion rate. It is expected to be 1.0, growing larger on reactor stalls or disk problems.\n\nscylla_io_queue_flow_ratio", | ||
"title": "I/O Group [[iogroup]] Queue flow ratio" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should have this graph collapsed by default similarly to Tombstones
and MVs
on the Detailed
dashboard.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A panel cannot be collapsed; I can collapse the row that contains this graph with scylla_io_queue_consumption
This patch adds a panel that shows scylla_io_queue_flow_ratio. Fixes scylladb#2306 Signed-off-by: Amnon Heiman <amnon@scylladb.com>
@vladzcloudius, how can we progress with this panel? The upcoming Scylla monitoring release will have the option to scroll through the tooltips. But I think it will not be enough and we should only show the average and remove anything that is close to 1, so the normal case will not be shown; I'll appreciate your thoughts. |
I agree that we should aggregate. STDEV on the other hand should be a more informative one.
@xemul WDYT? What we are looking for here is a clear way to work with this metric for Support people. |
@vladzcloudius I wrote multiple ideas, and none of them was average. |
The problem is that it may end up showing all shards values. If we can't find anything better - we can start with this. However I was hoping we WOULD find something better. |
Do you have some real life examples for what would be a safe threshold to remove? e.g. remove the range (0.9-1.1) |
You can pick any I/O heavy SC or FC cluster. |
This patch adds a panel that shows scylla_io_queue_flow_ratio.
data:image/s3,"s3://crabby-images/bf8d0/bf8d01ffe20b066bfd0a473133db5e3b03097687" alt="Screenshot_20240610_214114"
Fixes #2306