-
Notifications
You must be signed in to change notification settings - Fork 670
[Ray Observability] Disk usage in Dashboard #1152
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
architkulkarni
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me. Curious why this is opt-in instead of enabled by default (question for the Ray side I guess)
|
I found ray-project/ray#24416 and ray-project/ray#14676.
CC: @rkooo567 @DmitriGekhtman, could you please provide some insight on whether enabling disk usage for Kubernetes Pods by default would be advisable or not? Thanks! |
|
Also cc @scottsun94 |
|
IIUC, the problem with enabling it by default is: when multiple ray pods share the same k8s node, all these ray pods show the disk usage of that node. People might think that they are independent which might lead to confusion. I will still vote for enabling it by default:
|
|
Yeah I also think it is better enabling it, but we should have some tooltip or UI improvement to make sure people understand the problem. What @scottsun94 said is correct (disk usage in k8s is not set per pod, so the information could be misleading).
I don't think we have such best practices IIRC (pr does kuberay use daemonset to achieve this? @kevin85421 ). |
|
Thank @scottsun94 @rkooo567 for the reply.
Open an issue to track the progress ray-project/ray#36362.
Yes, this is the best practice, but a lot of users do not follow it, so the issue is still there.
No, KubeRay does not use DaemonSet. |
[Ray Observability] Disk usage in Dashboard
Why are these changes needed?
Related slack thread: https://ray-distributed.slack.com/archives/C01DLHZHRBJ/p1686170239143299
Without this PR, the values in the column

Disk(root)are0.0000B/1.0000B.With this PR, everything works fine.

Related issue number
Checks