-
Notifications
You must be signed in to change notification settings - Fork 670
Inject the --block option to ray start command automatically #932
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inject the --block option to ray start command automatically #932
Conversation
Signed-off-by: Yicheng-Lu-llll <luyc58576@gmail.com>
220124b to
a689ce6
Compare
Signed-off-by: Yicheng-Lu-llll <luyc58576@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
-
Add an unit test
-
Open an issue to remove
--blockfrom YAML files after 0.5.0, and update #940.
Signed-off-by: Yicheng-Lu-llll <luyc58576@gmail.com>
kevin85421
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Approve this PR without running by myself because: (1) This PR has enough unit tests (2) The screenshots in the PR description are reliable.
|
…oject#932) Inject the --block option to ray start command automatically
Why are these changes needed?
This PR:
--blockoption to the ray start command automatically if the user has not set the-- blockoption. See Remove ray-cluster.without-block.yaml #675 and [Feature][Docs] Explain how to specify container command for head pod #912 (comment)'parameters': 'true'or'parameters': 'false'(e.g--block,--disable-usage-stats) inrayStartParams. However, kuberay can not handle(will fail to run all the pods) if the user set 'parameters': 'false' in rayStartParams. This PR enables the user to set a 'false' value.Prove that Without --block, it may take a much longer time to detect the unhealthy condition and restart the pod:
You can find without_block.yaml and with_block.yaml here.
To summarize the result, In a single, healthy node environment:
Questions:
Should we enforce adding the
--blockoption (even if the user set'block': 'false'inrayStartParams)?'Because not every user will enable Fault Tolerance. But the readiness probe and Liveness probe will only be installed if Fault Tolerance is enabled. That is, if running without Fault Tolerance and ray processes somehow crash, kuberay will never detect the failure and restart the pod for the user.
Related issue number
Closes #915
Checks
I've made sure the tests are passing in various situations:
'block' : 'false'inrayStartParams'block' : 'true'inrayStartParamsblocknot set.Test convertParamMap