Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

pod status of tfjob always pending after #693 #727

Closed
merryzhou opened this issue Mar 1, 2020 · 5 comments
Closed

pod status of tfjob always pending after #693 #727

merryzhou opened this issue Mar 1, 2020 · 5 comments

Comments

@merryzhou
Copy link
Contributor

Is this a BUG REPORT or FEATURE REQUEST?:
/kind bug

What happened:
we use volcano to schedule tfjobs, but all pods status are always pending, after some research ,we found that in #693: volcano no longer support scheduling.v1alpha1 and scheduling.v1alpha2 API。
but podgroup created by tf-operator is scheduling.v1alpha1 version

so will volcano no longer compatible with tfjobs?

What you expected to happen:
volcano can scheduler tfjobs 、mpijobs。。

Environment:

  • Volcano Version:
    image
@merryzhou merryzhou changed the title pod status of tfjob always pending after #693 pod status of tfjob always pending after [#693](https://github.com/volcano-sh/volcano/pull/693) Mar 1, 2020
@merryzhou merryzhou changed the title pod status of tfjob always pending after [#693](https://github.com/volcano-sh/volcano/pull/693) pod status of tfjob always pending after #693 Mar 1, 2020
@k82cn
Copy link
Member

k82cn commented Mar 1, 2020

so will volcano no longer compatible with tfjobs?

We'll continue to support tfjobs: for current tfjobs, it's better to use volcano:0.3, and we'll raise PR in tf-operator to correct that.

@k82cn
Copy link
Member

k82cn commented Mar 1, 2020

/cc @hzxuzhonghu , please help to raise a PR in kubeflow/tf-operator for that.

@YuxiJin-tobeyjin
Copy link
Contributor

Not only tf-operator, other operators such as pytorch-operator, mpi-operator all have the same problems.

@hzxuzhonghu
Copy link
Collaborator

We plan to release 0.4 in the next week in order to support kubeflow. And by then, i can file a pr in kubeflow.

@k82cn
Copy link
Member

k82cn commented Jun 15, 2020

That was fixed in kubeflow/common v3.0 and main branch of kubeflow/tf-operator :)

@k82cn k82cn closed this as completed Jun 15, 2020
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants