-
Notifications
You must be signed in to change notification settings - Fork 164
Issues: tony-framework/TonY
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
ERROR ApplicationMaster:496 - Exception while preparing AM org.apache.hadoop.yarn.exceptions.YarnException: Can't resolve the ip of ubuntu at com.linkedin.tony.util.Utils.getHostNameOrIpFromTokenConf(Utils.java:365) at com.linkedin.tony.ApplicationMaster.prepare(ApplicationMaster.java:476) at com.linkedin.tony.ApplicationMaster.run(ApplicationMaster.java:368) at com.linkedin.tony.ApplicationMaster.main(ApplicationMaster.java:342)
#673
opened May 26, 2022 by
ckqqqq
Allow that one role of task executor could make other roles exit
enhancement
New feature or request
#636
opened Jan 19, 2022 by
zuston
Task executors that support specific roles are restarted when they fail
enhancement
New feature or request
#620
opened Nov 25, 2021 by
zuston
Provide tony-submit cli tool to submit app
enhancement
New feature or request
help wanted
Extra attention is needed
#578
opened Aug 3, 2021 by
zuston
There is a vulnerability in Protocol Buffers 0.8.1 ,upgrade recommended
good first issue
Good for newcomers
help wanted
Extra attention is needed
#552
opened May 17, 2021 by
QiAnXinCodeSafe
A little mistakes in tony-example README
good first issue
Good for newcomers
#528
opened Apr 17, 2021 by
daugraph
Support elastic Horovod on TonY
enhancement
New feature or request
help wanted
Extra attention is needed
#525
opened Apr 12, 2021 by
zuston
Check for app failure before updating task infos
good first issue
Good for newcomers
#464
opened Sep 14, 2020 by
hungj
Add more testing for Tony retry logic
good first issue
Good for newcomers
#452
opened Jun 26, 2020 by
goyalankit
Add retries id in the environment when we retry
good first issue
Good for newcomers
#434
opened May 14, 2020 by
oliverhu
fail the job instead of hanging if it's requesting GPU(s) on a host where it doesn't have enough GPU(s)
help wanted
Extra attention is needed
#432
opened Mar 21, 2020 by
burgerkingeater
Add option to return SUCCEED when training is completed with some failed job tasks
#420
opened Jan 16, 2020 by
charliechen211
Print tony version in tony client and tony AM
good first issue
Good for newcomers
#392
opened Oct 8, 2019 by
hungj
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.