Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

[feature] Expose more information about a job for debugging purposes #544

Open
fosterseth opened this issue Feb 9, 2022 · 1 comment
Open

Comments

@fosterseth
Copy link
Member

fosterseth commented Feb 9, 2022

What I'd like to be able to tell at a glance

  1. what was the exact work_submit command that started the work? this would be analogous to the job_args field in awx
  2. what ongoing threads/processes/file descriptors are open for a particular work unit?
    • i.e. is monitorRemoteUnit still running? we can't infer that from the status because monitorRemoteStatus and monitorRemoteStdout operate in separate threads
    • is GetResults spinning in a loop

Solutions:

  1. Logging data. This could be structured logging, similar to job_lifecycle in awx.
    Pros: doesn't require changes to work unit structs, or to the status file.
    Cons: Finding and parsing logs is tedious and not fun.

  2. Put this data in work unit structures. This can either be written to disk or kept just in memory.
    Pros: we can just narrow in on some culprit work unit by looking at its status file or by calling some receptorctl that gives us more information about the work unit
    Cons: more work to implement and get right

@fosterseth
Copy link
Member Author

fosterseth commented Feb 9, 2022

@AlanCoding maybe you have some other ideas of nice-to-haves when it comes to extra information about a receptor work unit

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants