Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Prevent ingestion of jobs with 0 start or end time #197

Merged
merged 2 commits into from
Aug 4, 2017

Conversation

jtpalmer
Copy link
Contributor

Description

Changes ingestor SQL query to exclude jobs with a start or end time that is 0.

Motivation and Context

Jobs with zero start time should not be included in the data warehouse.

Tests performed

Shredded and ingested data containing jobs with zero start time. Observed that before the change there were rows in modw.jobfact with start_time_ts zero before the change, but none after.

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

  • My code follows the code style of this project as found in the CONTRIBUTING document.
  • I have added tests to cover my changes.
  • All new and existing tests passed.

@jtpalmer jtpalmer added the bug Bugfixes label Jul 27, 2017
@jtpalmer jtpalmer force-pushed the fix-ingestor-zero-start-end-times branch from 4abcfe1 to 7b4ee6d Compare July 27, 2017 12:50
@jpwhite4
Copy link
Member

jpwhite4 commented Aug 3, 2017

Could you also add a line or two to the documentation (or FAQ perhaps?) to explain this new (desired) behaviour? I'm anticipating someone complaining that the job count is off!

@jtpalmer
Copy link
Contributor Author

jtpalmer commented Aug 4, 2017

Documentation has been added to the FAQ.

Copy link
Contributor

@smgallo smgallo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For XSEDE, we assume that the end time is the correct value and if start_time > end_time or start_time <=0 we calculate it based on the end _time and wall duration. Should we do something similar here? We still skip jobs with 0 end time.

@jtpalmer
Copy link
Contributor Author

jtpalmer commented Aug 4, 2017

@smgallo Open XDMoD will calculate a missing start time if both the end time and wall time are present. This PR documents that in the FAQ. I can move that note somewhere else if that would be more appropriate.

Edit: We could also move that logic out of the shredder and into an ingestor if that would be preferable.

@smgallo
Copy link
Contributor

smgallo commented Aug 4, 2017

Lets leave the logic where it is for now. As we move from the old ETL to ETLv2 we can see if it makes sense to move it elsewhere.

@jtpalmer jtpalmer merged commit 7826116 into ubccr:xdmod7.0 Aug 4, 2017
@jtpalmer jtpalmer deleted the fix-ingestor-zero-start-end-times branch August 4, 2017 14:35
@tyearke tyearke added this to the v7.0.0 milestone Aug 14, 2017
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
bug Bugfixes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants