Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Run PR gpu utests/relvals on both CUDA and ROCm GPUs #2418

Merged
merged 1 commit into from
Mar 12, 2025
Merged

Conversation

iarspider
Copy link
Contributor

@iarspider iarspider commented Jan 22, 2025


Additional changes :

@cmsbuild
Copy link
Contributor

cmsbuild commented Jan 22, 2025

cms-bot internal usage

@iarspider
Copy link
Contributor Author

please test with cms-sw/cmssw#46579

to check that cpu tests are not broken

@cmsbuild
Copy link
Contributor

-1

Failed Tests: ClangBuild
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-b157ff/43916/summary.html
COMMIT: 118dd7e
CMSSW: CMSSW_15_0_X_2025-01-22-1100/el8_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cms-bot/2418/43916/install.sh to create a dev area with all the needed externals and cmssw changes.

CMS deprecated warnings: 1 CMS deprecated warnings found, see summary page for details.

Clang Build

I found compilation warning while trying to compile with clang. Command used:

USER_CUDA_FLAGS='--expt-relaxed-constexpr' USER_CXXFLAGS='-Wno-register -fsyntax-only' /usr/bin/time -v scram build -k -j 32 COMPILER='llvm compile'

See details on the summary page.

@iarspider
Copy link
Contributor Author

please test with cms-sw/cmssw#47163

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-b157ff/43917/summary.html
COMMIT: 118dd7e
CMSSW: CMSSW_15_0_X_2025-01-22-1100/el8_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cms-bot/2418/43917/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • You potentially added 1 lines to the logs
  • Reco comparison results: 1664 differences found in the comparisons
  • DQMHistoTests: Total files compared: 49
  • DQMHistoTests: Total histograms compared: 3819085
  • DQMHistoTests: Total failures: 149
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3818916
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
  • Checked 214 log files, 184 edm output root files, 49 DQM output files
  • TriggerResults: found differences in 1 / 47 workflows

@iarspider
Copy link
Contributor Author

iarspider commented Jan 23, 2025

test parameters:

  • addpkg = HeterogeneousTest, HeterogeneousCore
  • enable = gpu
  • workflows_gpu = 141.044406,141.044408,141.044412,141.044414,141.044422,141.044424,160.03502

@iarspider
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

Pull request #2418 was updated.

@iarspider
Copy link
Contributor Author

please test

@iarspider
Copy link
Contributor Author

please abort

@iarspider
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 3, 2025

Pull request #2418 was updated.

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 3, 2025

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-b157ff/44767/summary.html
COMMIT: 1d0c56c
CMSSW: CMSSW_15_1_X_2025-03-03-1100/el8_amd64_gcc12
Additional Tests: CUDA
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cms-bot/2418/44767/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • You potentially removed 2 lines from the logs
  • Reco comparison results: 14 differences found in the comparisons
  • DQMHistoTests: Total files compared: 49
  • DQMHistoTests: Total histograms compared: 3920300
  • DQMHistoTests: Total failures: 98
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3920182
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
  • Checked 214 log files, 184 edm output root files, 49 DQM output files
  • TriggerResults: found differences in 1 / 47 workflows

CUDA Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 24 differences found in the comparisons
  • DQMHistoTests: Total files compared: 10
  • DQMHistoTests: Total histograms compared: 57865
  • DQMHistoTests: Total failures: 1239
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 56626
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 9 files compared)
  • Checked 50 log files, 48 edm output root files, 10 DQM output files
  • TriggerResults: no differences found

@iarspider
Copy link
Contributor Author

All links are fixed now. Reminder: 1d0c56c should be reverted before merging.

@iarspider
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

Pull request #2418 was updated.

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-b157ff/44884/summary.html
COMMIT: e217f79
CMSSW: CMSSW_15_1_X_2025-03-09-2300/el8_amd64_gcc12
Additional Tests: CUDA
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cms-bot/2418/44884/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 6 differences found in the comparisons
  • DQMHistoTests: Total files compared: 49
  • DQMHistoTests: Total histograms compared: 3920300
  • DQMHistoTests: Total failures: 24
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3920256
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
  • Checked 214 log files, 184 edm output root files, 49 DQM output files
  • TriggerResults: no differences found

CUDA Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 48 differences found in the comparisons
  • DQMHistoTests: Total files compared: 10
  • DQMHistoTests: Total histograms compared: 57865
  • DQMHistoTests: Total failures: 1928
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 55937
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 9 files compared)
  • Checked 50 log files, 48 edm output root files, 10 DQM output files
  • TriggerResults: no differences found

@smuzaffar
Copy link
Contributor

please test

lets run the tests with few packages ( e.g. HeterogeneousCore, HeterogeneousTest) checked out

@iarspider
Copy link
Contributor Author

iarspider commented Mar 11, 2025

@smuzaffar I have started the test, if you plan to do more tests - rebuild this job, since automatic tests won't work (and the PR is ignored by bot)

@cmsbuild
Copy link
Contributor

-1

Failed Tests: cudaUnitTests
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-b157ff/44917/summary.html
COMMIT: e217f79
CMSSW: CMSSW_15_1_X_2025-03-11-1100/el8_amd64_gcc12
Additional Tests: CUDA
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cms-bot/2418/44917/install.sh to create a dev area with all the needed externals and cmssw changes.

CUDA Unit Tests

I found 1 errors in the following unit tests:

---> test cudaTimeMeasurement had ERRORS

Comparison Summary

Summary:

  • You potentially added 3 lines to the logs
  • Reco comparison results: 4 differences found in the comparisons
  • DQMHistoTests: Total files compared: 49
  • DQMHistoTests: Total histograms compared: 3920300
  • DQMHistoTests: Total failures: 3
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3920277
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
  • Checked 214 log files, 184 edm output root files, 49 DQM output files
  • TriggerResults: no differences found

CUDA Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 47 differences found in the comparisons
  • DQMHistoTests: Total files compared: 10
  • DQMHistoTests: Total histograms compared: 57865
  • DQMHistoTests: Total failures: 3400
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 54465
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 9 files compared)
  • Checked 50 log files, 48 edm output root files, 10 DQM output files
  • TriggerResults: no differences found

@iarspider
Copy link
Contributor Author

@smuzaffar ping

@iarspider iarspider merged commit fcf6a5c into master Mar 12, 2025
7 checks passed
@iarspider iarspider deleted the pr-rocm-tests branch March 12, 2025 10:53
# for free to join this conversation on GitHub. Already have an account? # to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants