Fix "nprocesses>1" code generation (example: uutt within pptt gives "const int denominators = 36,36;") #343

valassi · 2022-01-24T18:25:53Z

Hi @oliviermattelaer as discussed today at the meeting I am assigning this to you.

This is a followup to #337 and to #272. The problem: code generation for pptt now succeeds (after fixing the assert in #337), however the code does not build in the uutt subdirectory, because nprocesses=2 there (#272).

I checked that the uutt standalone and the uutt within pptt differ in the following way for instance:

/data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp> diff -r pp_tt.auto/ uu_tt.auto/
...
diff -r pp_tt.auto/SubProcesses/P1_Sigma_sm_uux_ttx/CPPProcess.cc uu_tt.auto/SubProcesses/P1_Sigma_sm_uux_ttx/CPPProcess.cc
...
502c497
<     const int denominators = 36,36; // FIXME: assume process.nprocesses == 1 for the moment (eventually denominators[nprocesses]?)
---
>     const int denominators = 36; // FIXME: assume process.nprocesses == 1 for the moment (eventually denominators[nprocesses]?)

From our discussions, I understand that this is because "uutt" within pptt is not only "uu" proper, but also some other combinations of quarks (as you mentioned today, probably it is that u+ubar to t+tbar and ubar+u to t+tbar are not exactly the same?). Anyway, what I mean is that the ggtt within pptt and the ggtt standalone instead are essentially the same code, so it is really the "uutt" part which is different.

As discussed today, this probaby comes from the base python that you developed, rather than from my plugin modifying your python. So best if you take a look. Thanks! Andrea

PS I am instead merging PR #340 that contains ONLY the fix for the assert in #337

The text was updated successfully, but these errors were encountered:

…rocesses This completes the fix for the assert issue madgraph5#337 (pptt generation was failing). The second issue in pptt (generation succeeds but build fails with nprocesses=2) is moved to madgraph5#343

oliviermattelaer · 2022-01-27T12:41:59Z

So I confirm that the nprocesses=2 is due here to the symmetry between u u~ process and u~ u.
As it is quite clear from the code below:

        calculate_wavefunctions(perm, helicities[ihel]);
        t[0] = matrix_1_uux_ttx();
        // Mirror initial state momenta for mirror process
        perm[0] = 1;
        perm[1] = 0;
        // Calculate wavefunctions
        calculate_wavefunctions(perm, helicities[ihel]);
        // Mirror back
        perm[0] = 0;
        perm[1] = 1;
        // Calculate matrix elements
        t[1] = matrix_1_uux_ttx();

I will prevent such type of duplication of matrix-element for the gpu plugin (I do not see the point to be honest)

oliviermattelaer · 2022-01-27T14:12:28Z

I have forbidden the increase of nprocesses due to symmetry factor (will see later if other source of multi-processes occurs)

valassi · 2022-01-27T15:52:06Z

Thanks Olivier! I created #360 and assigned it to myself about moving to a more recent launchpad version

valassi · 2022-03-03T06:51:27Z

Just a note, I observed this again while looking at clang-format in #388, but I will ignore it there

valassi · 2022-03-07T18:12:20Z

Reopening. Olivier has fixed this in upstream bazaar, but I need to fix this in the cudacpp plugin.

I have tested that the bazaar "gpu" backend has changed between 270 and 311:

diff -r pp_tt.gpu270 pp_tt.gpu311
...
diff -r pp_tt.gpu270/SubProcesses/P1_Sigma_sm_uux_ttx/CPPProcess.cc pp_tt.gpu311/SubProcesses/P1_Sigma_sm_uux_ttx/CPPProcess.cc
3c3
< // MadGraph5_aMC@NLO v. 2.9.5, 2021-08-22
---
> // MadGraph5_aMC@NLO v. 3.3.1_lo_vect, 2022-01-30
297,298c297,298
<     const int nprocesses = 2;  // FIXME: assume process.nprocesses == 1
<     const int denominators[2] = {36, 36}; 
---
>     const int nprocesses = 1;  // FIXME: assume process.nprocesses == 1
>     const int denominators[1] = {36}; 
303a304
> 
345a347
>

Note in particular that my plugin produces bad code both with 270 and with 311,

const int denominators = 36,36;

The fix is simple, the denominators should become an array and take the number of subprocesses (which is 1 in the new version, but it still remains an array denominators[1], not a scalar denominators).

…denominators' by vector 'denominators[1]'

….1.1_lo_vectorization/madgraph/iolibs/export_cpp.py (check tkdiff 2.7.0_gpu/madgraph/iolibs/export_cpp.py 3.1.1_lo_vectorization/madgraph/iolibs)

…LL ASSUME NPROCESSES == 1 *** Fix codegen templates, regenerate ggtt auto, fix also ggtt manual

valassi · 2022-03-08T15:28:22Z

This has now being addressed in PR #396.

As mentioned in thePR, however, the code still assumes nprocesses==1. We should find another example with nprocesses>1 (#272) even if "mirror processes" are disabled

valassi · 2022-03-08T17:40:05Z

This is fixed in #396 that I am about to merge. Closing again.

…me process.nprocesses == 1" (madgraph5#272 and madgraph5#343)

… comments madgraph5#272 and madgraph5#343

…1_gux_ttxux to P1_gu_ttxu The gqttq tests fail anyway and will need to be fixed (madgraph5#630). However, this completes the addition of gq_ttq as a new process to the repo. In particular it includes proof that Olivier's "split_nonidentical_grouping" madgraph5#619 fixes the gqttq builds. It also includes a lot of cleanup for "nprocesses" (madgraph5#272 and madgraph5#343) Revert "[gqttq] retry the tmad gqttq test with the P1_gu_ttxu directory - the test continues to fail (madgraph5#630)" This reverts commit 2dea1f7. Revert "[gqttq] temporarely use P1_gu_ttxu instead of P1_gux_ttxux for gqttq tmad tests" This reverts commit ea23a9a.

…dgraph5#272 and madgraph5#343 (see also PRs madgraph5#619, madgraph5#626, madgraph5#360 and madgraph5#396)

…esses as in 3.1.1_lo_vectorization/madgraph/iolibs/export_cpp.py (check tkdiff 2.7.0_gpu/madgraph/iolibs/export_cpp.py 3.1.1_lo_vectorization/madgraph/iolibs)

…B HOWEVER STILL ASSUME NPROCESSES == 1 *** Fix codegen templates, regenerate ggtt auto, fix also ggtt manual

…me process.nprocesses == 1" (madgraph5/madgraph4gpu#272 and madgraph5/madgraph4gpu#343)

…dgraph5/madgraph4gpu#272 and madgraph5/madgraph4gpu#343 (see also PRs madgraph5/madgraph4gpu#619, madgraph5/madgraph4gpu#626, madgraph5/madgraph4gpu#360 and madgraph5/madgraph4gpu#396)

…matting, and especially the build will fail. Codebase includes merging commit a6731bd (Olivier Wed Aug 23 13:23:12 2023 +0200) This uses Olivier's 'fix_mirror' branch for PR madgraph5#754 In particular a6731bd Olivier Mattelaer Wed Aug 23 13:23:12 2023 +0200 Merge branch 'fix_mirror' 2556cdd Olivier Mattelaer Wed Aug 23 09:27:38 2023 +0200 avoid that mirroring is reset by the plugin These lines fail the build (as well as clang formatting) [NOT OK] Check formatting in: pp_tt012j.mad/SubProcesses/P0_uux_ttx/CPPProcess.cc 786c786 < constexpr int helcolDenominators[1] = { 36,36 }; // assume nprocesses == 1 (madgraph5#272 and madgraph5#343) --- > constexpr int helcolDenominators[1] = { 36, 36 }; // assume nprocesses == 1 (madgraph5#272 and madgraph5#343) The same happens in each P subdirectory. Build errors: ccache /usr/local/cuda-12.0/bin/nvcc -Xcompiler -O3 -lineinfo -I. -I../../src -I/usr/local/cuda-12.0/include/ -DUSE_NVTX -gencode arch=compute_70,code=compute_70 -gencode arch=compute_70,code=sm_70 -use_fast_math -std=c++17 -ccbin /usr/lib64/ccache/g++ -DMGONGPU_FPTYPE_DOUBLE -DMGONGPU_FPTYPE2_DOUBLE -Xcompiler -fPIC -c gCPPProcess.cu -o gCPPProcess.o gCPPProcess.cu(779): error: static assertion failed with "Assume nprocesses == 1" gCPPProcess.cu(786): error: too many initializer values 2 errors detected in the compilation of "gCPPProcess.cu".

valassi assigned oliviermattelaer Jan 24, 2022

This was referenced Jan 24, 2022

wrong assert in cudacpp plugin (and test code generation for pptt) #337

Closed

Fix assert in pptt code generation #340

Merged

uudd generation: cIPC[0] should be excluded? #349

Closed

oliviermattelaer closed this as completed Jan 27, 2022

valassi mentioned this issue Jan 27, 2022

Port epochX CODEGEN from 270gpu to the 311lovec branch (and pick up the new features there!) #360

Closed

valassi changed the title ~~Fix "nprocesses>1" code generation (example: uutt within pptt)~~ Fix "nprocesses>1" code generation (example: uutt within pptt gives "const int denominators = 36,36;") Mar 3, 2022

valassi mentioned this issue Mar 3, 2022

clang-format (and related code changes) #388

Merged

valassi self-assigned this Mar 7, 2022

valassi reopened this Mar 7, 2022

valassi added a commit to valassi/madgraph4gpu that referenced this issue Mar 8, 2022

[pptt] attempt fix for madgraph5#343 in ggtt manual: replace scalar '…

88b8fa9

…denominators' by vector 'denominators[1]'

valassi added a commit to valassi/madgraph4gpu that referenced this issue Mar 8, 2022

[pptt] fix madgraph5#343: disable mirror processes *** NB HOWEVER STI…

7f434e4

…LL ASSUME NPROCESSES == 1 *** Fix codegen templates, regenerate ggtt auto, fix also ggtt manual

valassi mentioned this issue Mar 8, 2022

Disable mirror processes (nprocesses=2 in pptt) + Add process name to library names #396

Merged

valassi mentioned this issue Mar 8, 2022

Add an example of a calculation with nprocesses>1 #272

Closed

valassi closed this as completed Mar 8, 2022

valassi added a commit to valassi/madgraph4gpu that referenced this issue Apr 5, 2023

[gqttq] in CODEGEN, remove most (not all) of the comments about "assu…

f179c5c

…me process.nprocesses == 1" (madgraph5#272 and madgraph5#343)

valassi added a commit to valassi/madgraph4gpu that referenced this issue Apr 5, 2023

[gqttq] regenerate gqttq sa and mad after cleaning up "nprocesses==1"…

bcfc44c

… comments madgraph5#272 and madgraph5#343

valassi mentioned this issue Apr 6, 2023

Add SM gq to ttq (and gq to ttllq) - example of a process with DSIG1 and DSIG2 #626

Merged

valassi added a commit to valassi/madgraph4gpu that referenced this issue Apr 7, 2023

[gqttq] in CODEGEN, improve the comment on nprocesses>2 for issues ma…

0033615

…dgraph5#272 and madgraph5#343 (see also PRs madgraph5#619, madgraph5#626, madgraph5#360 and madgraph5#396)

valassi added a commit to mg5amcnlo/mg5amcnlo_cudacpp that referenced this issue Aug 16, 2023

[pptt] fix madgraph5/madgraph4gpu#343: disable mirror processes *** N…

e095afa

…B HOWEVER STILL ASSUME NPROCESSES == 1 *** Fix codegen templates, regenerate ggtt auto, fix also ggtt manual

valassi added a commit to mg5amcnlo/mg5amcnlo_cudacpp that referenced this issue Aug 16, 2023

[gqttq] in CODEGEN, remove most (not all) of the comments about "assu…

de11f92

…me process.nprocesses == 1" (madgraph5/madgraph4gpu#272 and madgraph5/madgraph4gpu#343)

valassi mentioned this issue Jul 24, 2024

WIP add pp_tt to repo (plus obsolete fixes for bug 872 via reset_cumulative_variable, now fixed by Olivier via a single helicity filter) #935

Draft

valassi mentioned this issue Aug 5, 2024

Add support for nprocesses>2 (i.e. beyond mirror processes) in cudacpp to speed up directory handling? #951

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix "nprocesses>1" code generation (example: uutt within pptt gives "const int denominators = 36,36;") #343

Fix "nprocesses>1" code generation (example: uutt within pptt gives "const int denominators = 36,36;") #343

valassi commented Jan 24, 2022

oliviermattelaer commented Jan 27, 2022

oliviermattelaer commented Jan 27, 2022

valassi commented Jan 27, 2022

valassi commented Mar 3, 2022

valassi commented Mar 7, 2022

valassi commented Mar 8, 2022

valassi commented Mar 8, 2022

Fix "nprocesses>1" code generation (example: uutt within pptt gives "const int denominators = 36,36;") #343

Fix "nprocesses>1" code generation (example: uutt within pptt gives "const int denominators = 36,36;") #343

Comments

valassi commented Jan 24, 2022

oliviermattelaer commented Jan 27, 2022

oliviermattelaer commented Jan 27, 2022

valassi commented Jan 27, 2022

valassi commented Mar 3, 2022

valassi commented Mar 7, 2022

valassi commented Mar 8, 2022

valassi commented Mar 8, 2022