Resolve #1842 by setting OMP_NUM_THREADS=2 in toolkit showcase #1846

j-wags · 2024-03-25T23:58:51Z

Resolves The toolkit showcase stochastically fails with OMP_NUM_THREADS=1 and AmberToolsToolkitWrapper #1842 (doesn't solve the issue that certain combinations of molecule + n_cores + rdkit version + ambertools version leads to something going wrong in sqm, but at least gets examples CI running for another few months until we can replace this whole stack with NAGL)
Add tests
Update docstrings/documentation, if applicable
Lint codebase
Update changelog

codecov · 2024-03-26T00:04:11Z

Codecov Report

Merging #1846 (629a65d) into main (7c22c1e) will decrease coverage by 0.02%.
Report is 5 commits behind head on main.
The diff coverage is n/a.

Additional details and impacted files

j-wags · 2024-03-26T00:45:07Z

Oh goodness, the rabbit hole goes deeper.

j-wags · 2024-03-29T16:16:23Z

My current level of confusion.

review-notebook-app · 2024-03-29T17:02:51Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

for more information, see https://pre-commit.ci

…ith OMP_NUM_THREADS=1

j-wags · 2024-03-29T21:46:04Z

The original error was "charge assignment fails in sqm in linux/rdkit CI runs with the original atom ordering of the ligand and OMP_NUM_THREADS=1". @Yoshanuikabundi also told me that he was also able to reproduce this locally on his linux desktop.

I've tried reproducing this error in CI in this PR, locally on my M1 mac, and in a linux/amd64 docker container on the same mac. The only sqm failures I've seen are in macos/rdkit CI builds on this PR:

FAILED: reorder the ligand atoms
PASSED: running identical CI again
FAILED: the same as above, but also forced OMP_NUM_THREADS=1
FAILED: running identical CI again
PASSED: resetting the ligand atom order and keeping OMP_NUM_THREADS=1

(note that the failures in the openff-docs builds were linux/rdkit, NOT the macos/rdkit ones that I'm getting here)

I think this means the error is stochastic and may not actually be based on atom ordering or OS.

I'm toying with the idea of having the AmberTools charge assignment pathway generate two conformers and try the second one if the first fails. On one hand, the behavior change would seem acceptable here, since it changes a random error into a valid outcome. On the other, it takes a relatively long time to fail on one conformer (~5-10 mins) and if there were a legitimate problem with the input then this change would double the the amount of time before the user got an error message.

So, I'm unsure here. I'll talk with @Yoshanuikabundi on Monday to determine if there's some way to more consistently reproduce the error so we can have a better grasp on options moving forward.

mattwthompson · 2024-04-02T16:05:59Z

If the number of threads/cores/solar flares is important, it would be useful to work that into the error message. The current failures are heavily obfuscated by the ValueError that eventually gets to the user

canonically order showcase ligand

432f658

kick ci because I don't believe this

860b13e

Restrict CI matrix and update toolkit showcase to reproduce error

0c63e09

pre-commit-ci bot and others added 2 commits March 29, 2024 17:03

[pre-commit.ci] auto fixes from pre-commit.com hooks

bf63d7c

for more information, see https://pre-commit.ci

revert ligand to original ordering to see if we can reproduce error w…

0e6c359

…ith OMP_NUM_THREADS=1

j-wags changed the title ~~Resolve #1842 by canonically ordering showcase ligand~~ [DNM] Resolve #1842 by canonically ordering showcase ligand Mar 29, 2024

set OMP_NUM_THREADS=2 in showcase

1b4bd8e

j-wags mentioned this pull request Apr 5, 2024

The toolkit showcase stochastically fails with OMP_NUM_THREADS=1 and AmberToolsToolkitWrapper #1842

Closed

revert changes to CI matrix

ee73763

j-wags changed the title ~~[DNM] Resolve #1842 by canonically ordering showcase ligand~~ Resolve #1842 by setting OMP_NUM_THREADS=2 in toolkit showcase Apr 9, 2024

update releasehistory

629a65d

j-wags merged commit 13bebaf into main Apr 9, 2024
18 checks passed

j-wags deleted the bandaid-1842 branch April 9, 2024 19:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resolve #1842 by setting OMP_NUM_THREADS=2 in toolkit showcase #1846

Resolve #1842 by setting OMP_NUM_THREADS=2 in toolkit showcase #1846

j-wags commented Mar 25, 2024 •

edited

Loading

codecov bot commented Mar 26, 2024 •

edited

Loading

j-wags commented Mar 26, 2024

j-wags commented Mar 29, 2024

review-notebook-app bot commented Mar 29, 2024

j-wags commented Mar 29, 2024

mattwthompson commented Apr 2, 2024

Resolve #1842 by setting OMP_NUM_THREADS=2 in toolkit showcase #1846

Resolve #1842 by setting OMP_NUM_THREADS=2 in toolkit showcase #1846

Conversation

j-wags commented Mar 25, 2024 • edited Loading

codecov bot commented Mar 26, 2024 • edited Loading

Codecov Report

j-wags commented Mar 26, 2024

j-wags commented Mar 29, 2024

review-notebook-app bot commented Mar 29, 2024

j-wags commented Mar 29, 2024

mattwthompson commented Apr 2, 2024

j-wags commented Mar 25, 2024 •

edited

Loading

codecov bot commented Mar 26, 2024 •

edited

Loading