Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Training data manifest or preprocessing scripts #2

Open
JacksonH44 opened this issue Jun 3, 2024 · 0 comments
Open

Training data manifest or preprocessing scripts #2

JacksonH44 opened this issue Jun 3, 2024 · 0 comments

Comments

@JacksonH44
Copy link

Hi, I'm interested in your work, and I'm wondering if you have information or resources related to the specific data preprocessing steps/scripts that you used, or the specific dataset you used in the experiment. I tried to re-create the data pipeline outlined in your paper, but I'm using a dataset from the paper "Widespread Expansion of Protein Interaction Capabilities by Alternative Splicing", that I found in the supplemental information, Table S2, sheet 2B. Is this not the data your group used, because after following the data processing methods you outlined in your paper, I only got 475 triplet pairs, instead of the 973 mentioned in your paper.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant