Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

another problem: Error: protect(): protection stack overflow #3

Open
crazyhottommy opened this issue Mar 24, 2017 · 12 comments
Open

another problem: Error: protect(): protection stack overflow #3

crazyhottommy opened this issue Mar 24, 2017 · 12 comments

Comments

@crazyhottommy
Copy link

Hi,

another problem with phantompeakqual or with R

Decompressing ChIP file
Decompressing control file
Loading required package: caTools
Reading ChIP tagAlign/BAM file Hs_940_temp//M940_Crep2.tagAlign.gz 
opened Hs_940_temp/M940_Crep2_spp_tmp//M940_Crep2.tagAlign144c7b15c3b9
done. read 42273553 fragments
ChIP data read length 37 
[1] TRUE
Reading Control tagAlign/BAM file Hs_940_.temp/control.tagAlign.gz 
opened Hs_940_.temp/M940_Crep2_spp_tmp//control.tagAlign144c62adab27
done. read 2922304 fragments
Error: protect(): protection stack overflow
Execution halted
Error: protect(): protection stack overflow
Execution halted

I googled and found http://stackoverflow.com/questions/28728774/how-to-set-max-ppsize-in-r

after adding one line

options("experssion" = 500000)

on top of the run_spp.R script, same error.
Thanks for looking into this.
Tommy

@crazyhottommy
Copy link
Author

any idea on this error?

@akundaje
Copy link
Contributor

akundaje commented Mar 28, 2017 via email

@crazyhottommy
Copy link
Author

I actually wrote a snakemake version of the pipeline to accommodate our own usage. I know the bolts and nuts of my own pipeline so I can easily extend it.

I will continue to debug...thanks though.

Tommy

@leepc12
Copy link
Collaborator

leepc12 commented Mar 28, 2017 via email

@crazyhottommy
Copy link
Author

@leepc12 I added options("experssion" = 500000) to the run_spp.R script, but still same error. puzzling.

@andreamariossi
Copy link

Hi,
having the same problem. I tried the suggested solutions Rscript --max-ppsize=500000 [RFILE] or adding options("exppression" = 500000) . I tried both on Xubuntu 16.04 LTS and Red Hat 4.4.7-3.
thanks,
Andrea

@seenstevo
Copy link

@leepc12
Hi,
I have run into the same problem recently while working with DNase-seq on a new species. I know the program is still working as old tagAlign files are processed fine. My working theory is that it may have something to do with the number of scaffolds/chromosomes (col 1). The reason is that this new species currently has a basic genome assembly so there are many very small scaffolds (something like >200000). I tested two files derived from one file that fails with "Error: protect(): protection stack overflow", one was produced from head -100000 and one from tail -100000. The head file worked fine while the tail one failed with the same error (Error: protect(): protection stack overflow). Given that they are the same size etc and the scaffolds are ordered by size (largest first), the head derived file only has one or two different scaffold/chr while the tail derived file has a few rows per scaffold/chr so has thousands of different ones so this is what I thought might be the culprit.

@andreakcl89 , @crazyhottommy Does this fit with your issue? Has anyone found a fix?
Thanks, Sean

@oshomroni
Copy link

I have the exact same issue using ATAC_seq workflow provided by the Kundaje lab. I ran it for Xenopus laevis version 9.2, which contains 18 chromosomes and 108015 scaffolds. In terms of proportions, non-mitochondrial chromosomes cover 92% of the genome, a very small percentage is mitochondrial DNA, and 7.7% of the genome is scaffolds. If, as @seenstevo mentioned, the issue is the massive number of scaffolds that the analysis has to run through, maybe it is not too bad to remove 7.7% of the genome to get the analysis done

@akundaje
Copy link
Contributor

akundaje commented Dec 6, 2017 via email

@crazyhottommy
Copy link
Author

@seenstevo no, I am processing the same human data. somehow in a different computing cluster, the error disappeared.

@oshomroni
Copy link

I added "Rscript --max-ppsize=500000" inside the postalign_xcor.bds script before ${RUN_SPP}, and it worked. Guess you need a good server with tons of memory for this to work.

@leepc12
Copy link
Collaborator

leepc12 commented Dec 7, 2017 via email

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants