Error on very large fastq file #12

essere · 2020-04-29T21:22:28Z

Hello,
I am analyzing a very large fastq file (~240 Gb for each pair).

I have this error when running fastq-pair.

"We cannot allocate the memory for a table size of -436581356. Please try a smaller value for -t"

Could you provide some solution for this?

Thanks
Sam

linsalrob · 2020-04-29T21:33:48Z

what is the command you are using?

If you are setting -t on the command line, you might be setting it too large of a number! It is stored as a signed int and so if you set the value too high it may be misinterpreted.

essere · 2020-04-29T21:51:21Z

From your instructions, I get the number of lines by "wc -l".

wc -l Sample.R1.fastq
2548641872

By dividing by 4, I got this number: 637160468
So I tried a command

fastq_pair -t 637160468 Sample.R1.fastq Sample.R2.fastq

Is it ok to use a smaller number than actual reads?

linsalrob · 2020-04-29T22:10:30Z

So that should have been OK (depending on the specific system you used), but yes, I suspect you have an integer overflow

Yes, it is OK to use a smaller number than the actual number of reads. Try dividing the number by 4 (159290117). In this case you will end up with, on average, four sequences per bucket.

essere · 2020-04-29T22:49:22Z

Thank you. It worked !

This is the very program I have been looking for.
You are so great !!

linsalrob · 2020-04-30T00:41:44Z

Excellent!

linsalrob closed this as completed Apr 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error on very large fastq file #12

Error on very large fastq file #12

essere commented Apr 29, 2020 •

edited

Loading

linsalrob commented Apr 29, 2020

essere commented Apr 29, 2020

linsalrob commented Apr 29, 2020

essere commented Apr 29, 2020

linsalrob commented Apr 30, 2020

Error on very large fastq file #12

Error on very large fastq file #12

Comments

essere commented Apr 29, 2020 • edited Loading

linsalrob commented Apr 29, 2020

essere commented Apr 29, 2020

linsalrob commented Apr 29, 2020

essere commented Apr 29, 2020

linsalrob commented Apr 30, 2020

essere commented Apr 29, 2020 •

edited

Loading