Fix Segmentation Fault Under Certain Conditions #535

evolvedmicrobe · 2018-12-12T20:26:19Z

This PR was motivated by a SIGSEGV that STAR produced when aligning a particular read against the Rabbit genome. The problem is reproducible, and a file with the rabbit genome and one read that produces the error is available here.

After downloading, the following command will exit with a SIGSEGV

STAR --genomeDir rabbit --outSAMmultNmax -1 --readFilesIn temp.fastq

I'd love to integrate a fix for this problem, and attempted one with this PR.

I diagnosed the issue as follows. In trying to map the read, the aligner routine within ReadAlign::maxMappableLength2strands was trying to find exact matches for the end of the read. The first step in this process is looking up a given prefix length in the suffix array index. This index contains minimum (and maximum) locations for prefixes up to size L_max (14 in this case). For this particular read segment, no prefix of size 14 was found, so the program re-attempted to search the index with a prefix of size 13, which it did find. Because a prefix of size 13 and not 14 was found, the program knew there was no match of size 14 and so tried to set the end of the search range in the suffix array (indStartEnd[1]) using a short-cut that avoids the binary search. In particular, it set the End of the bounded range to the value of one minus the next element in the index.

However, it appears not all elements prior to the start of the next prefix are guaranteed to match the current prefix. In this particular case, that next element pointed to a position 10 bases before the end of the genome, which meant it could not produce an alignment 13 bases in length. Later on inside ReadAlign::stitchPieces and its call to sjSplitAlign this led to the SIGSEGV when while calculating the alignment start position it inferred to be 3 basepairs past the end of the genome, giving an offset position of -3 which when converted to an unsigned integer has value 18446744073709551613, and consequently the calculated value for isj in the code line P->sjDstart[isj] referred to unmapped memory, leading to the SIGSEGV.

I believe this can happen as the suffix array can be sorted as follows:

ACTGAAAA <- iSA1 / first prefix
ACTGNNNN <- iSA2
ACTTACTG <- next prefix

Since it seemed the short-cut avoiding the binary-search was not reliable, I removed it with this commit. Since it also appeared that this was a general problem where one could not guarantee the initial range guessed held matching prefixes up to the prefix length tested, I also changed the binary search to account for this. Testing showed for particular read this had these changes restored the correct behavior in the inferred bounds (as determined by doing a full binary search).

This PR was motivated by a SIGSEGV that STAR produced when aligning a particular read against the Rabbit genome. Explanation of the problem and fix below. The issue was as follows. In trying to map the read, the aligner routine within ReadAlign::maxMappableLength2strands was trying to find exact matches for the end of the read. The first step in this process is looking up a given prefix length in the suffix array index. This index contains minimum (and maximum) locations for prefixes up to size L_max (14 in this case). For this particular read segment, no prefix of size 14 was found, so the program re-attempted to search the index with a prefix of size 13, which it did find. Because a prefix of size 13 and not 14 was found, the program knew there was no match of size 14 and so tried to set the end of the search range in the suffix array (indStartEnd[1]) using a short-cut that avoids the binary search. In particular, it set the End of the bounded range to the value of one minus the next element in the index. However, it appears not all elements prior to the start of the next prefix are guaranteed to match the current prefix. In this particular case, that next element pointed to a position 10 bases before the end of the genome, which meant it could not produce an alignment 13 bases in length. Later on inside ReadAlign::stitchPieces and its call to sjSplitAlign this led to the SIGSEGV when while calculating the alignment start position it inferred to be 3 basepairs past the end of the genome, giving an offset position of -3 which when converted to an unsigned integer has value 18446744073709551613, and consequentally the calculated value for isj in the code line P->sjDstart[isj] referred to unmapped memory, leading to the SIGSEGV. I believe this can happen as the suffix array can be sorted as follows: ACTGAAAA <- iSA1 / first prefix ACTGNNNN <- iSA2 ACTTACTG <- next prefix Since it seemed the short-cut avoiding the binary-search was not reliable, I removed it with this commit. Since it also appeared that this was a general problem where one could not guarantee the initial range guessed held matching prefixes up to the length tested (but rather only the length tested minus 1), I also changed the binary search to account for this. Testing showed for this case it restored the correct behavior in the inferred bounds, such that the alignment length was consistent across the start and end of the array element range set.

alexdobin · 2018-12-13T14:44:25Z

Hi Nigel,

thanks a lot for a very thorough investigation of this problem.
I am generally not happy with this part of the code as it also produces seg-faults for really small genomes. As you pointed out, the real problem is in the way this index is generated in the indexing step, the locations very close to the genome ends should be excluded. I am reluctant to remove the part of the code that you suggested as it may change its behavior unpredictably in other cases.
I will work on your test case to find a consistent solution.

Cheers
Alex

evolvedmicrobe · 2018-12-13T19:26:59Z

Hi Alex,

Thanks for taking a look at being willing to solve the problem, I thought you might have a better fix, will look forward to it!

Thanks again,
Nigel

evolvedmicrobe · 2019-01-18T04:30:46Z

Hi Alex,

I hope you're doing well in the New Year! Just wanted to inquire quickly if you thought you might have come up with a better and more consistent solution.

All the best,
Nigel

alexdobin · 2019-01-24T21:23:38Z

Hi Nigel,

sorry for the delay, I was busy with the new release and could not look into this problem. I hope to get back to it within the next week or two.

Cheers
Alex

evolvedmicrobe · 2019-01-24T21:24:16Z

Wonderful, thank you Alex!

evolvedmicrobe · 2021-01-07T00:58:09Z

Hi @alexdobin,

Just wanted to ping on this issue in case you might have a chance to revisit it. We're continuing to see a lot of segfaults when people align things to the rabbit genome that we'd like to avoid. I would be happy to help out with anything needed if I can as well.

Cheers,
Nigel

alexdobin · 2021-01-09T20:54:50Z

Hi Nigel,

have you tried it with one of the latest versions?
The seg-fault for small (or masked) genomes issue was fixed in 2.7.4a. I would hope that this issue might have been fixed as well?

Cheers
Alex

…seg-fault whem mapping to the rabbit genome. Issue #1223: fixed the N_unmapped value reported in ReadsPerGene.out.tab. The single-end (i.e. partially mapped alignment are not excluded from N_unmapped. dev_EoI_2.7.9a_2021-09-30

This was referenced Dec 12, 2018

Fix Segmentation Fault Under Certain Conditions 10XGenomics/STAR#1

Open

Segmentation fault (core dumped) in Masked genome #235

Closed

alexdobin added the complex enhancement label Aug 26, 2019

alexdobin added this to the 2.7.10 milestone Sep 28, 2021

alexdobin added the testing label Sep 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Segmentation Fault Under Certain Conditions #535

Fix Segmentation Fault Under Certain Conditions #535

evolvedmicrobe commented Dec 12, 2018 •

edited

Loading

alexdobin commented Dec 13, 2018

evolvedmicrobe commented Dec 13, 2018

evolvedmicrobe commented Jan 18, 2019

alexdobin commented Jan 24, 2019

evolvedmicrobe commented Jan 24, 2019

evolvedmicrobe commented Jan 7, 2021

alexdobin commented Jan 9, 2021

Fix Segmentation Fault Under Certain Conditions #535

Are you sure you want to change the base?

Fix Segmentation Fault Under Certain Conditions #535

Conversation

evolvedmicrobe commented Dec 12, 2018 • edited Loading

alexdobin commented Dec 13, 2018

evolvedmicrobe commented Dec 13, 2018

evolvedmicrobe commented Jan 18, 2019

alexdobin commented Jan 24, 2019

evolvedmicrobe commented Jan 24, 2019

evolvedmicrobe commented Jan 7, 2021

alexdobin commented Jan 9, 2021

evolvedmicrobe commented Dec 12, 2018 •

edited

Loading