Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RNA-seq quality report sequence duplication level #415

Open
rezarahman12 opened this issue Jul 4, 2024 · 1 comment
Open

RNA-seq quality report sequence duplication level #415

rezarahman12 opened this issue Jul 4, 2024 · 1 comment

Comments

@rezarahman12
Copy link

Hi rMATS team,

I would like to apply rMATS on our recent RNA-seq data. During QC check of the fastq files using FastQC, I see a high level of sequence duplication (please see the attached fig). Do I need to remove it before running rMATS?

I appreciate your time.

Kind regards
Reza

Status for each FastQC section showing whether results seem entirely normal (green), slightly abnormal (orange) or very unusual (red).
image

@EricKutschera
Copy link
Contributor

The FastQC plot shows that there is a high level of duplication, but it doesn't give details about the particular sequences. Ideally you could take a look at your files to see what the sequences are. It's possible that the duplicates are due to actual high levels of certain sequences in your samples or it could be due to some technical issue

rMATS should not give any errors related to duplicate sequences. You could try running rMATS and if you get any significant output you can look at those regions in the bam files to see if anything looks unusual

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants