You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
My RNA-seq data were downloaded from the Internet, and the authors did not elaborate on the type of library built. for --libtype parameter, i try two ways.
one: python3 /public/home/xiaoxiong/miniconda3/envs/libgsl/rMATS/rmats.py --b1 b1.txt --b2 b2.txt --gtf /public/home/xiaoxiong/hTSC_STB96_RNA_seq/hTSC_STB96_rMATs_Ensembol/STB72_STB96/genes.gtf -t paired --libType fr-firststrand --readLength 100 --nthread 4 --od AS_libgsl_firststrand --tmp AS_libgsl_firststrand --variable-read-length
gtf: 153.78015661239624
There are 36601 distinct gene ID in the gtf file
There are 199138 distinct transcript ID in the gtf file
There are 14315 one-transcript genes in the gtf file
There are 1305354 exons in the gtf file
There are 5603 one-exon transcripts in the gtf file
There are 3566 one-transcript genes with only one exon in the transcript
Average number of transcripts per gene is 5.440780
Average number of exons per transcript is 6.555022
Average number of exons per transcript excluding one-exon tx is 6.715845
Average number of gene per geneGroup is 7.854318
statistic: 0.42818236351013184
read outcome totals across all BAMs
USED: 583925661
NOT_PAIRED: 5064400
NOT_NH_1: 270852260
NOT_EXPECTED_CIGAR: 6728411
NOT_EXPECTED_READ_LENGTH: 0
NOT_EXPECTED_STRAND: 0
EXON_NOT_MATCHED_TO_ANNOTATION: 45292289
JUNCTION_NOT_MATCHED_TO_ANNOTATION: 1269436
CLIPPED: 65970785
total: 979103242
outcomes by BAM written to: AS_libgsl_firststrand/2024-06-19-13_12_09_367167_read_outcomes_by_bam.txt
novel: 2397.6510169506073
The splicing graph and candidate read have been saved into AS_libgsl_firststrand/2024-06-19-13_12_09_367167_*.rmats
save: 28.220320224761963
loadsg: 17.720765352249146
==========
Done processing each gene from dictionary to compile AS events
Found 89553 exon skipping events
Found 11671 exon MX events
Found 18472 alt SS events
There are 11225 alt 3 SS events and 7247 alt 5 SS events.
Found 6930 RI events
another python3 /public/home/xiaoxiong/miniconda3/envs/libgsl/rMATS/rmats.py --b1 b1.txt --b2 b2.txt --gtf /public/home/xiaoxiong/hTSC_STB96_RNA_seq/hTSC_STB96_rMATs_Ensembol/hTSC_STB72/genes.gtf -t paired --libType fr-secondstrand --readLength 100 --nthread 4 --od AS_libgsl_second --tmp AS_libgsl_second --variable-read-length
gtf: 1452.7515552043915
There are 36601 distinct gene ID in the gtf file
There are 199138 distinct transcript ID in the gtf file
There are 14315 one-transcript genes in the gtf file
There are 1305354 exons in the gtf file
There are 5603 one-exon transcripts in the gtf file
There are 3566 one-transcript genes with only one exon in the transcript
Average number of transcripts per gene is 5.440780
Average number of exons per transcript is 6.555022
Average number of exons per transcript excluding one-exon tx is 6.715845
Average number of gene per geneGroup is 7.854318
statistic: 0.08252692222595215
read outcome totals across all BAMs
USED: 54658238
NOT_PAIRED: 5238074
NOT_NH_1: 266573095
NOT_EXPECTED_CIGAR: 6915951
NOT_EXPECTED_READ_LENGTH: 0
NOT_EXPECTED_STRAND: 0
EXON_NOT_MATCHED_TO_ANNOTATION: 443321716
JUNCTION_NOT_MATCHED_TO_ANNOTATION: 131377399
CLIPPED: 64958367
total: 973042840
outcomes by BAM written to: AS_libgsl_second/2024-06-19-13_19_27_769310_read_outcomes_by_bam.txt
novel: 2378.783494949341
The splicing graph and candidate read have been saved into AS_libgsl_second/2024-06-19-13_19_27_769310_*.rmats
save: 3.096289873123169
loadsg: 1.7018141746520996
==========
Done processing each gene from dictionary to compile AS events
Found 47791 exon skipping events
Found 3302 exon MX events
Found 16193 alt SS events
There are 9849 alt 3 SS events and 6344 alt 5 SS events.
Found 6783 RI events
The main differences are about 400 million EXON_NOT_MATCHED_TO_ANNOTATION and 130 million JUNCTION_NOT_MATCHED_TO_ANNOTATION when run with --libType fr-secondstrand. If the library was actually unstranded then I would expect to see similar numbers of NOT_MATCHED_TO_ANNOTATION for fr-firststrand and fr-secondstrand. Based on the output it seems that your data should be run with --libType fr-firststrand
My RNA-seq data were downloaded from the Internet, and the authors did not elaborate on the type of library built. for --libtype parameter, i try two ways.
one: python3 /public/home/xiaoxiong/miniconda3/envs/libgsl/rMATS/rmats.py --b1 b1.txt --b2 b2.txt --gtf /public/home/xiaoxiong/hTSC_STB96_RNA_seq/hTSC_STB96_rMATs_Ensembol/STB72_STB96/genes.gtf -t paired --libType fr-firststrand --readLength 100 --nthread 4 --od AS_libgsl_firststrand --tmp AS_libgsl_firststrand --variable-read-length
gtf: 153.78015661239624
There are 36601 distinct gene ID in the gtf file
There are 199138 distinct transcript ID in the gtf file
There are 14315 one-transcript genes in the gtf file
There are 1305354 exons in the gtf file
There are 5603 one-exon transcripts in the gtf file
There are 3566 one-transcript genes with only one exon in the transcript
Average number of transcripts per gene is 5.440780
Average number of exons per transcript is 6.555022
Average number of exons per transcript excluding one-exon tx is 6.715845
Average number of gene per geneGroup is 7.854318
statistic: 0.42818236351013184
read outcome totals across all BAMs
USED: 583925661
NOT_PAIRED: 5064400
NOT_NH_1: 270852260
NOT_EXPECTED_CIGAR: 6728411
NOT_EXPECTED_READ_LENGTH: 0
NOT_EXPECTED_STRAND: 0
EXON_NOT_MATCHED_TO_ANNOTATION: 45292289
JUNCTION_NOT_MATCHED_TO_ANNOTATION: 1269436
CLIPPED: 65970785
total: 979103242
outcomes by BAM written to: AS_libgsl_firststrand/2024-06-19-13_12_09_367167_read_outcomes_by_bam.txt
novel: 2397.6510169506073
The splicing graph and candidate read have been saved into AS_libgsl_firststrand/2024-06-19-13_12_09_367167_*.rmats
save: 28.220320224761963
loadsg: 17.720765352249146
==========
Done processing each gene from dictionary to compile AS events
Found 89553 exon skipping events
Found 11671 exon MX events
Found 18472 alt SS events
There are 11225 alt 3 SS events and 7247 alt 5 SS events.
Found 6930 RI events
ase: 5.3955464363098145
![4773f9cf07b10c90bd82196e3074a97](https://private-user-images.githubusercontent.com/166396847/340964386-2f93de86-4dff-4269-99f1-8f3327f2ed52.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjEzNDE3MTksIm5iZiI6MTcyMTM0MTQxOSwicGF0aCI6Ii8xNjYzOTY4NDcvMzQwOTY0Mzg2LTJmOTNkZTg2LTRkZmYtNDI2OS05OWYxLThmMzMyN2YyZWQ1Mi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzE4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcxOFQyMjIzMzlaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT03ZGQ5OGJjZTIyODgxZTA0ZDBkN2EzMGJkMzUyODU2MDYyYjg5YTQzNjQyNWE5YTlhMTExYjdiNjFiYWUxMjIxJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.I0qKlci8tlvdODiV1SCTrDgYNTDx38dfsFkDuraan5U)
![4](https://private-user-images.githubusercontent.com/166396847/340965919-733bec88-e118-463c-88c0-a244ab38ec03.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjEzNDE3MTksIm5iZiI6MTcyMTM0MTQxOSwicGF0aCI6Ii8xNjYzOTY4NDcvMzQwOTY1OTE5LTczM2JlYzg4LWUxMTgtNDYzYy04OGMwLWEyNDRhYjM4ZWMwMy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzE4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcxOFQyMjIzMzlaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1lYjVhMzc3YzBmNjBkY2YzMjQ5MzZkYTczODM2OTI1NTYyNTM3MTQ0OTYxNzhmYTc5MDA5OTA3NjAxYWM3YzBhJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.FO3-5_JY2fVaZYuz_DIfEXNXElRvNFQzLydammK1c1s)
count: 153.7568280696869
Processing count files.
Done processing count files.
another python3 /public/home/xiaoxiong/miniconda3/envs/libgsl/rMATS/rmats.py --b1 b1.txt --b2 b2.txt --gtf /public/home/xiaoxiong/hTSC_STB96_RNA_seq/hTSC_STB96_rMATs_Ensembol/hTSC_STB72/genes.gtf -t paired --libType fr-secondstrand --readLength 100 --nthread 4 --od AS_libgsl_second --tmp AS_libgsl_second --variable-read-length
gtf: 1452.7515552043915
There are 36601 distinct gene ID in the gtf file
There are 199138 distinct transcript ID in the gtf file
There are 14315 one-transcript genes in the gtf file
There are 1305354 exons in the gtf file
There are 5603 one-exon transcripts in the gtf file
There are 3566 one-transcript genes with only one exon in the transcript
Average number of transcripts per gene is 5.440780
Average number of exons per transcript is 6.555022
Average number of exons per transcript excluding one-exon tx is 6.715845
Average number of gene per geneGroup is 7.854318
statistic: 0.08252692222595215
read outcome totals across all BAMs
USED: 54658238
NOT_PAIRED: 5238074
NOT_NH_1: 266573095
NOT_EXPECTED_CIGAR: 6915951
NOT_EXPECTED_READ_LENGTH: 0
NOT_EXPECTED_STRAND: 0
EXON_NOT_MATCHED_TO_ANNOTATION: 443321716
JUNCTION_NOT_MATCHED_TO_ANNOTATION: 131377399
CLIPPED: 64958367
total: 973042840
outcomes by BAM written to: AS_libgsl_second/2024-06-19-13_19_27_769310_read_outcomes_by_bam.txt
novel: 2378.783494949341
The splicing graph and candidate read have been saved into AS_libgsl_second/2024-06-19-13_19_27_769310_*.rmats
save: 3.096289873123169
loadsg: 1.7018141746520996
==========
Done processing each gene from dictionary to compile AS events
Found 47791 exon skipping events
Found 3302 exon MX events
Found 16193 alt SS events
There are 9849 alt 3 SS events and 6344 alt 5 SS events.
Found 6783 RI events
ase: -3.3512372970581055
![12](https://private-user-images.githubusercontent.com/166396847/340965175-86759399-e035-460c-b72c-b8e532fa9dc5.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjEzNDE3MTksIm5iZiI6MTcyMTM0MTQxOSwicGF0aCI6Ii8xNjYzOTY4NDcvMzQwOTY1MTc1LTg2NzU5Mzk5LWUwMzUtNDYwYy1iNzJjLWI4ZTUzMmZhOWRjNS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzE4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcxOFQyMjIzMzlaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1mNDdlM2MwYzA4MGJjOWJhNDZiOWY3OTQxYTUwOTUxOTQxZTM0YzhkYWM5OTM4OGM1YTQzNTM0YzM5ZjkxNThkJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.VrWQ8r-DZI1WgfbSrJavcryvy_Izx3GZs1YcAkww9u8)
![3](https://private-user-images.githubusercontent.com/166396847/340965526-d2d1feb0-9ed0-4954-b7f7-5e77f52ee3ed.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjEzNDE3MTksIm5iZiI6MTcyMTM0MTQxOSwicGF0aCI6Ii8xNjYzOTY4NDcvMzQwOTY1NTI2LWQyZDFmZWIwLTllZDAtNDk1NC1iN2Y3LTVlNzdmNTJlZTNlZC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNzE4JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDcxOFQyMjIzMzlaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT03YzQ0MjRjYWMzMmRhZjFmMTRiYzlhOWViMGM0MzkxNjBmMDkyMTYwMDk4Yjg2YmRkYzQ1OWY5OGJiMDllNzkzJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.vwH95GasKa52mBhUfcRhMV77doi4DNZw9tQYMDHk4is)
count: 5.232464551925659
Processing count files.
Done processing count files.
so, I don't know if that is the right method, or if I should try it again -libType fr-unstranded.
do you have a good suggestion?
The text was updated successfully, but these errors were encountered: