Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CADD plugin not working as expected in v111 #703

Open
puva opened this issue Mar 10, 2024 · 1 comment
Open

CADD plugin not working as expected in v111 #703

puva opened this issue Mar 10, 2024 · 1 comment
Assignees
Labels

Comments

@puva
Copy link

puva commented Mar 10, 2024

Hi,
we recently upgraded VEP from v107 to v111 (executable, cache and plugins), and we found that some variants correctly annotated in v107, with v111 have the CADD scores only in some conditions.
An example of these variants is chr1 100660 A C:

Case 1 - 2500 variants, --pick => CADD missing
Input: VCF with ~2500 variants, option --pick
Command line:
vep --assembly GRCh38 --cache --dir_cache [PATH]/vep_111 --dir_plugins [PATH]/Plugins_111 --force_overwrite --format vcf --input_file in_full.vcf --offline --plugin CADD,snv=/cadd_1.6_hg38/whole_genome_SNVs.tsv.gz,indels=/cadd_1.6_hg38/gnomad.genomes.r3.0.indel.tsv.gz --vcf --out_vep111_full_pick.vcf --pick
Output:
chr1 100660 . A C 8.82 PASS [...];CSQ=C|intron_variant&non_coding_transcript_variant|MODIFIER||ENSG00000238009|Transcript|ENST00000477740|lncRNA||3/3||||||||||-1||||| [...]

Case 2 - 2500 variants, no --pick => CADD ok
Input: VCF with ~2500 variants, without --pick
Command line:
vep --assembly GRCh38 --cache --dir_cache [PATH]/vep_111 --dir_plugins [PATH]/Plugins_111 --force_overwrite --format vcf --input_file in_full.vcf --offline --plugin CADD,snv=/cadd_1.6_hg38/whole_genome_SNVs.tsv.gz,indels=/cadd_1.6_hg38/gnomad.genomes.r3.0.indel.tsv.gz --vcf --out_vep111_full.vcf
Output (CADD in bold):
chr1 100660 . A C 8.82 PASS [...];CSQ=C|intron_variant&non_coding_transcript_variant|MODIFIER||ENSG00000238009|Transcript|ENST00000466430|lncRNA||2/3||||||||||-1||||4.519|0.319874,C|intron_variant&non_coding_transcript_variant|MODIFIER||ENSG00000238009|Transcript|ENST00000477740|lncRNA||3/3||||||||||-1||||4.519|0.319874 [...]

Case 3 - single variant, --pick => CADD ok
Input: VCF with chr1 100660 A C only, option --pick
Command line:
vep --assembly GRCh38 --cache --dir_cache [PATH]/vep_111 --dir_plugins [PATH]/Plugins_111 --force_overwrite --format vcf --input_file in_single.vcf --offline --plugin CADD,snv=/cadd_1.6_hg38/whole_genome_SNVs.tsv.gz,indels=/cadd_1.6_hg38/gnomad.genomes.r3.0.indel.tsv.gz --vcf --out_vep111_single.vcf --pick
Output (CADD in bold):
chr1 100660 . A C 8.82 PASS [...];CSQ=C|intron_variant&non_coding_transcript_variant|MODIFIER||ENSG00000238009|Transcript|ENST00000477740|lncRNA||3/3||||||||||-1||||4.519|0.319874 [...]

The same input files are correcly annotated with VEP 107 (executable, cache and plugins).
I tried using executable and cache from VEP 111 + CADD plugin from v107 and all variants were annotated.
It seems like a problem with the latest CADD plugin.

I'm attaching input and output files.
cadd_debug.tar.gz

Best,
Paolo

@olaaustine olaaustine self-assigned this Mar 11, 2024
@olaaustine
Copy link
Contributor

Hi @puva,
Thank you for your query and for the detailed information
We have been able to recreate the issue.
We will update this ticket when a fix has been applied.
Thank you very much
Ola.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants