Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

write Dbxref instead of db_xref when converting genbank to GFF3 #102

Open
timflutre opened this issue Oct 26, 2015 · 4 comments
Open

write Dbxref instead of db_xref when converting genbank to GFF3 #102

timflutre opened this issue Oct 26, 2015 · 4 comments

Comments

@timflutre
Copy link

I am trying to convert a Genbank file to GFF3 following the latest version of the official specification. Here is an example of Genbank file I need to convert:
ftp://ftp.ncbi.nlm.nih.gov/genomes/Vitis_vinifera/ARCHIVE/BUILD.1.1/CHR_01/vvi_ref_chr1.gbs.gz

The script genbank_to_gff.py works but write db_xref instead of Dbxref. Same for note instead of Note. I also have other issues, e.g. exon being encoded as "feature mRNA", etc.

I can see that you often advise people to look at gffutils. But it doesn't handle the Genbank format. So should I start looking at your BCBio code? Is there any chance to include it at some point in Biopython?

@timflutre timflutre changed the title write Dbxref instead of db_xref when converting from genbank write Dbxref instead of db_xref when converting genbank to GFF3 Oct 26, 2015
@chapmanb
Copy link
Owner

chapmanb commented Nov 2, 2015

Timothée;
Thanks for the report and sorry to be slow in getting back with you. This script only handles formatting conversion -- taking what is in the GenBank file and converting over to GFF format. It doesn't try to do the work of massaging naming of attributes to match between the two. The code really only uses the internal Biopython representation to do the conversion between a shared object, but doesn't have any special cases. I'd be happy to accept a pull request to do that, or it's something you could ask about supporting at gffutils if you'd like a more forward-looking approach. gffutils has some support for Biopython interoperability now, although would also need work to handle these specific naming conversion cases as well.

Sorry to not have a ready solution for you but hope this helps.

@timflutre
Copy link
Author

Ok, thanks for getting back to me.

@lcscs12345
Copy link

Hi Timothée, have you tried annotwriter?

@timflutre
Copy link
Author

Thanks @lcscs12345 , I'll have a look!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants