Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Set up programmatic retrieval of most up to date (current) TAIR GAF file from GO #10

Open
lreiser opened this issue Nov 26, 2019 · 1 comment
Assignees

Comments

@lreiser
Copy link

lreiser commented Nov 26, 2019

iFAD needs to read the most current GO annotation data files for TAIR.
These can be retrieved either from the GO website
http://current.geneontology.org/products/pages/downloads.html
or the Zenodo repository
a description of how to programmatically access GAF files from Zenodo can be found here:
http://geneontology.org/docs/tools-guide/#programmatic-download-bdbag

TAIR updates OUR gaf files on a quarterly basis . These files are consumed by the GO consortium, post processed to remove errors and then merged with annotations from other sources.

@lreiser
Copy link
Author

lreiser commented Jan 14, 2020

We need to discuss the process of retrieving the unique gene IDS from the GO GAF file. Because some of the data comes from sources other than TAIR, in addition to searching in Column 11 for the AGI code (ATNGNNNNN) we need to look in Column 3 which is where UniProt puts the identifier.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants