-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Suggestion for making GTDB database #91
Comments
Hi Xinpeng, thanks for letting me know. Regards, |
Also forgot to mention: the threads function of the python program will give errors if using multiple threads(more than 1), I fixed manually and could send it later if needed. Best Regards, |
Oh, that's interesting, it works fine on my machine and others. I'm interested to see the error message (if you still have it) and the fix, thanks. |
I guess it might be my env/python version error if it works for you. Let me post it here: python $WORK/final_course_project/glacier_algae/script/OPERA-MS/src_utils/make_operams_db_from_gtdb.py all_genomes.txt.gz all_taxonomy_r220.tsv.gz --outdir test --threads 16 |
The old code: def multi_threads_seqinfos(fnames):
def multi_threads_seqinfos(fnames, threads):
|
by the way, you forget a "t" in the find command :) find -L OPERA-MS-DB/ -type f -name '*.fna.gz' > OPERA-MS-DB/genomes_list.tx find -L OPERA-MS-DB/ -type f -name '*.fna.gz' > OPERA-MS-DB/genomes_list.txt |
This is weird, I should have caught this issue before. Thanks for letting me know. |
my pleasure :) |
Hi,
I followed the wiki to create the GTDB-database and I noticed in the final step there might be some errors:
find OPERA-MS-DB/ -type f -name '*.fna.gz' > OPERA-MS-DB/genomes_list.tx
It will give a empty list file and I guess it should be:
find -L OPERA-MS-DB/ -type f -name '*.fna.gz' > OPERA-MS-DB/genomes_list.txt
Otherwise when run the strain cluster step, it may give the error and fail at that part.
Best,
The text was updated successfully, but these errors were encountered: