Skip to content

Releases: UPHL-BioNGS/Donut_Falls

1.6.20240510

09 May 18:41
e09c2a6
Compare
Choose a tag to compare

Moderate release (no param changes)

  • There's a new summary file
    • donut_falls/summary/donut_falls.tsv now exists for those who need tsv files.
  • More information is found in the final json file (donut_falls/summary/donut_falls.json)
  • Introduces a mash dist step that ensure that Illumina and Nanopore reads are from the same isolate
    • The workflow will continue as if there were no Illumina reads if they are "too distant"
  • Updated busco image
  • Moved all images to use staphb's repository on dockerhub
  • Updated default of pypolca to --careful
  • Updated read type of flye to --nano-hq
  • Sorts consensus fasta files in donut_falls/sample/consensus from largest to smallest contig

1.5.20240305

06 Mar 21:12
84d00e0
Compare
Choose a tag to compare
1.5.20240305 Pre-release
Pre-release

Nanopore's flow cells R10.4 have forced our hands and we really needed to adjust to the higher quality reads.

This is a pre-release for which all features have not been tested or finalized, but should work. We've been struggling to find test datasets, and it is time consuming to generate our own.

Things that needed adjusting

  1. Circlator hated us and failed 80% of the time (or more). We have switched out Circlator with DNAAPLER. This has made life better.
  2. Donut Falls is now a single file. This should make it easier to use as a subworkflow elsewhere (we never did finish the documentation about using git submodules). More information can be found here : https://github.com/UPHL-BioNGS/Donut_Falls/wiki/Linking
  3. Even though this file does not adhere to the NF-Core template, the labels and basic process template should be compatible with nf-core's config files. They follow a lot of the same logic.
  4. Filtlong has been replaced with fastp and rasusa. We were noticing irregularities with filtlong (i.e. we would specify coverage, but then not achieve the desired coverage, or we would specify read length but still have too-short reads go through). Fastp is now used as a hard cutoff: nanopore reads must be larger than 1,000 bases and have a quality score greater than Q12 to pass. This should not be a problem for R10.4 runs. Rasusa will then subsample these reads to 150X coverage (based on 5M genome size). We have some example config files for smaller genomes. We recommend on running unicycler on "older" datasets.
  5. The tests have changed. Many of the old datasets won't pass the fastp filter.
  6. Medaka has been reduced down to 1 CPU. We were noticing irregularities with medaka polishing at higher cpu usage.
  7. Multiple assemblers can be specified at once. We were finding that we needed to run flye and unicycler on most of our samples, so this simplified things for us.
  8. We needed more QC metrics, specifically we needed coverage estimates on plasmids. It helps tells if they are real or not.
  9. POLCA has been replaced with pypolca. It seems to perform as well, is faster, and the container is smaller.

We dropped support for the following:

  1. Trycycler : removing reads that don't conform wasn't best practice and we were getting a proliferation of plasmids.
  2. Dragonflye : the current version won't run on our local system (likely due to some perl weirdness) and we are wary of supporting options that we can't test.
  3. Miniasm : if Flye, Unicycler, or Raven couldn't close a genome, miniasm couldn't close it either

1.1.20230425

05 May 17:59
040a042
Compare
Choose a tag to compare

Mid - level release

  • adds support for dragonflye assembly
  • puts more files in the multiqc report
  • made porechop optional
  • fixed folding bugs
  • added circular and length to fasta header

1.0.20230425

21 Apr 18:35
1e0c29c
Compare
Choose a tag to compare

Simplifies things greatly with a input file, and a simplified polishing scheme.

This matches the wiki.

0.0.20220810

01 Aug 17:37
0de6c40
Compare
Choose a tag to compare

Donut Falls is a basic de novo assembly workflow for Nanopore (ONT) reads with or without corresponding Illumina reads.

Initial release