Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

General comments/discussions/etc #1

Open
ieguinoa opened this issue Oct 12, 2019 · 0 comments
Open

General comments/discussions/etc #1

ieguinoa opened this issue Oct 12, 2019 · 0 comments

Comments

@ieguinoa
Copy link
Collaborator

ieguinoa commented Oct 12, 2019

  • The idea is that if i generate a cwl-abstract file now with this then, in the future, it could be updated(as in, generating a new version) when the full galaxy-cwl is working, which will produce executable cwl descriptions. So...look carefully what and how the executable cwl is being produced from galaxy in the galaxy-cwl project (see https://github.com/common-workflow-language/galaxy)

  • All steps in the CWL-abstract should have the same class ? something like Galaxy_step, how does this affects the evolution of the generated cwl-abstract to the real ones in the future? what is it actually being printed as class in the cwl-galaxy project?

  • The main thing that I should take into account to implement this is that all 3 ids (the one in the galaxy workflow file and the ones i set for cwl abstract in each case...from run or from static workflow), are the same in each step. -Take into account the different Galaxy workflow format versions AND the CWL versions: should I create different branches for the versions/combinations

  • Datatypes: when extracting input/output datatypes from Galaxy workflow steps, can I assume these datatypes exist in CWL? otherwise is there any way to convert them to CWL types? even if its not a 1-to-1 correlation. or what is the intersection between Galaxy datatypes and CWL types?. what about Galaxy tools that define specific types?
    CWL uses ontologies to define formats. The can be from IANA (https://www.iana.org/assignments/media-types/media-types.xhtml) or EDAM (https://www.ebi.ac.uk/ols/ontologies/edam/terms?iri=http%3A%2F%2Fedamontology.org%2Fformat_1915)
    See also https://www.commonwl.org/user_guide/16-file-formats/
    For example, to indicate a bam file, it should be
    type: File
    label: Aligned sequences in BAM format
    format: edam:format_2572

  • Something similar to datatypes happens with tool/job information about the operation it is performing, ideally i would like to tag that using edam, but thats only available in a handful of galaxy tools, although it should be widely available if i look in bio.tools but there is no direct link between galaxy tools and bio.tools entries

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant