Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Aug/ Sep] [Pubs] Curation Workflow Tracking #137

Open
8 of 11 tasks
aditigopalan opened this issue Sep 4, 2024 · 1 comment
Open
8 of 11 tasks

[Aug/ Sep] [Pubs] Curation Workflow Tracking #137

aditigopalan opened this issue Sep 4, 2024 · 1 comment
Assignees

Comments

@aditigopalan
Copy link
Contributor

aditigopalan commented Sep 4, 2024

This ticket tracks curation workflow progression.

Note: It is possible for work to take place simultaneously in the three sections with overlapping periods, allowing curation workflows for different months to coincide.

1. Curation and Annotation

  • Run Pubmed crawler to generate PublicationView manifest [205 publications generated, long sprint anticipated]
  • Send Amber and Jineta a copy of the PublicationView manifest from latest crawl to review for MC2 Center Newsletter publication highlights
  • Send Amber "News from CCKP" for MC2 Center Newsletter
  • Annotate publications in PublicationView manifest [In progress]
  • Generate ToolView and DatasetView manifests based on PublicationView manifest
  • Run the automated curation workflow to upload publications, datasets and tools [This includes splitting manifests, processing and validating manifests, generating target synapse IDs for upload, schema updates, upload to synapse and (in progress) a validation check for uploads)
  • Generate UNION tables
  • QC of staging tables
  • Performing automate portal sync to CCKP
  • Validate data on the CCKP

Status check [Plan to report numbers for each category following pubmed crawl]:

  • Publication upload [ ]

No data model updates

@aclayton555
Copy link

24-9 - all have been annotated, up to the Sept crawl. Action needed is to push sync.

Orion has run union/QC checks. Have found a couple of new values that will necessitate a small data model release, but this can be decoupled from pushing the sync (should be done by tomorrow). Sync will also include some of the datasets (e.g. light sheet)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants