Skip to content

Code used to generate summaries, models and figures for article "A field-wide assessment of differential high throughput sequencing reveals widespread bias".

License

Notifications You must be signed in to change notification settings

rstats-tartu/geo-htseq-paper

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Geo-htseq-paper

We analyzed the field of expression profiling by high throughput sequencing, or RNA-seq, in terms of replicability and reproducibility, using data from the GEO (Gene Expression Omnibus) repository. Our work puts an upper bound of 56% to field-wide reproducibility, based on the types of files submitted to GEO.

Getting data

Got to https://doi.org/10.5281/zenodo.7529832 for the latest version and download data archive, let's say, to your Downloads folder.

Then create new folder, e.g. "geo-htseq" and enter this folder

mkdir geo-htseq
cd geo-htseq

Copy downloaded dataset to your working directory and uncompress:

cp ~/Downloads/geo-htseq.tar.gz .
tar -xzvf geo-htseq.tar.gz

Remove tar.gz archive from working directory:

rm geo-htseq.tar.gz

Now you should have dataset in "output" subdirectory ready for analysis.

Workflow graph

rulegraph

About

Code used to generate summaries, models and figures for article "A field-wide assessment of differential high throughput sequencing reveals widespread bias".

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • R 79.2%
  • TeX 11.8%
  • Python 8.9%
  • Singularity 0.1%