Skip to content

A comprehensive Radiomics pipeline for extracting, preprocessing, and analyzing oncological imaging biomarkers that can be used for benign-malignancy prediction, survival status prediction, and similar tasks.

Notifications You must be signed in to change notification settings

Astarakee/Radiomics_pipeline

Repository files navigation

Radiomics Pipeline for Oncological Image Analysis

This repository contains the implementation codes of radiomics analysis.
This repo was developed for the study of 'A comparative study of radiomics and deep-learning based methods for pulmonary nodule malignancy prediction in low dose CT images.'
If you use any of our code, please cite:

@article{Astaraki2021,

title = { A comparative study of radiomics and deep-learning based methods for pulmonary nodule malignancy prediction in low dose CT images },
author = {Astaraki, Mehdi and Yang, Guang and Zakko, Yousuf and Toma Dasu, Iuliana and Smedby, Örjan and Wang, Chunliang},
url = {---},
year = {2021}
}

Background

Since 2010, the field of imaging biomarkers has been more formalized under the name of 'radiomics'.
This term emanates from the words 'radio' that adopted from radiology, followed by the suffix 'omics' which refers to the study of biological phenomena.

The goal of the rapidly growing radiomics in the oncology field is to capture valuable information regarding the shape, size, intensity, and texture of tumor phenotype that would distinct or complement other sources of information such as clinical reports, and laboratory examinations.

Considering the growing role of radiomics in oncology image studies, the Image Biomarker Standard Initiative (IBSI) has been established to provide a step by step guideline for radiomics analysis including feature nomenclature and definition, feature calculation, benchmark data set, and reporting regulations

The goal of radiomic feature analysis is to develop a mathematical model or a function to stratify patients based on the predicted outcome using extracted radiomic features. From the machine learning perspective, this task is equivalent to develop a learning algorithm to predict the outcomes of given data points. In other word, the learning algorithm analyzes the training data and learns the underlying representative characteristics of the data to infer a hypothesis in order to predict the outcomes of unseen data.

In current research practices, radiomic pipelines are usually developed manually with a specific dataset of that study.
Therefore, the performance and functionality of different pipeline settings depend on the properties of the employed learning algorithms and training procedures, which in turn would result in deviations of the model performance on the same dataset.
In this repo, we tried to provide a standard radiomic analysis pipeline based on the classical machine learning algorithms.

Folder structure


radiomics_pipeline
│   ├── data_loader 'loading image data of extracted features'
│   │   ├── feature_loader.py
│   │   ├── paths_and_dirs.py
│   │   └── sitk_stuff.py
│   ├── experiments_results ' dir for saving the summary of the experiments'
│   │   └── experiment_210613-1834.json
│   ├── feature_extraction ' extracting the radiomics features'
│   │   ├── extraction_lung_nodule.py
│   │   └── params.yaml
│   ├── features ' dir to save the radiomic features'
│   │   └── radiomic_features.csv
│   ├── feature_selections ' feature selection methods'
│   │   ├── feature_selectors.py
│   ├── learning_algorithms ' learning algorithm models'
│   │   ├── models.py
│   ├── preprocessing ' preprocessing the features or images'
│   │   ├── class_balancing.py
│   ├── readme.ipynb
│   ├── run ' executing the experiments'
│   │   ├── main.py
│   │   └── main_sfs.py
│   ├── training ' trainer functions for learning algorithms'
│   │   └── trainer.py
│   └── utilities ' some utility functions '
│   ├── feature_tools.py
│   ├── json_stuff.py
│   ├── metircs_evaluation.py
│   └── ...

Usage

The original study was conducted on the Kaggle Data Science Bowl 2017 dataset.
To get access to the segmentation mask files, please contact me via "[email protected]"

Requirements


- Python >= 3.6

All packages used in this repo are listed in the 'requirements.txt'.
Run 'pip3 install -r requirements.txt' to install the required packages.

About

A comprehensive Radiomics pipeline for extracting, preprocessing, and analyzing oncological imaging biomarkers that can be used for benign-malignancy prediction, survival status prediction, and similar tasks.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published