GitHub - erogol/resnet.torch: an updated version of fb.resnet.torch with many changes.

resnet.torch

This is a fork of https://github.com/facebook/fb.resnet.torch. Refer to that if you need to know the details of this library.

This code is heavily modified with many additions throughout my research. Many of the changes are optional and defined in "opts.lua". Here is the list of the additions by no means complete.

Class weighting to tackle class imbalance (-classWeighting)

It counts the number of instances for each category and use the normalized reverse frequency to scale learning rates per category.

Emprically verified way to freeze convolutional layers of the network.

I tried everything suggested to freeze a pretrained network, however, I saw that any method still updates the model. In the end, I modified nnlr in order to freeze the network without any such leak. nnlr is a library that you can scale learning rates per layer. I changed the code to give a exact value per layer instead of scaling the base learning rate. The idea is to give 0 learning rate and weight decays to each of feature layers and prevent the model updating parameters.

Better booking of the trained models.

Any model trained is arraged in a folder named by the important model parameters and sub-foldered by the date of the execution.

Plotting accuracy and loss values

In the created folder for training model, there are loss and accuracy plots using gnuplot, plotting per epoch values.

New models;

GoogleNet
ResNet with Stochastic Depth
SimpleNet (a small architecture which is a good baseline)
And some others

Model initialization with a different learning rate (-model_init_LR)

It is good to stabilize a model before setting the learning rate to a base value. Given value is used for initial 5 epochs.

Save the model optimState so that you can continue the training from any checkpoint with all history recovered.
dataset/balanced.lua for balancing instance selection against imabalnced datasets
Set optimizer adam or sgd (-optimizer (sgd))

NOTE: Check other branches of the project. Eacn includes a particular model architecture.

Siamese: Learning embeddings of data based on instance similarity.http://yann.lecun.com/exdb/publis/pdf/chopra-05.pdf
TripletNet: Learning embeddings of data based on instance similarity. https://arxiv.org/pdf/1412.6622.pdf
Regeression: It is the same network structure but the code is tuned for Regression.

WARNING: " I suggest you to use this repo with caution since codes are only used for research purposes and there might be buggy details."

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.ipynb_checkpoints		.ipynb_checkpoints
datasets		datasets
models		models
nnlr		nnlr
pretrained		pretrained
train_scripts		train_scripts
utils		utils
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
ClassifyImage.ipynb		ClassifyImage.ipynb
INSTALL.md		INSTALL.md
PATENTS		PATENTS
PredictWithPretrainedModel.lua		PredictWithPretrainedModel.lua
README.md		README.md
checkpoints.lua		checkpoints.lua
dataloader.lua		dataloader.lua
main.lua		main.lua
opts.lua		opts.lua
plotting.lua		plotting.lua
train.lua		train.lua
utils.lua		utils.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

resnet.torch

About

Releases

Packages

Languages

erogol/resnet.torch

Folders and files

Latest commit

History

Repository files navigation

resnet.torch

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages