InferSent: Learning universal sentence representations

This repository contains PyTorch implementation and experiment interface for supervised NLI task with different models for learning universal sentence representations.

Results

A baseline model MeanEmbedding and three LSTM based models LSTM, BiLSTM and BiLSTM-maxpool are trained on NLI task using SNLI data. The sentence embeddings are evaluated on 8 transfer tasks using SentEval framework.

The micro and macro metric for SentEval tasks are computed as defined in Section 5 of the InferSent paper [1]. The results are tabulated below:

Model	snli-dev	snli-test	senteval-micro	senteval-macro
MeanEmbedding	69.5	69.1	77.31	77.92
LSTM	80.5	80.2	70.467	70.282
BiLSTM	80.00	80.08	71.997	71.531
BiLSTM-maxpool	86.50	85.87	79.075	78.831

Organization

This repository is organized into the following major components:

models.py - Pytorch modules for the encoder and classifier models.
data.py - SNLIData class for preparing data for training and evaluation.
train.py - Pytorch Lightning model and training CLI for training with different encoders.
eval.py - CLI that takes model checkpoint and runs evaluation on SNLI and SentEval tasks.
demo.ipynb - Jupyter notebook for testing model inference and analyzing the results.

Setup

# Using pip
pip install -r requirements.txt
# Using conda
conda env create -f environment.yml
# Download english model for SpaCy tokenizer
python -m spacy download en_core_web_sm

To run evaluation with SentEval, prepare SentEval installation as follows:

git clone https://github.com/facebookresearch/SentEval.git
cd SentEval/ && python setup.py install
# Download datasets
cd SentEval/data/downstream/ && ./get_transfer_data.bash

Training

Run train.py with one of the following encoder types: MeanEmbedding, LSTM, BiLSTM, BiLSTM-maxpool. The training process will create model checkpoints, TensorBoard logs and hyperparams file hparams.yaml in the ./logs directory.

python train.py --encoder_type='BiLSTM'

Evaluation

Run eval.py with a model checkpoint flag to run evaluation tasks on SNLI and SentEval.

python eval.py --checkpoint_path='./logs/MeanEmbedding/version_0/checkpoints/epoch=2-step=12875.ckpt'

Pre-trained models

The model checkpoints and TensorBoard logs are public and can be found here: https://drive.google.com/drive/folders/1Ebjyf0wj31EZMPEBiG1nHW-1JOMMl1IY?usp=sharing

References

[1] A. Conneau, D. Kiela, H. Schwenk, L. Barrault, A. Bordes, Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.vscode		.vscode
.gitignore		.gitignore
Infersent Analysis.ipynb		Infersent Analysis.ipynb
LICENSE		LICENSE
README.md		README.md
data.py		data.py
environment.yml		environment.yml
environment_Lisa.yml		environment_Lisa.yml
eval.py		eval.py
job.sh		job.sh
models.py		models.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InferSent: Learning universal sentence representations

Results

Organization

Setup

Training

Evaluation

Pre-trained models

References

About

Languages

License

parasdahal/infersent

Folders and files

Latest commit

History

Repository files navigation

InferSent: Learning universal sentence representations

Results

Organization

Setup

Training

Evaluation

Pre-trained models

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages