Sentiment Classification using Self-Attention Model and POS Embeddings

This is an extension repo of the paper: ``A Structured Self-Attentive Sentence Embedding'' published by IBM and MILA. https://arxiv.org/abs/1703.03130

The repo is forked from https://github.com/ExplorerFreda/Structured-Self-Attentive-Sentence-Embedding

Usage

get_data.py

Split the official Yelp dataset review.json to training, dev, and testing. Tokenize sentences. Generate the vocabulary.

get_tensors.py

Transform tokens/POS tags to indices. Train POS2vec using word2vec Python library. Zero-pad word and POS sequences.

feature_generator.py

Use PyTorch Dataloader class to generate a batch of features and labels. This will speed up training process.

model.py

Model for using word2vec feature only.

model_pos.py

Model for using word2vec and POS2vec featurs.

model_pos_attention.py

Model for separate attention layers for the two features.

train*.py

Training codes for all combinations of parameters. Need to refactorize them to be one file and accept arguments.

The best accuracy is 73.05% on the testing data.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
feature_generator.py		feature_generator.py
get_data.py		get_data.py
get_tensors.py		get_tensors.py
model.py		model.py
model_pos_attention.py		model_pos_attention.py
model_word.py		model_word.py
train.ipynb		train.ipynb
train_pos.py		train_pos.py
train_pos_attention.py		train_pos_attention.py
train_pos_one_hot.py		train_pos_one_hot.py
train_pos_one_hot_attention.py		train_pos_one_hot_attention.py
train_with_embedding.py		train_with_embedding.py
train_with_embedding_double.py		train_with_embedding_double.py
train_word.py		train_word.py
train_word_only_with_embedding.py		train_word_only_with_embedding.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Classification using Self-Attention Model and POS Embeddings

Usage

About

Releases

Packages

Languages

License

nateanl/Structured-Self-Attentive-Sentence-Embedding

Folders and files

Latest commit

History

Repository files navigation

Sentiment Classification using Self-Attention Model and POS Embeddings

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages