Neural-Projection Skip-Gram with Deep Average Network Classifier

This repository contains end-to-end implementation of Neural Projection with Skip Gram (NP-SG) and Deep Averaging Network (DAN) model. We first train a NP-SG model and then leverage the trained projected embeddings to train a DAN for SST-fine classification task. Our goal is to compare the performance of projection based on the fly embeddings generated from locally sensitive hashing with static and non-static embeddings.

Steps

Clone the repository.
Run pip install -r requirements.txt . This will download and install all the required packages.
Run python setup.py. This will set-up the directory structure and download required corpora for experiments.
Request for enWiki9 dataset at [email protected].
Set the config.py script before running experiments. The experiments spans over two steps -
a. Training a NP-SG model with some corpus (we use a chunk of wiki9). The larger this corpus the better it is.
b. Using the embeddings from step 1, for any downstream task e.g. we train a DAN model for SST-fine data
To test the pipeline set n=1000, test=True.
To run a complete experiement run the following three scripts:
a. python3 data_prep.py
b. python3 train_projection.py
c. python3 train_dan.py
OR
you can run the bash script run.sh
Set n > 10,000 to train the NP-SG model on a larger corpus.

Datasets

We have developed the pipeline to use wiki9, SST-Fine, and Bible Corpus for training the NP-SG model alongwith a DAN model on SST-Fine dataset for five class classification task.

enwiki9 (to train NP-SG model)
SST-Fine (to train test classification task)
Bible corpus (from nltk, for small scale test experiments)

Results

Trainable Embedding?	NP-SG train Dataset	Skip-gram train Size	Test Acc. (SST-Fine)
No	SST-Fine	7,000	30.9%
Yes	SST-Fine	7,000	37.68%
No	enWiki9	1,000	27.88%
Yes	enWiki9	1,000	37.51%
No	enWiki9	5,000	29.7%
Yes	enWiki9	5,000	38.1%
No	enWiki9	30,000	30.43%
Yes	enWiki9	30,000	38.42%
No	enWiki9	60,000	30.97%
Yes	enWiki9	60,000	40.33%

References

Few of recent papers that have shown state-of-the-art result with Neural Projections,

Neural Projection Skip-Gram (https://arxiv.org/pdf/1906.01605.pdf)
PRADO by Google Research (https://www.aclweb.org/anthology/D19-1506.pdf)
Self-governing Neural Networks (https://www.aclweb.org/anthology/D18-1105.pdf)
The classification model we have implemented in this research - Deep Averaging Network (https://people.cs.umass.edu/~miyyer/pubs/2015_acl_dan.pdf)

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
__pycache__		__pycache__
data		data
functions		functions
raw_data		raw_data
README.md		README.md
config.py		config.py
data_prep.py		data_prep.py
model.jpg		model.jpg
report.pdf		report.pdf
requirements.txt		requirements.txt
run.sh		run.sh
setup.py		setup.py
train_dan.py		train_dan.py
train_projection.py		train_projection.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural-Projection Skip-Gram with Deep Average Network Classifier

Steps

Datasets

Results

References

About

Releases

Packages

Languages

samrat-halder/Neural-Projection-Skip-Gram-DAN

Folders and files

Latest commit

History

Repository files navigation

Neural-Projection Skip-Gram with Deep Average Network Classifier

Steps

Datasets

Results

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages