Skip to content

Image Captioning using Multi-modal RNN i.e. using both Word Embeddings and CNN features as input

License

Notifications You must be signed in to change notification settings

hashbangCoder/MultiModal-Image-Captioning

Repository files navigation

MultiModal-Image-Captioning

This is a Torch implementation of Image Captioning using Multi-modal RNN that use both Word Embeddings and CNN features, as described in Mao et. al.

The implementation is incomplete and work in progress

Meanwhile check out the Tensorflow Implemetaion by J.Mao here

About

Image Captioning using Multi-modal RNN i.e. using both Word Embeddings and CNN features as input

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published