GitHub - chiragjn/deep-char-cnn-lstm: Deep Character CNN LSTM Encoder with Classification and Similarity Models

Deep Character CNN LSTM Encoder with Classification and Similarity Models

In Keras

Overall Idea:

Convolve over character embeddings with different kernel sizes
Concat them to get the char-word embedding
Pass them through a Dense layer with Residual connection
Optionally concat them with separate word embedding
Pass sequence of obtained word embeddings through a LSTM encoder
Train with a constrastive loss function (see References)

Work in Progress

TODO: Add loading utils
TODO: Add preprocessing and padding utils
TODO: Add batching utils
TODO: Add model training code
TODO: Add model continue-training code
TODO: Test Similarity implementation on Quora similar pair dataset
TODO: Test Classification implementation on Kaggle Toxic internet comments dataset
TODO: Tune Hyperparameters and try different modifications to architectures
TODO: Take Hyperparameters using argparse
TODO: Add tensorboard and tfdbg support

Example Usage:

from model import ClassifierModel, SimilarityModel

classifier = ClassifierModel(vocab_size=10000,
                             charset_size=100,
                             num_classes=5,
                             mode=ClassifierModel.MULTILABEL,
                             char_kernel_sizes=(3,),
                             encoder_hidden_units=128,
                             bidirectional=False)
classifier.compile_model()

similarity_model = SimilarityModel(vocab_size=10000,
                                   charset_size=100,
                                   num_negative_samples=1)
similarity_model.compile_model()

References:

Overall Idea

Siamese Recurrent Architectures for Learning Sentence Similarity (2016)

Encoder architecture heavily inspired from

Loss function taken from

A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval (2014)

Other Contrastive Loss functions to try

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
datasets		datasets
extras		extras
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model.py		model.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Character CNN LSTM Encoder with Classification and Similarity Models

Work in Progress

Example Usage:

References:

About

Releases

Packages

Languages

License

chiragjn/deep-char-cnn-lstm

Folders and files

Latest commit

History

Repository files navigation

Deep Character CNN LSTM Encoder with Classification and Similarity Models

Work in Progress

Example Usage:

References:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages