RNN LSTM Networks for Text Generation

This is a ML/Deep Learning and NLP project to train recurrent neural networks to generate Trump Tweets (text).

About the Project

A Recurent Neural Network with LSTM nodes implementation for text generation, trained on Donald Trump tweets dataset using contextual labels, and can generate realistic ones from random noise.
Ability to use Bidirectional RNNs, techniques such as attention-weighting and skip-embedding.
CuDNN implementation for training the RNNs on an nVidia GPU
This project has been specifically optimised for trump tweet dataset, and generating Trump tweets, but it can be used on any text dataset.

Resources

Live Demo
Trained Model Binaries
Trump Tweet Dataset
Schuster, Mike, and Kuldip K. Paliwal. "Bidirectional recurrent neural networks". IEEE transactions on Signal Processing 45.11 (1997): 2673-2681.
Sundermeyer, M., Schlüter, R., & Ney, H. (2012). "LSTM neural networks for language modeling". In Thirteenth annual conference of the international speech communication association.
Tang, Jian, et al. "Context-aware natural language generation with recurrent neural networks." arXiv preprint arXiv:1611.09900 (2016).

Implementation Guide

Clone this repository
Download the dataset (from Resources)
Download the dependencies, and CUDA
Train from train.py

Usage

# Simple Model Training (train.py)
from model.model import TextGenModel

model_config = {
    'name': 'trump_tweet_model',
    'meta_token': "<s>",
    'word_level': True,
    'rnn_layers': 2,
    'rnn_size': 512,
    'rnn_bidirectional': False,
    'max_length': 40,
    'max_words': 20000,
    'dim_embeddings': 100,
    'word_level': True,
    'single_text': False
}

trump_tweet_model = TextGenModel(model_config=model_config)
trump_tweet_model.train('trump_tweet_dataset.txt', header=False,
num_epochs=4, new_model=True)

Architecture

The recurrent neural network takes sequence of words as input and outputs a matrix of probability for each word from dictionary to be the next of given sequence.
The model also learns how much similarity is between words or characters and calculates the probability of each.
Using that, it predicts or generates the next word or character of sequence.

Representative image of model architecture for the Bidirectional LSTM network

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
model		model
.gitignore		.gitignore
readme.md		readme.md
train.py		train.py
trump_tweet_model_config.json		trump_tweet_model_config.json
trump_tweet_model_vocab.json		trump_tweet_model_vocab.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RNN LSTM Networks for Text Generation

About the Project

Resources

Implementation Guide

Usage

Architecture

License

About

Uh oh!

Releases

Packages

Languages

chirag2796/RNN-LSTM-for-Text-Generation

Folders and files

Latest commit

History

Repository files navigation

RNN LSTM Networks for Text Generation

About the Project

Resources

Implementation Guide

Usage

Architecture

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages