RNN Language Model

This repository contains a minimal character-level recurrent neural network (RNN) language model trained on the "Tiny Shakespeare" corpus popularized by Andrej Karpathy. The code was originally authored in Google Colab and kept here both as a Jupyter notebook and as the exported Python script that mirrors the notebook cells.

Repository structure

RNN Language Model/
- RNN_Language_Model.ipynb – the original Colab notebook. Open this if you want the exact interactive environment the model was built in.
- rnn_language_model.py – the notebook exported to a Python script. The script still contains notebook-style commands (for example !wget), so it is best treated as reference code or executed inside an interactive environment that understands shell magics (such as Jupyter).
LICENSE – licensing information for the project.

Requirements

The notebook/script expects the following software stack:

Python 3.8+
PyTorch (tested with version 2.x)
Jupyter (optional, but recommended for running the notebook)

If you plan to run the code locally, install the dependencies inside a virtual environment:

python -m venv .venv
source .venv/bin/activate
pip install torch jupyter

Dataset

Training uses the Tiny Shakespeare dataset downloaded from Karpathy's char-rnn repository. The notebook/script automatically fetches the data with:

!wget https://raw.githubusercontent.com/karpathy/char-rnn/master/data/tinyshakespeare/input.txt -O tiny_shakespeare.txt

You can also download the file manually and place it in the project directory if you prefer not to use the shell command inside Jupyter.

Running the model

Launch Jupyter (recommended):
```
jupyter notebook
```
Then open RNN Language Model/RNN_Language_Model.ipynb and run the cells sequentially.
Or execute the exported script inside an environment that supports notebook magics (e.g., ipython):
```
ipython RNN\ Language\ Model/rnn_language_model.py
```

During training, the script samples random contiguous batches of 64 characters and trains for 2,000 steps using an nn.RNN layer with a hidden size of 128. Progress is printed every 200 steps. After training, the model generates ~300 characters of text conditioned on the prompt "KING: ".

Customization

Adjust block_size, batch_size, and the training loop (range(2000)) to change the context length, batch size, or the number of optimization steps.
Modify the hidden_size parameter in RNNLanguageModel to increase or decrease the capacity of the network.
Replace the dataset download URL with your own text corpus to experiment with different domains.

License

This project is distributed under the terms of the MIT License. See the LICENSE file for full details.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
RNN Language Model		RNN Language Model
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RNN Language Model

Repository structure

Requirements

Dataset

Running the model

Customization

License

About

Uh oh!

Releases

Packages

Languages

License

caydenrgarrett/RNN-Language-Model

Folders and files

Latest commit

History

Repository files navigation

RNN Language Model

Repository structure

Requirements

Dataset

Running the model

Customization

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages