decoder-only

Here are 17 public repositories matching this topic...

microsoft / encoder-decoder-slm

Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and vision-language capabilities

encoder-decoder vision-and-language llm decoder-only

Updated Feb 7, 2025
Python

MKnoche / DONUT

Star

[ICCV 2025] DONUT: A Decoder-Only Model for Trajectory Prediction

autonomous-driving trajectory-prediction motion-prediction argoverse decoder-only iccv2025

Updated Sep 29, 2025
Python

liaoyanqing666 / Decoder-only-transformer_Time_Series_Prediction

Star

使用Decoder-only的Transformer进行时序预测，包含SwiGLU和RoPE(Rotary Positional Embedding)，Time series prediction using Decoder-only Transformer, Including SwiGLU and RoPE(Rotary Positional Embedding)

time-series pytorch transformer rope time-series-prediction decoder-only rotary-positional-embedding swiglu

Updated Jan 25, 2024
Python

cisnlp / MEXA

Star

🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment

multilingual evaluation embeddings evaluation-metrics cross-lingual multilingual-nlp large-language-models decoder-only

Updated Apr 6, 2025
Python

pittisl / mPnP-LLM

Star

Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"

deep-learning multimodal embodied-ai large-language-model decoder-only modality-adaptation

Updated Jan 19, 2024
Python

ntphuc149 / ViAG

Star

ViAG: A Novel Framework for Fine-tuning Answer Generation models ultilizing Encoder-Decoder and Decoder-only Transformers's architecture

meteor question-answering bart llama rouge bleu-score encoder-decoder fine-tuning answer-generation t5 plms bartpho llm bertscore instruction-tuning qlora qwen decoder-only vit5

Updated May 26, 2025
Python

michaelbabsek / LLM

Star

attention-mechanism multihead-attention llm llm-training llm-inference decoder-only

Updated Jun 2, 2025
Python

This study examines the effectiveness of transformer-based models for financial time series forecasting, specifically focusing on log returns derived from daily closing prices of the DAX40 index. We propose a decoder-only transformer model designed for immediate-term financial time series forecasting: The PatternDecoder.

transformer lstm convolutional-neural-networks informer time-series-analysis autoformer decoder-only

Updated Sep 1, 2025
Jupyter Notebook

Akhan521 / GPT-From-Scratch

Star

🧸 A fully custom GPT-style language model built from scratch using PyTorch and trained on Winnie-the-Pooh! Explored the core mechanics of self-attention, autoregressive text generation, and modular model training, all without relying on any external libraries.

python machine-learning machine-learning-algorithms transformers pytorch gpt self-attention generative-ai decoder-only

Updated Jul 5, 2025
Python

Amir-Hofo / GPT2

Star

in dev ...

transformers pytorch english-nlp gpt2 huggingface decoder-only bpe-tokenizer tiny-stories

Updated Sep 12, 2025
Python

sea-rod / minigpt

Star

A mini version of GPT implemented on shakespear using BPE

python ai gpt transfomer decoder-model minigpt decoder-only

Updated May 27, 2025
Jupyter Notebook

egesualp / growth-vs-forgetting

Star

This repository contains the implementation and experiments for comparing gradual growth methods, specifically the G_stack approach, with naive models trained from scratch. The project focuses on addressing catastrophic forgetting and improving model performance in continuous learning scenarios.

inference catastrophic-forgetting llm decoder-only model-growth stacking-llm