sparse-attention

Star

Here are 23 public repositories matching this topic...

lucidrains / native-sparse-attention-pytorch

Star

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

deep-learning artificial-intelligence attention sparse-attention

Updated Aug 15, 2025
Python

thu-ml / SpargeAttn

Star

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

attention vit quantization video-generation mlsys inference-acceleration ai-infra vision-transformer sparse-attention llm sageattention

Updated Sep 27, 2025
Cuda

NVlabs / LongLive

Star

LongLive: Real-time Interactive Long Video Generation

real-time interactive sparse-attention long-context efficient-tuning video-genenratio

Updated Oct 13, 2025
Python

SHI-Labs / NATTEN

Star

Fast Multi-dimensional Sparse Attention

cuda pytorch sparse-attention neighborhood-attention

Updated Oct 10, 2025
C++

mit-han-lab / radial-attention

Star

[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

wan mochi diffusion-models sparse-attention efficientml hunyuan-video

Updated Sep 18, 2025
Python

svg-project / Sparse-VideoGen

Star

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

wan diffusion diffusion-model sparse-attention efficientml hunyuan-video

Updated Oct 5, 2025
Python

weigao266 / Awesome-Efficient-Arch

Star

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

moe mamba linear-models state-space-model mixture-of-experts efficient-architectures linear-attention sparse-attention linear-rnn diffusion-llm

Updated Aug 29, 2025

ByteDance-Seed / ShadowKV

Star

[ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

research high-throughput low-rank cpu-offload sparse-attention long-context llm-inference

Updated May 1, 2025
Python

XunhaoLai / native-sparse-attention-triton

Star

Efficient triton implementation of Native Sparse Attention.

natural-language-processing sparse-attention large-language-models

Updated May 23, 2025
Python

thu-nics / MoA

Star

[CoLM'25] The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

model-compression sparse-attention large-language-models

Updated Jul 11, 2025
Python

ByteDance-Seed / FlexPrefill

Star

Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

natural-language-processing research sparse-attention large-language-models

Updated Oct 13, 2025
Python

thu-ml / SLA

Star

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

transformer video-generation mlsys inference-acceleration ai-infra linear-attention sparse-attention diffusion-transformer train-acceleration sparse-linear-attention

Updated Oct 19, 2025
Python

INV-WZQ / SparseD

Star

[Arxiv 2025] SparseD: Sparse Attention for Diffusion Language Models

efficiency sparse-attention diffusion-language-models

Updated Oct 7, 2025
Python

eezkni / SSIU

Star

[TIP-2025] Official Pytorch implementation of "Structural Similarity-Inspired Unfolding for Lightweight Image Super-Resolution"

lightweight super-resolution sparse-attention

Updated Jul 8, 2025
Python

lim142857 / Sparsifiner

Star

Demo code for CVPR2023 paper "Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers"

attention-mechanism fast-inference sparse-neural-networks low-rank vision-transformer efficient-transformers sparse-attention efficient-vision-transformers

Updated Jul 4, 2023
Python

Dynamic Attention Mask (DAM) generate adaptive sparse attention masks per layer and head for Transformer models, enabling long-context inference with lower compute and memory overhead without fine-tuning.

inference-optimization sparse-attention efficient-ai

Updated Jun 16, 2025
Python

wenhao728 / VORTA

Star

The code implementation of paper "VORTA: Efficient Video Diffusion via Routing Sparse Attention"

diffusion-models sparse-attention video-diffusion-model

Updated Oct 15, 2025
Python

Iron-Bound / native-sparse-attention

Star

Building Native Sparse Attention

deep-learning sparse-attention flash-attention

Updated Feb 20, 2025
Python

sidcraftscode / Hydra

Star

Toy Hydra prototypes: SSM + sparse attention + MoE + memory; synthetic benchmarks. Paper: https://arxiv.org/abs/2508.15099

benchmarking memory pytorch language-model pkm state-space-models mixture-of-experts sparse-attention long-context

Updated Sep 30, 2025
Python

vleonel-junior / TabNSA_CCP

Star

Classification binaire avec architecture Sparse Attention pour données tabulaires. Optimisation automatique des hyperparamètres via Optuna. Testé sur datasets de churn télécommunications et bancaire.

machine-learning pytorch churn binary-classification telecommunications tabular optuna banki predictive- sparse-attention

Updated Jun 20, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the sparse-attention topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sparse-attention topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sparse-attention

Here are 23 public repositories matching this topic...

lucidrains / native-sparse-attention-pytorch

thu-ml / SpargeAttn

NVlabs / LongLive

SHI-Labs / NATTEN

mit-han-lab / radial-attention

svg-project / Sparse-VideoGen

weigao266 / Awesome-Efficient-Arch

ByteDance-Seed / ShadowKV

XunhaoLai / native-sparse-attention-triton

thu-nics / MoA

ByteDance-Seed / FlexPrefill

thu-ml / SLA

INV-WZQ / SparseD

eezkni / SSIU

lim142857 / Sparsifiner

ResponsibleAILab / DAM

wenhao728 / VORTA

Iron-Bound / native-sparse-attention

sidcraftscode / Hydra

vleonel-junior / TabNSA_CCP

Improve this page

Add this topic to your repo