FrostNet: Towards Quantization-Aware Network Architecture Search
-
Updated
May 3, 2024 - Python
FrostNet: Towards Quantization-Aware Network Architecture Search
A resource-conscious neural network implementation for MCUs
🚀 Leveraging advanced RNN with LSTM for efficient, real-time anomaly detection in IoT networks, optimized for performance in resource-constrained environments.
eve-mli: making learning interesting
Clean C language version of quantizing llama2 model and running quantized llama2 model
This project demonstrates the impact of model design choices on both energy consumption and economic cost. It analyzes the weight importance within a neural network, estimates the total FLOPs required for inference, and explores how quantization and pruning affect resource efficiency.
Code for ICCV2025 paper 'Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers'
A Tutorial Notebook to Quantization in Machine Learning
Add a description, image, and links to the quantization-efficient-network topic page so that developers can more easily learn about it.
To associate your repository with the quantization-efficient-network topic, visit your repo's landing page and select "manage topics."