pruning

Here are 438 public repositories matching this topic...

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated Jul 3, 2024
Python

quic / aimet

Star

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

open-source machine-learning opensource deep-neural-networks compression deep-learning pruning quantization auto-ml network-quantization network-compression

Updated Jul 3, 2024
Python

openvinotoolkit / nncf

Star

Neural Network Compression Framework for enhanced OpenVINO™ inference

nlp sparsity compression deep-learning tensorflow transformers pytorch classification pruning object-detection quantization semantic-segmentation bert hawq onnx openvino mmdetection mixed-precision-training quantization-aware-training

Updated Jul 2, 2024
Python

ModelTC / llmc

Star

LLMC is an elegant tool for LLM compression.

Updated Jul 2, 2024
Python

huggingface / optimum-intel

Star

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

optimization intel transformers inference pruning quantization distillation onnx openvino diffusers

Updated Jul 2, 2024
Jupyter Notebook

neuralmagic / sparseml

Star

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

nlp sparsity tensorflow keras pytorch deep-learning-algorithms image-classification deep-learning-library pruning object-detection transfer-learning automl computer-vision-algorithms onnx deep-learning-models sparsification pruning-algorithms smaller-models sparsification-recipes

Updated Jul 2, 2024
Python

VainF / Torch-Pruning

Star

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

pruning model-compression channel-pruning network-pruning structured-pruning efficient-deep-learning depgraph structural-pruning cvpr2023

Updated Jul 2, 2024
Python

ZIB-IOL / SMS

Star

Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"

sparsity deep-learning neural-network optimization pytorch pruning averaging

Updated Jul 2, 2024
Python

xuxw98 / DSPDet3D

Star

[ECCV 2024] 3D Small Object Detection with Dynamic Spatial Pruning

robotics point-cloud pruning object-detection point-clouds scannet sparse-convolution 3d-object-detection small-object-detection efficient-networks matterport3d 3d-scene-understanding dynamic-neural-network eccv2024

Updated Jul 2, 2024
Python

cupcakearmy / autorestic

Sponsor

Star

Config driven, easy backup cli for restic.

config cli backup incremental pruning restic deduplication incremental-backup config-driven

Updated Jul 3, 2024
Go

horseee / LLM-Pruner

Star

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

bloom compression pruning llama language-model vicuna baichuan pruning-algorithms llm chatglm neurips-2023 llama-2

Updated Jul 1, 2024
Python

HankYe / Once-for-Both

Star

[CVPR'24] Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

imagenet pruning search-algorithm model-compression deit

Updated Jul 1, 2024
Python

neuralmagic / deepsparse

Star

Sparsity-aware deep learning inference runtime for CPUs

nlp performance computer-vision inference machinelearning pruning object-detection pretrained-models quantization cpus onnx sparsification llm-inference deepsparse

Updated Jul 1, 2024
Python

huawei-noah / Efficient-Computing

Star

Efficient computing methods developed by Huawei Noah's Ark Lab

pruning quantization knowledge-distillation model-compression self-supervised binary-neural-networks

Updated Jul 1, 2024
Jupyter Notebook

DrChainsaw / NaiveNASflux.jl

Star

Your local Flux surgeon

flux machine-learning deep-learning mutation neural-networks hyperparameter-optimization pruning transfer-learning architecture-search morphisms

Updated Jun 29, 2024
Julia

nathanhubens / fasterai

Sponsor

Star

FasterAI: Prune and Distill your models with FastAI and PyTorch

compression pytorch pruning fastai knowledge-distillation

Updated Jun 27, 2024
Jupyter Notebook

luuyin / OWL

Star

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"

sparsity pruning llm largelanguagemodel

Updated Jun 26, 2024
Python

mcthouacbb / Sirius

Star

Chess engine

chess ai extensions engine evaluation bitboard pruning alpha-beta-pruning negamax reductions

Updated Jun 29, 2024
C++

binarypatrick / Prune

Star

Prune is a simple tool that lets you remove archives in a folder, deleting any archives not matching the specified retention options.

backup maintenance pruning

Updated Jun 26, 2024
C#

alibaba / TinyNeuralNetwork

Star

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

deep-neural-networks deep-learning pytorch pruning model-compression model-converter quantization-aware-training post-training-quantization

Updated Jun 26, 2024
Python

Improve this page

Add a description, image, and links to the pruning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pruning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pruning

Here are 438 public repositories matching this topic...

intel / neural-compressor

quic / aimet

openvinotoolkit / nncf

ModelTC / llmc

huggingface / optimum-intel

neuralmagic / sparseml

VainF / Torch-Pruning

ZIB-IOL / SMS

xuxw98 / DSPDet3D

cupcakearmy / autorestic

horseee / LLM-Pruner

HankYe / Once-for-Both

neuralmagic / deepsparse

huawei-noah / Efficient-Computing

DrChainsaw / NaiveNASflux.jl

nathanhubens / fasterai

luuyin / OWL

mcthouacbb / Sirius

binarypatrick / Prune

alibaba / TinyNeuralNetwork

Improve this page

Add this topic to your repo