Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2k 164

  2. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 763 99

  3. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 705 57

  4. PanzaMail PanzaMail Public

    Python 278 15

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 265 22

  6. QUIK QUIK Public

    Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024

    C++ 175 14

Repositories

Showing 10 of 54 repositories
  • EvoPress Public
    IST-DASLab/EvoPress’s past year of commit activity
    Python 17 1 0 0 Updated Feb 13, 2025
  • QuEST Public

    Work in progress.

    IST-DASLab/QuEST’s past year of commit activity
    Jupyter Notebook 31 MIT 1 1 0 Updated Feb 13, 2025
  • PanzaMail Public
    IST-DASLab/PanzaMail’s past year of commit activity
    Python 278 Apache-2.0 15 5 5 Updated Feb 10, 2025
  • ScalableMNN Public

    Official Repository for "Scalable Mechanistic Neural Networks" (ICLR 2025)

    IST-DASLab/ScalableMNN’s past year of commit activity
    0 MIT 0 0 0 Updated Feb 6, 2025
  • gemm-fp8 Public

    High Performance FP8 GEMM Kernels for SM89 and later GPUs.

    IST-DASLab/gemm-fp8’s past year of commit activity
    Cuda 3 MIT 0 0 0 Updated Jan 24, 2025
  • GridSearcher Public

    GridSearcher simplifies running grid searches for machine learning projects in Python, emphasizing parallel execution and GPU scheduling without dependencies on SLURM or other workload managers.

    IST-DASLab/GridSearcher’s past year of commit activity
    Python 2 Apache-2.0 0 0 0 Updated Jan 23, 2025
  • gemm-int8 Public

    High Performance Int8 GEMM Kernels for SM80 and later GPUs.

    IST-DASLab/gemm-int8’s past year of commit activity
    Python 3 MIT 0 0 0 Updated Jan 15, 2025
  • MicroAdam Public

    This repository contains code for the MicroAdam paper.

    IST-DASLab/MicroAdam’s past year of commit activity
    Python 16 Apache-2.0 4 1 0 Updated Dec 14, 2024
  • llm-foundry Public Forked from mosaicml/llm-foundry

    LLM training code for Databricks foundation models

    IST-DASLab/llm-foundry’s past year of commit activity
    Python 0 Apache-2.0 549 0 1 Updated Nov 27, 2024
  • IST-DASLab/marlin_artifact’s past year of commit activity
    Python 0 0 0 0 Updated Nov 25, 2024

Top languages

Loading…

Most used topics

Loading…