Regarding GGUF Quantize model

akarshan8001 · April 30, 2024, 7:55am

I have use PEFT lora technique to fine tune a mistral model i used bit and byte for my quantization so wanted to where its saving ,mu quantize model and how i can use quantize model with my adapter layer. if any reference please share article

Topic		Replies	Views
Finetuned LLM model conversion to GGUF - performance drop Models	4	2073	July 31, 2024
How to load a model fine-tuned with QLoRA 🤗Transformers	2	7094	July 29, 2024
Peft model from pretrained load in 8/4 bit 🤗Transformers	6	17883	October 12, 2023
Quantizing a model on M1 Mac for qlora 🤗Transformers	0	1777	March 14, 2024
Using Trainer class + 4/8 bit quantised model for prediction 🤗Transformers	1	272	August 22, 2025

Regarding GGUF Quantize model

Related topics