INT4 LoRA high-quality-tuning vs QLoRA: A user inquired about the discrepancies amongst INT4 LoRA high-quality-tuning and QLoRA in terms of accuracy and speed. Another member explained that QLoRA with HQQ consists of frozen quantized weights, isn't going to use tinnygemm, and utilizes dequantizing together with torch.matmulUpdate vision model to gp