LLM Compression: Enhancing AWQ

Dec 1, 2024

Graduation project focused on improving AWQ (Activation-aware Weight Quantization) with extra scaling.