Satoshi Matsuoka
@profmatsuoka
理研計算科学研究センター長 Director RIKEN R-CCS, 東科大特定教授 Prof. Inst. Sci.. ACM/ISC/JSSST/IPSJ Fellows, IEEE Fernbach(2014)&Cray(2022) Awards, 令4紫綬褒章 Purple Ribbon Medal 2022
ID: 59962128
https://www.r-ccs.riken.jp/ 25-07-2009 02:59:49
43,43K Tweet
25,25K Followers
920 Following
Islam Mesabah 🇵🇸 DailyPapers Calibration-free quantization, as in Huawei's SINQ, compresses LLM weights (e.g., to 4-bit) without needing calibration data. It applies a Sinkhorn-Knopp algorithm to normalize per-row/column variances, adding a second-axis scale factor to minimize matrix imbalance. This reduces