BitCPM-CANN: Native 1.58-Bit Large Language Model Training on Ascend NPU

r/LocalLLaMA
Machine Learning Generative AI AI Hardware AI Research

Paper: Abstract We present BitCPM-CANN, a systematic family-level study of 1.58-bit (ternary) quantization-aware