BitCPM-CANN: Native 1.58-Bit Large Language Model Training on Ascend NPU
r/LocalLLaMA
•
Machine Learning
Generative AI
AI Hardware
AI Research
Paper: Abstract We present BitCPM-CANN, a systematic family-level study of 1.58-bit (ternary) quantization-aware