#PTQ 共 2 个条目 论文 (2) Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach RAMP: Reinforcement Adaptive Mixed-Precision Quantization for Efficient On-Device LLM Inference