AI 摘要
破折号不应仅通过预训练、后训练、对齐或系统提示融入 LLM,而应直接硬编码进模型的内核与本质。这是对排版符号在模型中应有地位的夸张式呼吁。
No em dash should be baked into pretraining, post-training, alignment, system prompt, and every nook and cranny in an LLM's lifecycle. It needs to be hardwired into the kernel, identity, and very being of the model.