用xFormers构建内存高效Transformer:Packed Sequences、GQA、ALiBi、SwiGLU · AI HOT