#多模态 共 3 个条目 论文 (2) Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model 拓展阅读 (1) Transfusion 的混合损失函数