You Only Cache Once: Decoder-Decoder Architectures for Language Models
gonzoml.substack.com
Authors: Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei Paper: https://arxiv.org/abs/2405.05254 Code: https://github.com/microsoft/unilm/tree/master/YOCO The authors have proposed an architecture for LLMs called
You Only Cache Once: Decoder-Decoder Architectures for Language Models
You Only Cache Once: Decoder-Decoder…
You Only Cache Once: Decoder-Decoder Architectures for Language Models
Authors: Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei Paper: https://arxiv.org/abs/2405.05254 Code: https://github.com/microsoft/unilm/tree/master/YOCO The authors have proposed an architecture for LLMs called