Multimodal Token Fusion for Vision Transformers

  • Yikai Wang
    Tsinghua University,Beijing National Research Center for Information Science and Technology (BNRist), State Key Lab on Intelligent Technology and Systems,Department of Computer Science and Technology
  • Xinghao Chen
    Huawei Noah's Ark Lab
  • Lele Cao
    Tsinghua University,Beijing National Research Center for Information Science and Technology (BNRist), State Key Lab on Intelligent Technology and Systems,Department of Computer Science and Technology
  • Wenbing Huang
    Institute for AI Industry Research (AIR), Tsinghua University
  • Fuchun Sun
    Tsinghua University,Beijing National Research Center for Information Science and Technology (BNRist), State Key Lab on Intelligent Technology and Systems,Department of Computer Science and Technology
  • Yunhe Wang
    Huawei Noah's Ark Lab

収録刊行物

被引用文献 (1)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ