publications

(*) denotes equal contribution.

2023

  1. arXiv
    UniIR: Training and Benchmarking Universal Multimodal Information Retrievers
    Cong Wei, Yang Chen, Haonan Chen, and 5 more authors
    arXiv:2311.17136, 2023
    uniir.jpg
  2. arXiv
    MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
    Xiang Yue, Yuansheng Ni, Kai Zhang, and 9 more authors
    arXiv:2311.16502, 2023
    mmmu.jpg
  3. TMLR
    DreamEdit: Subject-driven Image Editing
    Tianle Li, Max Ku*, Cong Wei*, and 1 more author
    Transactions on Machine Learning Research (TMLR) 2023, 2023
    dreamedit.png
  4. CVPR
    Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers
    Cong Wei*, Brendan Duke*, Ruowei Jiang, and 3 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
    sparsifiner.png