XMAI at ACL'23
Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning
General GPT
WIP: Multimodal Large Language Model
Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning
WIP: Multimodal Large Language Model