2024 . 10 . 15

Cascaded Transformer-based Networks for Wikipedia Large-scale Image-Caption Matching