Author,Title,Journal,ISSN,Publisher,Date,Volume,Number,Page,URL,URL(DOI) Li,BLIP-2: Bootstrapping languageimage pre-training with frozen image encoders and large language models,arXiv:2301.12597,,,2023,,,,https://cir.nii.ac.jp/crid/1370584341837295903,