BLIP¶ 约 23 个字 预计阅读时间不到 1 分钟 BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models