PaliGemma is an open-source Vision-Language Model (VLM) designed to address a broad range of vision-language tasks. It combines the SigLIP-So400m vision encoder and the Gemma-2B language model, ...