Open-source multimodal model for text and image understanding.
Advanced AI model for multimodal applications on mobile and web.