Sale!

Paligemma-3B-PT-896

Original price was: $0.05.Current price is: $0.01.

Pricing is for per inference request. Get 500 requests for $5!

The Paligemma-3b-pt-896 is a versatile and lightweight vision-language model (VLM) inspired by PaLI-3 and based on open components such as the SigLIP vision model and the Gemma language model. It takes both image and text as input and generates text as output, supporting multiple languages. It is designed for class-leading fine-tune performance on a wide range of vision-language tasks such as image and short video caption, visual question answering, text reading, object detection and object segmentation.

Paligemma-3b-pt-896 on Qubrid AI Model Studio

We have simplified how you can use or fine-tune Paligemma-3b-pt-896 on our AI Model Studio running on Qubrid’s AI Cloud. Powered by NVIDIA GPUs, we offer you performance with simplicity so you can build your Paligemma-3b-pt-896 applications quickly without the need to setup or install anything. Login now to inference or fine tune this model – no programming needed.

Learn how to Fine-tune AI Models on Qubrid AI Platform

AI Model Author: Google

This model is not owned or developed by Qubrid AI. This model has been developed and built to a third-party’s requirements for this application and use case.

SKU: PALIGEMMA-3B-PT-896 Category:

Request Quote / Product Info

  • Easy Deployment
  • Simple UI
  • Powerful AI Models and GPUs

Brand

AI Model Type

Image to Text

Shopping Cart
Scroll to Top