The Phi-3-Vision-128K-Instruct is a lightweight, state-of-the-art open multimodal model built upon datasets which include synthetic data and filtered publicly available websites, with a focus on very high-quality, reasoning dense data both on text and vision. The model belongs to the Phi-3 model family, and the multimodal version comes with 128K context length (in tokens) it can support. The model underwent a rigorous enhancement process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures.
Phi-3-Vision-128K-Instruct on Qubrid AI Model Studio
We have simplified how you can use or fine-tune Phi-3-Vision-128K-Instruct on our AI Model Studio running on Qubrid’s AI Cloud. Powered by NVIDIA GPUs, we offer you performance with simplicity so you can build your Phi-3-Vision-128K-Instruct applications quickly without the need to setup or install anything. Login to try for free.
Learn how to Fine-tune AI Models on Qubrid AI Platform
AI Model Author: Microsoft
This model is not owned or developed by Qubrid AI. This model has been developed and built to a third-party’s requirements for this application and use case.