VOOZH about

URL: https://huggingface.co/stabilityai/japanese-stable-vlm

⇱ stabilityai/japanese-stable-vlm · Hugging Face


You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

By clicking "Agree", you agree to the License Agreement and acknowledge Stability AI's Privacy Policy.

Log in or Sign Up to review the conditions and access this model content.

Japanese Stable VLM

Please note: for commercial usage of this model, please see https://stability.ai/license

商用利用に関する日本語での問い合わせは partners-jp@stability.ai までお願い致します。

Model Details

Japanese Stable VLM is a vision-language instruction-following model that enables to generate Japanese descriptions for input images and optionally input texts such as questions.

Usage

Model Details

Training

This model is a vision-language instruction-following model with the LLaVA 1.5 architecture. It uses stabilityai/japanese-stablelm-instruct-gamma-7b as a language model and openai/clip-vit-large-patch14 as an image encoder. During training, the MLP projection was trained from scratch at the first stage and the language model and the MLP projection were further trained at the second stage.

Training Dataset

The training dataset includes the following public datasets:

Use and Limitations

Intended Use

This model is intended to be used by the open-source community in vision-language applications.

Limitations and bias

The training dataset may have contained offensive or inappropriate content even though we applied data filters. We recommend users exercise reasonable caution when using these models in production systems. Do not use the model for any applications that may cause harm or distress to individuals or groups.

How to cite

@misc{JapaneseStableVLM, 
 url = {[https://huggingface.co/stabilityai/japanese-stable-vlm](https://huggingface.co/stabilityai/japanese-stable-vlm)}, 
 title = {Japanese Stable VLM}, 
 author = {Shing, Makoto and Akiba, Takuya}
}

Contact

Downloads last month
8
Safetensors
Model size
8B params
Tensor type
F32
·

Spaces using stabilityai/japanese-stable-vlm 2

Collection including stabilityai/japanese-stable-vlm

Paper for stabilityai/japanese-stable-vlm