Cloudflare 中文文档
Workers AI
编辑这个页面
跳转官方原文档
Set theme to dark (⇧+D)

uform-gen2-qwen-500m

Beta

Model ID: @cf/unum/uform-gen2-qwen-500m

UForm-Gen is a small generative vision-language model primarily designed for Image Captioning and Visual Question Answering. The model was pre-trained on the internal image captioning dataset and fine-tuned on public instructions datasets: SVIT, LVIS, VQAs datasets.

More Information  

​​ Properties

Task Type: Image-to-Text

​​ Code Examples

Workers - TypeScript

​​ Response

​​ API Schema

The following schema is based on JSON Schema

Input JSON Schema
Output JSON Schema