baidu/Qianfan-OCR
Image-Text-to-Text • 5B • Updated • 19.1k • 806
Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios.
Domain-Enhanced Universal Vision-Language Models