Alibaba, one of China’s biggest tech companies, announced the release of two new A.I. models on Friday that dramatically level up the possibilities of artificial intelligence.
The open source models, called Qwen-VL and Qwen-VL-Chat, are vision language models, meaning they “read” images rather than text, unlike competitors ChatGPT and Google Bard. Qwen-VL-Chat promises complex features like providing directions by scanning street signs, solving math equations based on a photo, and weaving together a narrative based on multiple pictures. For example, it can scan an image of a sign in a hospital written in Mandarin and then translate it into English, or help a news organization write a caption for a photo, the company says.
Qwen-VL, the other release Friday, is an updated version of its existing image-reading chatbot that can now read pictures in higher resolution.
Alibaba declined to comment to Fortune beyond its public announcement.
These new iterations of A.I. are the latest shots fired in the arms race among developers to create increasingly sophisticated tools, as the technology graduates from gimmick to genuine game-changer. For example, Alibaba says its new image-scanning technology has significant opportunities to help visually impaired people with shopping, allowing them, for instance, to scan an item and have the chatbot recite the label back to them.
Both models will be made available on Alibaba Cloud’s proprietary model-as-a-service platform Modelscope and on Hugging Face, the popular startup that has a library of A.I. models.
Alibaba’s release comes just a day after Meta launched an A.I. model fine-tuned for writing code, built on the open-source Llama 2 model released in July. Alibaba has been trying to keep up with Meta’s A.I. rollouts for the last few months. Earlier this month, Alibaba unveiled its first two open-source large language models, Qwen-7B and Qwen-7B-Chat—the same ones that form the basis for Friday’s releases. In July, the two companies struck an agreement to make Meta’s Llama 2 model available to the Chinese market via Alibaba’s cloud division.
By making these new models open-source, Alibaba is letting users tweak the tools to develop their own apps or conduct research. Most A.I. companies hope that users will adapt open-source models into tools for highly specific use cases, without having to undertake the onerous task of building a large language model from scratch. Alongside the open-source offerings, the companies offer their proprietary models as a service, hoping to capture market share in the burgeoning industry.
Comments