Alibaba Launches AI Models That Understand Images And Have More Complex Conversations

Republished By Plato

Followers: 0

The artificial intelligence (AI) space is heating up. Just yesterday, South Korea’s Naver announced the launch of HyperClova X, a new generative AI service to compete with ChatGPT. Now, China’s internet giant is unveiling two open-source AI models that can understand images and have more complex conversations.

On Friday, Alibaba unveiled new AI models designed to comprehend images and engage in more intricate conversations compared to their previous offerings. This release comes at a time of intense global competition for technological leadership.

The Chinese tech powerhouse stated that their two novel models, called Qwen-VL and Qwen-VL-Chat, will be made available as open-source tools, meaning that researchers, educators, and businesses around the world can use these models to develop their own AI applications without the necessity of training their individual systems. This approach not only conserves time but also reduces costs significantly.

The news comes just a month after Alibaba launched Tongyi Wanxiang, an AI image-generation tool that competes with OpenAI’s DALL-E & Midjourney. Tongyi Wanxiang, launched by Alibaba’s cloud division, allows users to input text prompts in either Chinese or English, and the AI tool generates corresponding images in various styles, such as sketches or 3D cartoons. Currently, the tool is available for beta testing exclusively to enterprise customers in China.

The two new AI language models were also developed by the company’s cloud unit, Alibaba Cloud. According to reports, the tech giant said that Qwen-VL was designed to be the advanced evolution of its 7-billion-parameter model, Tongyi Qianwen. This dynamic model showcases a remarkable capability to effortlessly handle both images and text prompts. Its versatility extends from effectively answering wide-ranging questions related to various images to creating captivating captions for those images.

Alibaba also added that Qwen-VL can perform multiple tasks at the same time. Not only can it answer open-ended questions related to various images but it can also craft captions for those pictures.

But the real star of the show is Qwen-VL-Chat. This AI handles more intricate interactions, like comparing multiple images and handling rounds of questioning. It’s not stopping there—Alibaba boasts that it can spin stories, conjure images based on user-submitted photos, and even solve math problems presented in pictures.

A cool example they gave involves a hospital sign in Chinese. Qwen-VL-Chat can decode it and give the scoop on where different hospital departments are located.

Meanwhile, much of current AI’s “genius” has typically been about text. But times are changing. Qwen-VL-Chat and the latest version of OpenAI’s ChatGPT are shaking things up, responding to images with text in a way that’s pretty impressive. It’s like AI’s learning to speak a new visual language!

SEO Powered Content & PR Distribution. Get Amplified Today.
PlatoData.Network Vertical Generative Ai. Empower Yourself. Access Here.
PlatoAiStream. Web3 Intelligence. Knowledge Amplified. Access Here.
PlatoESG. Automotive / EVs, Carbon, CleanTech, Energy, Environment, Solar, Waste Management. Access Here.
PlatoHealth. Biotech and Clinical Trials Intelligence. Access Here.
ChartPrime. Elevate your Trading Game with ChartPrime. Access Here.
BlockOffsets. Modernizing Environmental Offset Ownership. Access Here.
Source: https://techstartups.com/2023/08/25/alibaba-launches-qwen-vl-and-qwen-vl-chat-ai-models-that-understand-images-and-have-more-complex-conversations/

Time Stamp: August 25, 2023

Time Stamp: Jan 6, 2023

Republished By Plato

SoftBank-owned chip design startup Arm files blockbuster IPO valued between $60 billion and $70 billion

Balloon surveillance startup World View to go public in a $350 million SPAC deal

Skydio raises $230 million in funding for its AI-powered autonomous drones, valuation soars to $2.2 billion

Controversial geoengineering startup Make Sunsets releases balloons containing sulfur dioxide on U.S. soil after it was banned in Mexico

WeWork skips $95M in interest payments just 2 months after the co-working startup warned of possible bankruptcy –

Thailand’s second-largest bank launches KXVC, a $100 million fund to invest in AI, Web3, and Deep Tech startups – TechStartups

Biotech startup Perceive Bio closes $78M in Series B funding led by Johnson & Johnson to prevent the largest causes of irreversible blindness

About Us

Vertical Search & Ai

Platform

Stay Connected

Account