Get the latest tech news
QVQ-Max: Think with Evidence
QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD Introduction Last December, we launched QVQ-72B-Preview as an exploratory model, but it had many issues. Today, we are officially releasing the first version of QVQ-Max, our visual reasoning model. This model can not only “understand” the content in images and videos but also analyze and reason with this information to provide solutions. From math problems to everyday questions, from programming code to artistic creation, QVQ-Max has demonstrated impressive capabilities.
Flexible Application: From Problem-Solving to Creation Beyond analysis and reasoning, QVQ-Max can also perform interesting tasks like helping you design illustrations, generate short video scripts, or even create role-playing content based on your requirements. Learning Assistant: For students, QVQ-Max can help solve difficult problems in subjects like math and physics, especially those accompanied by diagrams. Visual Agent: Improve the model’s ability to handle multi-step and more complex tasks, such as operating smartphones or computers, and even playing games.
Or read this on Hacker News