Visual Large Language Models

14d

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models ...

WinBuzzer

Z.ai Launches GLM-4.6V AI Model to Let AI Agents See Natively

V, a multimodal model that has introduced native visual function calling to bypass text conversion in agentic workflows.

EurekAlert!

New multi-modal AI framework brings human-like reasoning to self-driving vehicles

Autonomous driving systems increasingly rely on data-driven approaches, yet many still struggle with reasoning, handling rare scenarios, and transparently explaining their actions. A new study ...

Hosted on MSN

Large Language Models Get All the Hype, but Small Models Do the Real Work

There’s a paradox at the heart of modern AI: The kinds of sophisticated models that companies are using to get real work done and reduce head count aren’t the ones getting all the attention. Ever-more ...

Science Daily

Like human brains, large language models reason about diverse data in a general way

Researchers find large language models process diverse types of data, like different languages, audio inputs, images, etc., similarly to how humans reason about complex problems. Like humans, LLMs ...

InfoQ

LLaVA-CoT Shows How to Achieve Structured, Autonomous Reasoning in Vision Language Models

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

Unite.AI

Book Review: Large Language Models by Stephan Raaijmakers

As someone who owns more than fifteen volumes from the MIT Press Essential Knowledge series, I approach each new release with both interest and caution: the series often delivers thoughtful, ...

InfoQ

Meta Open-Sources Large Concept Model, a Language Model That Predicts Entire Sentences

Some results have been hidden because they may be inaccessible to you

Show inaccessible results