Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models ...
V, a multimodal model that has introduced native visual function calling to bypass text conversion in agentic workflows.
Autonomous driving systems increasingly rely on data-driven approaches, yet many still struggle with reasoning, handling rare scenarios, and transparently explaining their actions. A new study ...
There’s a paradox at the heart of modern AI: The kinds of sophisticated models that companies are using to get real work done and reduce head count aren’t the ones getting all the attention. Ever-more ...
Researchers find large language models process diverse types of data, like different languages, audio inputs, images, etc., similarly to how humans reason about complex problems. Like humans, LLMs ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
As someone who owns more than fifteen volumes from the MIT Press Essential Knowledge series, I approach each new release with both interest and caution: the series often delivers thoughtful, ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...