Vision-language models (VLMs) are rapidly changing how humans and robots work together, opening a path toward factories where machines can “see,” ...