Elon Musk’s AI chatbot, Grok, just got a serious upgrade—and it’s stepping up its game in the AI arms race. xAI, the Musk-founded company behind Grok, has rolled out two major features: Grok Vision and real-time multilingual voice capabilities.
Grok Vision is exactly what it sounds like—Grok can now “see” through your phone’s camera and respond to visual cues.
Think pointing your camera at a plant and asking, “Is this toxic to pets?” or scanning a street sign in a foreign language.
At launch, this feature is available on iOS devices, with Android support expected soon.
While it doesn’t support screen sharing or screenshot analysis just yet, the fact that Grok is now image-aware makes it more interactive and practical than ever.
On the audio front, Grok’s voice capabilities have gone multilingual. It can now converse in multiple global languages with real-time voice responses.
This puts it toe-to-toe with the likes of ChatGPT and Gemini, especially for users outside English-dominant regions.
Android users are the first to access this voice rollout, but iOS support is on the horizon.
The upgrades clearly show xAI’s intent to make Grok more than just a clever chatbot—it wants to build a fully integrated assistant for Musk’s social platform X (formerly Twitter).
With vision and voice rolled in, Grok is evolving into something more akin to a virtual companion that understands both what you say and what you see.
It’s still early days, and Grok has a lot of catching up to do with rivals like OpenAI and Google.
But with these sensory and linguistic boosts, Musk’s AI is no longer just talking—it’s watching and listening too.
Read More Related Blogs: