Google’s AI display glasses: prototype dazzles, reality bites

Google’s AI – At Google I/O, the company previewed its upcoming Android XR AI glasses with an in-lens display—an upgrade meant to blend what you see with Gemini audio, while promising widgets, translation, navigation, and AI camera features. But the hands-on prototype came
By the time the demo reps handed over the prototype, it already felt like the real product was close—and just far enough to remind you this is still engineering, not retail.
At Google’s I/O developer conference this week. MISRYOUM Tech News had a brief hands-on with the company’s upcoming AI-powered glasses with both audio and visual output. Google described a first generation of audio-only glasses that it said will begin shipping this fall. The display version is a next step beyond that: Android XR glasses with an in-lens display that overlays information on the real world.
Google’s pitch is familiar to anyone who’s watched the rapid rise of “spatial” assistants. In the glasses’ view. widgets can appear in front of you—showing items like the weather. walking directions. Uber pickup details. live translation. and more. The company also showed that users could design their own widgets using AI.
Those visuals sit alongside Gemini. To activate Gemini, the user performs a two-second press on the right side of the glasses’ frame. A startup chime signals that Gemini is on and listening. In the prototype, starting up Gemini also started the camera at the same time. In the shipping version, users will be able to configure whether the camera turns on when Gemini starts.
The biggest difference between what was tested and what’s coming is also the simplest: this one doesn’t yet detect when it’s on your head. The reps said the shipping version will detect when the glasses are placed on your head and taken off. but the prototype used in the demo didn’t include that feature.
That gap showed up elsewhere too. One of the most telling moments came when the testers tried photo capture and object identification. When the photo capture button was pressed, the display was off, and the picture transferred to a phone and watch. Later, long-press video capture was said to be supported, but it wasn’t available to test on the prototype. For the AI editing flow. the demo allowed voice control such as asking Gemini to take a photo and turn a person into an anime character. The photo goes to the phone. then to the Gemini and Nano Banana servers. and returns to the wearer as the edited version.
In the noisy Google I/O venue—where Wi-Fi was under a heavy load—the back-and-forth time became part of the experience. The round-trip for the edited result took around 45 seconds.
Music was the first “comfort” use case, but even that didn’t fully land. The team asked Gemini to play a favorite artist and cranked the volume to the maximum. In the crowded, noisy environment, the sound still wasn’t crisp or detailed enough to judge the audio quality confidently. The initial impression from that limited test was that the glasses wouldn’t replace higher-quality earbuds. but they could still work if you wanted music outdoors while walking. hiking. or doing chores at home. The upsides were practical: without earbuds in your ears. it’s easier to hear someone talking. compared with transparency-mode experiences on devices like Apple’s AirPods.
To stop music, a single tap on the side of the frame—around the middle, as if tapping on your temple—was shown in the demo.
The visual experience, meanwhile, arrived with its own rough edges. With the display enabled, a simple home screen appeared in the wearer’s field of view. The demo version had widgets preloaded, including weather and a countdown to Google’s I/O event. It also allowed quick launchers into apps like Google Maps or Translate if those were among a user’s main use cases.
In the prototype, there was just one display over the right eye. The image looked a little fuzzy. and the reps pointed to the testing setup: prescription contacts optimized for distance on one lens and near vision on the other. When one eye was closed. the image came into better focus—but the experience still quickly caused eye strain above the right eye. and it wasn’t clear whether the prescription or the display itself was responsible.
Translation was one of the cleanest demonstrations. One demonstrator spoke rapid Spanish, and the glasses automatically detected the language and displayed English text on the display. At the same time, Gemini spoke English in the wearer’s ear. The demo strongly suggested a world-traveler use case: “We could see world travelers buying the glasses for this experience alone. ” the hands-on experience noted.
Google says Translate will work on the audio-only glasses as well, just without the text appearing on the glasses. In that mode, the transcription would show on the phone alongside the real-time audio feedback.
Navigation aimed at the same “hands-free” promise. The demo started Google Maps navigation by asking Gemini to navigate to a destination—even something vague like “the nearest coffee shop.” Gemini would activate Google Maps on the phone. without taking the phone out of a pocket. After a brief delay while the experience loaded, turn-by-turn directions appeared on the glasses. When looking forward, the wearer would see the next-turn information. When looking down, the display would show the blue dot on a map to help orientation. Turning left or right would rotate the experience in space. Then the demo returned the wearer to walking without the map blocking the forward view. and because the experience is tied to Google Maps on the phone. saved destinations like “home” and “work” would be available.
Not every object-identification test ran smoothly. The prototype initially struggled to identify a replica of a Monet painting on a shelf because it didn’t automatically enable the camera—it had to be turned on again from the app. Even so. after a couple of questions and moving closer to focus on the Monet signature at the bottom left. Gemini eventually said it looked like a Monet.
Other tests were smoother. The glasses immediately identified a plant on the shelf and answered questions about recipes in a book.
That contrast raised an unspoken question during the hands-on: these are tasks many people already do today with Google Lens or AI models integrated into chatbot apps—so why the glasses? The demo’s appeal was the friction reduction: doing it without pulling out a phone.
As for who helped build the look and feel, Google said the eyewear with the display was developed in partnership with Warby Parker, Gentle Monster, and Samsung, blending Google technology with those brands’ design aesthetics.
Prototype and shipping versions also differed in hardware scope. The prototype tested had a single display over the right eye, but the platform can support both single and dual displays, as well as audio-only glasses.
Google says it will have more to share about its Android XR display glasses later this year, when it expands its trusted tester program. Until then, the company’s message leans on audio: it believes audio will suffice for some users’ needs.
That’s also where the other product line comes in. Google’s first-generation audio glasses—expected to ship this fall—provide access to Gemini with audio delivered through the glasses’ frame speakers. Users can do things like listen to music through the glasses. press a button to take a photo. make a call. and tap into phone apps. in the same broad way that the future display version is meant to work.
During the demo, third-party app tapping wasn’t specifically shown, but the glasses are designed to let Gemini handle tasks like “take the ingredients from this recipe and add them to my shopping list.”
Google also showcased an example from its keynote where the glasses could see a meal being cooked on the stove and offer feedback—such as whether the meat was fully done yet.
Google AI glasses Android XR Gemini Google I/O Warby Parker Gentle Monster Samsung translation Google Maps navigation Nano Banana trusted tester program AR display
So are these shipping or just more I/O hype?
I watched the headline and it sounds like they can put Uber pickups in your face?? Cool but also creepy. If it’s audio-only first then why show the fancy display prototype at all.
The “in-lens display” sounds like it’s gonna be like those AR glasses where the text is always blurry. Also if it overlays widgets, doesn’t that basically mean they’re tracking where you look? Not reading the whole thing but I’m betting it’s more marketing than reality.
Engineering not retail… so basically it won’t work right when it “ships this fall.” I’m just waiting for the first time it mis-reads something and gives you the wrong translation or navigation. And “AI camera features” is how you end up recording strangers without realizing it. Hard pass for me unless they make it super simple and safe.