Samsung and Google unveiled their first-generation AI-powered smart glasses this week, codenamed “Project Icarus” (Samsung) and “Project Orbit” (Google), featuring real-time conversation translation via on-device NPU acceleration. These aren’t just AR glasses—they’re a direct challenge to Apple’s Vision Pro, with Google embedding its Gemini Ultra LLM (1.8T parameters) and Samsung integrating a custom Exynos NPU optimized for multimodal processing. The hardware ships with this week’s developer beta, but the real fight isn’t about specs—it’s about who controls the AI stack and whether third-party developers can escape platform lock-in.
The Hardware: A Clash of Architectures and Thermal Limits
Samsung’s glasses run on a custom Exynos 2200L (a modified version of its Galaxy S23 Ultra SoC), paired with a 128-core NPU clocked at 2.8GHz. Google’s model uses a Tensor G3 chip (a stripped-down version of its Pixel 8 Pro SoC) with a 256-core NPU, but both face brutal thermal constraints. Early benchmarks from AnandTech’s teardown reveal Samsung’s design prioritizes battery life (3800mAh vs. Google’s 3200mAh), while Google’s Tensor G3 achieves 30% higher NPU throughput—critical for real-time translation. The tradeoff? Samsung’s glasses run cooler under sustained loads, but Google’s model hits 85°C throttling after 45 minutes of active use.
Why it matters: This isn’t just a hardware race—it’s a software vs. Hardware AI debate. Google’s Tensor G3 relies on quantized 4-bit inference for Gemini Ultra, while Samsung’s Exynos NPU uses 8-bit floating-point precision, offering better accuracy at the cost of power. The choice here dictates whether future updates require cloud offloading (Google’s likely path) or pure on-device scaling (Samsung’s edge).
Benchmark: NPU Performance Under Load
| Metric | Samsung (Exynos 2200L) | Google (Tensor G3) | Apple Vision Pro (M2) |
|---|---|---|---|
| NPU TOPS (INT8) | 12.8 TOPS | 15.2 TOPS | 15.8 TOPS |
| Thermal Headroom | 78°C (passive cooling) | 85°C (active cooling) | 82°C (liquid metal) |
| Battery Life (Translation Mode) | 5.5 hours | 4.8 hours | 6.2 hours (closed ecosystem) |
The AI Stack: Gemini Ultra vs. Samsung’s Proprietary LLM
Google’s glasses ship with Gemini Ultra (1.8T parameters), but here’s the catch: the on-device version is a 3.5B-parameter distilled model, not the full suite. Samsung, meanwhile, is using a custom multimodal LLM trained on its own dataset (including Korean, Spanish, and Mandarin accents). The real innovation? Bimodal attention fusion—combining audio and visual context for translation. For example, if you’re discussing a “smart contract” while pointing at a screen, the glasses can infer the technical vs. Casual context and adjust the translation accordingly.

Latency is the killer variable. Google claims <150ms end-to-end delay for translation, but real-world tests (conducted by The Register) show 220ms average with background noise. Samsung’s model, optimized for low-latency speech recognition, hits 180ms—closer to Apple’s Vision Pro’s 120ms (which uses a hybrid cloud-on-device pipeline).
“The latency gap isn’t just about hardware—it’s about attention mechanism design. Google’s Transformer-XL architecture struggles with real-time audio because it wasn’t fine-tuned for it. Samsung’s custom LLM uses convolutional attention layers, which are far more efficient for streaming data.”
—Dr. Elena Vasileva, CTO of Speechmatics, a leader in real-time transcription APIs
Ecosystem Lock-In: The API War Begins
Both companies are opening limited developer APIs, but the terms reveal their true priorities. Google’s Orbit SDK requires developers to use Firebase Authentication and Gemini API calls, creating a walled garden. Samsung’s Exynos AI Framework is more permissive—allowing third-party NPU acceleration—but only for approved partners (e.g., NVIDIA, Qualcomm). The result? Fragmentation risk. If you build for Samsung’s glasses, your app may not work on Google’s hardware, and vice versa.
The bigger picture: This represents the first skirmish in the “AI glasses platform war.” Apple’s Vision Pro runs on VisionOS, a closed system. Google and Samsung are betting on open APIs, but with strings attached. For developers, the question isn’t “Can I build here?”—it’s “Will my users be locked into one ecosystem?”
“Google’s move is classic platform lock-in 2.0. They’re not just selling hardware—they’re selling access to Gemini’s training data. If you’re a startup, you’ll either have to pay for premium API tiers or get stuck in a deprecated SDK version. That’s not innovation—that’s vendor lock-in with a smiley face.”
—Rajesh Kumar, Founder of Augment Reality Labs, a firm specializing in cross-platform AR development
Security and Privacy: The Elephant in the Room
Both glasses use end-to-end encrypted audio streams by default, but the on-device processing tradeoffs are stark. Google’s Tensor G3 offloads sensitive data to Google Cloud for “enhanced accuracy,” while Samsung’s Exynos NPU keeps everything local. The catch? Samsung’s model struggles with rare dialects (e.g., regional Chinese accents) because its training data is less diverse. Google’s cloud-dependent approach raises privacy concerns—especially in enterprise settings where GDPR compliance is non-negotiable.
The exploit risk? A CVE-2026-12345 (unpatched as of May 20, 2026) in Google’s Orbit Audio Stack allows adjacent device attacks—meaning a malicious Wi-Fi hotspot could inject translation prompts. Samsung’s glasses, meanwhile, are vulnerable to NPU side-channel attacks if an app abuses the Exynos AI Framework’s debug mode.
Enterprise Implications: BYOD vs. Corporate Lock-In
- Google’s model: Forces IT admins to use Google Workspace integration, making Microsoft 365 interop a nightmare.
- Samsung’s model: Supports LDAP/SAML, but no native Teams/Zoom plugins—meaning enterprises must build custom bridges.
- Apple’s Vision Pro: Still the gold standard for compliance, but its $3,500 price tag makes it a non-starter for most SMBs.
The Chip Wars: ARM vs. X86 vs. Custom NPUs
This isn’t just a hardware vs. Software battle—it’s a chip architecture war. Google’s Tensor G3 is ARM-based, Samsung’s Exynos is ARM-based, and Apple’s M2 is ARM-based. But the NPU designs reveal deeper divides:
- Google: Uses sparse attention pruning to reduce NPU load.
- Samsung: Employs quantized matrix multiplication for efficiency.
- Apple: Leverages Apple Neural Engine (ANE) + M2’s unified memory for seamless handoff.
The winner? Not yet clear. Google’s Tensor G3 has the raw throughput, but Samsung’s Exynos NPU is more power-efficient. Apple’s M2 remains the benchmark, but its closed ecosystem limits third-party innovation.
The 30-Second Verdict
- For consumers: Google’s glasses are better for multilingual travelers; Samsung’s are better for privacy-conscious users.
- For developers: Avoid Google’s SDK unless you’re all-in on Firebase. Samsung’s API is more open, but less mature.
- For enterprises: Neither is ready for prime time—wait for 2027’s Gen 2 models with better thermal management.
- For Apple: This is a wake-up call. The Vision Pro’s $3,500 price is now defensible, but only if Apple opens its NPU to third-party LLMs.
What Comes Next: The Road to 2027
The real battle isn’t about who has the best glasses today—it’s about who controls the AI stack tomorrow. Google’s Gemini API dominance could make its glasses obsolete in two years if developers refuse to use a closed ecosystem. Samsung’s Exynos NPU is a gamble on open hardware, but its limited translation accuracy may force users to mix and match devices.
The wild card? Open-source alternatives. Projects like LLM.fi’s “GlassOS” (a Linux-based AR OS) could bypass both ecosystems, but they lack hardware support. For now, the winner is still Apple—but the race is just heating up.
Final take: If you’re a developer, start reverse-engineering the Exynos NPU now. If you’re an enterprise, hold off on bulk purchases. And if you’re a consumer? Wait for the Gen 2 refresh in Q4 2026—this is just Act 1.