OpenAI bets big on audio as Silicon Valley declares war on screens
The form factors may differ, but the thesis is the same: audio is the interface of the future. Every space -- your home, your car, even your face -- is becoming an interface.
OpenAI is betting big on audio AI, and it’s not just about making ChatGPT sound better. According to new reporting from The Information, the company has unified several engineering, product, and research teams over the past two months to overhaul its audio models, all in preparation for an audio-first personal device expected to launch in about a year.
The move reflects where the entire tech industry is headed — toward a future where screens become background noise and audio takes center stage. Smart speakers have already made voice assistants a fixture in more than a third of U.S. homes. Meta just rolled out a feature for its Ray-Ban smart glasses that uses a five-microphone array to help you hear conversations in noisy rooms — essentially turning your face into a directional listening device. Google, meanwhile, began experimenting in June with “Audio Overviews” that transform search results into conversational summaries, and Tesla is integrating xAI’s chatbot Grok into its vehicles to create a conversational voice assistant that handles everything from navigation to climate control through natural dialogue.
It’s not just the tech giants placing this bet. A motley crew of startups has emerged with the same conviction, albeit with varying degrees of success. The makers of the Humane AI Pin burned through hundreds of millions before their screenless wearable became a cautionary tale. The Friend AI pendant, a necklace that claims it will record your life and offer companionship, has sparked privacy concerns and existential dread in equal measure. And now at least two companies, including Sandbar and one helmed by Pebble founder Eric Migicovsky, are building AI rings expected to debut in 2026, allowing wearers to literally talk to the hand.
The form factors may differ, but the thesis is the same: audio is the interface of the future. Every space — your home, your car, even your face — is becoming a control surface.