AI audio editing is reshaping how creators work, and Andrew Mason—CEO of Descript and founder of Groupon—is at the forefront of that shift. In this episode of Innovating Music, Mason explains how Descript’s text-based editing model removes technical friction, speeds up collaboration, and opens new creative possibilities for podcasters, video producers, and teams working remotely. From live transcription to AI-generated voice corrections, Descript’s features are designed to make editing feel as natural as writing.
Editing Audio Like Text
Descript transforms media editing by letting users cut, copy, and paste audio and video as easily as editing a document. Creators can import multi-track recordings, make changes directly in the transcript, and see those edits reflected in the underlying media. This approach keeps editors in their “creative brain” rather than toggling between waveform manipulation and narrative decision-making.
Collaboration in the Cloud
Built for remote production, Descript is fully cloud-first—offering real-time multi-track transcription, speaker labeling, and version history. Multiple users can edit the same project simultaneously, making it a natural fit for distributed teams, podcast networks, and production studios working across geographies.
AI Features That Enhance, Not Replace
One of Descript’s most talked-about tools is Overdub, which learns a user’s voice from a short sample and can insert new words into recordings seamlessly. Mason emphasizes that the goal isn’t to replace creativity, but to automate repetitive or technical tasks so creators can focus on storytelling and content quality.
Whether you’re a podcaster tightening your episodes, a video producer collaborating across continents, or a creative exploring new tools, Descript offers a faster, more flexible way to work. Listen to the full conversation with Andrew Mason to hear how AI is changing the editing game.
Highlights
“We’re not automating creativity; we’re removing the technical complexity of working with audio and video.”
“With Descript, editing media is as simple as editing a text document.”
“Overdub lets you type in new words, and it generates them in your own voice.”
“Cloud-first design means full collaboration, version history, and remote co-editing.”
Hosted on Acast. See acast.com/privacy for more information.