OpenAI has unveiled an update to its ChatGPT desktop app for macOS, introducing voice control and enhanced coding features. Mac users can now experience a more interactive and hands-free approach to using ChatGPT, further blurring the lines between human-computer interaction.
The update is part of the “12 Days of OpenAI” series of announcements, where the company releases new features and updates for its major products.
Advanced Voice Mode on macOS
The updated ChatGPT Mac app features Advanced Voice Mode, allowing users to speak their requests to ChatGPT instead of typing. This means users can ask ChatGPT to draft emails, brainstorm ideas, or debug code, all through voice commands.
OpenAI CPO Kevin Weil demonstrated this new capability, showing ChatGPT floating on the Mac screen and responding to voice requests even when the user was engaged in other tasks.
The addition of Advanced Voice Mode provides a more natural and intuitive way to interact with ChatGPT. Users can now speak their requests, making the experience more akin to interacting with a human assistant. This is particularly useful for tasks that require multitasking or when hands-free operation is preferred.
ChatGPT as a Coding Companion
A standout feature is improved ability of ChatGPT to interact directly with code within popular editors like Warp and Xcode. Instead of copying and pasting code into the ChatGPT interface, users can simply ask ChatGPT to analyze the code displayed on their screen. The AI can then offer suggestions, explanations, or even write new code snippets directly into the editor.
This improves the ChatGPT integration with coding environments such as VS Code, Xcode, Terminal, and iTerm2 that OpenAI added in November.
Beyond coding, ChatGPT can also integrate with note-taking apps such as Apple Notes, Quip, and Notion. For example, a user planning a trip in Notion could ask ChatGPT to help flesh out their ideas, including citing relevant sources from the internet.
This move towards app integration is a step towards OpenAI’s vision of “agentic” AI, where ChatGPT performs tasks on the user’s behalf, blending the capabilities of multiple apps into a single, convenient package. “This is what OpenAI calls the “agentic” approach to AI, where ChatGPT performs tasks on your behalf,” explained Weil.
The enhanced coding features in the updated Mac app position ChatGPT as a valuable tool for developers. By directly interacting with code within popular editors, ChatGPT can help streamline workflows and provide real-time assistance.
“Say you’re using a code editor like Warp and staring at a long, confusing list of code. Instead of scrolling endlessly, you can ask ChatGPT to analyze what’s on the screen and it will offer suggestions, explanations, or even write new code snippets,” explained Weil during the demonstration.
App Integration: Towards Agentic AI
The integration with note-taking apps like Apple Notes, Quip, and Notion further expands ChatGPT’s capabilities. It allows users to leverage the AI’s assistance for a wider range of tasks, from planning and organizing to writing and research.
“If you’re planning a trip and using Notion to jot down ideas, you can ask ChatGPT to help flesh them out, including citing sources from the internet,” added Weil.
While these new features are currently exclusive to Mac users, OpenAI has announced that an update for the Windows app is in development. No specific timeframe for the Windows update has been given.
Previous Announcements During the “12 Days of OpenAI”
On December 19, OpenAI launched a toll-free number and WhatsApp access for ChatGPT, making the AI chatbot more accessible.
December 18 brought API access for the full version OpenAI’s o1 model, enhancements to the Realtime API for voice interactions, and a new preference fine-tuning method.
On December 16, OpenAI made its ChatGPT live web search feature available to all users, allowing anyone to retrieve up-to-date information directly from the web.
December 14 brought new customization options to ChatGPT, letting users streamline tasks and manage projects effectively. Projects allows users to group chats, files, and custom instructions into dedicated folders, creating an organized workspace for managing tasks and workflows.
As a huge improvement to its advanced voice mode for ChatGPT, OpenAI on December 12 added vision capabilities, enabling users to share live video and screens for real-time analysis and assistance.
On December 11, OpenAI fully released Canvas, a collaborative editing workspace that offers advanced tools for both text and code refinement. Initially launched in beta in October 2024, Canvas replaces ChatGPT’s standard interface with a split-screen design, allowing users to work on text or code while engaging in conversational exchanges with the AI.
The addition of Python execution is a standout feature of Canvas, enabling developers to write, test, and debug scripts directly within the platform. OpenAI demonstrated its utility during a live event by using Python to generate and refine data visualizations. OpenAI described the feature as “reducing friction between idea generation and implementation”.
On December 9, OpenAI officially launched Sora, its advanced AI tool for generating videos from text prompts, signaling a new era for creative AI. Integrated into paid ChatGPT accounts, Sora allows users to animate still images, extend existing videos, and merge scenes into cohesive narratives.
Released on December 7 was Reinforcement Fine-Tuning as a new framework designed to enable the customization of AI models for industry-specific applications. It is OpenAI’s latest approach to improving AI models by training them with developer-supplied datasets and grading systems. Unlike traditional supervised learning, which focuses on replicating desired outputs
On December 5, OpenAI unveiled ChatGPT Pro, a new premium subscription tier priced at $200 per month, aimed at professionals and enterprises seeking advanced AI capabilities for high-demand workflows.