Nvidia is broadening the horizons for its G-Assist AI assistant, introducing plugin support on April 23rd, 2025, that allows the on-device tool to interact with a range of third-party applications and data sources.
Initially launched in late March with a focus on optimizing PC performance and game settings for GeForce RTX users, G-Assist can now be extended via add-ons to control Spotify, check Twitch streams, query Google Gemini, and more, all processed locally on the user’s machine.
The assistant operates using an 8-billion parameter, Llama-based small language model (SLM) that runs directly on Nvidia RTX 30-, 40-, or 50-series desktop GPUs (requiring at least 12GB of VRAM) via the TensorRT-LLM framework.
This on-device processing, which occupies roughly 10GB of disk space, allows G-Assist to function offline and respond quickly by analyzing screen content through video capture and optical character recognition (OCR), without sending data to the cloud. Nvidia does note, however, that invoking G-Assist during GPU-intensive tasks like gaming might cause a temporary dip in performance while the GPU allocates resources for AI processing, though performance returns to normal afterward.
Extending Functionality with Plugins
To facilitate this expansion, Nvidia has released a ChatGPT-based G-Assist plugin builder and opened a GitHub repository containing sample plugins, documentation, and developer resources.
The toolkit enables enthusiasts and developers to create custom functionalities. The architecture involves a manifest.json
file defining the plugin’s capabilities (including functions, parameters, descriptions, and tags) and an executable (which can leverage Python or C++ bindings provided by Nvidia). An optional config.json
file can store settings or credentials. According to Nvidia’s announcement, developers can “Extend G-Assist’s capabilities with custom functionality tailored to specific workflows, games and tools.”
Nvidia showcased several sample plugins available on its GitHub repository:
- Spotify: Control music playback and volume adjustments hands-free.
- Google Gemini: Allows G-Assist to invoke the cloud-based Gemini AI for more complex queries, requiring a free Google AI Studio API key.
- Twitch: Check if specific streamers are live using voice commands like, “Hey Twitch, is [streamer] live?”
- Peripheral Controls: Manage RGB lighting and fan speeds for supported hardware from Logitech G, Corsair, MSI, and Nanoleaf.
- Data Lookups: Example plugins demonstrate fetching real-time stock prices and weather information.
Users can interact with plugins either by letting G-Assist’s AI determine the correct function based on a natural language query (zero-shot function calling prefixed with /fc
) or by directly invoking a plugin by name (e.g., /spotify play my playlist
or “Hey spotify, play my playlist”). The GitHub README explains that direct invocation is generally faster when the user knows which specific plugin function they need.
Local AI and Community Development
The move positions G-Assist, accessed via the Nvidia App overlay using the Alt+G shortcut, as a platform for local AI interaction rather than just a gaming utility. Nvidia emphasizes the ability to “Interact with G-Assist directly from the NVIDIA overlay, without tabbing out of an application or workflow.” The project, licensed under the Apache License 2.0, explicitly encourages community development, with Nvidia inviting plugin submissions via pull requests on GitHub for potential inclusion and sharing.
The system is designed for integration with AI agentic frameworks like Langflow, CrewAI, and Flowise, potentially allowing G-Assist to serve as a component within larger, possibly low-code, AI applications and workflows. G-Assist itself is built using Nvidia’s Avatar Cloud Engine (ACE) technology, the same suite developers use for creating AI-powered NPCs.
This local-first approach differentiates G-Assist from cloud-dependent assistants like Microsoft’s Xbox Copilot, which launched for Insiders in April and relies on remote processing via a mobile app interface for console and mobile users. G-Assist’s focus remains on the PC ecosystem, building upon its initial capabilities for performance analysis and system tuning, which were demonstrated during Computex 2024 and evolved from a concept first floated as an Nvidia April Fools’ joke back in 2017. Support for laptops with compatible GPUs is planned for a future update.