HomeWinBuzzer NewsPerplexity Launches Multimodal AI Assistant for Android, Challenging Google and OpenAI

Perplexity Launches Multimodal AI Assistant for Android, Challenging Google and OpenAI

Perplexity’s Android assistant helps with daily tasks using multimodal AI, offering users manage schedules, book rides, and much more.

-

Perplexity, the AI-driven search engine startup, has officially launched Perplexity Assistant for Android, integrated in its main app.

This move positions the company to directly compete with tech giants like Google and OpenAI in the fast-evolving AI assistant market.

Described by Perplexity CEO Aravind Srinivas as a shift “from an answer engine to a natively integrated assistant,” the assistant brings multimodal capabilities to Android users, offering tools for task automation, app integration, and contextual understanding.

Available in 15 languages, including English, Spanish, and Hindi, the assistant is free to use via the Perplexity mobile app.

Multimodal Features Focused on Utility

Perplexity Assistant distinguishes itself through its multimodal approach, allowing users to interact with the AI via text, voice commands, and visual input from their smartphone camera.

For instance, the assistant can analyze objects in front of the camera, identify them, and provide relevant information. It can also interpret text or visuals on a user’s screen and respond to queries based on that context.

One example shared by Perplexity demonstrates the assistant’s ability to streamline daily tasks. A user can ask it to recommend nearby restaurants, evaluate options, and then reserve a table via OpenTable.

This context-aware functionality allows users to conduct multi-step interactions without needing to repeat information or switch between apps. However, not all features are equally polished.

For example, email summarization and calendar integration currently rely on workaround solutions involving notification access rather than direct app connections.

Perplexity spokesperson Sara Platnick acknowledged the limitations, stating, “We’re continuing to add support for more apps and more functionality though, so this is just the starting point.”

Competing with Tech Giants

The launch of Perplexity Assistant comes as competition in the AI assistant market intensifies. Google’s Gemini assistant, which debuted on the Pixel 9 series and is now available as a standalone app for iOS, offers advanced cross-app multitasking and voice-driven interactions with files.

Similarly, OpenAI’s Advanced Voice Mode delivers low-latency responses, live video support and customizable intonations, positioning itself as a leader in conversational AI.

Related: Apple Siri’s AI Overhaul Slips to 2026 as Google’s Gemini Leads the Way

Unlike Google’s Gemini, which is deeply integrated into Android and iOS ecosystems, Perplexity Assistant relies on users setting it as the default assistant on their devices. Despite this hurdle, Srinivas believes the assistant’s multimodal features and ease of use make it a compelling alternative.

Perplexity shared instructions to facilitate setting its new assistant up on Android devices which requires giving the app access to gesture control.

Broader Strategy: Enterprise AI and Sonar API

The release of Perplexity Assistant follows closely on the heels of the company’s Sonar API launch, which underscores its ambitions in the enterprise AI space.

Sonar offers customizable, real-time AI search tools with citation-backed answers, making it a valuable resource for developers and enterprises seeking reliable and adaptable search solutions.

The Sonar API is available in two tiers: the base Sonar model, which emphasizes speed and affordability, and the Sonar Pro model, which offers advanced capabilities such as a larger context window and multi-step query handling.

Related: Google’s Gemini Live to Offer Voice Discussions on Files

These features cater to diverse industry needs, from healthcare to e-commerce, and highlight Perplexity’s commitment to addressing both consumer and enterprise markets.

In a blog post announcing the API, the company stated, “To optimize for factuality and authority, APIs require a real-time connection to the Internet, with answers informed by trusted sources.” This focus on real-time data and transparency reflects a broader trend in the AI industry, where accuracy and ethical considerations are increasingly prioritized.

Legal Challenges and Content Controversies

Despite its growing influence, Perplexity has faced legal challenges from major publishers, including Dow Jones and The New York Times, who accuse the company of unauthorized content usage.

These lawsuits allege that Perplexity’s AI system extracts and summarizes articles without proper permission, a practice critics have labeled as a “content kleptocracy.”

Perplexity has responded by introducing a revenue-sharing program for publishers, though many argue that this measure falls short of addressing the ethical concerns.

The company however maintains that it respects intellectual property, but the ongoing legal disputes underscore the complexities of balancing innovation with compliance in the rapidly evolving AI space.

The Rise of Agentic AI in a Crowded Market

Perplexity’s new assistant represents a growing trend toward agentic AI systems capable of performing complex, multi-step tasks autonomously. OpenAI just today  introduced Operator, a browser-based AI agent capable of performing a variety of online tasks, such as booking reservations, filling out forms, and managing grocery orders.

These systems, often derived from large language models (LLMs), are designed to interpret data and execute actions, from scheduling appointments to managing logistics.

Competitors like Anthropic, Nvidia, and Microsoft have also entered this space with their own agentic approaches.

What sets Perplexity apart is its emphasis on live-search in combination with user-centric design and multimodal functionality. By offering tools that integrate voice, text, and visual inputs, the assistant aims to provide a seamless user experience tailored to a wide range of tasks. However, ChatGPT already offers a similar feature set and offers more powerful models, better voice interaction and lower latency.

Challenges and Opportunities

The release of Perplexity Assistant is a bold step in the company’s effort to redefine how users interact with AI on mobile devices. However, its success will depend on several factors, including its ability to address current feature gaps, expand app integration, and navigate ongoing legal challenges.

At the same time, the assistant’s entry into the Android ecosystem signals a broader shift in the AI industry, where smaller players are challenging established tech giants by focusing on innovation and adaptability.

Whether Perplexity can maintain its momentum in this competitive landscape remains to be seen, but its latest offering highlights the potential for AI to transform everyday tasks and workflows.

Markus Kasanmascheff
Markus Kasanmascheff
Markus has been covering the tech industry for more than 15 years. He is holding a Master´s degree in International Economics and is the founder and managing editor of Winbuzzer.com.

Recent News

0 0 votes
Article Rating
Subscribe
Notify of
guest
0 Comments
Newest
Oldest Most Voted
Inline Feedbacks
View all comments
0
We would love to hear your opinion! Please comment below.x
()
x