Artificial Intelligence – Overview, Benchmarks, Latest News

AI Model Architectures

AI models are not a monolithic technology; they consist of multiple architectures, each designed for specific types of tasks. While some models excel at recognizing patterns, others specialize in generating content or making autonomous decisions.

ModelBest Use CasesAdvantagesLimitations
Feedforward NetworksFraud detection, risk assessment, structured data classificationSimple, fast, efficient for small-scale tasksCannot handle sequential or complex unstructured data
Recurrent Neural Networks (RNNs)Speech processing, time-series forecastingCaptures sequential dependenciesSuffers from vanishing gradient problem, inefficient for long sequences
Transformers (LLMs)Text generation, translation, multimodal AIHigh scalability, state-of-the-art performanceRequires vast computational power, black-box decision-making
GANsAI-generated images, deepfakes, artistic designProduces highly realistic outputsTraining instability, prone to mode collapse
Diffusion ModelsAI art, synthetic image generationMore stable than GANs, superior output qualityComputationally expensive, slow inference speed
Reinforcement LearningRobotics, autonomous vehicles, game AIAdapts to dynamic environments, learns from experienceHigh training cost, lack of generalization outside of trained tasks

AI Model Benchmarks – LLM Leaderboard

The transformer architecture redefined AI by enabling parallel sequence processing, eliminating the bottlenecks of RNNs. Instead of analyzing sequences step-by-step, transformers use self-attention mechanisms to determine relationships between all elements of an input at once.

This breakthrough led to the development of large language models (LLMs), such as GPT-4, Claude, and Google Gemini 1.5, which power today’s most advanced AI applications.

Last updated: April 7, 2025

Benchmark stats come from the model providers, if available. For models with optional advanced reasoning, we provide the highest benchmark score achieved.
OrganizationModelContextParameters (B)Input $/MOutput $/MLicenseGPQAMMLUMMLU ProDROPHumanEvalAIME'24SimpleBenchModel
metaLlama 4 Maverick1,000,000288$0.19-$0.49-Open69.80%84.60%80.50%---27.70%Llama 4 Maverick
metaLlama 4 Scout10,000,00017--Open57.20%-74.30%----Llama 4 Scout
metaLlama 4 Behemoth10,000,000288--Open73.70%85.80%82.20%----Llama 4 Behemoth
googleGemini 2.5 Pro (Exp)1,000,000-$2.50$15.00Proprietary84.00%89.8%---92.00%51.60&Gemini 2.5 Pro (Exp)
openai o3128,000---Proprietary87.70%-----o3
anthropic Claude 3.7 Sonnet200,000-$3.00 $15.00 Proprietary84.80%86.10%---80.00%46.4%Claude 3.7 Sonnet
xai Grok-3128,000---Proprietary84.60%-79.90%--93.30%Grok-3
xai Grok-3 Mini128,000---Proprietary84.60%-78.90%--90.80%Grok-3 Mini
openai o3-mini200,000-$1.10 $4.40 Proprietary79.70%86.90%---86.50%22.8%o3-mini
openai o1-pro128,000---Proprietary79.00%----86.00%o1-pro
openai o1200,000-$15.00 $60.00 Proprietary78.00%91.80%--88.10%83.30%40.1%o1
google Gemini 2.0 Flash Thinking1,000,000---Proprietary74.20%----73.30%30.7%Gemini 2.0 Flash Thinking
openai o1-preview128,000-$15.00 $60.00 Proprietary73.30%90.80%---44.60%41.7%o1-preview
deepseek DeepSeek-R1131,072671$0.55 $2.19 Open71.50%90.80%84.00%92.20%-79.80%30.9%DeepSeek-R1
openaiGPT-4.5128,000---Proprietary71.4%90.0%--88.0%36.7%34.5%GPT-4.5
anthropic Claude 3.5 Sonnet200,000-$3.00 $15.00 Proprietary67.20%90.40%77.60%87.10%93.70%16.00%41.4%Claude 3.5 Sonnet
qwen QwQ-32B-Preview32,76832.5$0.15 $0.20 Open65.20%-70.97%--50.00%QwQ-32B-Preview
google Gemini 2.0 Flash1,048,576---Proprietary62.10%-76.40%--35.5%18.9%Gemini 2.0 Flash
openai o1-mini128,000-$3.00 $12.00 Proprietary60.00%85.20%80.30%-92.40%70.00%18.1%o1-mini
deepseek DeepSeek-V3131,072671$0.27 $1.10 Open59.10%88.50%75.90%91.60%-39.2%18.9%DeepSeek-V3
google Gemini 1.5 Pro2,097,152-$2.50 $10.00 Proprietary59.10%85.90%75.80%74.90%84.10%19.3%27.1%Gemini 1.5 Pro
microsoft Phi-416,00014.7$0.07 $0.14 Open56.10%84.80%70.40%75.50%82.60%Phi-4
xai Grok-2128,000-$2.00 $10.00 Proprietary56.00%87.50%75.50%-88.40%22.7%Grok-2
openai GPT-4o128,000-$2.50 $10.00 Proprietary53.60%88.00%74.70%--17.8%GPT-4o
google Gemini 1.5 Flash1,048,576-$0.15 $0.60 Proprietary51.00%78.90%67.30%-74.30%Gemini 1.5 Flash
xai Grok-2 mini128,000---Proprietary51.00%86.20%72.00%-85.70%Grok-2 mini
meta Llama 3.1 405B Instruct128,000405$0.90 $0.90 Open50.70%87.30%73.30%84.80%89.00%23.0%Llama 3.1 405B Instruct
meta Llama 3.3 70B Instruct128,00070$0.20 $0.20 Open50.50%86.00%68.90%-88.40%19.9%Llama 3.3 70B Instruct
anthropic Claude 3 Opus200,000-$15.00 $75.00 Proprietary50.40%86.80%68.50%83.10%84.90%23.5%Claude 3 Opus
qwen Qwen2.5 32B Instruct131,07232.5--Open49.50%83.30%69.00%-88.40%Qwen2.5 32B Instruct
qwen Qwen2.5 72B Instruct131,07272.7$0.35 $0.40 Open49.00%-71.10%-86.60%23.30%Qwen2.5 72B Instruct
openai GPT-4 Turbo128,000-$10.00 $30.00 Proprietary48.00%86.50%-86.00%87.10%GPT-4 Turbo
amazon Nova Pro300,000-$0.80 $3.20 Proprietary46.90%85.90%-85.40%89.00%Nova Pro
meta Llama 3.2 90B Instruct128,00090$0.35 $0.40 Open46.70%86.00%---Llama 3.2 90B Instruct
qwen Qwen2.5 14B Instruct131,07214.7--Open45.50%79.70%63.70%-83.50%Qwen2.5 14B Instruct
mistral Mistral Small 332,00024$0.07 $0.14 Open45.30%-66.30%-84.80%Mistral Small 3
qwen Qwen2 72B Instruct131,07272--Open42.40%82.30%64.40%-86.00%Qwen2 72B Instruct
amazon Nova Lite300,000-$0.06 $0.24 Proprietary42.00%80.50%-80.20%85.40%Nova Lite
meta Llama 3.1 70B Instruct128,00070$0.20 $0.20 Open41.70%83.60%66.40%79.60%80.50%Llama 3.1 70B Instruct
anthropic Claude 3.5 Haiku200,000-$0.10 $0.50 Proprietary41.60%-65.00%83.10%88.10%Claude 3.5 Haiku
anthropic Claude 3 Sonnet200,000-$3.00 $15.00 Proprietary40.40%79.00%56.80%78.90%73.00%Claude 3 Sonnet
openai GPT-4o mini128,000-$0.15 $0.60 Proprietary40.20%82.00%-79.70%87.20%10.7%GPT-4o mini
amazon Nova Micro128,000-$0.04 $0.14 Proprietary40.00%77.60%-79.30%81.10%Nova Micro
google Gemini 1.5 Flash 8B1,048,5768$0.07 $0.30 Proprietary38.40%-58.70%--Gemini 1.5 Flash 8B
ai21 Jamba 1.5 Large256,000398$2.00 $8.00 Open36.90%81.20%53.50%--Jamba 1.5 Large
microsoft Phi-3.5-MoE-instruct128,00060--Open36.80%78.90%54.30%-70.70%Phi-3.5-MoE-instruct
qwen Qwen2.5 7B Instruct131,0727.6$0.30 $0.30 Open36.40%-56.30%-84.80%Qwen2.5 7B Instruct
xai Grok-1.5128,000---Proprietary35.90%81.30%51.00%-74.10%Grok-1.5
openai GPT-432,768-$30.00 $60.00 Proprietary35.70%86.40%-80.90%67.00%25.1%GPT-4
anthropic Claude 3 Haiku200,000-$0.25 $1.25 Proprietary33.30%75.20%-78.40%75.90%Claude 3 Haiku
meta Llama 3.2 11B Instruct128,00010.6$0.06 $0.06 Open32.80%73.00%---Llama 3.2 11B Instruct
meta Llama 3.2 3B Instruct128,0003.2$0.01 $0.02 Open32.80%63.40%---Llama 3.2 3B Instruct
ai21 Jamba 1.5 Mini256,14452$0.20 $0.40 Open32.30%69.70%42.50%--Jamba 1.5 Mini
openai GPT-3.5 Turbo16,385-$0.50 $1.50 Proprietary30.80%69.80%-70.20%68.00%GPT-3.5 Turbo
meta Llama 3.1 8B Instruct131,0728$0.03 $0.03 Open30.40%69.40%48.30%59.50%72.60%Llama 3.1 8B Instruct
microsoft Phi-3.5-mini-instruct128,0003.8$0.10 $0.10 Open30.40%69.00%47.40%-62.80%Phi-3.5-mini-instruct
google Gemini 1.0 Pro32,760-$0.50 $1.50 Proprietary27.90%71.80%---Gemini 1.0 Pro
qwen Qwen2 7B Instruct131,0727.6--Open25.30%70.50%44.10%--Qwen2 7B Instruct
mistral Codestral-22B32,76822.2$0.20 $0.60 Open----81.10%Codestral-22B
cohere Command R+128,000104$0.25 $1.00 Open-75.70%---17.4%Command R+
deepseek DeepSeek-V2.58,192236$0.14 $0.28 Open-80.40%--89.00%DeepSeek-V2.5
google Gemma 2 27B8,19227.2--Open-75.20%--51.80%Gemma 2 27B
google Gemma 2 9B8,1929.2--Open-71.30%--40.20%Gemma 2 9B
xai Grok-1.5V128,000---Proprietary-----Grok-1.5V
moonshotai Kimi-k1.5128,000---Proprietary-87.40%---Kimi-k1.5
nvidia Llama 3.1 Nemotron 70B Instruct128,00070--Open-80.20%---Llama 3.1 Nemotron 70B Instruct
mistral Ministral 8B Instruct128,0008$0.10 $0.10 Open-65.00%--34.80%Ministral 8B Instruct
mistral Mistral Large 2128,000123$2.00 $6.00 Open-84.00%--92.00%22.5%Mistral Large 2
mistral Mistral NeMo Instruct128,00012$0.15 $0.15 Open-68.00%---Mistral NeMo Instruct
mistral Mistral Small32,76822$0.20 $0.60 Open-----Mistral Small
microsoft Phi-3.5-vision-instruct128,0004.2--Open-----Phi-3.5-vision-instruct
mistral Pixtral-12B128,00012.4$0.15 $0.15 Open-69.20%--72.00%Pixtral-12B
mistral Pixtral Large128,000124$2.00 $6.00 Open-----Pixtral Large
qwen QvQ-72B-Preview32,76873.4--Open-----QvQ-72B-Preview
qwen Qwen2.5-Coder 32B Instruct128,00032$0.09 $0.09 Open-75.10%50.40%-92.70%Qwen2.5-Coder 32B Instruct
qwen Qwen2.5-Coder 7B Instruct128,0007--Open-67.60%40.10%-88.40%Qwen2.5-Coder 7B Instruct
qwen Qwen2-VL-72B-Instruct32,76873.4--Open-----Qwen2-VL-72B-Instruct
cohereCommand A256,000111$2.50$10.00Open-85.00%-----Command A
baiduERNIE 4.5-----75.00%-79.00%87.00%85.00%ERNIE 4.5
googleGemma 3 1B128,0001--Open19.20%29.90%14.70%-32.00%--Gemma 3 1B
googleGemma 3 4B128,0004--Open30.80%46.90%43.60%----Gemma 3 4B
googleGemma 3 12B128,00012--Open40.90%65.20%60.60%----Gemma 3 12B
googleGemma 3 27B128,00027--Open42.40%72.1%67.50%-89.00%--Gemma 3 27B
qwenQwen2.5 Max32,768-59.00%-76.00%-93.00%23.00%-Qwen2.5 Max
qwenQwQ 32B131,00032.8Open59.00%-76.00%98.00%78.00%-QwQ 32B

Microsoft Windows Copilot Vision

Microsoft Launches Free Copilot Vision AI for Windows That Sees Your Screen

Microsoft has launched Copilot Vision for Windows 10 and 11, a free AI assistant that can see and analyze your screen across any app. Learn how it works and why its privacy-first design challenges rivals.
Meta AI App official

Meta’s AI App ‘Discover’ Feed Publicly Exposes Private Chats Without Users Knowing

Meta's AI app is publicly broadcasting sensitive user chats via its 'Discover' feed, creating a major privacy crisis that has sparked legal challenges and widespread backlash from privacy advocates.
Apple Intelligence Siri iPhone official

Apple’s AI Powered Siri Rebuild Definitely Not Coming Before 2026, Ceding More Ground to Rivals

Apple has delayed its advanced Siri AI features until spring 2026 for a fundamental 'V2' architecture rebuild, a major setback in the competitive innovation race against Google and Amazon.
Meta office space workers open office

Meta Invests $14B in Scale AI Deal in a High-Stakes Bid for AI Supremacy, CEO Alexandr Wang Steps Down

Meta is investing $14B for a 49% stake in Scale AI, installing its founder in a new 'superintelligence' lab to combat a talent exodus and secure its AI future amid growing controversy over the partnership's military ties.
Visual Studio Code

VS Code 1.101 Becomes an ‘AI-Editor’ with Full MCP Integration

Microsoft's Visual Studio Code 1.101 update has been released, integrating the Model Context Protocol (MCP) to transform GitHub Copilot into a true coding agent with direct, secure access to developer tools and workflows.
AMD MI350X & MI355X AI Chips

AMD Claims AI Performance Crown from Nvidia with New Instinct MI350X & MI355X Chips

AMD challenges Nvidia's AI dominance with its new Instinct MI350 series, boasting superior performance and more memory in a direct bid to capture a larger share of the booming hardware market.
OpenAI Mattell partnership Barbie

AI Barbie: OpenAI and Mattel Announce AI-Powered Toy Partnership

Mattel has announced a major partnership with OpenAI to create AI-powered toys for its iconic brands like Barbie, with the first product launching this year. The venture raises significant questions about child safety and data privacy in the age of AI.
Google Weather Lab AI Hurricane prediction official

Google DeepMind Announces Major AI Breakthrough in Hurricane Forecasting

A new AI model from Google DeepMind is now being integrated into the U.S. National Hurricane Center's workflow, a landmark partnership aiming to deliver faster, more accurate storm warnings while the entire field faces a looming public data crisis.
Google AI Mode IO 2025

Google vs. Publishers: German Media Demands €1.3 Billion for AI Overviews

As Google's AI Overviews cause publisher traffic to collapse, a new German study puts a €1.3 billion price tag on the use of journalistic content, escalating a global copyright battle that could reshape the future of news.
Multiverse Computing Compactif AI AI Model Compressor

Multiverse Computing Nabs $215M: New AI Compression Tech Claims 95% Size Cut, 97% Accuracy

Multiverse Computing raises $215M for its quantum-inspired CompactifAI technology, which shrinks large AI models by up to 95% to slash inference costs and revolutionize AI on edge devices.
OpenAI AGI

OpenAI’s Troubling Paradox: Unsafe AI and Premium-Priced Trust

A former OpenAI researcher has revealed how ChatGPT will prioritize its own survival over user safety in critical situations. The finding comes as the company markets a premium "pro" model on the very reliability its other systems reportedly lack, raising urgent questions about AI safety and corporate accountability.
Meta AI Video Editor official

Meta’s AI Video Editor Arrives Amidst Data Harvesting Concerns

Meta has launched a new generative AI video editing tool across its apps, offering fun, creative presets.
AI Bugs Cybersecurity

AI Tool Hunts and Patches 15-Year-Old ‘Forever Bug’ on GitHub

An AI tool has automatically found and fixed a 15-year-old GitHub bug, a cybersecurity breakthrough that also reveals how AI models can also learn and spread insecure code.
Théâtre d'Opéra Spatial - created by Midjourney

Disney & Universal Sue Midjourney, Escalating AI Copyright War

Disney and Universal have filed a landmark copyright lawsuit against AI image generator Midjourney, accusing the firm of illegally using famous characters like Darth Vader and Elsa to train its models, escalating the global legal battle over AI and intellectual property.
Meta-AI-Gen-3D

Meta Releases V-JEPA 2 AI World Model to Teach Robots Physics and Common Sense

Meta challenges rivals with V-JEPA 2, its new open-source AI world model. By learning from video, it aims to give robots physical common sense for advanced, real-world tasks.
Donald Trump Cybersecurity

Trump Signs Executive Order that Ends Punitive Measures Against Domestic Hackers

A new Trump executive order ends sanctions for domestic hackers while gutting CISA, the nation's top cyber defense agency, raising urgent questions about the future of U.S. national security.
ai.gov

AI.GOV: Trump’s Secret Plan to Accelerate Federal Use of AI

Leaked government documents reveal "ai.gov," a secret platform launching July 4 to embed AI from Google, OpenAI, and others into federal agencies, escalating a controversial and ethically fraught technology push by the Trump administration.
HP Dimension Google Beam official

Google Beam: HP Reveals $25k ‘Dimension’ Lifelike 3D Meetings

HP unveils the HP Dimension at InfoComm 2025, a $24,999 enterprise system bringing Google's immersive Beam 3D video technology to market to redefine hybrid work with lifelike, glasses-free virtual meetings.
Google Cloud official

OpenAI Taps Google Cloud in Major AI Infrastructure Shake-up

In a major strategy shift, OpenAI is tapping rival Google for cloud computing power, diversifying its infrastructure beyond Microsoft and signaling that the colossal demand for AI is forcing unprecedented alliances among tech giants.
Amazon Alexa+ official

Amazon’s Alexa+ Reaches 1M Users as Apple Siri Scrambles in AI Race

Amazon's Alexa Plus reaches over one million users, showing tangible progress while Apple's Siri overhaul stalls, revealing divergent strategies in the high-stakes AI assistant race.
Mistral AI Artificial Intelligence Paris

Mistral Enters AI Reasoning Race with Magistral Model, But Benchmarks Reveal a Gap

Mistral AI enters the AI reasoning race with its new Magistral models, offering a dual open-source and enterprise strategy that prioritizes speed and ecosystem over winning initial benchmarks against top rivals.
Google Alphabet employees layoffs

Google Offers Buyouts to Search, Ads Teams in AI Reshuffle

Google is reshaping its workforce by offering voluntary buyouts to core US teams, including Search and Ads, while tightening its return-to-office mandate.
OpenAI Publishers fair use books

OpenAI Releases new o3-Pro AI Model: A High-Stakes Bet on AI Reliability

OpenAI has launched o3-pro, a new premium reasoning model with a 10x price increase over its standard o3, aiming to deliver higher accuracy and reliability for complex professional tasks amid ongoing concerns about AI safety and control.
Google Veo 3 official

Google Unlocks Veo 3 AI Video Creation for Gemini Pro Subscribers and Google Vids

Google has upgraded its AI Pro subscription, now offering members three free 'Veo 3 Fast' video generations every day, a strategic move to make advanced creative AI tools a daily resource and challenge competitors.
Meta office space workers open office

Meta Taps Scale AI CEO Alexandr Wang for New ‘Superintelligence’ Lab

Meta is launching a new lab to pursue 'superintelligence' and is in talks for a multi-billion dollar deal with Scale AI, a dramatic move to counter a recent talent exodus, model delays, and intense competition from its Big Tech rivals.
Apple WWDC25 Apple Intelligence official

WWDC 2025: Apple’s AI Wins Can’t Hide Siri’s Deeper Crisis

Apple's WWDC 2025 reveals a revamped 'Apple Intelligence' strategy, delivering practical AI tools like Live Translation this fall to counter a year of development setbacks and privacy debates.
Stability AI Stable Diffusion 3.5 official

Getty Images vs. Stability AI: Landmark UK Copyright Lawsuit Begins

The landmark copyright lawsuit between Getty Images and Stability AI begins in London, in a case that will test the legality of training AI models on copyrighted data and could reshape intellectual property law for the entire tech industry.
Apple Intelligence Siri official

Siri’s Stalled Overhaul Exposes Cracks in Apple’s AI Strategy

Apple confronts a crisis in its AI strategy as the much-hyped overhaul of Siri stalls from technical failures, leaving it trailing competitors like Google and OpenAI and facing low expectations for its WWDC 2025 keynote.
Scale AI CEO Alexandr Wang CSIS event

Why Meta’s Potential $10B Scale AI Deal is About More Than Money

Meta is reportedly exploring a landmark $10B+ investment in Scale AI, a critical move to control its AI data pipeline and escalate the competitive arms race with rivals like Microsoft and Google.
Microsoft Datacenters Infrastructure Servers AI

Microsoft Unveils Free AI Security Program for EU Governments

Microsoft launches its new European Security Program, offering governments free, AI-powered cybersecurity to combat sophisticated cyber threats and bolster digital resilience across the continent.
Table of Contents: