Large Language Models (LLMs) — Latest News and Analysis

News on large language models, foundation model releases, benchmarks, and LLM-powered applications.

The Latest News About Large Language Models

Zhipu AI

GLM-5.2 Tops Open-Weights AI Ranking as Coding Race Tightens

Z.ai's GLM-5.2 models takes the lead among open-weight models on Artificial Analysis' index, with public weights, a 1M-token window, and deployment caveats for coding teams.

Google DiffusionGemma Trades Quality for Local AI Speed

Google has introduced DiffusionGemma to speed local AI output through parallel text diffusion, but lower quality than Gemma 4 keeps trade-offs visible.
Anthropic Claude

Anthropic Makes Claude Fable Guardrails Visible After Apology

Anthropic has apologized for invisible Claude Fable 5 safeguards and will show fallback notices after hidden output changes threatened AI model evaluations.
Anthropic Claude

Anthropic Unveils Claude Fable 5 as a Mythos Class AI Model With Safety Routing

Anthropic has launched Claude Fable 5, bringing Mythos-class AI to regular Claude users with safety routing, a discounted June 22 access window, and usage-credit pricing.
OpenAI profit money

OpenAI Confidentially Files For IPO

OpenAI has filed a confidential draft S-1 for a possible IPO, leaving timing open as employee share-sale and Anthropic competition pressure starts to build.
Meta AI support assistant

Meta AI Support Flaw Exposes 20,225 Instagram Accounts

A flaw in Meta's AI-assisted Instagram recovery exposed 20,225 accounts, letting attackers redirect password resets and forcing June 19 security notices to users.
OpenAI Codex Windows Skills

ChatGPT Expected To Relaunch as Integrated Super App

OpenAI's planned ChatGPT super app is expected to put agents and Codex inside one hub, turning free prompts into paid tool paths as rivals push competing agent platforms.
DeepSeek

DeepSeek Leads Ramp Trending Software List as Firms Weigh Cheaper AI

DeepSeek has topped Ramp's June AI vendor list as US firms are increasingly betting on cheaper models.
xAI Grok official

xAI Reportedly Used Workaround to Train Grok With Claude Output After January Cutoff

xAI appears to have used a workaround to train its Grok AI with outputs of Anthropic's Claude model after an Anthropic access cutoff in January.
AI Research - ai generated

AI Leaders From Google, OpenAI, Anthropic, Microsoft Back DNA Screening as Biosecurity Chokepoint

AI leaders have backed DNA and RNA screening rules that would make gene synthesis sellers verify customers and orders before risky designs reach labs.

Sakana AI Opens Lab For Recursive Self-Improvement

Sakana AI has opened a Recursive Self-Improvement Lab to test whether AI can cut compute dependence.
Google-DeepMind-Math-AI

Leiden Declaration: Mathematicians Warn AI Could Strain Math Proof

Mathematicians warn in the Leiden Declaration AI proof tools could strain peer review, credit and verification.
AI Research - ai generated

GPT-Rosalind: OpenAI Pushes Its Genomics and Drug Discovery Model Into Controlled Research

OpenAI has expanded its genomics and drug discovery model GPT-Rosalind with life-sciences plugins and controlled access.

Training Details For Microsoft New In-House AI Models Put Clean-Data Promise in Doubt

Microsoft’s in-house MAI-Thinking-1 faces scrutiny over Common Crawl and public-web training data despite its pitch about clean, commercially licensed data.
Claude Code

Anthropic Says Claude Now Writes 80% of Its Production Code, Targets Full Self-Improving AI

Anthropic says Claude now authors over 80% of Anthropic production code, shifting risk from writing software to reviewing AI-made changes before they ship inside live systems.
ChatGPT saved memories

OpenAI Expands ChatGPT Memory With Editable Summaries

OpenAI has expanded ChatGPT memory, giving Plus and Pro users in the US editable summaries as Free and Go accounts wait for a global rollout in the coming weeks.

Tencent Is Developing a WeChat AI Agent for In-App Tasks

Tencent is reportedly developing a WeChat AI agent that would use mini programs to complete in-app tasks, with review and external tests still ahead.

Google’s New Gemma 4 12B Model Targets Local AI Agents on Laptops

Google has released Gemma 4 12B, a local multimodal AI model for laptops that tests whether audio, images, code, and tool calls fit in 16GB memory locally.
Spyware keylogger

AI Powered Malware Worm Prototype Adapts Attacks Across Hosts

Researchers built a contained AI powered malware worm that adapts attacks across lab hosts, exposing how local open-weight models complicate malware containment.
Meta AI Muse Spark

Meta AI Support Flaw Lets Hacker Hijack Instagram Accounts

Meta AI Support abuse exposed an Instagram recovery gap that let hackers change emails, reset passwords, and briefly seize high-profile accounts before a patch.
Anthropic Claude

Anthropic Reveals 31.5% Browser Agent Hijack Rate

Anthropic has disclosed a 31.5% prompt-injection success rate for Claude's browser agent before safeguards, showing how hostile web instructions can reach live tools.

Mistral Replaces Le Chat Assistant with New Vibe AI Agent for Work and Coding

Mistral has rebranded its Le Chat AI assistant as Vibe, folding work automation and remote coding into one AI agent with cloud sandboxes, connectors and tiered pricing.
ChatGPT Search

Benchmark Reveals AI Search Agents Guess Before They Search

LiveBrowseComp benchmark results suggest AI search agents often verify hunches instead of fresh web evidence, raising new doubts about benchmark scores for browsing skill.

Minimax Launches M3 AI Model with 1M Context, Multimodal Push

MiniMax is pushing M3 into the long-context model race with multimodal input and a claimed 1 million-token window.
Enterprise AI budget review scene with a dashboard of model tiers, token counters, approval badges, and spend caps

How Exploding AI Bills Are Pushing Tech Companies to Cheaper Alternatives

Rising token-driven AI bills are pushing more and more companies to ration access, track usage, and steer workers toward cheaper tools.
's Accelerating Math and Theoretical Physics with AI Workshop

Math Prodigy Terence Tao Sees AI Changing Proof Research

Terence Tao argues AI could split math research into specialized roles if verification keeps pace and human reviewers filter weak ideas before they spread.
Gemini Omni

Google Revises Gemini Quotas After AI Pro Subscriber Complaints

Google has tweaked Gemini quota rules after paid users hit five-hour walls after a just a few minutes, capping single-request usage and excluding failed jobs.

WorkBuddy: Tencent Expands OpenClaw-Compatible Desktop AI Agent Globally

Tencent has expanded WorkBuddy globally while betting smaller AI models can win more users against Alibaba and ByteDance in China's intensifying AI race.
Anthropic Claude

Anthropic SpaceX Deal: Musk Leaves Capacity For Claude in Doubt

Anthropic's SpaceX compute deal is real, but Musk's 180-day lease claim conflicts with payment terms through May 2029, raising Claude planning questions.
Apple Siri Mockup

Apple Mockup Leak Shows Siri’s New Chatbot App Plans

Apple's leaked Siri redesign points to a chatbot-style iPhone app, Dynamic Island replies, and Gemini-backed AI features that could debut at WWDC in June.