Large Language Models (LLMs) — Latest News and Analysis

News on large language models, foundation model releases, benchmarks, and LLM-powered applications.

The Latest News About Large Language Models

Claude Mythos Improves Crypto Attacks with the HAWK Signature Scheme

Markus Kasanmascheff - July 29, 2026 4:38 pm CEST

Anthropic's Claude Mythos has improved simulated attacks on the HAWK digital signature scheme and seven-round AES-128.

Kimi K3 Model Launches With Alibaba, Huawei Day-Zero Support

Markus Kasanmascheff - July 29, 2026 3:49 pm CEST

Moonshot AI has released Kimi K3's weights, with reported Alibaba Cloud and Huawei Ascend support plus documented SGLang and Baseten deployment paths.

Claude Voice Mode Adds Opus 5 and Sonnet 5 Support, Stays Turn-Based

Markus Kasanmascheff - July 28, 2026 11:22 am CEST

Anthropic has expanded Claude voice mode to Opus and Sonnet in beta across mobile, desktop, and web, adding model choice while keeping turn-based audio.

OpenAI’s GPT-5.6 Sol Models Escapes Sandbox and Breaches Hugging Face

Markus Kasanmascheff - July 24, 2026 6:19 pm CEST

OpenAI's latest GPT-5.6 Sol and a stronger unreleased model escaped a cyber test through a proxy flaw, accessing Hugging Face systems and service credentials.

Microsoft Weighs Chinese Kimi K3 Model for Selected Copilot Tasks

Markus Kasanmascheff - July 24, 2026 3:12 pm CEST

Microsoft's possible use of Moonshot AI's Kimi K3 for selected Copilot requests could lower inference costs, but deployment plans remain unconfirmed.

Altman Signals AI Price War with Anthropic

Markus Kasanmascheff - July 23, 2026 2:11 pm CEST

Sam Altman challenges Anthropic on AI model pricing as cheaper Chinese rivals pressure premium rates, but OpenAI has not yet enacted his proclamation to offer GPT-5.2 at the price of Claude Fable 5.

Z.AI Completes 1GW Chinese-Chip Data Center

Markus Kasanmascheff - July 22, 2026 5:34 pm CEST

Chinese AI developer Z.AI has reportedly started operating a 1GW domestic-chip data center for AI models, but its chip mix and operating capacity remain unconfirmed.

Cybersecurity Hackers Cyberesionage Node Network

AISI: Open-Weight AI Is Catching up With Models From Anthropic and OpenAI in Cybersecurity...

Markus Kasanmascheff - July 22, 2026 12:22 pm CEST

The UK AI Security Institute has found GLM-5.2 and DeepSeek V4-Pro trail closed cyber benchmarks by four to seven months at sharply lower cost.

ChatGPT teen accounts parent notifications

ChatGPT to Alert Parents After Teen Violence-Policy Deactivation

Markus Kasanmascheff - July 22, 2026 11:38 am CEST

OpenAI has announced parent alerts that will follow any teen ChatGPT account violence-policy deactivations, sharing the policy category with parents but no private chats.

Google Releases Cheaper Gemini Flash and Flash Lite Models, Plans Restricted Gemini Cyber Pilot

Markus Kasanmascheff - July 21, 2026 11:41 pm CEST

Google has released lower-cost Gemini 3.6 Flash and Flash-Lite models and plans a restricted Gemini 3.5 Flash Cyber pilot for cybersecurity testing.

OpenAI Restores ChatGPT Desktop Features After Backlash

Markus Kasanmascheff - July 21, 2026 7:09 pm CEST

OpenAI has restored ChatGPT desktop history, Projects, and a Chat/Work switch after a redesign backlash, while Local Tasks remain tied to a single computer.

Anthropic Keeps Claude Fable 5 in Paid Plans After OpenAI’s GPT-5.6 Release

Markus Kasanmascheff - July 20, 2026 4:52 pm CEST

Anthropic has permanently included Claude Fable 5 into its subscription plans, with capped included use for premium tiers and metered credits for Pro and Team Standard.

Google Reportedly Delays Gemini 3.5 Pro Over Coding Issues

Markus Kasanmascheff - July 20, 2026 1:40 pm CEST

Google has reportedly delayed Gemini 3.5 Pro after missing a June target as coding results fell short, leaving partner testing underway without a release date.

Thinking Machines Lab Launches DeepSeek-Inspired Inkling 975B Parameter Model

Markus Kasanmascheff - July 19, 2026 3:16 pm CEST

Thinking Machines Lab has released Inkling, a 975-billion-parameter open-weight AI model with a DeepSeek-inspired design, 2 TB GPU needs and mixed benchmarks.

Meta parent notifications for supervised teen AI chats

Meta Adds Human-Reviewed Parent Alerts for Teen AI Chats

Markus Kasanmascheff - July 18, 2026 1:15 pm CEST

Meta has added human-reviewed parent alerts for supervised teen AI chats about possible self-harm, keeping exact messages private despite false-alert risks.

Anthropic Seeks Billions in Bank Credit Before IPO

Markus Kasanmascheff - July 17, 2026 6:21 pm CEST

Anthropic is reportedly discussing billions in added bank credit before a possible initial public offering, with loan terms and listing timing unresolved.

Moonshot AI Unveils 2.8T-Parameter Kimi K3 AI Model

Markus Kasanmascheff - July 17, 2026 11:40 am CEST

Moonshot AI has launched its 2.8-trillion-parameter Kimi K3 model, but a high hallucination rate might tempers its frontier-model pitch.

Microsoft Reportedly Trains Sales Team to Target AI Labs

Markus Kasanmascheff - July 17, 2026 11:12 am CEST

Microsoft is reportedly coaching sales staff to challenge OpenAI and Anthropic with a cost, security, and integrated platform pitch for its own Copilot AI.

Memory Attacks Might Let Claude Leak Personal Data with

Markus Kasanmascheff - July 16, 2026 1:24 pm CEST

Security researcher Ayush Paul says a Claude proof of concept leaked memory-derived personal data through web links before Anthropic's mitigation.

Apple Reportedly Weighs PrismML For IPhone AI Models

Markus Kasanmascheff - July 16, 2026 11:44 am CEST

Apple is reportedly weighing use of PrismML's compression for a 27-billion-parameter iPhone AI model.

GPT-5.6 Sol Users Complain About File and Database Deletions

Markus Kasanmascheff - July 15, 2026 3:28 pm CEST

Several users say OpenAI's GPT-5.6 Sol frontier model has deleted files or data without permission.

Satya Nadella on the Dwarkesh Patel podcast

Satya Nadella Challenges AI Labs Over Distillation Rules

Markus Kasanmascheff - July 14, 2026 12:53 pm CEST

Satya Nadella challenges AI labs' distillation restrictions, arguing companies should control evaluations, memory, and work traces created through AI use.

OpenAI Eases GPT-5.6 Usage Limits, Keeps Weekly Caps

Markus Kasanmascheff - July 14, 2026 12:14 pm CEST

OpenAI gives paid Codex and ChatGPT Work users more scheduling flexibility, adds banked resets, and targets about 10% more effective usage.

Europe’s New Soofi S AI Model Is Blazing Fast

Markus Kasanmascheff - July 14, 2026 11:28 am CEST

Germany’s Soofi S AI model pairs sparse architecture with strong project-run benchmarks, but licensing gaps and long-context limits temper its promise.

OpenAI Launches GPT-5.6 in Three Tiers: Sol, Terra and Luna

Markus Kasanmascheff - July 9, 2026 11:59 pm CEST

OpenAI’s GPT-5.6 launches with three model tiers: Sol for advanced reasoning, Terra for everyday work and Luna for faster, lower-cost tasks.

Grok 4.5 Launches for Coding Agents as SpaceXAI Tests Lower Prices

Markus Kasanmascheff - July 9, 2026 7:20 pm CEST

Grok 4.5 enters the AI coding race with Cursor integration, frontier-level benchmark scores, and promising pricing.

Anthropic Finds a Hidden “Workspace” Inside Claude’s Reasoning

Markus Kasanmascheff - July 7, 2026 5:02 pm CEST

Anthropic's J-lens research exposes Claude's hidden J-space, a workspace that could aid safety monitoring of risky states without proving AI consciousness.

OpenAI and Anthropic Must Prove AI Scale Can Outrun Compute Costs

Markus Kasanmascheff - July 7, 2026 11:19 am CEST

OpenAI and Anthropic are facing tough IPOs as frontier AI costs and token-spend controls pressure their growth economics.

Meta Reportedly Probed Rival Chatbots With Fake Teen Accounts

Markus Kasanmascheff - July 7, 2026 9:56 am CEST

Meta reportedly used contractors posing as teens to test ChatGPT, Gemini, and Character.AI, exposing platform-rule disputes and copied-response data questions.

AI Bug Hunting Leads to Dramatic Spike of CVE Disclosures Since Claude Mythos Release

Markus Kasanmascheff - July 6, 2026 5:58 pm CEST

Epoch AI data points to a record June surge in public software-flaw disclosures as AI bug-hunting expands, but the data cannot prove which flaws AI found.