Live · weighted across 376 sources

The signal, extracted.

AI moves too fast and lies too often. We track frontier labs, analysts who actually know, and the open-source firehose — then weight what survives. No newsletter pad, no affiliate slop.

frontier Official

Apple researchers build a context understanding benchmark and find quantized models degrade unevenly

A new benchmark from Apple ML Research probes LLMs on four distinct context understanding tasks across nine datasets, finding that pretrained dense models struggle with nuanced contextual features and that 3-bit quantization causes variable performance drops.

The signal, extracted.

Apple researchers build a context understanding benchmark and find quantized models degrade unevenly

Recent

Apple at ICLR 2026: RNN parallelization, tool-augmented SSMs, unified image models, and more

ParaRNN: Apple researchers train a 7B-parameter nonlinear RNN, competitive with transformers

Apple research generates realistic long-term motion with a 64x temporally compressed embedding

End-to-end FP8 in RL training: NeMo RL achieves 48% speedup over BF16 baseline

Archive

NVIDIA adds Muon optimizer support to Megatron Core, closes gap with AdamW at scale

Three LLM agents wrote 600,000 lines of code and ran 850 experiments to win a Kaggle competition

NVIDIA FLARE reduces federated learning migration to ~5 lines of code and an environment swap

NVIDIA Blackwell delivers 150+ tokens/sec/user on DeepSeek-V4-Pro out of the box

Open AI systems have a structural edge in cybersecurity defense — here is why

QIMMA validates Arabic benchmarks before running models on them — and finds systematic problems in established datasets

Gemma 4 runs as a vision-language-action agent on an 8 GB Jetson Orin Nano Super

Running Transformers.js in a Chrome extension: the Manifest V3 architecture that actually works

DeepSeek-V4 cuts KV cache to 2% of standard cost to make million-token agent context practical

ADeLe scores models and tasks on the same 18-ability scale to predict performance before deployment

Microsoft researchers: we are benchmarking AI against the past when we should be asking what comes next

Microsoft's New Future of Work report: AI is speeding up work changes but distributing benefits unevenly

Microsoft researchers on AI and climate: separate the data from the hype before drawing conclusions

Microsoft's AutoAdapt turns LLM domain adaptation from guesswork into a constraint-aware pipeline

Google's Vantage uses AI avatars to assess skills like critical thinking in adaptive conversations

Google's MoGen generates synthetic neuron shapes that cut brain-mapping errors by 4.4%

Simula treats synthetic data generation as mechanism design, not sample-by-sample prompting

ReasoningBank gives agents a memory that learns from failures, not just successes

Google Photos can now reframe your shots from a new camera angle after the fact

Gemma 4 releases four model sizes under Apache 2.0, with the 31B ranked third among all open models

Gemini Robotics-ER 1.6 adds instrument reading and multi-view reasoning, developed with Boston Dynamics

Gemini 3.1 Flash TTS launches with audio tags, 70+ languages, and SynthID watermarking

Google DeepMind partners with five global consultancies to deploy frontier AI at enterprise scale

Google DeepMind's Decoupled DiLoCo trains LLMs across data centers on standard internet bandwidth

How we decide what's news

Cluster

Weight

Signal

Cite