The signal, extracted.

AI moves too fast and lies too often. We track frontier labs, analysts who actually know, and the open-source firehose — then weight what survives. No newsletter pad, no affiliate slop.

analysis Analysis

A closed-form theory of word2vec shows it is equivalent to running PCA on word co-occurrence statistics

Berkeley researchers prove that word2vec learns in discrete, sequential rank-incrementing steps, and that the final representations are exactly the top eigenvectors of a matrix defined by corpus co-occurrence probabilities.

The signal, extracted.

A closed-form theory of word2vec shows it is equivalent to running PCA on word co-occurrence statistics

Recent

Divide-and-conquer value learning reduces Bellman recursions logarithmically for long-horizon RL

Mutual information as an imaging metric predicts decoder performance across four domains without task-specific training

SPEX and ProxySPEX identify influential feature interactions in LLMs with exponentially fewer ablations

GRASP makes long-horizon planning with learned world models practical through three targeted fixes

Archive

Qwen-MT covers 92 languages at $0.5 per million tokens using a lightweight MoE architecture

GSPO replaces token-level RL clipping with sequence-level optimization, fixing MoE training collapse

Qwen-Image is a 20B image model that renders Chinese calligraphy and multi-column English text accurately

Qwen-Image-Edit: a 20B model that separates semantic and appearance editing with bilingual text support

Qwen3Guard brings streaming safety detection to open-source guardrail models

Top logits can leak task-irrelevant image information as readily as full residual stream projections

Apple researchers build a context understanding benchmark and find quantized models degrade unevenly

Apple at ICLR 2026: RNN parallelization, tool-augmented SSMs, unified image models, and more

ParaRNN: Apple researchers train a 7B-parameter nonlinear RNN, competitive with transformers

Apple research generates realistic long-term motion with a 64x temporally compressed embedding

End-to-end FP8 in RL training: NeMo RL achieves 48% speedup over BF16 baseline

NVIDIA adds Muon optimizer support to Megatron Core, closes gap with AdamW at scale

Three LLM agents wrote 600,000 lines of code and ran 850 experiments to win a Kaggle competition

NVIDIA FLARE reduces federated learning migration to ~5 lines of code and an environment swap

NVIDIA Blackwell delivers 150+ tokens/sec/user on DeepSeek-V4-Pro out of the box

Open AI systems have a structural edge in cybersecurity defense — here is why

QIMMA validates Arabic benchmarks before running models on them — and finds systematic problems in established datasets

Gemma 4 runs as a vision-language-action agent on an 8 GB Jetson Orin Nano Super

Running Transformers.js in a Chrome extension: the Manifest V3 architecture that actually works

DeepSeek-V4 cuts KV cache to 2% of standard cost to make million-token agent context practical

ADeLe scores models and tasks on the same 18-ability scale to predict performance before deployment

Microsoft researchers: we are benchmarking AI against the past when we should be asking what comes next

Microsoft's New Future of Work report: AI is speeding up work changes but distributing benefits unevenly

Microsoft researchers on AI and climate: separate the data from the hype before drawing conclusions

Microsoft's AutoAdapt turns LLM domain adaptation from guesswork into a constraint-aware pipeline

Google's Vantage uses AI avatars to assess skills like critical thinking in adaptive conversations

Google's MoGen generates synthetic neuron shapes that cut brain-mapping errors by 4.4%

Simula treats synthetic data generation as mechanism design, not sample-by-sample prompting

ReasoningBank gives agents a memory that learns from failures, not just successes

Google Photos can now reframe your shots from a new camera angle after the fact

Gemma 4 releases four model sizes under Apache 2.0, with the 31B ranked third among all open models

Gemini Robotics-ER 1.6 adds instrument reading and multi-view reasoning, developed with Boston Dynamics

Gemini 3.1 Flash TTS launches with audio tags, 70+ languages, and SynthID watermarking

Google DeepMind partners with five global consultancies to deploy frontier AI at enterprise scale

Google DeepMind's Decoupled DiLoCo trains LLMs across data centers on standard internet bandwidth

How we decide what's news

Cluster

Weight

Signal

Cite