china

6 stories

GSPO replaces token-level RL clipping with sequence-level optimization, fixing MoE training collapse

Qwen introduces Group Sequence Policy Optimization, a new RL algorithm that eliminates the instability and infrastructure overhead that blocked GRPO from scaling to large Mixture-of-Experts language models.

Apr 26, 2026

1 source · primary

china Official

Qwen-MT covers 92 languages at $0.5 per million tokens using a lightweight MoE architecture

Alibaba's qwen-mt-turbo update builds translation-specific reinforcement learning on top of Qwen3, adding terminology injection, domain prompts, and translation memory at prices well below general-purpose frontier models.

Apr 26, 2026

1 source · primary

china Official

Qwen3Guard brings streaming safety detection to open-source guardrail models

Alibaba's Qwen team releases Qwen3Guard, a safety guardrail model in two variants — one generative, one streaming — with three-tier severity classification and support for 119 languages, available in 0.6B, 4B, and 8B sizes.

Apr 26, 2026

1 source · primary

china Official

Qwen-Image is a 20B image model that renders Chinese calligraphy and multi-column English text accurately

Alibaba's Qwen team releases a 20B MMDiT image foundation model aimed at complex text rendering and precise image editing, claiming top results across six public benchmarks.

Apr 26, 2026

1 source · primary

china Analysis

DeepSeek V4 arrives: near-frontier performance at lower cost

DeepSeek releases two MIT-licensed preview models — V4-Pro at 1.6 trillion parameters and V4-Flash at 284 billion — priced below comparable frontier offerings and engineered for long-context efficiency.

Apr 26, 2026

1 source · independent

china Official

Qwen-Image-Edit: a 20B model that separates semantic and appearance editing with bilingual text support

Built on the 20B Qwen-Image model, Qwen-Image-Edit handles both high-level semantic and low-level appearance editing while adding precise bilingual text editing, extending the model's text rendering expertise to image modification tasks.

Apr 26, 2026

1 source · primary