
OpenAI Introduces GPT-OSS-Safeguard Models for Online Harm Classification
OpenAI released open-weight reasoning models (120B, 20B) that apply developer-written policies at inference to classify online harms, extending the internal Safety Reasoner to the public under Apache 2.0.








