I created a training pipeline to remove propaganda and gaslighting from Chinese models!
I'm thrilled to announce LazarusAI's ReAligned-Qwen3.5 series of models, finetuned to reduce Chinese ideological bias and censorship, refusal behavior, and state-narrative framing
I use SFT + GRPO pipeline with a dataset crafted to target the taxonomy of chinese censorship and bias, along with my ReAligned classifier model as a GRPO reward signal.