In one of upcoming request, we will be interested in deploying LLM based content filtering and tagging to reduce the burden on moderators, by reducing the amount of noise entering the wikis, and providing more focused signals for moderators to quickly handle abuse.
As preliminary work before taking up this initiative, we want to explore the performance of gpt-oss-safeguard-20b in this context. This will enable us to have more information to scope this initiative better.

