- What use case is the model going to support/resolve?**
Was made to detect content which we do not to expose kids to in roblox games (based on the roblox TOS)
- Do you have a '''model card'''?
No. See https://huggingface.co/derenrich/enwiki-kid-friendly-classifier
- What team created/trained/etc.. the model? What tools and frameworks have you used?**
Future Audiences. Trained using huggingface transformers for the purposes of filtering out articles on roblox.
- What kind of data was the model trained with, and what kind of data the model is going to need in production (for example, calls to internal/external services, special datasources for features, etc..) ?**
input: article title and short description
output: categorical variable (e.g. none/crime/political/...)
- If you have a minimal codebase that you used to run the first tests with the model, could you please share it?**
- State what team will own the model and please share some main point of contacts (see more info in '''Ownership of a model''').**
Future Audiences / me
- What is the current latency and throughput of the model, if you have tested it?** We don't need anything precise at this stage, just some ballparks numbers to figure out how the model performs with the expected inputs. For example, does the model take ms/seconds/etc.. to respond to queries? How does it react when 1/10/20/etc.. requests in parallel are made? If you don't have these numbers don't worry, open the task and we'll figure something out while we discuss about next steps!
don't know. it's only experimental. ModernBERT-based so should be very easy to host.
- Is there an expected frequency in which the model will have to be retrained with new data?** What are the resources required to train the model and what was the dataset size?
very infrequently
- Have you checked if the output of your model is safe from a human rights point of view? **Is there any risk of it being offensive for somebody? Even if you have any slight worry or corner case, please tell us!
have not checked. possible biases. only experimental for now.
- Everything else that is relevant in your opinion.**