For example, get the property data types from another datasource (an API call) and inject it to the output.
Timebox: 5-8 hours
For example, get the property data types from another datasource (an API call) and inject it to the output.
Timebox: 5-8 hours
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Declined | None | T259351 property datatype information is missing in articlequality/ORES | |||
Resolved | Ladsgroup | T260778 [Investigation] Check if we can use RevDoc but inject property data type from another datasource |
This has the benefit of making dump analysis pretty much simpler but since datasources change for each requests in live requests, it won't affect performance (it slightly decreases it). We can hard-code property data types into ores but it would bloat the model file (and maintaining the list is another headache). A great solution would be to have a local server cache and keep it there but that's a long shot :(
Our options and the downsides of each option:
Honestly, the last option sounds least hard to achieve. I think we should go that way.
Would the last option mean that we'd still get the same quality score for a given Item no matter if it was scored live or from the dumps?
They currently differ already,, because ores dump analyzor can't hit API for every entity to get property suggester output but it's not too much of a difference. We won't increase the gap but probably need to re-implement some features twice.
I want to also add that using wbgetentities is not possible in ores because it doesn't support sending revids and special entity data would work in ores because of ores heavily depending on mwapi library which doesn't support such requests (we would need to inject a new type of session which seems like a big overhead).