- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Advanced Search
Jan 24 2019
In T200297#4903879, @Milimetric wrote:This wikitext-in-JSON thing seems really complicated.
Jan 23 2019
Important change of plans—We're discussing backfilling, and it might be best to allow mismatched model versions in the dumps for now. In other words, go ahead and backfill with whatever the current model version is. Normalized data will continue to be segregated by model version, but the monthly "current" and "historical" dumps will patch together whatever scores are available, simply taking the newest model version used to score each revision.
Jan 22 2019
@Nikerabbit Hi, are we unblocked now that MLEB 2019.01 is released?
In T213943#4888027, @Nikerabbit wrote:All we have to do is rely on a namespace-based message group, and configure the group with:
Just to clarify, you have a code that gets run when the state of a language of message group's translations is changed to published?
Jan 21 2019
Jan 18 2019
In T200297#4887041, @Krinkle wrote:
- The feature proposes to store arbitrary text (specifically, wikitext) inside JSON blobs.
Jan 17 2019
My lazy update: I think we can do this without implementing anything new. All we have to do is rely on a namespace-based message group, and configure the group with:
$states['published']['right'] = 'translate-manage';
I've dropped this field set from https://gerrit.wikimedia.org/r/#/c/mediawiki/extensions/CentralNotice/+/484784
Jan 16 2019
We should probably decline this task in favor of T182331: [Epic] Deploy ORES in kubernetes cluster.
How many rows are in the tables? There should be a timestamp column, often ts, to query. It's probably worth keeping an archival dump if it might be real data.
Jan 15 2019
In T213816#4881623, @Harej wrote:Do we, as a general rule, want to prevent the creation of Jade pages that do not correspond to legal revision IDs?
Good find, thanks!
In T209732#4881126, @Ottomata wrote:We could emit a single test event per hour into the topic in each dc... :)
Something tricky I ran into: success files aren't written for hours where there are zero changeprop events through codfw. Maybe we have to change that job to write success files even for empty hours?
hdfs dfs -ls hdfs://analytics-hadoop/wmf/data/event/mediawiki_revision_score/datacenter=codfw/year=2019/month=1/day=12 Picked up JAVA_TOOL_OPTIONS: -Dfile.encoding=UTF-8 Found 9 items drwxr-xr-x - hdfs hadoop 0 2019-01-12 04:21 hdfs://analytics-hadoop/wmf/data/event/mediawiki_revision_score/datacenter=codfw/year=2019/month=1/day=12/hour=1 drwxr-xr-x - hdfs hadoop 0 2019-01-12 19:21 hdfs://analytics-hadoop/wmf/data/event/mediawiki_revision_score/datacenter=codfw/year=2019/month=1/day=12/hour=16 drwxr-xr-x - hdfs hadoop 0 2019-01-12 20:20 hdfs://analytics-hadoop/wmf/data/event/mediawiki_revision_score/datacenter=codfw/year=2019/month=1/day=12/hour=17 drwxr-xr-x - hdfs hadoop 0 2019-01-12 21:20 hdfs://analytics-hadoop/wmf/data/event/mediawiki_revision_score/datacenter=codfw/year=2019/month=1/day=12/hour=18 drwxr-xr-x - hdfs hadoop 0 2019-01-12 22:20 hdfs://analytics-hadoop/wmf/data/event/mediawiki_revision_score/datacenter=codfw/year=2019/month=1/day=12/hour=19 drwxr-xr-x - hdfs hadoop 0 2019-01-12 05:20 hdfs://analytics-hadoop/wmf/data/event/mediawiki_revision_score/datacenter=codfw/year=2019/month=1/day=12/hour=2 drwxr-xr-x - hdfs hadoop 0 2019-01-12 23:20 hdfs://analytics-hadoop/wmf/data/event/mediawiki_revision_score/datacenter=codfw/year=2019/month=1/day=12/hour=20 drwxr-xr-x - hdfs hadoop 0 2019-01-13 00:21 hdfs://analytics-hadoop/wmf/data/event/mediawiki_revision_score/datacenter=codfw/year=2019/month=1/day=12/hour=21 drwxr-xr-x - hdfs hadoop 0 2019-01-12 06:20 hdfs://analytics-hadoop/wmf/data/event/mediawiki_revision_score/datacenter=codfw/year=2019/month=1/day=12/hour=3
Jan 14 2019
Jan 12 2019
There were some cleanups to make on our side, and a synchronization issue with the research client. I've replaced the ORES consumer part of their pipeline with our standard score_revisions utility and will wait for feedback.
Jan 11 2019
I think my last comment is a different bug, so splitting into T213582