User Details
- User Since
- Mar 14 2023, 12:16 PM (82 w, 3 d)
- Availability
- Available
- LDAP User
- Unknown
- MediaWiki User
- ROdonnell-WMF [ Global Accounts ]
Yesterday
Seemingly, I clicked the Merge button! ๐
QA is good in dev.
Wed, Oct 9
Tue, Oct 8
E2E integration tests pass in dv
Mon, Oct 7
Thu, Oct 3
In Snapshot, NDJSON files
Version 4.0 is in Kafka compacted topics
Working in dev - on-demand
I will pick up 7-9 Oct, after bulk ingestion has started. We'll see if it can be ready for the next SC ingestion
Wed, Oct 2
Tue, Oct 1
Closing this ticket, the root cause was due to old invalid Kafka messages in our dev backend. Thank you @cscott for helping me.
de.wikinews has 3 mismatches after a S3 cleanup.
anpwiki, these 5 have mismatches. I don't see any redirects, only edits that occurred within 5 minute period
I've found one issue with our setup, old s3 files are not getting cleaned in our articles folder. However, there is still some mismatch in older articles (from July and August) that get redirected to newer article names.
Mon, Sep 30
I reran our ingestion today and got these mismatches in https://da.wiktionary.org
Thu, Sep 26
We are seeing a mismatch in the w/rest.php/v1/revision/%s/html API calls
huwiki errors to flag to WMF:
12 rows (out of 661601) in huwiki snapshot have mismatching revision id in the HTML field.
Checked dewikinews snapshot after dev ingestion. There were 35+ mismatches before this fix in dewikinews. After the fix, only one document has a mismatching HTML rev id, but this seems to be from the snapshot service, see JSON duplicate below. Checking a larger project huwiki`
Wed, Sep 25
Latency for articleUpdate service. Mostly the GetRevisionHTML is causing the delay, but not very often
I'm going to test on a different set of projects: ['dawiktionary', 'dewikinews', 'huwiki', 'cswikibooks']
One error related to log message: "wmf api parsoid error", caused by RequestTimeout for this ZH page: https://zh.wikipedia.org/wiki/Template:Recent_changes_article_requests/list. (could be because it's a Template page?)
OK, will do
Probably best to move the last sub-task to the next ticket where we do the full bulk ingestion in PROD
Airflow logs in dev:
Tue, Sep 24
Mon, Sep 23
Tue, Sep 17
Sep 11 2024
See screenshot of DLQ with CS Voyage message: