My attempt at a logico-historical approach:
The result is based on dump of 2019-10-01, any edit after that date is not counted in the system. This is by design to have snapshots of a wiki in the given times.
I am trying to make sense of the dataset on the WikiCov based on the example you shared...
I will try to help understand the structure of this data set in this comment (see bellow).
Mon, Nov 11
@Dydimus The selected page is https://en.wikipedia.org/wiki/Hurricane_Hazel (your email today, 5:29 PM CET).
Please let me know what do you think about the WikiConv data set. Thanks.
@Ladsgroup We might have a problem with your most recent version of ORES quality score predictions for Wikidata.
@Christine_Domgoergen_WMDE Interim report is updated to include Stuttgart and Basel.
Sun, Nov 10
2019/11/10: shared an excerpt of the WikiConv data set with Dydimus to assess it and decide if it can be used in his research project.
2019/11/10: consider the WikiConv data set for this project.
Here is an interim campaign report:
Fri, Nov 8
Thu, Nov 7
A Google Spreadsheet summarizing the daily developments has been shared with you a few moments ago. I will continue to report the daily campaign numbers there.
Wed, Nov 6
@awight Thank you for helping to sort this out. Then we wait until the new eventlogging is ready and use the beacon/impression path in webrequest. We've learned something.
@Christine_Domgoergen_WMDE Of course. Both the spreadsheets and an interim report will be prepared.
Tue, Nov 5
@Christine_Domgoergen_WMDE Reporting aggregated data until 2019/11/04:
@Christine_Domgoergen_WMDE Reporting soon for 2019/11/04 and some aggregated data as well.
Mon, Nov 4
- the ticket is re-opened,
- next update is on November 13, 2019 (two weeks after October 30 when the last report was delivered);
- user edits will be tracked too.
@Christine_Domgoergen_WMDE Working on it right now and reporting back in minutes.
@awight Thank you!
@Christine_Domgoergen_WMDE No problem:
Sun, Nov 3
@awight Thank you for your assessment of the problem.
I've tested the campaign banner impressions today, and they are found on the beacon/impression path, test day: November 01, 2019:
Fri, Nov 1
- incorrect data.frame produced;
- fixing now.
- produce the final analytics dataset w. Pyspark: DONE.
Thu, Oct 31
- ORES score predictions moved to hdfs, loaded to Spark;
- all join operations will be performed in the cluster.
@awight Thanks for pointing this out.
@Addshore Many thanks.
@Addshore Could you please share the beacon path in wmf.webrequest where you saw the test banner impression previously? Thanks.
@Boris_Brunst_ext_WMDE Oct 30 user registrations: one (1) from WMDE_2019_emailc. Do we need anything else here?
The test banner impression produced yesterday (October 30, 2019) on aawiki is not found in our databases; from stat1004, use event;, then:
Wed, Oct 30
Testing will commence tomorrow morning from the centralnoticeimpression schema in the event Hadoop database:
- the test banner has been successfully tracked by @Addshore in the Central Notice processing pipeline;
- the test was conducted on the aawiki wiki;
- the data are still not found in the centralnoticeimpression table; moreover, there are no data in that table for anything on Oct 30 2019.
@Christine_Domgoergen_WMDE In the meantime, you will most probably enjoy to learn that I've found all the test user registrations in the respective databaseL
Once again, I can only read what is already in our databases, and cannot write a single thing there. So I am afraid that I cannot tell why the data are not present when they are not present.
However, I will double check the user registrations data.
@Boris_Brunst_ext_WMDE Oct 29 user registrations: None.
Tue, Oct 29
could you please let us know exactly which tag did you use for your test user registration from the following:
... but if I understand this correctly, it doesn't say anything about the rate of resolving these conflicts.
Update, 14:08 CET: still no data on banner impressions.
I have tested the following
@Boris_Brunst_ext_WMDE Oct 28 user registrations:
Mon, Oct 28
@Boris_Brunst_ext_WMDE No worries, will do.
@Lydia_Pintscher I guess this task is completed now.
@Boris_Brunst_ext_WMDE Shall we close this ticket then?
@Boris_Brunst_ext_WMDE The latest numbers are here.
@Boris_Brunst_ext_WMDE Here are the latest numbers. Please let me know what are the next steps in analytics for this campaign. Thanks.
- This will be dealt with by ShEx in the near future;
- closing the task as invalid.
Thu, Oct 24
Tue, Oct 22
- Dashboard online.
Mon, Oct 21
@Boris_Brunst_ext_WMDE I am on it.
@Boris_Brunst_ext_WMDE Got it. Here we go (Oct 19 and Oct 20):
Sat, Oct 19