Page MenuHomePhabricator

mschwarzer (Malte)
User

Projects (1)

Today

  • No visible events.

Tomorrow

  • No visible events.

Friday

  • No visible events.

User Details

User Since
Aug 9 2016, 1:58 PM (509 w, 1 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
Muwnd [ Global Accounts ]

Recent Activity

Mar 21 2020

mschwarzer closed T149682: Define success criteria for performing A/B tests, a subtask of T142477: Improve mobile recommendations in Android app, as Declined.
Mar 21 2020, 7:09 PM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal
mschwarzer closed T149682: Define success criteria for performing A/B tests as Declined.
Mar 21 2020, 7:09 PM · Discovery-ARCHIVED, Goal
mschwarzer closed T155101: define languages for citolytics A/B test as Declined.
Mar 21 2020, 7:08 PM · Article-Recommendation

Jun 6 2017

mschwarzer added a comment to T142009: Related Pages recommendations user study design.

@Capt_Swing I'm very interested to talk to you about it. But I guess this ticket is the wrong place for it. I already sent an email to you a few weeks ago: ms (a) mieo.de.

Jun 6 2017, 6:53 AM · Reading Epics (AI Based Reading Recommendations), Design-Research, Reading-UX-Research

May 26 2017

mschwarzer added a comment to T92826: Ready-to-use Docker package for MediaWiki.

...

  1. adding containers for CirrusSearch/Elasticsearch, Graphoid, Mathoid
May 26 2017, 3:43 AM · Wikimania-Hackathon-2017, Services (doing), User-mobrovac, MediaWiki-Containers, MediaWiki-Releasing, Wikimedia-Hackathon-2015

May 12 2017

mschwarzer added a comment to T142009: Related Pages recommendations user study design.

@Capt_Swing Will you continue the research on this topic? I'm asking because I plan to do a similar evaluation for link-based recommendation algorithm but instead a user study I want to use the Android app to conduct an online evaluation. (See: T142477)

May 12 2017, 8:28 AM · Reading Epics (AI Based Reading Recommendations), Design-Research, Reading-UX-Research

Apr 20 2017

mschwarzer updated the task description for T142477: Improve mobile recommendations in Android app.
Apr 20 2017, 1:57 AM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal

Apr 13 2017

mschwarzer updated the task description for T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.
Apr 13 2017, 6:12 AM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Mar 29 2017

mschwarzer closed T142555: Recommendations interface, a subtask of T142477: Improve mobile recommendations in Android app, as Resolved.
Mar 29 2017, 3:50 AM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal
mschwarzer closed T142555: Recommendations interface as Resolved.

CirrusSearch API is used. See T143197

Mar 29 2017, 3:50 AM · Discovery-ARCHIVED
mschwarzer updated the task description for T149682: Define success criteria for performing A/B tests.
Mar 29 2017, 3:34 AM · Discovery-ARCHIVED, Goal

Mar 27 2017

mschwarzer added a comment to T159521: Lost Wikitech 2FA details, recovery needed.

@bd808 Thanks. I added the file:

Mar 27 2017, 1:48 AM · User-bd808, Cloud-Services

Mar 11 2017

mschwarzer added a comment to T159521: Lost Wikitech 2FA details, recovery needed.

@Aklapper Is there anyhow a way to proceed with this?

Mar 11 2017, 1:44 PM · User-bd808, Cloud-Services

Mar 3 2017

mschwarzer added a comment to T159521: Lost Wikitech 2FA details, recovery needed.

I mixed something up. The phab account is linked to my MediaWiki profile. But yes, the wikitech user page refers to a ticket of this phab account. Sorry for the confusion.

Mar 3 2017, 3:31 PM · User-bd808, Cloud-Services
mschwarzer created T159521: Lost Wikitech 2FA details, recovery needed.
Mar 3 2017, 11:26 AM · User-bd808, Cloud-Services

Mar 1 2017

mschwarzer updated the task description for T154592: Project Progress (internal).
Mar 1 2017, 9:10 AM · Discovery-ARCHIVED

Feb 4 2017

mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

@dcausse Thank you very much! In the MediaWiki everything seems to work correctly. However, in the Android app it does not work. I cannot use citolytics-en.wmflabs.org as mediaWikiBaseUri / API endpoint. I keep getting these error messages when opening a Wiki article from within the Android app:

Feb 4 2017, 1:32 PM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Jan 29 2017

mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

Now you should have SSH access to hadoop000.math.eqiad.wmflabs. The ES dumps are located in /srv/wikisim/data/results/:

Jan 29 2017, 10:11 PM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal
mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

I'll prepare the ES bulk dumps for enwiki, simplewiki and ndswiki and upload them to hadooop.math.eqiad:/srv/wikisim/data/results/.

Jan 29 2017, 10:09 AM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Jan 27 2017

mschwarzer updated the task description for T154592: Project Progress (internal).
Jan 27 2017, 4:37 PM · Discovery-ARCHIVED
mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.
  • I uploaded a result file to one of our lab instances (hadoop000.math.eqiad). What would be the best approach so you can access it for testing?
  • For the Oozie workflow integration I already prepared a PySpark script that reads the data from HDFS and send updates as bulk to ES ( https://gerrit.wikimedia.org/r/#/c/334130/4/oozie/citolytics/transferCitolyticsToES.py - it mainly reuses the code from the popularity_score script). If this script is not suitable for testing, I also can prepare data in the elastic bulk format.
  • Regarding languages, it depends what would be the simplest way for testing. I can generate recommendations for only a single language but also for more or all that are available as XML dump.
Jan 27 2017, 12:23 PM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Jan 26 2017

mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

@dcausse What needs to be done after the code review? Or what are the next steps to get the code deployed?

Jan 26 2017, 10:58 AM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Jan 25 2017

mschwarzer added a comment to T155101: define languages for citolytics A/B test.

@Physikerwelt The uncompressed the results (enwiki) are around 50 GB in size (~10 GB compressed). Other languages will less. So they won't fit to github (max. 2 GB per file) and onedrive. For now, I'll start uploading them to a lab instance.

Jan 25 2017, 5:48 PM · Article-Recommendation
mschwarzer added a comment to T111925: Can not use git-deploy from tin.eqiad.wmnet to labnodepool1001.eqiad.wmnet.

Hi. I'm having the same problem, when using Puppet to install role::elasticsearch::cirrus on a labs instance (hadoop000.math.eqiad.wmflabs). What should I git clone? Or is there any other work-around?

Jan 25 2017, 3:17 PM · RelEng-Archive-FY201718-Q1, OKR-Work, Patch-For-Review, Cloud-VPS, Continuous-Integration-Scaling, Salt, Deployments
mschwarzer removed a watcher for Tracking-Neverending: mschwarzer.
Jan 25 2017, 10:54 AM
mschwarzer removed a watcher for Discovery-ARCHIVED: mschwarzer.
Jan 25 2017, 10:52 AM

Jan 24 2017

mschwarzer added a comment to T155101: define languages for citolytics A/B test.

Where can I upload the data? The data from all requested wikis won't fit on our labs instances.

Jan 24 2017, 2:56 PM · Article-Recommendation
mschwarzer updated the task description for T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.
Jan 24 2017, 2:03 PM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Jan 12 2017

mschwarzer updated the task description for T154592: Project Progress (internal).
Jan 12 2017, 2:22 PM · Discovery-ARCHIVED

Jan 5 2017

mschwarzer added a project to T154592: Project Progress (internal): Discovery-ARCHIVED.
Jan 5 2017, 10:58 AM · Discovery-ARCHIVED

Jan 4 2017

mschwarzer created T154592: Project Progress (internal).
Jan 4 2017, 4:34 PM · Discovery-ARCHIVED

Dec 30 2016

mschwarzer added a comment to T151861: Enable 'analytics_cluster' role on Labs instance.

Thanks! Can you recommend any Oozie starting point? Is there already a workflow that uses Wikipedia XML dumps? Or one that writes to ES?

Dec 30 2016, 5:47 PM · Analytics-Kanban, MediaWiki-Vagrant

Dec 20 2016

mschwarzer added a comment to T125393: Implement A/B test to measure CirrusSearch "opening_text" performance..

Can you provide the database schema where the data is stored? Then, I can create a query for the aggregation.

Dec 20 2016, 2:45 PM · Patch-For-Review, Mobile-App-Android-Sprint-75-Rhenium, Wikipedia-Android-App-Backlog
mschwarzer added a comment to T151861: Enable 'analytics_cluster' role on Labs instance.

Thanks for setting it up.

Dec 20 2016, 10:18 AM · Analytics-Kanban, MediaWiki-Vagrant

Dec 19 2016

mschwarzer created T153641: How to use Wikipedia EventLogging schemas in Vagrant setup?.
Dec 19 2016, 11:05 AM · Analytics-Kanban, Analytics, MediaWiki-extensions-EventLogging, MediaWiki-Vagrant

Dec 17 2016

mschwarzer added a comment to T151861: Enable 'analytics_cluster' role on Labs instance.

@Physikerwelt @Ottomata I was able to run Flink jobs on YARN (see https://wikitech.wikimedia.org/wiki/Flink ). However, I could not enable Oovie / Hive using these instructions:

Dec 17 2016, 5:20 PM · Analytics-Kanban, MediaWiki-Vagrant

Dec 13 2016

mschwarzer added a comment to T151861: Enable 'analytics_cluster' role on Labs instance.

Without having HDFS mounted Oozie fails, because it cannot access HDFS:

Dec 13 2016, 3:53 PM · Analytics-Kanban, MediaWiki-Vagrant

Dec 12 2016

mschwarzer added a comment to T151861: Enable 'analytics_cluster' role on Labs instance.

Same problem when only enabling the hadoop role :/

Dec 12 2016, 2:22 PM · Analytics-Kanban, MediaWiki-Vagrant
mschwarzer added a comment to T151861: Enable 'analytics_cluster' role on Labs instance.

Vagrant seems to call the lxc-attach help function:

Dec 12 2016, 1:40 PM · Analytics-Kanban, MediaWiki-Vagrant
mschwarzer added a comment to T151861: Enable 'analytics_cluster' role on Labs instance.

Due to the error with --provision the Hadoop ports weren't set up correctly:

Dec 12 2016, 1:38 PM · Analytics-Kanban, MediaWiki-Vagrant
mschwarzer added a comment to T151861: Enable 'analytics_cluster' role on Labs instance.

Not rebooting is not really a suitable solution when using the VM for development, since I also need to enable other roles or change port-forwarding.

Dec 12 2016, 1:08 PM · Analytics-Kanban, MediaWiki-Vagrant

Dec 1 2016

mschwarzer added a comment to T151861: Enable 'analytics_cluster' role on Labs instance.
mschwarzer@mlp:/srv/mediawiki-vagrant$ vagrant --version
Vagrant 1.7.4``
Dec 1 2016, 5:39 PM · Analytics-Kanban, MediaWiki-Vagrant

Nov 29 2016

mschwarzer created T151861: Enable 'analytics_cluster' role on Labs instance.
Nov 29 2016, 10:35 AM · Analytics-Kanban, MediaWiki-Vagrant

Nov 21 2016

mschwarzer added a comment to T148833: Use Android Analytics framework for collecting click through-data.

@Physikerwelt The data is currently not available, but I already requested the release (See https://phabricator.wikimedia.org/T125393 ). As soon as the data gets public I'll do the analysis.

Nov 21 2016, 2:40 PM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal

Nov 15 2016

mschwarzer added a comment to T125393: Implement A/B test to measure CirrusSearch "opening_text" performance..

@EBernhardson @mpopov Any news on releasing the data?

Nov 15 2016, 11:04 AM · Patch-For-Review, Mobile-App-Android-Sprint-75-Rhenium, Wikipedia-Android-App-Backlog

Nov 3 2016

mschwarzer added a comment to T149805: Clarify WMF involvement in "Improve mobile recommendations in Android app".

@Deskana Most of the work regarding CirrusSearch and Android is already done. Thus, I think we can keep the work load of your team at a minimum. In other words, I would be happy to do as much work as possible.

Nov 3 2016, 8:15 AM · Discovery-ARCHIVED, Goal

Nov 2 2016

mschwarzer updated subscribers of T149805: Clarify WMF involvement in "Improve mobile recommendations in Android app".

As @leila said in T143197#2752764, this experiment requires that someone in the WMF needs to "own" it. In order to make that happen, I would like to know how we can support that or who to contact.

Nov 2 2016, 2:29 PM · Discovery-ARCHIVED, Goal
mschwarzer created T149805: Clarify WMF involvement in "Improve mobile recommendations in Android app".
Nov 2 2016, 2:29 PM · Discovery-ARCHIVED, Goal

Nov 1 2016

mschwarzer renamed T142477: Improve mobile recommendations in Android app from Use citolytics for mobile recommendations to Improve mobile recommendations in Android app.
Nov 1 2016, 12:58 PM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal
mschwarzer created T149682: Define success criteria for performing A/B tests.
Nov 1 2016, 12:51 PM · Discovery-ARCHIVED, Goal
mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

@Nuria Thanks for the clarification. I'll review the project and update the corresponding tickets.

Nov 1 2016, 11:21 AM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Oct 28 2016

mschwarzer added a comment to T125393: Implement A/B test to measure CirrusSearch "opening_text" performance..

@EBernhardson Thanks for pointing to the spreadsheet. It would be really great, if you can make the (anonymized) raw data available so that we can prepare our study.

Oct 28 2016, 8:47 AM · Patch-For-Review, Mobile-App-Android-Sprint-75-Rhenium, Wikipedia-Android-App-Backlog
mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

@leila Thanks for your questions!

Oct 28 2016, 8:44 AM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Oct 27 2016

mschwarzer added a comment to T125393: Implement A/B test to measure CirrusSearch "opening_text" performance..

Is the outcome (raw data/evaluation) of the A/B test still available? We would like to use it as reference for our Citolytics A/B test.

Oct 27 2016, 7:07 AM · Patch-For-Review, Mobile-App-Android-Sprint-75-Rhenium, Wikipedia-Android-App-Backlog

Oct 26 2016

mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

@Physikerwelt do you have access to WMF resources where we can store the recommendations?

Oct 26 2016, 7:00 AM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Oct 25 2016

mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

@dcausse I currently do not have access to the analytics cluster. Is it possible to upload it somewhere else?

Oct 25 2016, 5:29 PM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal
mschwarzer added a comment to T148833: Use Android Analytics framework for collecting click through-data.

@Dbrant Yes, that's correct!

Oct 25 2016, 5:08 PM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal
mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

@Physikerwelt The JSON output for top-10 recommendations (including scores) is around 2GB in size (without scores 1.3 GB).

Oct 25 2016, 3:57 PM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal
mschwarzer updated the task description for T142477: Improve mobile recommendations in Android app.
Oct 25 2016, 9:37 AM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal
mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

@dcausse The transferToES.py script should work to write the JSON data from HDFS to ES. But what would in general the best approach to get the Citolytics recommendations to the CirrusSearch ES instance?

Oct 25 2016, 9:34 AM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Oct 22 2016

mschwarzer renamed T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index from Develop a Citolytics MediaWiki extension to Integrate Citolytics to CirrusSearch MediaWiki extension.
Oct 22 2016, 11:12 AM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal
mschwarzer renamed T148833: Use Android Analytics framework for collecting click through-data from Use Analytics framework for collecting click through-data to Use Android Analytics framework for collecting click through-data.
Oct 22 2016, 11:00 AM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal
mschwarzer added a comment to T148833: Use Android Analytics framework for collecting click through-data.

@Dbrant How did you evaluate your morelike A/B testing? ( https://phabricator.wikimedia.org/T125393 ) Is it possible to re-use your system for Citolytics?

Oct 22 2016, 12:18 AM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal
mschwarzer updated the task description for T148833: Use Android Analytics framework for collecting click through-data.
Oct 22 2016, 12:06 AM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal

Oct 21 2016

mschwarzer updated the task description for T148833: Use Android Analytics framework for collecting click through-data.
Oct 21 2016, 1:54 PM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal
mschwarzer created T148833: Use Android Analytics framework for collecting click through-data.
Oct 21 2016, 1:27 PM · Wikipedia-Android-App-Backlog, Discovery-ARCHIVED, Goal

Sep 27 2016

mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

@dcausse, can you point to the CirrusSearch process you mentioned for writing data from Hadoop/HDFS to elasticsearch. I could only find a class for writing to elasticsearch without HDFS.

Sep 27 2016, 3:02 PM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Aug 18 2016

mschwarzer added a comment to T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.

Thanks! This way sounds more feasible. I'll add an extra query prefix (citolytics:) to CirrusSearch.

Aug 18 2016, 3:29 PM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Aug 17 2016

mschwarzer updated the task description for T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.
Aug 17 2016, 12:08 PM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal
mschwarzer created T143197: Add a citolytics query prefix that exposes the relatedness information from the elasticsearch index.
Aug 17 2016, 10:22 AM · Discovery-Search (Current work), CirrusSearch, Discovery-ARCHIVED, Goal

Aug 16 2016

mschwarzer added a comment to T142831: Review the current automatic recommendation mechanism.

The Wikipedia API with CirrusSearch extension is used:

Aug 16 2016, 9:18 AM · Discovery-ARCHIVED, CirrusSearch
mschwarzer added a watcher for Discovery-ARCHIVED: mschwarzer.
Aug 16 2016, 9:06 AM

Aug 15 2016

mschwarzer added a comment to T142831: Review the current automatic recommendation mechanism.

Clicks on article recommendations are tracked via the Analytics funnel API:

Aug 15 2016, 11:38 AM · Discovery-ARCHIVED, CirrusSearch

Aug 13 2016

mschwarzer added a comment to T142831: Review the current automatic recommendation mechanism.

The current automatic recommendation mechanism uses the MediaWikiApi ("morelike:"-query).

Aug 13 2016, 9:21 AM · Discovery-ARCHIVED, CirrusSearch

Aug 12 2016

mschwarzer added a comment to T142831: Review the current automatic recommendation mechanism.

@Physikerwelt can you tell me where I can find the respective source code?

Aug 12 2016, 6:46 PM · Discovery-ARCHIVED, CirrusSearch

Aug 10 2016

mschwarzer created T142555: Recommendations interface.
Aug 10 2016, 7:48 AM · Discovery-ARCHIVED

Aug 9 2016

mschwarzer added a watcher for Tracking-Neverending: mschwarzer.
Aug 9 2016, 2:04 PM
mschwarzer added a member for Article-Recommendation: mschwarzer.
Aug 9 2016, 2:03 PM