Page MenuHomePhabricator

FaFlo (F.Flöck)
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Jun 4 2015, 5:00 PM (281 w, 8 h)
Availability
Available
LDAP User
Unknown
MediaWiki User
FaFlo [ Global Accounts ]

Recent Activity

Jun 29 2020

FaFlo added a comment to T256533: Identify accounts with very high login rate.

Using this to log in with a bot password for the wikiwho API. Logins should not happen THAT often. Are data available if/when the login rates for my account drastically increased? Will try to fix.

Jun 29 2020, 3:29 PM · User-Urbanecm, Wikimedia-General-or-Unknown, Security, Security-Team

May 26 2020

FaFlo added a comment to T253420: Add 'deletedtext' right to user group 'researchers'.

Greetings. I do not really require it for any specific research, so I'd fine with being removed. Iirc, back in the day, it was meant mostly for retrieving more than 50 revisions with full content via the API (nowadays mainly needed to keep the WikiWho API running smoothly). But I probably can achieve the same with a normal bot account, right?

May 26 2020, 3:07 PM · Wikimedia-Site-requests, MediaWiki-User-management, Trust-and-Safety, Research

Mar 6 2020

FaFlo added a comment to T231990: Spike: Internal links, Files, etc. may not show accurate contributor information [4 hours].

A short update on this: the https://github.com/wikiwho/WhoColor repository can be set up and tested locally by anyone, including the parser. It is the same parser we use for the WhoColor API. Any changes to the parsing there , that have been tested by the Wikimedia team to perform better, can then be integrated by us into the deployed API. Feel free to do pull requests or apply for collaborator status to this repo.

Mar 6 2020, 12:43 PM · Spike, Community-Tech (Kanban-Q3-2019-20), Who-Wrote-That

Feb 5 2020

FaFlo added a comment to T231990: Spike: Internal links, Files, etc. may not show accurate contributor information [4 hours].

I would like to split this apart into distinct issues:

Feb 5 2020, 11:22 AM · Spike, Community-Tech (Kanban-Q3-2019-20), Who-Wrote-That

Jan 28 2020

FaFlo added a comment to T231990: Spike: Internal links, Files, etc. may not show accurate contributor information [4 hours].

Just now get to answer this. Will have a look this week and get back to you.

Jan 28 2020, 9:38 AM · Spike, Community-Tech (Kanban-Q3-2019-20), Who-Wrote-That

Dec 11 2019

FaFlo added a comment to T232131: [BUG] WWT breaks or removes some article elements.

Sorry for not responding this earlier: Yes, most of these are known, but this overview and examples are certainly helpful. See also: https://github.com/wikiwho/WhoColor/issues
And the assessment of this having to do with the regexes used is also correct.
In regards to fixing it: We do not have the (wo)manpower on our side at this time to fix this. You are certainly invited to become a collaborator (or submit pull requests) on the github repo and we can deploy these changes, if they were tested beforehand, i.e. check if the parsing of the HTML breaks (more) for whatever fixes are administered. So I do not see why your team could not make and test these changes or not estimate it, but maybe I'm missing something.

Dec 11 2019, 10:15 AM · Who-Wrote-That, Community-Tech

Jul 16 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Sorry for the ambiguous wording. I meant if you are at revision 5 of an article with 10 revisions, and looking at a token that existed in revision 5 (e.g. with the call https://api.wikiwho.net/en/api/v1.0.0-beta/rev_content/OUR_DUMMY_ARTICLE/5/?......), and if that token has been deleted in, say, revision 8, then you would already see that in its "out" list, although you are at revision 5 currently.

Jul 16 2019, 8:14 AM · Who-Wrote-That, Community-Tech

Jul 11 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

@Mooeypoo , is the memory error in the output still occurring in your testing? We tried a fix.

Jul 11 2019, 8:05 AM · Who-Wrote-That, Community-Tech
FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

@ifried Hey Ifried, nice to meet you and good to hear :)

Jul 11 2019, 7:56 AM · Who-Wrote-That, Community-Tech

Jun 26 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Sorry for the late reply. Valid point regarding the memory, will look into
it.

Jun 26 2019, 10:49 AM · Who-Wrote-That, Community-Tech

Apr 9 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

@Niharika Alright. I won't be at the Hackathon unfortunately. But I'm in San Francisco in the week of May 13-17, so if you guys are based at the SF offices, I could simply swing by to talk about what is already there and what we could provide. Not strictly necessary, but might be helpful.

Apr 9 2019, 2:58 PM · Who-Wrote-That, Community-Tech

Apr 3 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

@Niharika what is the current status of this project, do you need any input? Do you need the token ids in the output as you requested?

Apr 3 2019, 4:39 PM · Who-Wrote-That, Community-Tech

Feb 19 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Now that I had time to look at it: the

id="token-805"

in the extended HTML is *not* actually the WikiWho token ID, but simply a positional index for the token for that revision. (I do admit that we have to update the documentation to that effect...)
The WhoColor userscript goes through the extended HTML in the conflict and age views and retrieves the respective conflict and age scores by order from the list in the inline model 2.
(the "class name" is simply the user id - or an ad-hoc user hash for IPs - but I guess you figured that out already)

Feb 19 2019, 10:25 AM · Who-Wrote-That, Community-Tech

Feb 15 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Sorry, I didn't see the notification for that message...

Feb 15 2019, 11:33 AM · Who-Wrote-That, Community-Tech

Jan 20 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.
Jan 20 2019, 1:11 PM · Who-Wrote-That, Community-Tech

Jan 16 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

The annotations per token that the WikiWho APIs produce are always for the wiki markup, including all tokens, also those in tables, references. I does *not* expand templates or anything transcluded, which means that the content of those elements is not annotated for now, only the wiki markup that they are called with. That does not, therefore, pertain to tables, references and infoboxes in general, as long as nothing is transcluded. I.e. that is the first source of "error" in the sense that the API has simply no annotations for the transcluded content. (It could be added of course, but that would mean a couple more steps, including processing all templates. In practice I would rather add some nice-looking HTML in the frontend that says something along the lines of "could not color this template". )

Jan 16 2019, 11:20 AM · Who-Wrote-That, Community-Tech

Jan 7 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Hi, nice to see this is getting traction again, a proper browser extension/better interface/cleaner highlighting would be great!

Jan 7 2019, 4:39 PM · Who-Wrote-That, Community-Tech

Oct 9 2018

FaFlo added a comment to T199121: RFC: Spec for representing multiple content objects per revision (MCR) in XML dumps.

Of course checksums make lot of of sense for countless use cases, including many in research (mentioned paper was never intended to make a sweeping point to the contrary, but yes, discussion for another time).
And I think MCR is awesome, JFTR.

Oct 9 2018, 4:57 PM · Structured-Data-Backlog, Platform Team Initiatives (MCR), Multi-Content-Revisions, Multimedia, Platform Team Workboards (Done with CPT), TechCom-RFC (TechCom-RFC-Closed), Structured Data Engineering, Dumps-Generation, User-ArielGlenn, User-Daniel, Wikidata

Apr 21 2018

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Hi, I can only comment on how we implemented it for api.wikiwho.net, but these are good points in general as well:

  • revisions/"when" vs. "who" : Regarding the actual implementation, you get the rev-ids of the origin and change revisions for a token and then fetch the meta-info for that revision in a second step, such as the editor and timestamp. So like you say, it is not "who" in the first instance, that is just derived.
  • whitespace: we split the text into tokens at the whitespace (and other special chars), so you would not attribute changes where someone just adds/removes whitespace (i.e., whitespaces are not tokens) without altering other text pieces. Alas, if someone would split a word into two via a whitespace (or -conversely - concatenate) , we would attribute the "new" tokens to that editor. If we talk about "formatting" in a wiki-markup sense, one would probably have to ex-post filter those changes that touched "cosmetic" markup like section headers or hrules, which is doable, but more tricky. Or simply run the whole thing in parallel on the parsed, front-end text with formatting ignored.
Apr 21 2018, 7:38 AM · Who-Wrote-That, Community-Tech

Jan 17 2018

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Can you tell us why not all languages can be added to the service? Technically speaking, why is the tool dependent on which language is it being used for?

Jan 17 2018, 3:14 PM · Who-Wrote-That, Community-Tech

Jan 13 2018

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Hi, as an author of WikiWho/ WhoColor:

  • Great that this is being picked up, I would be happy to be of assistance
  • "Note that unlike bisecting, blaming - essentially, content persistence - is a complex problem, which has been the topic of several research projects in the past. Trying to write a new tool from scratch is probably not a good idea" --> yup, and we have evaluated WikiWho in that regard, showing high accuracy especially also for longer, more complex revision histories, although only for English so far (see the paper)
  • Regarding speed: we are processing the EventStreams of several languages on the fly, that is not an issue. We just don't have any caching layer for the materialized json yet, but that is on the to do list. For the mid-term future (2-3 years), the upkeep and further development of the service is secured at GESIS (my employer) and also the extension to more languages (although maybe not all). But for the long term I also think hosting it at the WMF might make more sense.
Jan 13 2018, 4:03 PM · Who-Wrote-That, Community-Tech

Aug 2 2017

FaFlo closed T161696: Oauth for PAWS fails - presumably because of username change as Resolved.

Seems to work - thanks :)

Aug 2 2017, 11:46 AM · MediaWiki-extensions-OAuth, PAWS

Jul 31 2017

FaFlo added a comment to T161696: Oauth for PAWS fails - presumably because of username change.

any news on this?

Jul 31 2017, 11:50 AM · MediaWiki-extensions-OAuth, PAWS

Mar 30 2017

FaFlo added a comment to T161696: Oauth for PAWS fails - presumably because of username change.

so, is there a solution? sounds like a quick fix would be replacing the name in the paws record. I would also be fine with my user being deleted from paws and to then do a new first login (I don't have anything important on there yet).

Mar 30 2017, 8:46 AM · MediaWiki-extensions-OAuth, PAWS

Mar 29 2017

FaFlo created T161696: Oauth for PAWS fails - presumably because of username change.
Mar 29 2017, 11:29 AM · MediaWiki-extensions-OAuth, PAWS

Mar 2 2016

FaFlo added a comment to T89416: Accuracy review of Wikipedias.

So I'm not certain how "demand" is usually measured, but based on discussions I witnessed, and my understanding of the challenges Wikipedia faces regarding quality, I would concur with @Qgil that content curation and bringing in more eyes to spot errors is something that is certainly helpful for the editor community.

Mar 2 2016, 5:00 PM · Accuracy-Review-of-Wikipedias, Outreachy-Round-12, Google-Summer-of-Code (2016)

Nov 3 2015

FaFlo updated the task description for T116589: Make sure all mentors sign up in Outreachy'11 application system.
Nov 3 2015, 10:33 AM · Outreachy-Round-11

Oct 4 2015

FaFlo added a comment to T89416: Accuracy review of Wikipedias.

hi, what i certainly can do is to provide mentoring needed regarding the implementation/extension of wikiwho in this setting and some general feedback

Oct 4 2015, 2:50 PM · Accuracy-Review-of-Wikipedias, Outreachy-Round-12, Google-Summer-of-Code (2016)

Jun 4 2015

FaFlo added a comment to T89763: wikiwho on pywikibot.

hi, I just now became aware of this thread. Several things: (i) a collaborator just recently updated wikiwho to run with the newest version of the wikimedia-utilities and python3 (see https://github.com/maribelacosta/wikiwho/tree/python3 ). (ii) Be aware that wikiwho is the *only* solution (apart from Luca de Alfaro's A3 algo that we evaluated as well) that was soundly tested for the accuracy of the provenance attribution and that the solution is not trivial in many instances. Or I'm not aware of accuracy testing that might have been done so far for other approaches. This can be critical if used in a real editing scenario by an end-user. (iii) Not so related but maybe helpful: our API is already giving authorship information and you can use that too; example: wikiwho.net/wikiwho/wikiwho_api_api.py?revid=649876382&name=Laura_Bush&format=json&params=author. (iv) if anyone needs input or has feature requests we are happy to respond to them. it's just that the whole phabricator line of communication was not on my radar, but I will have an eye on it. Else, drop me a talk page message or an email to f.floeck@gmail.com or at github

Jun 4 2015, 5:21 PM · Pywikibot