Page MenuHomePhabricator

FaFlo (F.Flöck)
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Jun 4 2015, 5:00 PM (228 w, 6 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
FaFlo [ Global Accounts ]

Recent Activity

Jul 16 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Sorry for the ambiguous wording. I meant if you are at revision 5 of an article with 10 revisions, and looking at a token that existed in revision 5 (e.g. with the call https://api.wikiwho.net/en/api/v1.0.0-beta/rev_content/OUR_DUMMY_ARTICLE/5/?......), and if that token has been deleted in, say, revision 8, then you would already see that in its "out" list, although you are at revision 5 currently.

Jul 16 2019, 8:14 AM · Who-Wrote-That, Community-Tech

Jul 11 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

@Mooeypoo , is the memory error in the output still occurring in your testing? We tried a fix.

Jul 11 2019, 8:05 AM · Who-Wrote-That, Community-Tech
FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

@ifried Hey Ifried, nice to meet you and good to hear :)

Jul 11 2019, 7:56 AM · Who-Wrote-That, Community-Tech

Jun 26 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Sorry for the late reply. Valid point regarding the memory, will look into
it.

Jun 26 2019, 10:49 AM · Who-Wrote-That, Community-Tech

Apr 9 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

@Niharika Alright. I won't be at the Hackathon unfortunately. But I'm in San Francisco in the week of May 13-17, so if you guys are based at the SF offices, I could simply swing by to talk about what is already there and what we could provide. Not strictly necessary, but might be helpful.

Apr 9 2019, 2:58 PM · Who-Wrote-That, Community-Tech

Apr 3 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

@Niharika what is the current status of this project, do you need any input? Do you need the token ids in the output as you requested?

Apr 3 2019, 4:39 PM · Who-Wrote-That, Community-Tech

Feb 19 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Now that I had time to look at it: the

id="token-805"

in the extended HTML is *not* actually the WikiWho token ID, but simply a positional index for the token for that revision. (I do admit that we have to update the documentation to that effect...)
The WhoColor userscript goes through the extended HTML in the conflict and age views and retrieves the respective conflict and age scores by order from the list in the inline model 2.
(the "class name" is simply the user id - or an ad-hoc user hash for IPs - but I guess you figured that out already)

Feb 19 2019, 10:25 AM · Who-Wrote-That, Community-Tech

Feb 15 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Sorry, I didn't see the notification for that message...

Feb 15 2019, 11:33 AM · Who-Wrote-That, Community-Tech

Jan 20 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.
Jan 20 2019, 1:11 PM · Who-Wrote-That, Community-Tech

Jan 16 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

The annotations per token that the WikiWho APIs produce are always for the wiki markup, including all tokens, also those in tables, references. I does *not* expand templates or anything transcluded, which means that the content of those elements is not annotated for now, only the wiki markup that they are called with. That does not, therefore, pertain to tables, references and infoboxes in general, as long as nothing is transcluded. I.e. that is the first source of "error" in the sense that the API has simply no annotations for the transcluded content. (It could be added of course, but that would mean a couple more steps, including processing all templates. In practice I would rather add some nice-looking HTML in the frontend that says something along the lines of "could not color this template". )

Jan 16 2019, 11:20 AM · Who-Wrote-That, Community-Tech

Jan 7 2019

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Hi, nice to see this is getting traction again, a proper browser extension/better interface/cleaner highlighting would be great!

Jan 7 2019, 4:39 PM · Who-Wrote-That, Community-Tech

Oct 9 2018

FaFlo added a comment to T199121: RFC: Spec for representing multiple content objects per revision (MCR) in XML dumps.

Of course checksums make lot of of sense for countless use cases, including many in research (mentioned paper was never intended to make a sweeping point to the contrary, but yes, discussion for another time).
And I think MCR is awesome, JFTR.

Oct 9 2018, 4:57 PM · CPT Initiatives (MCR), Multi-Content-Revisions, Multimedia, Core Platform Team Workboards (Done with CPT), TechCom-RFC (TechCom-Approved), Structured Data Engineering, Dumps-Generation, User-ArielGlenn, User-Daniel, Wikidata

Apr 21 2018

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Hi, I can only comment on how we implemented it for api.wikiwho.net, but these are good points in general as well:

  • revisions/"when" vs. "who" : Regarding the actual implementation, you get the rev-ids of the origin and change revisions for a token and then fetch the meta-info for that revision in a second step, such as the editor and timestamp. So like you say, it is not "who" in the first instance, that is just derived.
  • whitespace: we split the text into tokens at the whitespace (and other special chars), so you would not attribute changes where someone just adds/removes whitespace (i.e., whitespaces are not tokens) without altering other text pieces. Alas, if someone would split a word into two via a whitespace (or -conversely - concatenate) , we would attribute the "new" tokens to that editor. If we talk about "formatting" in a wiki-markup sense, one would probably have to ex-post filter those changes that touched "cosmetic" markup like section headers or hrules, which is doable, but more tricky. Or simply run the whole thing in parallel on the parsed, front-end text with formatting ignored.
Apr 21 2018, 7:38 AM · Who-Wrote-That, Community-Tech

Jan 17 2018

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Can you tell us why not all languages can be added to the service? Technically speaking, why is the tool dependent on which language is it being used for?

Jan 17 2018, 3:14 PM · Who-Wrote-That, Community-Tech

Jan 13 2018

FaFlo added a comment to T184144: Investigation: Who Wrote That revision search tool.

Hi, as an author of WikiWho/ WhoColor:

  • Great that this is being picked up, I would be happy to be of assistance
  • "Note that unlike bisecting, blaming - essentially, content persistence - is a complex problem, which has been the topic of several research projects in the past. Trying to write a new tool from scratch is probably not a good idea" --> yup, and we have evaluated WikiWho in that regard, showing high accuracy especially also for longer, more complex revision histories, although only for English so far (see the paper)
  • Regarding speed: we are processing the EventStreams of several languages on the fly, that is not an issue. We just don't have any caching layer for the materialized json yet, but that is on the to do list. For the mid-term future (2-3 years), the upkeep and further development of the service is secured at GESIS (my employer) and also the extension to more languages (although maybe not all). But for the long term I also think hosting it at the WMF might make more sense.
Jan 13 2018, 4:03 PM · Who-Wrote-That, Community-Tech

Aug 2 2017

FaFlo closed T161696: Oauth for PAWS fails - presumably because of username change as Resolved.

Seems to work - thanks :)

Aug 2 2017, 11:46 AM · MediaWiki-extensions-OAuth, PAWS

Jul 31 2017

FaFlo added a comment to T161696: Oauth for PAWS fails - presumably because of username change.

any news on this?

Jul 31 2017, 11:50 AM · MediaWiki-extensions-OAuth, PAWS

Mar 30 2017

FaFlo added a comment to T161696: Oauth for PAWS fails - presumably because of username change.

so, is there a solution? sounds like a quick fix would be replacing the name in the paws record. I would also be fine with my user being deleted from paws and to then do a new first login (I don't have anything important on there yet).

Mar 30 2017, 8:46 AM · MediaWiki-extensions-OAuth, PAWS

Mar 29 2017

FaFlo created T161696: Oauth for PAWS fails - presumably because of username change.
Mar 29 2017, 11:29 AM · MediaWiki-extensions-OAuth, PAWS

Mar 2 2016

FaFlo added a comment to T89416: Accuracy review of Wikipedias.

So I'm not certain how "demand" is usually measured, but based on discussions I witnessed, and my understanding of the challenges Wikipedia faces regarding quality, I would concur with @Qgil that content curation and bringing in more eyes to spot errors is something that is certainly helpful for the editor community.

Mar 2 2016, 5:00 PM · Accuracy-Review-of-Wikipedias, Outreachy-Round-12, Google-Summer-of-Code (2016)

Nov 3 2015

FaFlo updated the task description for T116589: Make sure all mentors sign up in Outreachy'11 application system.
Nov 3 2015, 10:33 AM · Outreachy-Round-11

Oct 4 2015

FaFlo added a comment to T89416: Accuracy review of Wikipedias.

hi, what i certainly can do is to provide mentoring needed regarding the implementation/extension of wikiwho in this setting and some general feedback

Oct 4 2015, 2:50 PM · Accuracy-Review-of-Wikipedias, Outreachy-Round-12, Google-Summer-of-Code (2016)

Oct 2 2015

Restricted Application updated subscribers of T4994: Automatically generated count and list of contributors to an article (authorship tracking).
Oct 2 2015, 4:31 PM · MediaWiki-Page-History

Jun 4 2015

FaFlo added a comment to T89763: wikiwho on pywikibot.

hi, I just now became aware of this thread. Several things: (i) a collaborator just recently updated wikiwho to run with the newest version of the wikimedia-utilities and python3 (see https://github.com/maribelacosta/wikiwho/tree/python3 ). (ii) Be aware that wikiwho is the *only* solution (apart from Luca de Alfaro's A3 algo that we evaluated as well) that was soundly tested for the accuracy of the provenance attribution and that the solution is not trivial in many instances. Or I'm not aware of accuracy testing that might have been done so far for other approaches. This can be critical if used in a real editing scenario by an end-user. (iii) Not so related but maybe helpful: our API is already giving authorship information and you can use that too; example: wikiwho.net/wikiwho/wikiwho_api_api.py?revid=649876382&name=Laura_Bush&format=json&params=author. (iv) if anyone needs input or has feature requests we are happy to respond to them. it's just that the whole phabricator line of communication was not on my radar, but I will have an eye on it. Else, drop me a talk page message or an email to f.floeck@gmail.com or at github

Jun 4 2015, 5:21 PM · Pywikibot