Page MenuHomePhabricator

Jsalsman (James Salsman)
Statistician and software engineer

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Wednesday

  • Clear sailing ahead.

User Details

User Since
Feb 12 2015, 11:33 PM (239 w, 3 d)
Availability
Available
IRC Nick
jsalsman
LDAP User
Jsalsman
MediaWiki User
Unknown

James Salsman is a statistician, software engineer, speech recognition specialist, and World Wide Web Consortium Invited Expert in the Device API, Web Apps, and HTML working groups. Salsman has over 20 years of telephony, signal processing, C, Perl, Javascript, Flash MXML, R, SQL, Tcl/Tk, Java and related experience. Salsman studied computer science and mathematics at Carnegie Mellon University and has worked or corresponded with EnglishCentral.com (Google Ventures), Scientific Learning ("Reading Assistant"), Rosetta Stone, 8DWorld.com ("Wiz World Online"), Transparent.com, DynEd.com, Ordinate.com (Pearson), and Carnegie Speech, Netscape, Silicon Graphics, LeapFrog Enterprises, MobiTV, Cisco, Mindsource, and a variety of Silicon Valley startups. Salsman's contributions to open source software include substantial improvements to the phase vocoder algorithm efficiency, upgrades to TCL and Android, and work to patch and extend Mediawiki. He is currently working on speech recognition for pronunciation evaluation, helping people learn to speak and read well. Other interests include:

Hack the Future (volunteer mentor)
Google Summer of Code (volunteer mentor)

Recent highlights

EF Education First, EF Learning Labs, Shanghai, China, 2013–2014: Improved automatic speech recognition (ASR) systems providing pronunciation assessment for English language learning by diagnosing Adobe Flash-based microphone upload channel faults, immediately reversing a 30% accuracy drop prior to my arrival. Architected, validated, and implemented further pronunciation assessment accuracy improvement using Sensory Fluentsoft ASR with phoneme duration and acoustic scores normalized by establishing a leaderboard of exemplar pronunciations from student uploads, achieving a 24% increase in scores’ agreement with a panel of human judges. Prototyped auditory feedback for pronunciation exercises, designed ASR QA systems, and additional word and phrase score improvements on cross-platform mobile and desktop ASR implementations. Several other contributions to processes, internal technical documentation, and online learning functions. Used C, JavaScript, sh, C#, and ObjectiveC on Android, iOS, Linux servers, Windows ASP.NET servers and desktop, and OS X.

Selected publications

J. Salsman (July 2014) “Development challenges in automatic speech recognition for computer assisted pronunciation teaching and language learning” in Proceedings of the Research Challenges in Computer Aided Language Learning Conference (CALL 2014) Antwerp, Belgium: http://talknicer.com/Salsman-CALL-2014.pdf

S. Ronanki, J. Salsman, and L. Bo (December 2012) “Automatic Pronunciation Evaluation and Mispronunciation Detection using CMU Sphinx.” in Proceedings of the Workshop on Speech and Language Processing Tools in Education, pp. 61–68. 24th International Conference on Computational Linguistics (COLING 2012) Mumbai, India: http://www.aclweb.org/anthology/W12-5808

K. Roast and J. Salsman (August 2011) “K3D JavaScript Canvas Library.” Software documentation: http://en.wikibooks.org/wiki/K3D_JavaScript_Canvas_Library

J. Salsman (May 2010) “Asynchronous Microphone Upload – for Pronunciation Assessment, High-Quality, Low-Bandwidth Voice, Speech Transcription, Translation, and Speaker Identification and Verification.” in the Proceedings of the World Wide Web Consortium Workshop on Conversational Applications (W3C CONVAPPS) June 18–19, 2010, Somerset, New Jersey: http://www.w3.org/2010/02/convapps/Papers/asynchMicUpload.pdf

J. Salsman (October 2010) “Teaching computers to teach people to read and speak.” One Laptop Per Child San Francisco Bay Area Community Summit (OLPC-SF 2010) presentation. San Francisco, California: http://talknicer.com/olpcsf.pdf

J. Salsman (2005) “ReadSay PROnounce English System.” Self-published commercial software and instructional modules: http://talknicer.com/pronounce

J. Salsman (August 2004) “Getting Sorted Indices out of lsort.” Tcl Improvement Proposal (TCL TIP) #217. Tcl Developer Xchange: http://www.tcl.tk/cgi-bin/tct/tip/217.html

J. P. Salsman (July 1999) “Form-based Device Input and Upload in HTML.” World Wide Web Consortium Note submission from Cisco Systems, San Jose, California: http://www.w3.org/TR/device-upload

J. Salsman and H. Alvestrand (May 1999) “The Audio/L16 MIME content type.” Internet Engineering Task Force Request for Comments (IETF RFC 2586) http://www.ietf.org/rfc/rfc2586.txt

Recent Activity

Mon, Sep 9

Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

Prompts and user interactions can be added to an intelligibility assessment and remediation system using Tolchorp, the Topic-Lesson-Choice-Response-Prompt (TLCRP) format, mediatype text/tlcrp. Tolchorp is not YAML but is similar.

Mon, Sep 9, 11:57 PM · Technical-Tool-Request

Sun, Sep 8

Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Brijsri at this juncture we need to decide about whether to include anything https://www.docdroid.net/iWiA1ik/eybenetal2016ieeetransaffectcomput.pdf (which will get us pitch for Chinese, Vietnamese, etc.) and https://arxiv.org/pdf/1905.06533.pdf

Sun, Sep 8, 2:33 AM · Technical-Tool-Request
Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Halfak I have been unable to recruit http://twiliojob.speakclearly.info but am trying again with a new approach.

Sun, Sep 8, 2:14 AM · Technical-Tool-Request

Jul 17 2019

Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Aklapper Thanks anyway; if you find out let me know.

Jul 17 2019, 12:30 AM · Technical-Tool-Request

Jul 15 2019

Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Aklapper I can think of nobody other than you who would be more likely to know about graphics standards for colorblindness compatibility. I can think of nobody more likely than @Halfak who would know if there are people who want to work on an interactive speech-in and voice-out chatbot compatible with 0x0 pixel accessibility to the blind. I'm trying to get @Brijsri to transition our pronunciation assessment and intelligibility remediation system from Firebase to something more appropriate for Toolforge Labs, in a way it can work with both single wiktionary words as well as phrases in which they appear.

Jul 15 2019, 9:39 PM · Technical-Tool-Request
Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Aklapper do you want to be able to train dozens of languages or hundreds? I want to know if you know about graphics standards, and if you do, then I would like to talk to you about accessibility. If you don't, I would like to talk to the person you think is most likely to know who might know, about accessibility.

Jul 15 2019, 4:07 PM · Technical-Tool-Request

Jul 14 2019

Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Aklapper if you aren't not familiar with the answer, your idea of who would be most likely to know is someone with whom I want to talk about accessibility.

Jul 14 2019, 8:12 PM · Technical-Tool-Request
Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Aklapper is there a graphics standard saying for visual representations to use heat map color palettes which convey the same information in greyscale as they do in color, such as Viridis?

Jul 14 2019, 4:15 PM · Technical-Tool-Request
Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Brijsri here is the full 987 word file:

Jul 14 2019, 8:04 AM · Technical-Tool-Request

Jul 13 2019

Jsalsman reassigned T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary from Jsalsman to Brijsri.

@Brijsri can you use

to continue work on ?

Jul 13 2019, 9:51 PM · Technical-Tool-Request

Jul 11 2019

Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Halfak and http://englishprofile.org/english-grammar-profile/egp-online ?

Jul 11 2019, 10:48 PM · Technical-Tool-Request

Jul 10 2019

Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Halfak can you find someone who wants to make a chatbot summarizing and critiquing automatic parses of various speech recognition engines' transcription results using, for example, the LOGON parser? http://erg.delph-in.net/logon

Jul 10 2019, 11:07 PM · Technical-Tool-Request

Jul 9 2019

Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@LucasWerkmeister thank you. https://dev.lingualibre.fr/demo/sandbox.html is apparently the demo pointed to https://dev.lingualibre.fr/demo/ there. I am a huge fan of https://dev.lingualibre.fr/demo/simple.html

Jul 9 2019, 9:45 PM · Technical-Tool-Request
Jsalsman updated subscribers of T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Lucas_Werkmeister_WMDE @LucasWerkmeister is https://github.com/lingua-libre/RecordWizard/tree/master/modules/ui as cross-platform as the https://voice.sanalabs.com UI which uses https://www.npmjs.com/package/react-mic ?

Jul 9 2019, 4:57 PM · Technical-Tool-Request

Jul 8 2019

Jsalsman claimed T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

Thanks @Aklapper, and for your and @LucasWerkmeister's help on wikitech-l with e.g. https://github.com/lingua-libre/RecordWizard and https://meta.wikimedia.org/wiki/User:Urvaxhi/speechToText.js and my old strategy proposal. I am reviewing the first two.

Jul 8 2019, 6:34 PM · Technical-Tool-Request

Jul 16 2018

Jsalsman updated subscribers of T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@klove who is the correct resource to ask about this?

Jul 16 2018, 5:39 PM · Technical-Tool-Request
Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

If we stick with the Python architecture on a Google Cloud instance like http://sphinxcapt.org is now, we may be less than ten user stories from completion, working https://commons.wikimedia.org/wiki/File:Tasks_for_intelligibility_remediation_peer_learning_architecture.pdf into the new attached database schema.

Jul 16 2018, 3:19 PM · Technical-Tool-Request
Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

I forgot to include Brij's single-line widget for Wiktionary: https://brijmohan.github.io/iremedy/single_line.html

Jul 16 2018, 2:57 PM · Technical-Tool-Request
Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Aklapper sure, https://github.com/pobedyn/featex is the GSoC 4 feature/phoneme feature extraction code from last year, as published. Since then we've added five more features per phoneme as per slide 13 of http://j.mp/irslides and soon we will have 10 features, adding the nasal flap. We're converting that from Python to Google Firebase, or at least we were before I started having latency problems with it, so we might just stick to Python Flask.

Jul 16 2018, 1:35 PM · Technical-Tool-Request

Jul 13 2018

Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

BAMyers, please accept my apologies for confusing you with someone who wishes to remain anonymous. Thankfully my mistake prevented a larger one.

Jul 13 2018, 2:45 PM · Technical-Tool-Request

Jul 12 2018

Jsalsman updated subscribers of T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

@Brijsri and I are working on this. Our paper from last year just got cited by some speech language pathologist instructional designers at Texas A&M and Sydney: https://psi.engr.tamu.edu/wp-content/uploads/2018/04/hair2018idc.pdf

Jul 12 2018, 5:19 PM · Technical-Tool-Request

Dec 4 2017

Jsalsman added a comment to T141295: Improve the UI design of the Accuracy Review of Wikipedias Flask app.

@Snd96 When you get the phrase list pulled out from https://github.com/SND96/twine-stories/tree/master/twine_tutorials please post it here. Or perhaps even better, an we show the original Twee text to have expert Wikipedians review the lessons in the format in which they might be able to write new lessons?

Dec 4 2017, 5:22 PM · Accuracy-Review-of-Wikipedias

Dec 3 2017

Jsalsman renamed T141295: Improve the UI design of the Accuracy Review of Wikipedias Flask app from Improve the UI design of the app to Improve the UI design of the Accuracy Review of Wikipedias Flask app.
Dec 3 2017, 8:38 PM · Accuracy-Review-of-Wikipedias
Jsalsman placed T141295: Improve the UI design of the Accuracy Review of Wikipedias Flask app up for grabs.

@Aklapper I got that. Sahith (@Snd96 on Github) produced a speech-enabled training system to help people use @prnk28's system last summer, but couldn't get it working for reasons that I have only determined in the past week (not his fault); please see:

Dec 3 2017, 8:37 PM · Accuracy-Review-of-Wikipedias

Sep 7 2017

Jsalsman added a comment to T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .

Please see https://en.wiktionary.org/wiki/Wiktionary:Grease_pit/2017/August#Pronunciation_evaluation_gadget_for_Wiktionary:_GSoC_2017

Sep 7 2017, 10:35 AM · Technical-Tool-Request
Jsalsman renamed T166929: Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary from Microphone upload from browser for reading tutoring with pronunciation assessment in Wikiversity to Microphone upload from browser for reading tutoring with pronunciation assessment in Wiktionary .
Sep 7 2017, 10:34 AM · Technical-Tool-Request

Jan 6 2017

Jsalsman added a comment to T150545: Hash IPs on webrequest table.

Please do not hijack this phab ticket.

Jan 6 2017, 9:00 PM · Research-Archive
Jsalsman added a comment to T150545: Hash IPs on webrequest table.

I'm not sure what a cut & paste error is in this context. The original task
description said, "research doesn't really need raw IPs on webrequest table"

Jan 6 2017, 8:52 PM · Research-Archive
Jsalsman added a comment to T150545: Hash IPs on webrequest table.

How is the value of being able to re-run metrics compared to the risk of
disclosing readers personally identifying information when determining what
to store and what to refrain from storing? Is there any standard used to
determine what is an acceptable risk to readers in order to guarantee
continuous metrics?

Jan 6 2017, 8:42 PM · Research-Archive

Jan 4 2017

Jsalsman added a comment to T150545: Hash IPs on webrequest table.

How is the 60 days figure derived?

Jan 4 2017, 7:34 PM · Research-Archive

Dec 27 2016

Jsalsman added a comment to T150545: Hash IPs on webrequest table.

@leila, do all of the stakeholders have sufficient resources to evaluate whether there are any remaining dependencies on readers' private log information? Have they been able to do so? E.g., are there any remaining contractual obligations to provide it to third parties? If we sufficiently keep IP addresses from beyond the reach of a subpoena or national security letter, would dithering instead of discarding geolocation be sufficient?

Dec 27 2016, 3:21 PM · Research-Archive

Nov 22 2016

Jsalsman added a comment to T150545: Hash IPs on webrequest table.

@Tbayer, I proposed that the IP address and HTTPS proxy information both be
included in the hash. Please correct me if I am wrong, but that would not
be reversible.

Nov 22 2016, 4:37 PM · Research-Archive

Sep 18 2016

Jsalsman added a comment to T89761: Create new Python library to serialize Wikimedia Quiz format, GIFT quiz format, and allow round-trip conversions between the two formats. .

Is this for GIFT or Twee ( http://www.maximumverbosity.net/twine/Entweedle/
) or both or something else?

Sep 18 2016, 12:59 AM · Pywikibot, Outreachy (Round-13), Pywikibot-Wikidata, MediaWiki-extensions-Quiz

Aug 26 2016

Jsalsman added a comment to T143494: Endterm evaluation for "Accuracy review of Wikipedia".

@prnk28 I am still waiting for
https://www.mediawiki.org/wiki/Accuracy_review/docs to be updated with a
non-technical preface for reviewers, and http://tools.wmflabs.org/arowf/
needs a paragraph summary of that. Also /ask should be just before /inspect
there, to emphasize that reviewing is the primary use?

Aug 26 2016, 2:12 PM · Accuracy-Review-of-Wikipedias, Google-Summer-of-Code (2016)
Jsalsman added a comment to T143494: Endterm evaluation for "Accuracy review of Wikipedia".

@Sumit roger wilco; @prnk28's final submission link is
https://priyankamandikal.wordpress.com/2016/08/22/gsoc-2016-project-overview/

Aug 26 2016, 2:07 PM · Accuracy-Review-of-Wikipedias, Google-Summer-of-Code (2016)

Aug 11 2016

Jsalsman added a comment to T139067: Internationalization and localization of message strings for Accuracy Review.

Blocked on still making natural language strings in the code and supporting question generators.

Aug 11 2016, 9:39 PM · I18n, Accuracy-Review-of-Wikipedias
Jsalsman added a parent task for T139067: Internationalization and localization of message strings for Accuracy Review: T139260: Run instances on labs, heroku, PythonAnywhere, and Google App Engine.
Aug 11 2016, 9:38 PM · I18n, Accuracy-Review-of-Wikipedias
Jsalsman added a subtask for T139260: Run instances on labs, heroku, PythonAnywhere, and Google App Engine: T139067: Internationalization and localization of message strings for Accuracy Review.
Aug 11 2016, 9:38 PM · Accuracy-Review-of-Wikipedias

Aug 5 2016

Jsalsman added a comment to T140785: Optional registration system.

@prnk28 do you need to substitute something for slashes to make filenames out of base64?

Aug 5 2016, 6:16 PM · Accuracy-Review-of-Wikipedias
Jsalsman triaged T140683: Make the question creation scripts POST to /ask as Low priority.
Aug 5 2016, 6:14 PM · Accuracy-Review-of-Wikipedias
Jsalsman lowered the priority of T139067: Internationalization and localization of message strings for Accuracy Review from High to Normal.
Aug 5 2016, 6:04 PM · I18n, Accuracy-Review-of-Wikipedias
Jsalsman claimed T140684: Handle caching.
Aug 5 2016, 6:03 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T140781: Implement an archiving mechanism for completed questions.

@prnk28 please log all files on f.close() AND (this is new!:) add all GET accesses to the same log which rotates daily.

Aug 5 2016, 6:02 PM · Accuracy-Review-of-Wikipedias
Jsalsman triaged T140781: Implement an archiving mechanism for completed questions as High priority.
Aug 5 2016, 6:00 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T139260: Run instances on labs, heroku, PythonAnywhere, and Google App Engine.

Just PythonAnywhere, then set the priority of this to "normal"

Aug 5 2016, 5:59 PM · Accuracy-Review-of-Wikipedias
Jsalsman raised the priority of T139260: Run instances on labs, heroku, PythonAnywhere, and Google App Engine from Normal to High.
Aug 5 2016, 5:59 PM · Accuracy-Review-of-Wikipedias
Jsalsman triaged T140785: Optional registration system as Unbreak Now! priority.
Aug 5 2016, 5:57 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T141841: Decide on initial set of questions for user testing.

This is done, and I added the one part that isn't into a different task.

Aug 5 2016, 5:55 PM · Accuracy-Review-of-Wikipedias
Jsalsman triaged T141899: Create questions from student edits as Normal priority.
Aug 5 2016, 5:49 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T141899: Create questions from student edits.

These also need to be filtered by only articles in the top 20% of pageviews to start with.

Aug 5 2016, 5:48 PM · Accuracy-Review-of-Wikipedias
Jsalsman triaged T142041: Update all the question scripts to use pageview data as High priority.
Aug 5 2016, 5:47 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T142041: Update all the question scripts to use pageview data.

https://mediawiki.org/wiki/API:Categorymembers
will make the top 20% pageview cutoff easier.

Aug 5 2016, 5:47 PM · Accuracy-Review-of-Wikipedias

Aug 3 2016

Jsalsman added a comment to T141842: Link each GET endpoint from "help" to documentation .

Yes, their button labels and request contents are different.

Aug 3 2016, 3:57 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T140684: Handle caching.

@prnk28, not yet. I have been working on the hypothesis. If you use the
hypothesis interesting to the machine learning problem, then you will want
to log each file on f.close() instead of weekly, or maybe in addition to
weekly. I will take a look at caching after that.

Aug 3 2016, 3:56 PM · Accuracy-Review-of-Wikipedias

Aug 2 2016

Jsalsman updated the task description for T141842: Link each GET endpoint from "help" to documentation .
Aug 2 2016, 7:30 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T141842: Link each GET endpoint from "help" to documentation .

Good question! I probably screwed up answer-a is for the first attempt at
answering, answer-i is for the (i)nitial endorse or oppose review, answer-t
is for tibreaking an opposition, er... will edit....

Aug 2 2016, 7:29 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T141899: Create questions from student edits.

@prnk28 please figure out which of these is best, but try to get just the
~50 most recent, and then the two most recent edits they made in diff
format (in the iframe) for the 100 questions from Wikipedia Education
Program student editors:

Aug 2 2016, 7:08 PM · Accuracy-Review-of-Wikipedias
Jsalsman updated subscribers of T141841: Decide on initial set of questions for user testing.

Are there four question generation scripts? (1) keyword (recently) by age,
(2) template/category context by age, (3) diffs by usernames (i.e. student
editor username lists), and (4) median F-K readability standard score plus
pageview standard score? Can we generate 100 of each and use those 400
instead of the old questions with the bad formatting (headers to the left
of instead of below iframes)?

Aug 2 2016, 7:03 AM · Accuracy-Review-of-Wikipedias
Jsalsman created T141842: Link each GET endpoint from "help" to documentation .
Aug 2 2016, 6:52 AM · Accuracy-Review-of-Wikipedias
Jsalsman created T141841: Decide on initial set of questions for user testing.
Aug 2 2016, 6:49 AM · Accuracy-Review-of-Wikipedias
Jsalsman created T141840: Project report.
Aug 2 2016, 6:47 AM · Accuracy-Review-of-Wikipedias

Jul 30 2016

Jsalsman added a comment to T141295: Improve the UI design of the Accuracy Review of Wikipedias Flask app.

@prnk28, did Bootstrap code still show up in the stack backtrace after you
reverted?

Jul 30 2016, 5:01 PM · Accuracy-Review-of-Wikipedias

Jul 27 2016

Jsalsman added a comment to T141295: Improve the UI design of the Accuracy Review of Wikipedias Flask app.

@prnk28, is Bootstrap still in your version? If so, did you replace the CSS
directives?

Jul 27 2016, 4:40 PM · Accuracy-Review-of-Wikipedias

Jul 22 2016

Jsalsman added a comment to T140781: Implement an archiving mechanism for completed questions.

For filenumber sets with (-d)one files indicating completion of a
recommendation, we can append them all to a log file that we rotate weekly,
with headers indicating their original filename and timestamp, in -q, -a,
-r, -e, -o, -t, -d order. (Priyanka is that the correct order?)

Jul 22 2016, 3:53 AM · Accuracy-Review-of-Wikipedias

Jul 18 2016

Jsalsman added a comment to T139261: Hyperlink URLs from form input with <a href="URL">URL</a> after escape()ing.

P.S. http://stackoverflow.com/a/37960256 has specific provisions for
Unicode, I should say.

Jul 18 2016, 11:43 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T139261: Hyperlink URLs from form input with <a href="URL">URL</a> after escape()ing.

@Aklapper, @prnk28 is asking about regexps which match URLs with both
unicode and internal matched parentheses (as are relatively more common in
Wikipedia URLs) but also properly match a URL enclosed in external
parentheses. Here is what I found:

Jul 18 2016, 11:41 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T89416: Accuracy review of Wikipedias.

@prnk28 Here is a copy of the email I just sent you which has a list of things. I thought I would paste it here because you might want to make phabricator tasks from many of these things, but many of them you have already done:

Jul 18 2016, 3:32 AM · Accuracy-Review-of-Wikipedias, Outreachy-Round-12, Google-Summer-of-Code (2016)
Jsalsman raised the priority of T89416: Accuracy review of Wikipedias from Normal to High.
Jul 18 2016, 3:23 AM · Accuracy-Review-of-Wikipedias, Outreachy-Round-12, Google-Summer-of-Code (2016)

Jul 11 2016

Jsalsman added a watcher for Datasets-Webstatscollector: Jsalsman.
Jul 11 2016, 12:43 PM

Jul 10 2016

Jsalsman added a comment to T139260: Run instances on labs, heroku, PythonAnywhere, and Google App Engine.

@prnk28 if http://webchat.freenode.net/?channels=#wikimedia-labs can't help you get it running on toollabs, please ask for help on wikitech-l.

Jul 10 2016, 11:48 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T139067: Internationalization and localization of message strings for Accuracy Review.

@prnk28 as soon as you fix the division by zero error in inspect() in my Thursday checkin ( https://github.com/jsalsman/minireview/commit/376d42656098db23ca6cf57d8ab663679a73482b ) and extend /inspect and its template to handle displaying the most recent N completed recommendations (N=20? 50?) I think it will be ready for internationalization with Babel.

Jul 10 2016, 11:45 PM · I18n, Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T139077: Write scripts to populate /ask from wp backlogs.

@prnk28 Don't forget to make these for

  • using arbitrary strings and regular expressions in place of "recently" and e.g. "last year" with Wikiwho age thresholds of those strings (i.e. from known paid editing incidents such as "hydraulic fracturing" etc.)
  • using diffs from particular usernames (e.g. the Wikipedia Education Program students)
  • your Flesch-Kincaid readability test implementation
Jul 10 2016, 11:40 PM · Accuracy-Review-of-Wikipedias

Jul 8 2016

Jsalsman added a comment to T139080: Handle race conditions.

Random is better than round-robin, but we still might want to lock, so this is still valid.

Jul 8 2016, 12:52 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a member for Accuracy-Review-of-Wikipedias: prnk28.
Jul 8 2016, 5:50 AM
Jsalsman raised the priority of T139077: Write scripts to populate /ask from wp backlogs from Normal to High.

@prnk28 is working on this; trying "recently" as a keyword to use with its wikiwho date for stale passages.

Jul 8 2016, 5:44 AM · Accuracy-Review-of-Wikipedias
Jsalsman lowered the priority of T139261: Hyperlink URLs from form input with <a href="URL">URL</a> after escape()ing from High to Low.

Most of this is done in https://github.com/jsalsman/minireview/commit/376d42656098db23ca6cf57d8ab663679a73482b

Jul 8 2016, 5:41 AM · Accuracy-Review-of-Wikipedias
Jsalsman closed T139078: Display diffs and permalinks in an iframe in /ask as Resolved.

done in https://github.com/jsalsman/minireview/commit/376d42656098db23ca6cf57d8ab663679a73482b

Jul 8 2016, 5:39 AM · Accuracy-Review-of-Wikipedias
Jsalsman closed T139079: Summary statistics for /inspect as Resolved.

done in https://github.com/jsalsman/minireview/commit/376d42656098db23ca6cf57d8ab663679a73482b

Jul 8 2016, 5:38 AM · Accuracy-Review-of-Wikipedias

Jul 7 2016

Jsalsman added a comment to T139261: Hyperlink URLs from form input with <a href="URL">URL</a> after escape()ing.

I'll get these. @Aklapper the phabricator tasks are out of synch with the
other tasks in the etherpad and chat logs, but this one is still valid so
let me do this.

Jul 7 2016, 12:27 PM · Accuracy-Review-of-Wikipedias

Jul 4 2016

Jsalsman added a comment to T139077: Write scripts to populate /ask from wp backlogs.

@prnk28, re-write them to POST to /ask instead of just creating files, once
you get the iframeurl form element working. That way you can let other
people create questions from their own methods without giving them write
access to records/

Jul 4 2016, 3:36 PM · Accuracy-Review-of-Wikipedias

Jul 3 2016

Jsalsman added a comment to T139260: Run instances on labs, heroku, PythonAnywhere, and Google App Engine.

Be sure to turn off use_debugger (and less importantly, use_reloader) wherever you run this in production!

Jul 3 2016, 12:09 PM · Accuracy-Review-of-Wikipedias
Jsalsman created T139261: Hyperlink URLs from form input with <a href="URL">URL</a> after escape()ing.
Jul 3 2016, 11:54 AM · Accuracy-Review-of-Wikipedias
Jsalsman moved T89416: Accuracy review of Wikipedias from Backlog to GSoC on the Accuracy-Review-of-Wikipedias board.
Jul 3 2016, 11:50 AM · Accuracy-Review-of-Wikipedias, Outreachy-Round-12, Google-Summer-of-Code (2016)
Jsalsman assigned T139260: Run instances on labs, heroku, PythonAnywhere, and Google App Engine to prnk28.
Jul 3 2016, 11:50 AM · Accuracy-Review-of-Wikipedias
Jsalsman created T139260: Run instances on labs, heroku, PythonAnywhere, and Google App Engine.
Jul 3 2016, 11:46 AM · Accuracy-Review-of-Wikipedias
Jsalsman raised the priority of T139067: Internationalization and localization of message strings for Accuracy Review from Normal to High.
Jul 3 2016, 11:41 AM · I18n, Accuracy-Review-of-Wikipedias
Jsalsman lowered the priority of T139077: Write scripts to populate /ask from wp backlogs from High to Normal.
Jul 3 2016, 11:40 AM · Accuracy-Review-of-Wikipedias
Jsalsman reassigned T139079: Summary statistics for /inspect from Jsalsman to prnk28.

@prnk28 please merge and test https://github.com/jsalsman/minireview/commit/586daf7ab0da847af748a04c258b2b31a8678b62 for this

Jul 3 2016, 7:13 AM · Accuracy-Review-of-Wikipedias
Jsalsman updated the task description for T139079: Summary statistics for /inspect.
Jul 3 2016, 6:04 AM · Accuracy-Review-of-Wikipedias
Jsalsman claimed T139079: Summary statistics for /inspect.

I am working on this one....

Jul 3 2016, 6:01 AM · Accuracy-Review-of-Wikipedias
Jsalsman renamed T139078: Display diffs and permalinks in an iframe in /ask from Display diffs in an iframe in /ask to Display diffs and permalinks in an iframe in /ask.
Jul 3 2016, 12:56 AM · Accuracy-Review-of-Wikipedias

Jul 2 2016

Jsalsman added a comment to T139079: Summary statistics for /inspect.

@prnk28 ...the point being, that the ratio of reviewer identification token strings occurring in file sets with -e instead of -o (and something mumble mumble about -t we will figure out) is the same as the reviewer's agreement score, and, well, you probably need more statistics to figure out the reputation score as we originally designed it from that, but I'm pretty sure that can be part of GET /inspect?has=RevewByWikiGnome543 somehow.

Jul 2 2016, 11:50 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T139079: Summary statistics for /inspect.

@prnk28 HAVE YOU NOTICED HOW FAR AHEAD OF SCHEDULE THE REFACTORING GOT US? :D

Jul 2 2016, 11:45 PM · Accuracy-Review-of-Wikipedias
Jsalsman triaged T138953: Datetime analysis as Normal priority.
Jul 2 2016, 11:39 PM · Accuracy-Review-of-Wikipedias
Jsalsman triaged T139077: Write scripts to populate /ask from wp backlogs as High priority.
Jul 2 2016, 11:38 PM · Accuracy-Review-of-Wikipedias
Jsalsman triaged T139067: Internationalization and localization of message strings for Accuracy Review as Normal priority.
Jul 2 2016, 11:36 PM · I18n, Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T139067: Internationalization and localization of message strings for Accuracy Review.

@prnk28 please see https://pythonhosted.org/Flask-Babel/

Jul 2 2016, 11:36 PM · I18n, Accuracy-Review-of-Wikipedias
Jsalsman triaged T139078: Display diffs and permalinks in an iframe in /ask as High priority.
Jul 2 2016, 11:32 PM · Accuracy-Review-of-Wikipedias
Jsalsman triaged T139080: Handle race conditions as Low priority.
Jul 2 2016, 11:32 PM · Accuracy-Review-of-Wikipedias
Jsalsman added a comment to T139080: Handle race conditions.

@prnk28 what kind of state do we need to store to use http://code.activestate.com/recipes/528936-roundrobin-generator/ ?

Jul 2 2016, 11:31 PM · Accuracy-Review-of-Wikipedias