Algorithmic dangers and transparency -- Best practices
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Halfak
	Oct 12 2016, 2:12 PM

Description

Type of activity: Scheduled session
Main topic: T147708: Facilitate Wikidev'17 main topic "Artificial Intelligence to build and navigate content"
Timing: Tuesday, January 10th at 1:10PM PST
Location: Room 2
Stream link: https://www.youtube.com/watch?v=w__x1p66y5U
Back channel: #wikimedia-ai
Etherpad: https://etherpad.wikimedia.org/p/devsummit17-AI_ethics

How do we make sure that our filtering and ranking algorithms do not perpetuate biases or cause other types of social problems? What aspects of AIs should we make transparent and what are some good strategies for doing so? In this session, we'll develop a call to action and gather resources for a best practices document

Problem statement

There's no best practices document for not causing problems with your algorithm. What are common problems we can cause? What are users' expectations?

Expected outcome

A document containing prescriptions for transparency around new AI projects. The beginning of a set of guidelines and best practices.

Summary of discussion

There's clear interest, but it seems like we'll probably want a brief summary of the critical algorithms literature as part of a session. We could probably compress a useful overview into less than 10 minutes so that it doesn't dominate the discussion.

Concerns were raised in regards to ORES (@Halfak) and ElasticSearch (@EBernhardson). @Tbayer has been reading some the recent literature. Generally, interest has been signaled (via token and subscriptions) by @Aklapper, @jmatazzoni, @Lydia_Pintscher, @Capt_Swing, @Arlolra, @gpaumier, and @Siznax.

(Updated Nov. 21st, 2016)

Related Objects
Search...

Status	Assigned	Task
Resolved	Qgil	T153007 Technical Collaboration annual plan FY2017-18
Resolved	Qgil	T159313 Draft WMF annual plan program about technical events
Resolved	Qgil	T149300 Future of the Wikimedia Developer Summit
Resolved	• Rfarrand	T153996 Wikimedia Developer Summit 2017: Feedback Survey
Resolved	• Rfarrand	T141926 Wikimedia Developer Summit 2017
Resolved	Qgil	T141938 Prepare a program for Wikimedia Developer Summit 2017 to effectively address current high level movement needs
Resolved	Halfak	T147708 Facilitate Wikidev'17 main topic "Artificial Intelligence to build and navigate content"
Resolved	Halfak	T147929 Algorithmic dangers and transparency -- Best practices

Event Timeline

Halfak created this task.Oct 12 2016, 2:12 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 12 2016, 2:12 PM

Halfak mentioned this in T147708: Facilitate Wikidev'17 main topic "Artificial Intelligence to build and navigate content".Oct 12 2016, 2:14 PM

Halfak added a parent task: T147708: Facilitate Wikidev'17 main topic "Artificial Intelligence to build and navigate content".Oct 12 2016, 2:19 PM

Halfak claimed this task.Oct 12 2016, 8:03 PM

• gpaumier subscribed.Oct 12 2016, 8:46 PM

Qgil moved this task from Backlog to Missing proven interest on the Wikimedia-Developer-Summit (2017) board.Oct 13 2016, 11:07 AM

Halfak edited projects, added Machine-Learning-Team (Active Tasks); removed Machine-Learning-Team.Oct 13 2016, 2:49 PM

Halfak moved this task from Parked to Monitor (long term) on the Machine-Learning-Team (Active Tasks) board.

• Capt_Swing subscribed.Oct 28 2016, 10:34 PM

Siznax subscribed.Nov 15 2016, 8:01 PM

Arlolra subscribed.Nov 18 2016, 3:19 AM

Lydia_Pintscher subscribed.Nov 18 2016, 12:47 PM

@Mooeypoo has expressed concerns over tracking algorithmic bias around the Edit-Review-Improvements project. I think that @jmatazzoni and @Pginer-WMF should consider discussing this angle of algorithmic work in relation to ERI since it's a serious user-facing concern.

When I presented on my preliminary results regarding anonymous editor bias in damage detection, @Tbayer raised some good counter-points. I wonder if he'd want to bring that perspective to the dev summit.

I've been working on this idea around T148700: JADE: UI/API for reviewing/refuting how ORES classifies you and your stuff

From a blog I wrote about the idea:

So, I was listening to an NPR show titled "Digging into Facebook's File on You". At some point, there was some casual discussion of laws that some countries in the European Union have re. users' ability to review and correct mistakes in data that is stored about them. This made me realize that ORES needs a good mechanism for you to review how the system classifies you and your stuff.

• jmatazzoni awarded a token.Nov 18 2016, 10:56 PM

Halfak updated the task description. (Show Details)Nov 18 2016, 10:56 PM

Sounds like a fascinating topic.

Tgr mentioned this in T149373: Evaluating the user experience of AI systems .Nov 19 2016, 12:41 AM

• Capt_Swing awarded a token.Nov 19 2016, 1:32 AM

This is certainly of concern to me in search. I'm starting to evaluate ways to build ML re-ranking systems using user click through data as the input, and the ability of users to 'poison the well' so to speak will certainly become an issue to deal with. Would love to talk about how others are dealing with the issue of training with data from user behaviour.

In T147929#2807379, @Halfak wrote:

...

When I presented on my preliminary results regarding anonymous editor bias in damage detection, @Tbayer raised some good counter-points. I wonder if he'd want to bring that perspective to the dev summit.

I will be at the summit and am happy to participate in a session regarding this topic. (It's not my core work area currently, but I have been interested in it both as a volunteer editor who does quite of patrolling on Wikipedia and Wikidata, now with the help of ORES, and as a Wikipedia research topic in general - I also wrote a review of a related paper recently.)
In any case, I agree it's a worthwhile topic for a session at the summit.

Aklapper awarded a token.Nov 21 2016, 12:20 PM

Halfak updated the task description. (Show Details)Nov 21 2016, 3:14 PM

Halfak updated the task description. (Show Details)Nov 21 2016, 3:18 PM

Halfak updated the task description. (Show Details)

nshahquinn-wmf subscribed.Nov 21 2016, 5:45 PM

This has been a hot topic in the media recently (e.g. Cathy O'Neil's book Weapons of Math Destruction has gotten at lot of attention from the press). I don't know a lot about it, but I would really enjoy the chance to learn more.

• ZhouZ awarded a token.Nov 21 2016, 6:29 PM

• ssastry subscribed.Nov 21 2016, 8:17 PM

Niharika subscribed.Nov 22 2016, 4:53 PM

Complements T149373 quite nicely, IMO.

Qgil moved this task from Missing proven interest to Proposed Unconference Sessions on the Wikimedia-Developer-Summit (2017) board.Nov 28 2016, 10:26 AM

https://medium.com/@robot_MD/when-bias-in-product-design-means-life-or-death-ea3d16e3ddb2#.5a1ker5hd seems related.

bd808 moved this task from Proposed Unconference Sessions to To be pre-scheduled on the Wikimedia-Developer-Summit (2017) board.Dec 8 2016, 11:21 PM

Lucie subscribed.Dec 14 2016, 10:50 AM

Halfak updated the task description. (Show Details)Dec 16 2016, 11:17 PM

• jmatazzoni unsubscribed.Dec 16 2016, 11:23 PM

Tgr awarded a token.Dec 23 2016, 12:41 AM

Halfak updated the task description. (Show Details)Dec 28 2016, 8:12 PM

Halfak added a subscriber: • jmatazzoni.

Tarrow subscribed.Jan 4 2017, 12:48 PM

cscott awarded a token.Jan 5 2017, 12:07 AM

To the owner of this session: Here is the link to the session guidelines page: https://www.mediawiki.org/wiki/Wikimedia_Developer_Summit/2017/Session_Guidelines. We encourage you to recruit Note-taker(s) 2(min) and 3(max), Remote Moderator, and Advocate (optional) on the spot before the beginning of your session. Instructions about each role player's task are outlined in the guidelines. The physical version of the role cards will be made available in all the session rooms. Good luck prepping, see you at the summit! :)

Halfak updated the task description. (Show Details)Jan 7 2017, 4:19 AM

Just set up https://etherpad.wikimedia.org/p/devsummit17-AI_ethics. Looking forward to seeing you all in a few days.

I've added the link for the Youtube stream. See you all in person or on IRC tomorrow morning.

Note-taker(s) of this session: Follow the instructions here: https://www.mediawiki.org/wiki/Wikimedia_Developer_Summit/2017/Session_Guidelines#NOTE-TAKER.28S.29 After the session, DO NOT FORGET to copy the relevant notes and summary into a new wiki page following the template here: https://www.mediawiki.org/wiki/Wikimedia_Developer_Summit/2017/Your_Session and also link this from the All Session Notes page: https://www.mediawiki.org/wiki/Wikimedia_Developer_Summit/2017/All_Session_Notes. The EtherPad links are also now linked from the Schedule page (https://www.mediawiki.org/wiki/Wikimedia_Developer_Summit/2017/Schedule) for you!

https://www.mediawiki.org/wiki/Wikimedia_Developer_Summit/2017/AI_ethics

Halfak moved this task from Monitor (long term) to Completed on the Machine-Learning-Team (Active Tasks) board.Feb 7 2017, 7:17 PM

Halfak closed this task as Resolved.Feb 7 2017, 8:31 PM

Algorithmic dangers and transparency -- Best practicesClosed, ResolvedPublicActions