Wiktionary needs usable API
OpenPublic

Description

I can't seem to find a pre-existing bug, but Wiktionary needs a usable API.

Currently Wiktionary relies on MediaWiki's api.php, but that was (largely) built for Wikipedia. A proper Wiktionary API would allow retrieving definitions in a particular language from a language version of Wiktionary. Probably a few other things as well. ;-)


Version: unspecified
Severity: enhancement

bzimport added a subscriber: Unknown Object (MLST).
bzimport set Reference to bz36881.
MZMcBride created this task.Via LegacyMay 15 2012, 11:55 PM
MarkAHershberger added a comment.Via ConduitMay 16 2012, 4:11 PM

This should be a tracking bug. But I don't know of any other issues to put here.

Qgil added a comment.Via ConduitMar 25 2013, 1:02 AM

This idea has been suggested by Siebrand as a potential Google Summer of Code projects at http://www.mediawiki.org/wiki/Mentorship_programs/Possible_projects#Wiktionary_APIs

Does this make sense? Has there been any discussion in the Wiktionary community about specific API needs? I just want to know whether we would have a roughly defined project for a student. If the students should start by going to English Wiktionary and ask then this is not a feasible project proposal for GSOC 2013.

https://www.mediawiki.org/wiki/Summer_of_Code_2013#Project_ideas

If the idea makes sense we would also need at least one mentor.

Bawolff added a comment.Via ConduitMar 25 2013, 1:15 AM

(In reply to comment #2)

This idea has been suggested by Siebrand as a potential Google Summer of Code
projects at
http://www.mediawiki.org/wiki/Mentorship_programs/
Possible_projects#Wiktionary_APIs

Does this make sense? Has there been any discussion in the Wiktionary
community
about specific API needs? I just want to know whether we would have a roughly
defined project for a student. If the students should start by going to
English
Wiktionary and ask then this is not a feasible project proposal for GSOC

  1. https://www.mediawiki.org/wiki/Summer_of_Code_2013#Project_ideas

    If the idea makes sense we would also need at least one mentor.

Note Ive previously tried to do this. Well part of the reason my attempt semi failed was that I was a newbie at the time I would like to state this is not the easiest problem to solve (esp. If you intend to keep wiktionary the same as it is currently without any explicit machine readable annotations)

Bawolff added a comment.Via ConduitMar 25 2013, 1:24 AM

Btw for reference my http://en.wikinews.org/w/index.php?title=User:Bawolff/sandbox/Wiktionary_query (don't view on mobile site)

Its not exactly an api, but does similar things to an api. Part of the reason it sucks so much were naive design choices that were horrid (younger me was stupid. If you read the code don't judge too hard). Anyhow as a result of my experiance with that, I wouldn't reccomend this as a gsoc project unless the student already had quite a bit of proper experiance with parsing.

bzimport added a comment.Via ConduitMar 25 2013, 3:18 AM

wmf.amgine3691 wrote:

Side note: the usual first approach to this is look at existing dictionary api standards. There are a large number of existing, mostly proprietary, systems currently in production using en.Wiktionary mapped to existing standards. There are almost no efforts doing so with other languages.

MarkAHershberger added a comment.Via ConduitMar 25 2013, 3:14 PM

If someone would just implement RFC 2229, that would be awesome. https://tools.ietf.org/html/rfc2229

Alternatively, make the api calls as compatible as possible with that RFC.

bzimport added a comment.Via ConduitMar 28 2013, 12:46 AM

wmf.amgine3691 wrote:

A couple hours doodling for projects using wiktionary content, particularly going to DICT or WordNet, or code discussions on parsing wiktionary content (very popular whinge topic on stackoverflow):

https://github.com/onny/wikidict
http://extensions.libreoffice.org/extension-center/hunspell-is-the-icelandic-spelling-dictionary-project
http://www.trustlet.org/wiki/Wik2dict

(svn checkout http://wik2dict.googlecode.com/svn/trunk/ wik2dict-read-only)

https://code.google.com/p/wikokit/
http://inamidst.com/phenny/modules/wiktionary.py (ircbot module extracting data/metadata from wiktionary)
http://stackoverflow.com/search?q=wiktionary
http://goldendict.org/forum/viewtopic.php?f=5&t=1205 (en.WT for GoldenDict)
http://godlewski.free.fr/wiktionary-dict/
https://www.mediawiki.org/wiki/User:Gautham_shankar/Gsoc

Scholarly works:
http://scholar.google.ca/scholar?hl=en&q=Wiktionary&btnG=&as_sdt=1%2C5&as_sdtp=
http://www.aaaipress.org/Papers/AAAI/2008/AAAI08-137.pdf (Using Wiktionary for Computing Semantic Relatedness)
http://www.ukp.tu-darmstadt.de/data/lexical-resources/wordnet-wiktionary-alignment/

Aklapper added a comment.Via ConduitJun 13 2013, 5:26 PM
  • Bug 21450 has been marked as a duplicate of this bug. ***
Lydia_Pintscher added a comment.Via ConduitSep 24 2014, 1:41 PM

To make this happen Wiktionary needs to store its data in a structured and machine readable format. We have proposals for how to make this happen at https://www.wikidata.org/wiki/Wikidata:Wiktionary/Development. Once that is done the API will be done as well.

mxn added a subscriber: mxn.Via WebNov 24 2014, 9:00 PM
Glaisher added a subscriber: Glaisher.Via WebDec 30 2014, 10:24 AM
GPHemsley added a subscriber: GPHemsley.Via WebDec 30 2014, 12:22 PM
Aklapper added a project: Wiktionary.Via WebMar 7 2015, 4:36 PM
Liuxinyu970226 added a subscriber: Liuxinyu970226.Via WebMar 7 2015, 11:22 PM
MarkAHershberger removed a subscriber: MarkAHershberger.Via EmailMar 8 2015, 10:19 AM
jberkel added a subscriber: jberkel.Via WebMar 31 2015, 8:13 PM
Ricordisamoa awarded a token.Via WebApr 6 2015, 1:33 PM
Ricordisamoa added a subscriber: Ricordisamoa.

Add Comment