User Details
- User Since
- Apr 27 2016, 5:41 PM (259 w, 2 d)
- Availability
- Available
- LDAP User
- Unknown
- MediaWiki User
- NikoLind [ Global Accounts ]
Oct 1 2020
Sep 17 2020
There is now a first version of database filtering:
Sep 15 2020
Using a XML dump file to extract the raw text of Wikipedia articles seems very hard. Correctly expanding templates and handling links to produce text corresponding to what is seen in the web browser is problematic.
Sep 9 2020
WikiExtractor.py on github seem to work, as long as you don't use the latest commit, which is broken. You need to use the --templates option.
Sep 7 2020
There is something called gensim.
Somehow, the "same* script had probably been downloaded using different methods:
Sep 5 2020
Sep 4 2020
Sep 3 2020
Aug 20 2020
The component for generating manuscripts for recording sentences (rather than full articles):
Apr 3 2020
The work of adding MySQL/MariaDB support is about halfway through. There is a temporary branch with a version of pronlex that runs on MariaDB, passing all relevant test of the original Sqlite3 version.
Mar 19 2020
Declined due to notification of too long inactivity on open task.
Declined due to notification of too long inactivity on open task.
Feb 4 2020
I don't think we have any objections to the name, so please go ahead.
Dec 13 2019
Almost done, but a few minor things need some more attention.
Nov 14 2019
Oct 22 2019
Jun 11 2018
Fixed on not yet merged branch (dbapi.Query.TagLike)
Fixed on not yet merged branch
Dec 11 2017
I think incorrect json might map into an empty or incomplete entry struct on the server side? We should take a look at this... maybe there need to be a separate validation step.
Nov 24 2017
There is now a first version of API call to the lexicon server for deleting an entry, never to be seen again:
Nov 18 2017
Nov 14 2017
Nov 10 2017
Oct 31 2017
Joakim (KTH) mentioned the Montreal Forced Aligner:
Oct 17 2017
No changes in the API have been made, but inserting/updating an lex.Entry should now work with the lex.Entry.Tag field.
This will probably break things in other places...
Has now been merged with pronlex master branch. This will probably result in errors in other places...