Page MenuHomePhabricator

NikolajLindberg
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Apr 27 2016, 5:41 PM (259 w, 2 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
NikoLind [ Global Accounts ]

Recent Activity

Oct 1 2020

NikolajLindberg closed T262011: Filter out features ("sentences without digits", etc), a subtask of T261934: Filtering function of text, as Resolved.
Oct 1 2020, 1:04 PM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T262011: Filter out features ("sentences without digits", etc) as Resolved.
Oct 1 2020, 1:04 PM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T261934: Filtering function of text, a subtask of T261929: Indexed Relational Database of Text, as Resolved.
Oct 1 2020, 1:04 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T261934: Filtering function of text as Resolved.
Oct 1 2020, 1:04 PM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T261929: Indexed Relational Database of Text, a subtask of T261928: ☂Wikispeech Recording Manuscript Tool, as Resolved.
Oct 1 2020, 1:03 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T261929: Indexed Relational Database of Text as Resolved.
Oct 1 2020, 1:03 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file), a subtask of T261928: ☂Wikispeech Recording Manuscript Tool, as Resolved.
Oct 1 2020, 1:03 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file) as Resolved.
Oct 1 2020, 1:03 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T242421: Draft planning for Manuscript component as Resolved.
Oct 1 2020, 1:03 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file) from Backlog to Done on the Wikispeech-Jobrunner (Sprint) board.
Oct 1 2020, 10:38 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T261929: Indexed Relational Database of Text from In progress to Done on the Wikispeech-Jobrunner (Sprint) board.
Oct 1 2020, 10:38 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T242421: Draft planning for Manuscript component from In progress to Done on the Wikispeech-Jobrunner (Sprint) board.
Oct 1 2020, 10:38 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector

Sep 17 2020

NikolajLindberg added a subtask for T261928: ☂Wikispeech Recording Manuscript Tool: T261931: Create simplistic web GUI for basic filtering.
Sep 17 2020, 10:58 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg removed a subtask for T261929: Indexed Relational Database of Text: T261931: Create simplistic web GUI for basic filtering.
Sep 17 2020, 10:58 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg edited parent tasks for T261931: Create simplistic web GUI for basic filtering, added: T261928: ☂Wikispeech Recording Manuscript Tool; removed: T261929: Indexed Relational Database of Text.
Sep 17 2020, 10:58 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg added a comment to T262011: Filter out features ("sentences without digits", etc).

There is now a first version of database filtering:

Sep 17 2020, 10:54 AM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T262131: Figure out why two different runs of WikiExtractor.py used different paragraph delimiters?, a subtask of T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file), as Resolved.
Sep 17 2020, 10:52 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T262131: Figure out why two different runs of WikiExtractor.py used different paragraph delimiters? as Resolved.
Sep 17 2020, 10:52 AM · Wikispeech-STTS, Wikispeech-Jobrunner, Wikispeech-Speech-Data-Collector

Sep 15 2020

NikolajLindberg added a comment to T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file).

Using a XML dump file to extract the raw text of Wikipedia articles seems very hard. Correctly expanding templates and handling links to produce text corresponding to what is seen in the web browser is problematic.

Sep 15 2020, 10:29 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector

Sep 9 2020

NikolajLindberg added a comment to T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file).

WikiExtractor.py on github seem to work, as long as you don't use the latest commit, which is broken. You need to use the --templates option.

Sep 9 2020, 10:13 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector

Sep 7 2020

NikolajLindberg added a comment to T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file).

There is something called gensim.

Sep 7 2020, 1:55 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T262199: Improve regexp for splitting paragraphs into sentences .
Sep 7 2020, 10:09 AM · Wikispeech-STTS, Wikispeech-Jobrunner, Wikispeech-Speech-Data-Collector
NikolajLindberg added a comment to T262131: Figure out why two different runs of WikiExtractor.py used different paragraph delimiters?.

Somehow, the "same* script had probably been downloaded using different methods:

Sep 7 2020, 8:22 AM · Wikispeech-STTS, Wikispeech-Jobrunner, Wikispeech-Speech-Data-Collector

Sep 5 2020

NikolajLindberg closed T262121: chunkfeat punct as single characters not character sequence as Resolved.
Sep 5 2020, 3:23 PM · Wikispeech-STTS, Wikispeech-Jobrunner, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T262121: chunkfeat punct as single characters not character sequence, a subtask of T261929: Indexed Relational Database of Text, as Resolved.
Sep 5 2020, 3:23 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg updated the task description for T262115: load_db tool should filter articles and sentences before adding to db .
Sep 5 2020, 12:50 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg updated the task description for T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file).
Sep 5 2020, 12:21 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg updated the task description for T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file).
Sep 5 2020, 12:21 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T262131: Figure out why two different runs of WikiExtractor.py used different paragraph delimiters?.
Sep 5 2020, 12:12 PM · Wikispeech-STTS, Wikispeech-Jobrunner, Wikispeech-Speech-Data-Collector
NikolajLindberg updated the task description for T262115: load_db tool should filter articles and sentences before adding to db .
Sep 5 2020, 10:16 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T262121: chunkfeat punct as single characters not character sequence.
Sep 5 2020, 7:41 AM · Wikispeech-STTS, Wikispeech-Jobrunner, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T262117: generated word frequency table, a subtask of T261929: Indexed Relational Database of Text, as Resolved.
Sep 5 2020, 7:39 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T262117: generated word frequency table as Resolved.
Sep 5 2020, 7:39 AM · Wikispeech-STTS, Wikispeech-Jobrunner, Wikispeech-Speech-Data-Collector
NikolajLindberg created T262117: generated word frequency table.
Sep 5 2020, 6:16 AM · Wikispeech-STTS, Wikispeech-Jobrunner, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T262014: Add tests for lookup from Sprint to Incoming on the Wikispeech-Jobrunner board.
Sep 5 2020, 6:14 AM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T261955: Add source feature: article length in paragraphs , a subtask of T261929: Indexed Relational Database of Text, as Resolved.
Sep 5 2020, 6:13 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T261955: Add source feature: article length in paragraphs as Resolved.
Sep 5 2020, 6:13 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T262015: dbapi: clean up and remove/un-export some exposed functions from Sprint to Incoming on the Wikispeech-Jobrunner board.
Sep 5 2020, 6:13 AM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T262115: load_db tool should filter articles and sentences before adding to db from Sprint to Incoming on the Wikispeech-Jobrunner board.
Sep 5 2020, 6:13 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T262116: new db chunkfeat type: Unicode code block from Sprint to Incoming on the Wikispeech-Jobrunner board.
Sep 5 2020, 6:12 AM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T262116: new db chunkfeat type: Unicode code block.
Sep 5 2020, 6:12 AM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T262115: load_db tool should filter articles and sentences before adding to db .
Sep 5 2020, 6:09 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector

Sep 4 2020

NikolajLindberg closed T262010: dbapi.Add(text.Article) as Resolved.
Sep 4 2020, 9:40 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T262010: dbapi.Add(text.Article), a subtask of T261929: Indexed Relational Database of Text, as Resolved.
Sep 4 2020, 9:40 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T262015: dbapi: clean up and remove/un-export some exposed functions.
Sep 4 2020, 8:43 AM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T262014: Add tests for lookup.
Sep 4 2020, 8:40 AM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T262011: Filter out features ("sentences without digits", etc).
Sep 4 2020, 8:23 AM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T262010: dbapi.Add(text.Article).
Sep 4 2020, 8:21 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector

Sep 3 2020

NikolajLindberg created T261955: Add source feature: article length in paragraphs .
Sep 3 2020, 1:57 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T261930: Add article and its sentences in one single transaction , a subtask of T261929: Indexed Relational Database of Text, as Resolved.
Sep 3 2020, 1:54 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg closed T261930: Add article and its sentences in one single transaction as Resolved.
Sep 3 2020, 1:54 PM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg updated the task description for T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file).
Sep 3 2020, 9:47 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T261938: Find out how to best extract raw text from Wikipedia articles (probably from a dump file).
Sep 3 2020, 9:41 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T261928: ☂Wikispeech Recording Manuscript Tool from Backlog to In progress on the Wikispeech-Jobrunner (Sprint) board.
Sep 3 2020, 9:35 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T261929: Indexed Relational Database of Text from Backlog to In progress on the Wikispeech-Jobrunner (Sprint) board.
Sep 3 2020, 9:35 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T261928: ☂Wikispeech Recording Manuscript Tool from Incoming to Sprint on the Wikispeech-Jobrunner board.
Sep 3 2020, 9:35 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T261929: Indexed Relational Database of Text from Incoming to Sprint on the Wikispeech-Jobrunner board.
Sep 3 2020, 9:35 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg moved T261928: ☂Wikispeech Recording Manuscript Tool from Unsorted to Manuscript creator on the Wikispeech-Speech-Data-Collector board.
Sep 3 2020, 9:33 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T261935: Selection algorithm for creating a balanced manuscript from a set of sentences.
Sep 3 2020, 9:32 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T261934: Filtering function of text.
Sep 3 2020, 9:31 AM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T261931: Create simplistic web GUI for basic filtering.
Sep 3 2020, 9:28 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T261930: Add article and its sentences in one single transaction .
Sep 3 2020, 9:25 AM · Wikispeech-Jobrunner, Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T261929: Indexed Relational Database of Text.
Sep 3 2020, 9:24 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg created T261928: ☂Wikispeech Recording Manuscript Tool.
Sep 3 2020, 9:21 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector

Aug 20 2020

NikolajLindberg claimed T242421: Draft planning for Manuscript component.
Aug 20 2020, 7:28 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector
NikolajLindberg added a comment to T242421: Draft planning for Manuscript component.

The component for generating manuscripts for recording sentences (rather than full articles):

Aug 20 2020, 7:26 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Speech-Data-Collector

Apr 3 2020

NikolajLindberg added a comment to T248383: Adapt pronlex to use mysql.

The work of adding MySQL/MariaDB support is about halfway through. There is a temporary branch with a version of pronlex that runs on MariaDB, passing all relevant test of the original Sqlite3 version.

Apr 3 2020, 5:26 PM · Wikispeech-Text-to-Speech, Wikispeech-Jobrunner (Sprint), Wikispeech-STTS
NikolajLindberg claimed T248383: Adapt pronlex to use mysql.
Apr 3 2020, 5:03 PM · Wikispeech-Text-to-Speech, Wikispeech-Jobrunner (Sprint), Wikispeech-STTS

Mar 19 2020

NikolajLindberg closed T159916: Entry status in lexicon db: check that an entry update call to the lexicon db does the right thing with the entry status as Declined.

Declined due to notification of too long inactivity on open task.

Mar 19 2020, 8:55 AM · Wikispeech-Jobrunner, User-HannaLindgren, Wikispeech-Text-to-Speech, Wikispeech-STTS
NikolajLindberg closed T159151: Investigate available standard phonetizers with appropriate licence as Declined.

Declined due to notification of too long inactivity on open task.

Mar 19 2020, 8:55 AM · Wikispeech-Jobrunner, Wikispeech-Text-to-Speech, Wikispeech-STTS

Feb 4 2020

NikolajLindberg added a comment to T199414: (Re)name "the TTS-server".

I don't think we have any objections to the name, so please go ahead.

Feb 4 2020, 8:49 AM · User-Sebastian_Berlin-WMSE, User-LokalProfil, Wikispeech-Jobrunner (Sprint), Wikispeech-WMSE, Wikispeech-Text-to-Speech

Dec 13 2019

NikolajLindberg added a comment to T238308: Run automatic checks on Go code.

Almost done, but a few minor things need some more attention.

Dec 13 2019, 11:36 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Text-to-Speech

Nov 14 2019

NikolajLindberg moved T177831: lexserver host "localhost" in wikispeech.py should be replaced by actual specified host from Incoming to Proposed for next sprint on the Wikispeech-Jobrunner board.
Nov 14 2019, 12:53 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-Text-to-Speech, Wikispeech-STTS
NikolajLindberg moved T238308: Run automatic checks on Go code from Incoming to Proposed for next sprint on the Wikispeech-Jobrunner board.
Nov 14 2019, 12:42 PM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Text-to-Speech
NikolajLindberg added a project to T238308: Run automatic checks on Go code: Wikispeech-STTS.
Nov 14 2019, 10:33 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Text-to-Speech
NikolajLindberg created T238308: Run automatic checks on Go code.
Nov 14 2019, 10:12 AM · Wikispeech-Jobrunner (Sprint), Wikispeech-STTS, Wikispeech-Text-to-Speech

Oct 22 2019

NikolajLindberg moved T134754: Admin GUI for lexicon database from In Progress to Backlog on the Wikispeech-STTS board.
Oct 22 2019, 8:57 AM · Wikispeech, Wikispeech-NLP, Wikispeech-STTS

Jun 11 2018

NikolajLindberg closed T180435: Add lex.Entry.Tag as search criterion to dbapi.Query as Resolved.

Fixed on not yet merged branch (dbapi.Query.TagLike)

Jun 11 2018, 2:20 PM · Wikispeech-Lexicon, Wikispeech-STTS, Wikispeech
NikolajLindberg closed T180436: Add Entry.Tag search criterion to DB search as Resolved.

Fixed on not yet merged branch

Jun 11 2018, 2:19 PM · Wikispeech-Lexicon, Wikispeech-STTS, Wikispeech
NikolajLindberg closed T180436: Add Entry.Tag search criterion to DB search, a subtask of T180435: Add lex.Entry.Tag as search criterion to dbapi.Query, as Resolved.
Jun 11 2018, 2:19 PM · Wikispeech-Lexicon, Wikispeech-STTS, Wikispeech

Dec 11 2017

NikolajLindberg added a comment to T181624: Entry added even if entry string is invalid.

I think incorrect json might map into an empty or incomplete entry struct on the server side? We should take a look at this... maybe there need to be a separate validation step.

Dec 11 2017, 4:14 PM · Wikispeech-STTS, Wikispeech-Lexicon, Wikispeech

Nov 24 2017

NikolajLindberg closed T181298: Add delete_entry call to lexicon server as Resolved.

There is now a first version of API call to the lexicon server for deleting an entry, never to be seen again:

Nov 24 2017, 12:39 PM · Wikispeech-Lexicon, Wikispeech-STTS, Wikispeech
NikolajLindberg created T181298: Add delete_entry call to lexicon server.
Nov 24 2017, 12:37 PM · Wikispeech-Lexicon, Wikispeech-STTS, Wikispeech

Nov 18 2017

NikolajLindberg updated the task description for T180484: Create a pre-release of wikispeech and all sub components, for testing and bug fixes.
Nov 18 2017, 10:00 AM · Wikispeech, Wikispeech-STTS
NikolajLindberg updated the task description for T180484: Create a pre-release of wikispeech and all sub components, for testing and bug fixes.
Nov 18 2017, 9:59 AM · Wikispeech, Wikispeech-STTS

Nov 14 2017

NikolajLindberg updated the task description for T180484: Create a pre-release of wikispeech and all sub components, for testing and bug fixes.
Nov 14 2017, 4:53 PM · Wikispeech, Wikispeech-STTS
NikolajLindberg created T180436: Add Entry.Tag search criterion to DB search.
Nov 14 2017, 10:00 AM · Wikispeech-Lexicon, Wikispeech-STTS, Wikispeech
NikolajLindberg created T180435: Add lex.Entry.Tag as search criterion to dbapi.Query.
Nov 14 2017, 9:57 AM · Wikispeech-Lexicon, Wikispeech-STTS, Wikispeech

Nov 10 2017

NikolajLindberg created T180227: [EntryTag] Test that the lex.Entry.Tag works in the lexicon server: it should be possible to add a lex.Entry.Tag to an entry via the HTTP api.
Nov 10 2017, 1:24 PM · Wikispeech-STTS, Wikispeech-Lexicon, Wikispeech
NikolajLindberg updated the task description for T180203: [WORK IN PROGRESS] Add language tag/locale as a lexicon attribute.
Nov 10 2017, 10:35 AM · Wikispeech, Wikispeech-STTS
NikolajLindberg updated the task description for T180203: [WORK IN PROGRESS] Add language tag/locale as a lexicon attribute.
Nov 10 2017, 10:33 AM · Wikispeech, Wikispeech-STTS

Oct 31 2017

NikolajLindberg added a comment to T159151: Investigate available standard phonetizers with appropriate licence.

Joakim (KTH) mentioned the Montreal Forced Aligner:

Oct 31 2017, 1:26 PM · Wikispeech-Jobrunner, Wikispeech-Text-to-Speech, Wikispeech-STTS

Oct 17 2017

NikolajLindberg closed T177587: [EntryTag] Lexicon database API calls for handling entry tags as Resolved.

No changes in the API have been made, but inserting/updating an lex.Entry should now work with the lex.Entry.Tag field.

Oct 17 2017, 10:14 AM · Wikispeech, Wikispeech-STTS
NikolajLindberg closed T177587: [EntryTag] Lexicon database API calls for handling entry tags , a subtask of T177586: [Story] [EntryTag] Choosing disambiguated variants of a word, as Resolved.
Oct 17 2017, 10:14 AM · Story, Wikispeech-Lexicon, Wikispeech, Wikispeech-STTS
NikolajLindberg closed T177588: [EntryTag] Add EntryTag field to the lex.Entry struct of the lexicon database API as Resolved.

This will probably break things in other places...

Oct 17 2017, 10:13 AM · Wikispeech, Wikispeech-STTS
NikolajLindberg closed T177588: [EntryTag] Add EntryTag field to the lex.Entry struct of the lexicon database API, a subtask of T177586: [Story] [EntryTag] Choosing disambiguated variants of a word, as Resolved.
Oct 17 2017, 10:13 AM · Story, Wikispeech-Lexicon, Wikispeech, Wikispeech-STTS
NikolajLindberg closed T177467: [EntryTag] Add database entry tag unique to a reading of an entry, to distinguish between homographs as Resolved.

Has now been merged with pronlex master branch. This will probably result in errors in other places...

Oct 17 2017, 10:12 AM · Wikispeech-Lexicon, Wikispeech-STTS
NikolajLindberg closed T177467: [EntryTag] Add database entry tag unique to a reading of an entry, to distinguish between homographs, a subtask of T177586: [Story] [EntryTag] Choosing disambiguated variants of a word, as Resolved.
Oct 17 2017, 10:12 AM · Story, Wikispeech-Lexicon, Wikispeech, Wikispeech-STTS

Oct 16 2017

NikolajLindberg added a project to T178287: Could rbg2p report an error if a variable in a rule is not previously defined?: Wikispeech-STTS.
Oct 16 2017, 2:16 PM · Wikispeech-STTS, Wikispeech
NikolajLindberg added a project to T178287: Could rbg2p report an error if a variable in a rule is not previously defined?: Wikispeech.
Oct 16 2017, 2:16 PM · Wikispeech-STTS, Wikispeech