Set of tools for extracting text (sentences) suitable for recording from a large text corpus (typically a Wikipedia dump file or similar).
Description
Description
Set of tools for extracting text (sentences) suitable for recording from a large text corpus (typically a Wikipedia dump file or similar).