Background
The apps request page summaries of 5 sentences. Page Previews requests an 525 character extract. For performance reasons, we want to minimize the length of the extract as much as possible while satisfying pre-existing use-cases.
AC
- If the intro consists of one paragraph, then no more than N sentences of that paragraph are returned.
- If the intro consists of a short paragraph and a list, then it's returned as is.
Notes
- @bmansurov mentioned that CXServer's "segmentation" module can do SBD on HTML input for multiple languages. There could be an opportunity to work across teams to make an NPM library of it for consumption by CXServer and MCS.