The sole purpose of this extension deployed in production is to build the "Extracted page abstracts for Yahoo" dumps. I think providing such dumps doesn't make sense in 2024:
- Yahoo! doesn't have its own search engine anymore and it's backed by Bing now.
- If Yahoo! or any other company needs abstract dumps, they should either build it themselves (which is quite easy) or use WME. We are not here to serve big tech.
- This dump was made in 2005 when network was expensive and more limited than current internet capacity. These days, people can just download the full dump (current version only) and do any transformations they need.
- The downloads are quite low in numbers. Even those are probably mostly people who are curios or crawlers.
- In the current technologies, if we want to provide "summary" dumps, it's better to use something better than just getting the first x bytes of the article. An LLM based dump would be much more useful than the status quo.
- It is adding non-negligible cost on us:
- Extra database queries
- Maintenance of the extension: For example {T306629}
- A lot of extra complexity in the dump 1.0 infra
---
**Extension archival checklist**:
- On-wiki documentation
-- [X] Archive documentation on `mediawiki.org` (<https://www.mediawiki.org/wiki/Extension:ActiveAbstract>): replace page contents with `{{Archived extension|last revision id before archiving|task=T######}}` (for extensions); replace `T######` with this task's number.
-- [X] If documentation page was translatable, remove `<translate>`, visit Special:PageTranslation, and click "remove from translation" (if you don't have the translation administrator right, ask a [[ https://www.mediawiki.org/wiki/Special:ListUsers/translationadmin | user who does ]]).
-- [X] Update Wikidata item (https://www.wikidata.org/wiki/Q21676088) associated with documentation page
--- [X] add statement `Abandonware (Q281039)` to `instance of (P31)` together with qualifier `start time (P580)` = the YYYY-MM-DD date that you decided to archive extension (generally per edit history)
--- [X] add qualifier `end time (P582)` = the YYYY-MM-DD (same date as above) to `instance of (P31) = MediaWiki extension (Q6805426)`
- Phabricator
-- [x] Mark all Phabricator tasks for the extension either Declined or Invalid. Add a comment pointing to this task when doing so for reference.
-- [X] Archive Phabricator project #ActiveAbstract for the extension.
-- [X] Edit Phabricator project #ActiveAbstract description for the extension with a link to this ticket.
- Translatewiki.net/translations
-- [x] If the extension is deployed on Wikimedia sites, but it is known that it's not going to receive significant feature updates or deployed to new wikis, make sure that its project ids (usually "ext-//extensionname//") appear in the groups/MediaWiki/WikimediaLegacyAgg.yaml file in the translatewiki Gerrit repository and not in WikimediaMainAgg.yaml, WikimediaAdvancedAgg.yaml, etc. (If it also has an api group, it should remain in WikimediaTechnicalAgg.yaml).
-- [x] If the extension is no longer deployed on Wikimedia sites, remove it from all Wikimedia*Agg.yaml. (If it was ever deployed, by this time, it's most likely in WikimediaLegacyAgg.yaml or WikimediaTechnicalAgg.yaml.)
-- [x] If the extension is going to be completely archived and no longer developed, remove it completely from translatewiki.net by making sure that its project IDs don't appear in any of the following files:
--- [x] [[ https://gerrit.wikimedia.org/r/plugins/gitiles/translatewiki/+/refs/heads/master/groups/MediaWiki/ExtensionsAgg.yaml | groups/MediaWiki/ExtensionsAgg.yaml ]]
--- [x] [[ https://gerrit.wikimedia.org/r/plugins/gitiles/translatewiki/+/refs/heads/master/groups/MediaWiki/SkinsAgg.yaml | groups/MediaWiki/SkinsAgg.yaml ]]
--- [x] [[ https://gerrit.wikimedia.org/r/plugins/gitiles/translatewiki/+/refs/heads/master/groups/MediaWiki/mediawiki-extensions.txt | groups/MediaWiki/mediawiki-extensions.txt ]]
--- [x] [[ https://gerrit.wikimedia.org/r/plugins/gitiles/translatewiki/+/refs/heads/master/groups/MediaWiki/mediawiki-skins.txt | groups/MediaWiki/mediawiki-skins.txt ]]
--- [x] [[ https://gerrit.wikimedia.org/r/plugins/gitiles/translatewiki/+/refs/heads/master/repoconfig.yaml | repoconfig.yaml ]]
--- [x] [[ https://gerrit.wikimedia.org/r/plugins/gitiles/translatewiki/+/refs/heads/master/mw-config/validation-exclusion-list.php | mw-config/validation-exclusion-list.php ]]
- Configuration/tests/integrations/etc.
-- [X] Remove tests in [[ https://gerrit.wikimedia.org/g/integration/config | integration/config ]] ([[ https://gerrit.wikimedia.org/r/plugins/gitiles/integration/config/+/master/zuul/layout.yaml | zuul/layout.yaml ]]).
-- [X] Remove manifest/other references from [[ https://gerrit.wikimedia.org/r/plugins/gitiles/mediawiki/vagrant/+/master | mediawiki/vagrant ]] if they exist.
-- [ ] Remove from https://doc.wikimedia.org/cover-extensions/ if present (requires #continuous-integration-infrastructure shell user to delete directly from the server).
- Repositories
-- [X] Empty repository, leaving only the README indicating the repo is archived, and referring to this task by URL.
-- [ ] Remove from [[ https://gerrit.wikimedia.org/r/plugins/gitiles/mediawiki/extensions/+/master | mediawiki/extensions ]] / [[ https://gerrit.wikimedia.org/r/plugins/gitiles/mediawiki/skins/+/master | mediawiki/skins ]] (along with in `.gitmodules`). ([[ https://gerrit.wikimedia.org/r/c/mediawiki/extensions/+/1154871 | patch ]])
-- [ ] Mark the repository read-only in Gerrit (also edit description to prepend "[ARCHIVED]" and a reference to this task, remove all access lists and reparent on `All-Archived-Project`): https://gerrit.wikimedia.org/g/mediawiki/extensions/ActiveAbstract.
-- [x] Deactivate the Diffusion repository for the extension: rEAAB.
-- [ ] Delete the GitHub mirror: https://github.com/wikimedia/mediawiki-extensions-activeabstract.