- Crawler process that loads toolinfo.json source URLs, validates their content, and populates the Toolhub database
- API endpoint(s) to view crawler status information (run date, errors from run)
- GET /api/v1/crawler/runs - paginated, filterable list of known runs
- GET /api/v1/crawler/runs/{id} - run details
Description
Details
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
crawler: Move create/update logic to custom manager | wikimedia/toolhub | main | +98 -44 | |
api: Endpoints for crawler run status data | wikimedia/toolhub | main | +204 -35 | |
Add a basic crawler | wikimedia/toolhub | main | +653 -116 |
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | bd808 | T195679 [Toolhub Milestone 2] Initial API and UI; toolinfo crawler | |||
Resolved | bd808 | T264810 Toolinfo.json crawler | |||
Resolved | srishakatux | T264969 User interface to view crawler status information |
Event Timeline
Change 639666 had a related patch set uploaded (by BryanDavis; owner: Bryan Davis):
[wikimedia/toolhub@main] Toolinfo storage model and basic API
Change 641559 had a related patch set uploaded (by BryanDavis; owner: Bryan Davis):
[wikimedia/toolhub@main] Add a basic crawler
Change 642131 had a related patch set uploaded (by BryanDavis; owner: Bryan Davis):
[wikimedia/toolhub@main] api: Endpoints for crawler run status data
Change 642131 merged by jenkins-bot:
[wikimedia/toolhub@main] api: Endpoints for crawler run status data
Change 650639 had a related patch set uploaded (by BryanDavis; owner: Bryan Davis):
[wikimedia/toolhub@main] crawler: Move create/update logic to custom manager
Change 650639 merged by jenkins-bot:
[wikimedia/toolhub@main] crawler: Move create/update logic to custom manager
We will need some follow up work in the future to schedule crawler runs, but the core work is done.