Research Area: Database Population Workflow
The usefulness of Toolhub hinges on having thorough, current data. Beyond voluntary contributions from tool developers and tool users, it may be interesting to figure out what we can do to proactively fill Toolhub with information.
This may not necessarily be "clean" information -- it would probably be riddled with duplication and the kind of glitches you get when you impose structure on unstructured data. My current idea is that we would funnel data from various sources into a curation workflow where people can accept entries as-is, merge with existing entries, or reject them.
This problem is resolved when:
- There are proposed sources of data (both clean and dirty)
- There is a plan for cleaning up the unclean data (de-duplication, error checks, etc.)
- The social and technical processes involved are understood