Data Platform Request Form
Is this a request for a:
- Dataset
- Data Pipeline
- Data Feature
Is this a change to something existing:
- Yes - please provide details of existing datasets/data pipelines (wiki links, Git URL, names of jobs, etc)
- No
If a new dataset, has this been through the essential metric review? (need link):
- Yes
- No
Please provide the description of your request:
Our referrers haven't been updated since at least the launch of ChatGPT and since these aren't hardcoded into the referrer header parsing they haven't been appearing in pageviews datasets. We should have visibility of new referral sources added to our datasets and categorized properly.
Ex. Chatbot providers (ChatGPT, Perplexity, Claude, Grok, etc. )
Use Case: (Please briefly explain what this feature will be used for):
The addition of new referral sources will help us know if AI tools are bringing in more readers to our projects.
Ideal Delivery Date:
Q2
What is needed for this feature
- decide which new referal sources
- decide if these should be included in existing referer_class or create as a new category
- find the source for this data (webrequest)
- add to hard coded list ?
- add to pageview tables in hive, iceberg and druid
- data QA
Data Feature Checklist
Please link to the following if applicable.
| Document Type | Required? | Document/Link |
| Related PHAB Tickets | Yes | <add link here> |
| Product One Pager | Yes | <add link here> |
| Product Requirements Document (PRD) | Yes | <add link here> |
| Product Roadmap | No | <add link here> |
| Product Planning/Business Case | No | <add link here> |
| Product Brief | No | <add link here> |
| Other Links | No | <add links here> |