AQS 2.0
Open, In Progress, HighPublic1 Estimated Story Points
Actions

Description

Analytics Query Service (AQS) is the software behind the /metrics family of endpoints in RESTBase. It is a read-only HTTP proxy to results served from Cassandra and Druid. It is currently based on a very outdated fork of RESTBase, and has received little updates over the years.

As a part of the goal to sunset RESTBase, AQS needs to be migrated to a bespoke service exposed via the API Gateway.

We propose to break down the rewrite largely along dataset boundaries — similar to the module structure in RESTBase — with a separate project used to implement each.

The services were renamed during development. The names as of Jan 2023 are:

Page Analytics (was pageviews)
Device Analytics (was unique devices)
Edit Analytics (the subset of endpoints previously considered under "wikistats2" that pertain to edits)
Editor Analytics (the subset of endpoints previously considered under "wikistats2" that pertain to editors)
Media Analytics (was mediarequests)
Geo Analytics (was called both geoeditors and editors in different contexts)

The breakdown of endpoints by service can be found here. The remainder of this task description has been left unedited, for comparison.

The resulting services will be proxied by RESTBase and/or the API Gateway (the former to eventually be deprecated in favor of the latter) in order to maintain complete compatibility with the existing API.

The target language for these implementations is Go. While a complete comparison of Javascript/NodeJS and Go is out of scope for this issue, the (simplified) rationale is:

Strong, static typing; Statically typed languages eliminate entire classes of bugs common to dynamic languages, improve security, and making code easier to reason about
Ease of use; Go is more obvious, more explicit, and easier to understand. Complicated concepts like concurrency are easier to get right
Performance; Service latency can be expected to be both lower, but more importantly, more predictable with Go

Overview

Implement the new, stand-alone AQS service(s)
Deploy to k8s
Expose the /metrics hierarchy from the new service(s) using the API Gateway
Switch RESTBase to proxying requests from the old AQS service, to the new k8s-based one
Deprecate the http://{project}/api/rest_v1/metrics resources
Eventually phase out the RESTBase /metrics hierarchy

Solving this will make us progress on multiple fronts: T198901 T262315

NOTE: This will be picked up by Platform Engineering, with support from Analytics.

Related Objects
Search...

View Standalone Graph

This task is connected to more than 200 other tasks. Only direct parents and subtasks are shown here. Use View Standalone Graph to show more of the graph.

Status	Subtype	Assigned	Task
			· · ·
In Progress		None	T262315 <CORE TECHNOLOGY> API Migration & RESTBase Sunset
In Progress		None	T263489 AQS 2.0
Invalid		None	T288156 Create a stand-alone OpenAPI specification for AQS
Resolved		None	T288160 Development and test environments for AQS 2.0 Cassandra services
Resolved		None	T288296 AQS 2.0: Page Analytics Service
Resolved		SGupta-WMF	T288298 AQS 2.0: Device Analytics service
Invalid		BPirkle	T288301 AQS 2.0:Wikistats 2 service
Resolved		None	T288303 AQS 2.0: Media Analytics Service
Resolved		SGupta-WMF	T288305 AQS 2.0: Geo Analytics Service
Resolved		None	T288661 Create k8s deployment of AQS 2.0
Resolved		None	T288663 Obtain a security review of AQS 2.0
Resolved		apaskulin	T288664 AQS 2.0 user documentation
Resolved		BPirkle	T288667 Create Dashboards for AQS 2.0
Resolved		BPirkle	T302536 Problem details for HTTP APIs (rfc7807)
Resolved		BPirkle	T303817 <AQS 2.0> Onboard API Platform Team to AQS 2.0
Declined		None	T303819 <API Management> Supporting Shared APIs in GitLab
Resolved		BPirkle	T311541 AQS 2.0: Create repository for shared functions
Resolved		BPirkle	T313513 Review AQS 2.0 behavior for discrepancies with existing production service
Resolved		codebug	T315113 Synchronize .gitignore files
Resolved		SGupta-WMF	T317428 AQS 2.0 Code Review: Sept 2022
Duplicate	Spike	VirginiaPoundstone	T318108 <spike> Define remaining scope of AQS 2.0
Resolved		codebug	T322590 Update copyright notices throughout AQS 2.0
Resolved		None	T327817 Edit Analytics Service
Resolved		None	T327818 Editor Analytics Service
Resolved	Spike	VirginiaPoundstone	T319687 Define Product Requirements for remaining scope of AQS 2.0
Open		daniel	T323295 Strategize mapping AQS 2.0 urls to api.wikimedia.org
Resolved		BPirkle	T328969 AQS 2.0: Revisit in-service testing approach
Resolved		BPirkle	T335692 AQS 2.0: Makefile Improvements
			· · ·

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Eevans merged a task: T288153: Analytics Query Service rewrite.Aug 4 2021, 8:38 PM

Eevans updated the task description. (Show Details)

Eevans added a subtask: T288156: Create a stand-alone OpenAPI specification for AQS.

Eevans added a subtask: T288160: Development and test environments for AQS 2.0 Cassandra services.Aug 4 2021, 8:40 PM

• Clarakosi updated the task description. (Show Details)Aug 5 2021, 12:06 AM

Eevans updated the task description. (Show Details)Aug 5 2021, 9:30 PM

Eevans added a subtask: T288298: AQS 2.0: Device Analytics service.Aug 5 2021, 9:53 PM

Eevans closed subtask T288156: Create a stand-alone OpenAPI specification for AQS as Invalid.Aug 11 2021, 8:25 PM

Eevans updated the task description. (Show Details)Aug 19 2021, 10:58 PM

We can use https://gitlab.wikimedia.org/eevans/aqs to get started, and open merge-requests for code review there. Depending on the state of the Gitlab rollout as we get nearer completion, we can either move it to a dedicated project, or set it up in Gerrit in the usual way.

If there are no objections, we'll implement the new service Go.

odimitrijevic moved this task from Analytics Query Service to Incoming on the Analytics board.Aug 23 2021, 5:59 PM

odimitrijevic moved this task from Incoming to Analytics Query Service on the Analytics board.Aug 23 2021, 6:08 PM

Eevans updated the task description. (Show Details)Oct 5 2021, 8:41 PM

Eevans updated the task description. (Show Details)Nov 15 2021, 8:06 PM

BTullis subscribed.Jan 20 2022, 9:58 AM

Eevans removed a subtask: T299731: Implement aggregate endpoint of the pageviews API.Jan 21 2022, 1:09 AM

AQS 2.0 to API Platform

as per team meeting AQS 2.0 work will begin to fall under the API Platform workstream
@Eevans will start to include @nnikkhoui and @BPirkle in code reviews & in 2x weekly standups for AQS 2.0

• DAbad moved this task from Incoming to Should do next on the API Platform board.Mar 15 2022, 12:19 PM

• DAbad edited projects, added API Platform (API Platform Roadmap); removed API Platform.

• DAbad added a subtask: T303817: <AQS 2.0> Onboard API Platform Team to AQS 2.0 .Mar 15 2022, 12:29 PM

• DAbad added a subtask: T303819: <API Management> Supporting Shared APIs in GitLab.Mar 15 2022, 12:37 PM

BPirkle mentioned this in T303822: Evaluate API Platform Tasks Related to RestBase Sunset.Mar 17 2022, 6:13 PM

• DAbad moved this task from Backlog to Develop on the API Platform (API Platform Roadmap) board.Apr 12 2022, 2:06 PM

• DAbad changed the status of subtask T288296: AQS 2.0: Page Analytics Service from Open to In Progress.Apr 26 2022, 2:11 PM

• DAbad changed the task status from Open to In Progress.Apr 26 2022, 2:22 PM

• DAbad claimed this task.

• DAbad raised the priority of this task from Medium to High.

• DAbad closed subtask T303817: <AQS 2.0> Onboard API Platform Team to AQS 2.0 as Resolved.May 17 2022, 12:41 PM

• DAbad changed the status of subtask T288298: AQS 2.0: Device Analytics service from Open to In Progress.May 17 2022, 1:10 PM

• DAbad changed the status of subtask T288303: AQS 2.0: Media Analytics Service from Open to In Progress.Jun 28 2022, 1:15 PM

• DAbad changed the status of subtask T288305: AQS 2.0: Geo Analytics Service from Open to In Progress.Jun 28 2022, 1:20 PM

BPirkle mentioned this in T311190: Establish testing procedure for Druid-based endpoints.Jul 26 2022, 9:30 PM

BPirkle added a subtask: T313513: Review AQS 2.0 behavior for discrepancies with existing production service.Aug 3 2022, 8:43 PM

BPirkle added a subtask: T311190: Establish testing procedure for Druid-based endpoints.Aug 30 2022, 1:17 PM

August 30, 2022

Not completely done with Cassandra-based endpoints
Druid endpoints still need to be done
Tracking doc: https://docs.google.com/spreadsheets/d/1nl-4zjd5OfbgINsVGwEc5jh5_xEexz8H7-c5ZIFpopk/edit#gid=0
Implemented AQS assist (shared library)
working on testing environment
looking at how we productionize this
- use this as an example w/ service ops

BPirkle added a subtask: T316849: Audit tests for Druid-based endpoints.Sep 1 2022, 3:06 AM

BPirkle added a subscriber: Unknown Object (User).Sep 7 2022, 1:43 PM

BPirkle added a subscriber: SGupta-WMF.Sep 8 2022, 4:57 PM

BPirkle added a subtask: T317428: AQS 2.0 Code Review: Sept 2022.Sep 9 2022, 4:44 PM

BPirkle removed a subtask: T311190: Establish testing procedure for Druid-based endpoints.Sep 13 2022, 2:29 AM

BPirkle removed a subtask: T316849: Audit tests for Druid-based endpoints.Sep 14 2022, 7:30 PM

BPirkle closed subtask T302536: Problem details for HTTP APIs (rfc7807) as Resolved.Sep 27 2022, 2:08 PM

VirginiaPoundstone added a subtask: T318108: <spike> Define remaining scope of AQS 2.0.Oct 6 2022, 4:30 PM

BPirkle mentioned this in T320739: Provide API module in GrowthExperiments to allow querying image suggestion API for titles.Oct 13 2022, 7:24 PM

VirginiaPoundstone moved this task from Develop to Define on the API Platform (API Platform Roadmap) board.Oct 25 2022, 2:45 PM

VirginiaPoundstone moved this task from Define to Develop on the API Platform (API Platform Roadmap) board.Oct 25 2022, 2:47 PM

BPirkle closed subtask T311541: AQS 2.0: Create repository for shared functions as Resolved.Oct 26 2022, 2:22 AM

SGupta-WMF closed subtask T317428: AQS 2.0 Code Review: Sept 2022 as Resolved.Oct 31 2022, 11:15 AM

JArguello-WMF edited projects, added API Platform (API AQS 2.0); removed API Platform (API Platform Roadmap), Analytics.Nov 7 2022, 12:59 PM

BPirkle added a subtask: T322590: Update copyright notices throughout AQS 2.0.Nov 9 2022, 2:46 PM

VirginiaPoundstone moved this task from API AQS 2.0 to API Platform Roadmap on the API Platform board.Nov 14 2022, 9:56 AM

VirginiaPoundstone edited projects, added API Platform (API Platform Roadmap); removed API Platform (API AQS 2.0).

JArguello-WMF added a project: AQS2.0.Nov 14 2022, 7:10 PM

VirginiaPoundstone edited projects, added AQS 2.0 Roadmap; removed AQS2.0.Nov 14 2022, 7:46 PM

JArguello-WMF closed subtask T322590: Update copyright notices throughout AQS 2.0 as Resolved.Dec 1 2022, 3:08 PM

VirginiaPoundstone mentioned this in T314771: Implement a compatibility layer between RESTBase and native PCS responses.Dec 16 2022, 5:57 PM

BPirkle updated the task description. (Show Details)Jan 24 2023, 8:49 PM

BPirkle added a subtask: T327817: Edit Analytics Service.Jan 24 2023, 9:06 PM

BPirkle added a subtask: T327818: Editor Analytics Service.

BPirkle closed subtask T288301: AQS 2.0:Wikistats 2 service as Invalid.Jan 24 2023, 11:31 PM

Aklapper added a subtask: T319687: Define Product Requirements for remaining scope of AQS 2.0.Jan 30 2023, 4:29 PM

JArguello-WMF closed subtask T319687: Define Product Requirements for remaining scope of AQS 2.0 as Resolved.Feb 1 2023, 7:19 PM

JArguello-WMF closed subtask T315113: Synchronize .gitignore files as Resolved.Feb 1 2023, 7:31 PM

VirginiaPoundstone added a subtask: T323295: Strategize mapping AQS 2.0 urls to api.wikimedia.org .Feb 3 2023, 6:24 AM

BPirkle added a subtask: T328969: AQS 2.0: Revisit in-service testing approach.Feb 6 2023, 9:06 PM

JArguello-WMF closed subtask T328969: AQS 2.0: Revisit in-service testing approach as Resolved.Feb 21 2023, 10:32 PM

BPirkle mentioned this in T334851: Define a procedure/pattern to populate test environments.Apr 17 2023, 3:11 PM

JArguello-WMF edited projects, added API Platform (AQS 2.0 Roadmap); removed AQS 2.0 Roadmap, API Platform (API Platform Roadmap).Apr 18 2023, 5:24 PM

BPirkle added a subtask: T335692: AQS 2.0: Makefile Improvements.May 1 2023, 3:42 PM

VirginiaPoundstone moved this task from Backlog to In Progress on the API Platform (AQS 2.0 Roadmap) board.May 18 2023, 7:58 AM

VirginiaPoundstone edited projects, added API Platform (API Platform Roadmap); removed API Platform (AQS 2.0 Roadmap).

BPirkle closed subtask T335692: AQS 2.0: Makefile Improvements as Resolved.Jul 20 2023, 8:50 PM

VirginiaPoundstone added a project: AQS2.0.Aug 25 2023, 9:35 PM

VirginiaPoundstone removed a project: API Platform (API Platform Roadmap).

VirginiaPoundstone closed subtask T288298: AQS 2.0: Device Analytics service as Resolved.Sep 10 2023, 1:27 AM

VirginiaPoundstone moved this task from Incoming to AQS 2.0 Backlog on the AQS2.0 board.Sep 28 2023, 7:51 PM

VirginiaPoundstone closed subtask T288303: AQS 2.0: Media Analytics Service as Resolved.Jan 4 2024, 9:19 PM

VirginiaPoundstone closed subtask T288305: AQS 2.0: Geo Analytics Service as Resolved.

VirginiaPoundstone closed subtask T288296: AQS 2.0: Page Analytics Service as Resolved.

VirginiaPoundstone closed subtask T327818: Editor Analytics Service as Resolved.

VirginiaPoundstone closed subtask T327817: Edit Analytics Service as Resolved.

VirginiaPoundstone closed subtask T288160: Development and test environments for AQS 2.0 Cassandra services as Resolved.

VirginiaPoundstone closed subtask T303819: <API Management> Supporting Shared APIs in GitLab as Declined.Jan 24 2024, 9:19 PM

VirginiaPoundstone moved this task from AQS 2.0 Backlog to AQS 2.0 Epics on the AQS2.0 board.Jan 24 2024, 9:25 PM

VirginiaPoundstone closed subtask T288661: Create k8s deployment of AQS 2.0 as Resolved.Jan 24 2024, 9:41 PM

VirginiaPoundstone closed subtask T288663: Obtain a security review of AQS 2.0 as Resolved.

VirginiaPoundstone closed subtask T288667: Create Dashboards for AQS 2.0 as Resolved.Mar 1 2024, 9:19 PM

As API Gateway is nowadays owned by serviceops, adding the serviceops project tag to open API Gateway tasks tagged with the deprecated/archived "Platform Team Initiatives (API Gateway)" tag at https://phabricator.wikimedia.org/project/profile/4321/, as part of Phabricator Housekeeping.

Removing inactive task assignee. (Please do so as part of offboarding - thanks.)

apaskulin closed subtask T288664: AQS 2.0 user documentation as Resolved.Tue, Jul 9, 3:08 PM

	• Pchelolo
	Sep 21 2020, 6:18 PM

AQS 2.0Open, In Progress, HighPublic1 Estimated Story PointsActions

Description

Overview

Related ObjectsSearch...

Event Timeline

AQS 2.0
Open, In Progress, HighPublic1 Estimated Story Points
Actions

Related Objects
Search...