Page MenuHomePhabricator

Enable retention of daily metrics for longer periods of time in Graphite
Closed, ResolvedPublic

Description

We are looking at tracking lots of metrics about Wikidata at a daily scale.
We would also like to be able to track them for longer periods of time.

Graphite has customizable retention, (currently for a year at most)
As we are tracking less data we can retain it for longer without having a big impact on disk usage!

Event Timeline

Addshore raised the priority of this task from to Medium.
Addshore updated the task description. (Show Details)
Addshore subscribed.
Addshore set Security to None.
Addshore moved this task from Incoming to Doing on the WMDE-Analytics-Engineering board.

Change 247866 had a related patch set uploaded (by Addshore):
Retain daily.* graphite metrics for longer

https://gerrit.wikimedia.org/r/247866

for long term data warehousing or analytics type of workflows using ourhadoop/analytics infrastructure will be more appropriate I think. graphite is more focused on operational metrics from applications, services and so on

After a short discussion I'm going to look at creating a small wrapper for storing the metrics in sql tables / hadoop / flat files insead of graphite.
I'll leave this ticket open until we have a concrete path forward / solution! :)

After many discussions in many places we have decided to try and push forward with this config change and start storing data in this graphite instance.

Per discussions revealing there isn't really any guarantee of persistence in this graphite instance we will simply backup all data we add here.

We will also look at a better solution on the analytics cluster in the longer term

Change 247866 merged by Filippo Giunchedi:
Retain daily.* graphite metrics for longer (25y)

https://gerrit.wikimedia.org/r/247866

Addshore moved this task from Doing to Done on the WMDE-Analytics-Engineering board.