Investigate lack of recency bias in Cassandra histogram metrics
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Eevans
	Jun 9 2016, 7:42 PM

Description

Histogram metrics are lacking the recency bias in 2.2.6 that they had in 2.1.13, resulting in strangely consistent values.

Screenshot from 2016-06-09 21-40-31.png (886×2 px, 174 KB)

Related Objects
Search...

Status	Assigned	Task
Invalid	None	T93751 RFC: Next steps for long-term revision storage -- space needs, storage hierarchies
Declined	Eevans	T93496 Improve revision compression in Cassandra / Brotli or LZMA support
Declined	Eevans	T125904 Brotli compression for Cassandra
Declined	None	T120171 RFC: Differentiate storage strategies for archival storage vs. hot current data
Declined	None	T122028 RFC: Chunked storage algorithms for archival data vs. large-window brotli compression
Declined	Eevans	T125906 Evaluate Brotli compression for Cassandra
Invalid	None	T126582 Log input from cassandra caused logstash process to crash repeatedly
Resolved	• GWicke	T111746 [future] Keep an eye on materialized views in Cassandra 3.0
Resolved	Eevans	T126629 Cassandra 2.2.6
Resolved	Eevans	T137474 Investigate lack of recency bias in Cassandra histogram metrics

Event Timeline

Eevans created this task.Jun 9 2016, 7:42 PM

Restricted Application added a subscriber: Zppix. · View Herald TranscriptJun 9 2016, 7:42 PM

Eevans moved this task from Backlog to In-Progress on the Cassandra board.Jun 9 2016, 7:42 PM

Mentioned in SAL [2016-06-09T19:59:15Z] <urandom> Restarting Cassandra on xenon.eqiad.wmnet (removing patched test build; restoring state) : T137474

Mentioned in SAL [2016-06-10T13:13:00Z] <urandom> Testing patched Cassandra (dpkg -i ...; service cassandra-a restart) on xenon : T137474

Mentioned in SAL [2016-06-10T13:15:03Z] <urandom> Starting html dump(s) in RESTBase staging : T137474

Mentioned in SAL [2016-06-10T13:58:27Z] <urandom> Testing patched Cassandra (dpkg -i ...; service cassandra-a restart) on cerium : T137474

Mentioned in SAL [2016-06-10T13:59:54Z] <urandom> Testing patched Cassandra (dpkg -i ...; service cassandra-a restart) on praseodymim : T137474

Mentioned in SAL [2016-06-10T14:06:51Z] <urandom> Testing patched Cassandra (dpkg -i ...; service cassandra-a restart) on restbase-test2001 : T137474

Mentioned in SAL [2016-06-10T14:17:43Z] <urandom> Testing patched Cassandra (dpkg -i ...; service cassandra-{a,b} restart) on restbase-test200[1-2] : T137474

The root cause here is a deliberate change to the histogram implementation in order to address concerns some had over the lossy nature of the forward-decaying priority reservoir sampling used prior to Cassandra 2.2. Options are still being discussed on CASSANDRA-11752, but consensus seems to be that at a minimum, percentile accessors should be recency biased without requiring a reset-on-read.

Until a resolution to CASSANDRA-11752 is available, I propose we patch our build to make use of the Dropwizard ExponentiallyDecayingReservoir (the implementation used in 2.1). The patch for this is very simple:

diff --git a/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java b/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java
index 6fdb2ff..308a65b 100644
--- a/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java
+++ b/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java
@@ -60,7 +60,7 @@ public class CassandraMetricsRegistry extends MetricRegistry
 
     public Histogram histogram(MetricName name, boolean considerZeroes)
     {
-        Histogram histogram = register(name, new ClearableHistogram(new EstimatedHistogramReservoir(considerZeroes)));
+        Histogram histogram = register(name, new Histogram(new ExponentiallyDecayingReservoir()));
         registerMBean(histogram, name.getMBeanName());
 
         return histogram;
@@ -68,7 +68,7 @@ public class CassandraMetricsRegistry extends MetricRegistry
 
     public Timer timer(MetricName name)
     {
-        Timer timer = register(name, new Timer(new EstimatedHistogramReservoir(false)));
+        Timer timer = register(name, new Timer(new ExponentiallyDecayingReservoir()));
         registerMBean(timer, name.getMBeanName());
 
         return timer;

I have built Debian packages that apply this patch as part of the package build, they can be found here. I've manually installed these packages on the staging cluster, and have a couple of dump processes running.

Screenshot from 2016-06-10 17-29-06.png (710×1 px, 106 KB)

I propose that we keep the dumps running over the weekend, and upgrade restbase1007.eqiad.wmnet on Monday if everything continues to look OK.

In T137474#2371955, @Eevans wrote:

I propose that we keep the dumps running over the weekend, and upgrade restbase1007.eqiad.wmnet on Monday if everything continues to look OK.

Hmm, we have around 10% free space left on eqiad-staging nodes (~40 GB). Since the idea is to run the dumps, perhaps it'd be worth truncating the most significant CFs to be in the clear?

In T137474#2371955, @Eevans wrote:

I have built Debian packages that apply this patch as part of the package build, they can be found here. I've manually installed these packages on the staging cluster, and have a couple of dump processes running.

This looks awesome! Thank you, @Eevans for investigating and coming up with a working solution so quickly!

In T137474#2371977, @mobrovac wrote:

In T137474#2371955, @Eevans wrote:

I propose that we keep the dumps running over the weekend, and upgrade restbase1007.eqiad.wmnet on Monday if everything continues to look OK.

Hmm, we have around 10% free space left on eqiad-staging nodes (~40 GB). Since the idea is to run the dumps, perhaps it'd be worth truncating the most significant CFs to be in the clear?

It's going to be difficult to move the needle by a whole lot without truncating the wikipedia parsoid tables (which we've been reluctant to do so far). Truncating local_group_wikipedia_T_mobileapps_remaining would free up ~23G, and local_group_wikipedia_T_mobileapps_lead another ~8G (combined that's about 10% of the current data); Are we OK truncating the mobileapps tables?

[ ... ]
43165683	data/local_group_wikisource_T_parsoid_html
45114701	data/local_group_phase0_T_parsoid_html
2296759180	data/local_group_wikipedia_T_summary
8347359051	data/local_group_wikipedia_T_title__revisions
8465207914	data/local_group_wikipedia_T_mobileapps_lead
9147278915	data/local_group_wikipedia_T_parsoid_section_offsets
24520085710	data/local_group_wikipedia_T_mobileapps_remaining
76365898022	data/local_group_wikipedia_T_parsoid_dataW4ULtxs1oMqJ
188561917420	data/local_group_wikipedia_T_parsoid_html
318081577290	total

In T137474#2372094, @Eevans wrote:

It's going to be difficult to move the needle by a whole lot without truncating the wikipedia parsoid tables (which we've been reluctant to do so far).

I know, but we are coming at a point where we have to take a decision what to do next, otherwise we won't be able to store anything any more (but let's not discuss this here, it's kind of OT for this task).

For the time being keep the dumps running so that we collect as much data as possible. I will monitor the nodes over the week-end and if we come dangerously close to filling the disk I'll stop the dump. Where is it running from?

Truncating local_group_wikipedia_T_mobileapps_remaining would free up ~23G, and local_group_wikipedia_T_mobileapps_lead another ~8G (combined that's about 10% of the current data); Are we OK truncating the mobileapps tables?

Sure, go ahead and do that.

In T137474#2372153, @mobrovac wrote:

In T137474#2372094, @Eevans wrote:

It's going to be difficult to move the needle by a whole lot without truncating the wikipedia parsoid tables (which we've been reluctant to do so far).

I know, but we are coming at a point where we have to take a decision what to do next, otherwise we won't be able to store anything any more (but let's not discuss this here, it's kind of OT for this task).

For the time being keep the dumps running so that we collect as much data as possible. I will monitor the nodes over the week-end and if we come dangerously close to filling the disk I'll stop the dump. Where is it running from?

Thanks @mobrovac ! The dumps are running on xenon and cerium.

eevans@xenon:~$ screen -ls
There is a screen on:
	15544.dump	(06/09/2016 10:20:00 AM)	(Detached)
1 Socket in /var/run/screen/S-eevans.

eevans@cerium:~$ screen -ls
There is a screen on:
	18952.dump	(06/09/2016 10:20:09 AM)	(Detached)
1 Socket in /var/run/screen/S-eevans.

Truncating local_group_wikipedia_T_mobileapps_remaining would free up ~23G, and local_group_wikipedia_T_mobileapps_lead another ~8G (combined that's about 10% of the current data); Are we OK truncating the mobileapps tables?

Sure, go ahead and do that.

Done.

$ df -h
Filesystem                 Size  Used Avail Use% Mounted on
udev                        10M     0   10M   0% /dev
tmpfs                      3.2G  331M  2.8G  11% /run
/dev/md0                    28G  6.3G   20G  24% /
tmpfs                      7.8G     0  7.8G   0% /dev/shm
tmpfs                      5.0M     0  5.0M   0% /run/lock
tmpfs                      7.8G     0  7.8G   0% /sys/fs/cgroup
/dev/mapper/xenon--vg-srv  355G  271G   66G  81% /srv

Dumps ran continuously over the weekend in staging, and the metrics appear reasonable. I'm going to proceed with the upgrade of restbase1007 (the only production node currently running 2.2.6).

Mentioned in SAL [2016-06-13T17:37:11Z] <urandom> Upgrading restbase1007.eqiad.wmnet w/ https://people.wikimedia.org/~eevans/debian/cassandra_2.2.6-wmf1_all.deb : T137474

Mentioned in SAL [2016-06-13T17:38:00Z] <urandom> Restarting restbase1007-a.eqiad.wmnet : T137474

Mentioned in SAL [2016-06-13T17:52:42Z] <urandom> Restarting restbase1007-b.eqiad.wmnet : T137474

Mentioned in SAL [2016-06-13T17:55:20Z] <urandom> Restarting restbase1007-c.eqiad.wmnet : T137474

Mentioned in SAL [2016-06-13T17:58:01Z] <urandom> Upgrade of restbase1007.eqiad.wmnet (https://people.wikimedia.org/~eevans/debian/cassandra_2.2.6-wmf1_all.deb) complete : T137474

This is now deployed on 1007-{a,b,c}, and the metrics are lively once more.

Screenshot from 2016-06-13 13-30-07.png (528×1 px, 259 KB)

Thanks, @Eevans! Is there anything left on this task (upstreaming?), or should we resolve it?

In T137474#2377241, @GWicke wrote:

Thanks, @Eevans! Is there anything left on this task (upstreaming?), or should we resolve it?

I was planning to leave it open to keep track of the upstream issue, yes.

Eevans moved this task from In-Progress to Blocked on the Cassandra board.Jun 13 2016, 9:19 PM

A patch for this has been submitted upstream. We should test this out, and provide feedback if necessary, before it becomes a part of the 2.2.8 release.

A Debian package with a backported patch from https://issues.apache.org/jira/browse/CASSANDRA-11752 can be found at https://people.wikimedia.org/~eevans/debian/

Eevans moved this task from Next to In-Progress on the Cassandra board.Aug 24 2016, 4:32 PM

Mentioned in SAL [2016-08-24T20:03:54Z] <urandom> T137474 Starting htmldumper in RESTBase Staging

Mentioned in SAL [2016-08-24T20:59:02Z] <urandom> T137474: Upgrading xenon.eqiad.wmnet to cassandra_2.2.6-wmf2

Mentioned in SAL [2016-08-25T00:51:14Z] <urandom> T137474: Stopping dumps in RESTBase staging, and reverting xenon.eqiad.wmnet to Cassandra 2.2.6-wmf1

Some explanation:

When the test begins at ~20:00, all 3 nodes are running a version of Cassandra 2.2.6 patched to reinstate the Dropwizard ExponentiallyDecayingReservoir that was used prior to Cassandra 2.2:

P3893 Masterwork From Distant Lands

1	#! /bin/sh /usr/share/dpatch/dpatch-run
2	## 100reinstate_exp_decaying_resv.dpatch by Eric Evans <eevans@wikimedia.org>
3	##
4	## All lines beginning with `## DP:' are a description of the patch.
5	## DP: No description.
6
7	@DPATCH@
8	diff --git a/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java b/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java
9	index 6fdb2ff..308a65b 100644
10	--- a/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java
11	+++ b/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java
12	@@ -60,7 +60,7 @@ public class CassandraMetricsRegistry extends MetricRegistry
13
14	public Histogram histogram(MetricName name, boolean considerZeroes)
15	{
16	- Histogram histogram = register(name, new ClearableHistogram(new EstimatedHistogramReservoir(considerZeroes)));
17	+ Histogram histogram = register(name, new Histogram(new ExponentiallyDecayingReservoir()));
18	registerMBean(histogram, name.getMBeanName());
19
20	return histogram;
21	@@ -68,7 +68,7 @@ public class CassandraMetricsRegistry extends MetricRegistry
22
23	public Timer timer(MetricName name)
24	{
25	- Timer timer = register(name, new Timer(new EstimatedHistogramReservoir(false)));
26	+ Timer timer = register(name, new Timer(new ExponentiallyDecayingReservoir()));
27	registerMBean(timer, name.getMBeanName());
28
29	return timer;

At ~21:00, traffic generation was stopped long enough to upgrade xenon-a to a version of Cassandra 2.2.6 patched to include what was merged as a part of CASSANDRA-11752.

P3894 Masterwork From Distant 23456 7@DP 8diff 9index 10--- 11+++ 12@@ 13 14 15 16- 17+ 18 19 20 21@@ 22 23 24 25- 26+ 27 28 29 30diff 31index 32--- 33+++ 34@@ 35 36 37 38- 39+ 40 41 42 43 44 45 46- 47+ 48 49 50 51diff 52new 53index 54--- 55+++ 56@@ 57+/ 58+ 59+ 60+ 61+ 62+ 63+ 64+ 65+ 66+ * 67+ 68+ 69+ 70+ 71+ 72+ 73+ 74+ 75+package 76+ 77+import 78+import 79+import 80+import 81+import 82+import 83+import 84+import 85+import 86+ 87+import 88+ 89+import 90+import 91+import 92+import 93+ 94+/ 95+ 96+ 97+ 98+ 99+ 100+ 101+ 102+ 103+ 104+ 105+ 106+ 107+ 108+ 109+ 110+ 111+ 112+ 113+ 114+ 115+ 116+ 117+ 118+ 119+ 120+ 121+ 122+ 123+ 124+ 125+ 126+ 127+public 128+129+ 130+ 131+ 132+ 133+ 134+ 135+ 136+ 137+ 138+ 139+ 140+ 141+ 142+ 143+ 144+ 145+ 146+ 147+ 148+ 149+ 150+ 151+ 152+ 153+ 154+ 155+ 156+ 157+ 158+ 159+ 160+ 161+ 162+ 163+ 164+ 165+ 166+ 167+ 168+ 169+ 170+ 171+ 172+ 173+ * 174+ 175+ 176+ 177+ 178+ 179+ 180+ 181+ 182+ 183+ 184+ * 185+ 186+ 187+ 188+ 189+ 190+ 191+ 192+ 193+ 194+ 195+ 196+ 197+ 198+ 199+ 200+ 201+ 202+ 203+ 204+ 205+ 206+ 207+ 208+ 209+ 210+ 211+ 212+ 213+ 214+ 215+ 216+ 217+ 218+ 219+ 220+ 221+ 222+ 223+ 224+ 225+ 226+ 227+ 228+ 229+ 230+ 231+ 232+ 233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323+ 324+ 325+ 326+ 327+ 328+ 329+ 330+ 331+ 332+ 333+ 334+ 335+ 336+ 337+ 338+ 339+ 340+ 341+ 342+ 343+ 344+ 345+ 346+ 347+ 348+ 349+ 350+ 351+ 352+ 353+ 354+ 355+ 356+ 357+ 358+ 359+ 360+ 361+ 362+ 363+ 364+ 365+ 366+ 367+ 368+ 369+ 370+ 371+ 372+ 373+ 374+ 375+ 376+ 377+ 378+ 379+ 380+ 381+ 382+ 383+ 384+ 385+ 386+ 387+ 388+ 389+ 390+ 391+ 392+ 393+ 394+ 395+ 396+ 397+ 398+ 399+ 400+ 401+ 402+ 403+ 404+ 405+ 406+ 407+ 408+ 409+ 410+ 411+ 412+ 413+ 414+ 415+ 416+ 417+ 418+ 419+ 420+ 421+ 422+ 423+ 424+ 425+ 426+ 427+ 428+ 429+ 430+ 431+ 432+ 433+ 434+ 435+ 436+ 437+ 438+ 439+ 440+ 441+ 442+ 443+ 444+ 445+ 446+ 447+ 448+ 449+ 450+ 451+ 452+ 453+ 454+ 455+ 456+ 457+ 458+ 459+ 460+ 461+ 462+ 463+ 464+ 465+ 466+ 467+ 468+ 469+ 470+ 471+ 472+ 473+ 474+ 475+ 476+ 477+ 478+ 479+ 480+ 481+ 482+ 483+ 484+ 485+ 486+ 487+ 488+ 489+ 490+ 491+ 492+ 493+ 494+ 495+ 496+ 497+ 498+ 499+ 500+ 501+ 502+ 503+ 504+ 505+ 506+ 507+ 508+ 509+ 510+ 511+ 512+ 513+ 514+ 515+ 516+ 517+ 518+ 519+ 520+ 521+ 522+ 523+ 524+ 525+ 526+ 527+ 528+ 529+ 530+ 531+ 532+ 533+ 534+ 535+ 536+ 537+ 538+ 539+ 540+ 541+ 542+ 543+ 544+ 545+ 546+ 547+ 548+ 549+ 550+ 551+ 552+ 553+ 554+ 555+ 556+ 557+ 558+ 559+ 560+ 561+ 562+ 563+ 564+ 565+ 566+ 567+ 568+ 569+ 570+ 571+ 572+ 573+ 574+ 575+ 576+ 577+ 578+ 579+ 580+ 581+ 582+ 583+ 584+ 585+ 586+ 587+ 588+ 589+ 590+ 591+ 592+ 593+ 594+ 595+ 596+ 597+ 598+ 599+ 600+ 601+ 602+ 603+ 604+ 605+606diff 607deleted 608index 609--- 610+++ 611@@ 612- 613- 614- 615- 616- 617- 618- 619- 620- 621- * 622- 623- 624- 625- 626- 627- 628- 629-package 630- 631-import 632- 633-import 634-import 635-import 636-import 637- 638- 639- 640- 641- 642- 643-public 644-645- 646- 647- 648- 649- 650- 651- 652- 653- 654- 655- 656- 657- 658- 659- 660- 661- 662- 663- 664- 665- 666- 667- 668- 669- 670- 671- 672- 673- 674- 675- 676- 677- 678- 679- 680- 681- 682- 683- 684- 685- 686- 687- 688- 689- 690- 691- 692- 693- 694- 695- 696- 697- 698- 699- 700- 701- 702- 703- 704- 705- 706- 707- 708- 709- 710- 711- 712- 713- 714- 715- 716- 717- 718- 719- 720- 721- 722-723diff 724index 725--- 726+++ 727@@ 728 729 730 731- 732+ 733 734 735 736diff 737new 738index 739--- 740+++ 741@@ 742+ 743+ 744+ 745+ 746+ 747+ 748+ 749+ 750+ 751+ * 752+ 753+ 754+ 755+ 756+ 757+ 758+ 759+ 760+package 761+ 762+import 763+ 764+import 765+import 766+ 767+import 768+import 769+import 770+ 771+ 772+public 773+774+ 775+ 776+ 777+ 778+ 779+ 780+ 781+ 782+ 783+ 784+ 785+ 786+ 787+ 788+ 789+ 790+ 791+ 792+ 793+ 794+ 795+ 796+ 797+ 798+ 799+ 800+ 801+ 802+ 803+ 804+ 805+ 806+ 807+ 808+ 809+ 810+ 811+ 812+ 813+ 814+ 815+ 816+ 817+ 818+ 819+ 820+ 821+ 822+ 823+ 824+ 825+ 826+ 827+ 828+ 829+ 830+ 831+ 832+ 833+ 834+ 835+ 836+ 837+ 838+ 839+ 840+ 841+ 842+ 843+ 844+ 845+ 846+ 847+ 848+ 849+ 850+ 851+ 852+ 853+ 854+ 855+ 856+ 857+ 858+ 859+ 860+ 861+ 862+ 863+ 864+ 865+ 866+ 867+ 868+ 869+ 870+ 871+ 872+ 873+ 874+ 875+ 876+ 877+ 878+ 879+ 880+ 881+ 882+ 883+ 884+ 885+ 886+ 887+ 888+ 889+ 890+ 891+ 892+ 893+ 894+ 895+ 896+ 897+ 898+ 899+ 900+ 901+ 902+ 903+ 904+ 905+ 906+ 907+ 908+ 909+ 910+ 911+ 912+ 913+ 914+ 915+ 916+ 917+ 918+ 919+ 920+ 921+ 922+ 923+ 924+ 925+ 926+ 927+ 928+ 929+ 930+ 931+ 932+ 933+ 934+ 935+ 936+ 937+ 938+ 939+ 940+ 941+ 942+ 943+ 944+ 945+ 946+ 947+ 948+ 949+ 950+ 951+ 952+ 953+ 954+ 955+ 956+ 957+ 958+ 959+ 960+ 961+ 962+ 963+ 964+ 965+ 966+ 967+ 968+ 969+ 970+ 971+ 972+ 973+ 974+ 975+ 976+ 977+ 978+ 979+ 980+ 981+ 982+ 983+ 984+ 985+ 986+ 987+ 988+ 989+ 990+ 991+ 992+ 993+ 994+ 995+ 996+ 997+ 998+ 999+ 1000+ 1001 1002+ 1003+ 1004+ 1005+ 1006+ 1007+ 1008 1009+ 1010+ 1011+ 1012+ 1013+ 1014+ 1015+ 1016 1017+ 1018+ 1019+ 1020+ 1021+ 1022 1023+ 1024+ 1025+ 1026+ 1027+ 1028+ 1029 1030+ 1031+ 1032+ 1033+ 1034+ 1035+ 1036+ 1037 1038+ 1039+ 1040+ 1041+ 1042+ 1043 1044+ 1045+ 1046+ 1047+ 1048+ 1049+ 1050 1051+ 1052 1053+ 1054+ 1055 1056+ 1057+ 1058+ 1059 1060+ 1061+ 1062+ 1063+ 1064+ 1065+ 1066+ 1067+ 1068+ 1069+ 1070 1071+ 1072+ 1073+ 1074 1075+ 1076+ 1077 1078+ 1079 1080+ 1081+ 1082+ 1083+ 1084+ 1085+ 1086 1087+ 1088 1089+ 1090+ 1091+ 1092+ 1093+ 1094+ 1095+ 1096+ 1097 1098+ 1099+ 1100+ 1101+ 1102+ 1103 1104+ 1105+ 1106 1107+ 1108+ 1109+ 1110+ 1111 1112+ 1113+ 1114+ 1115+ 1116 1117+ 1118+ 1119+ 1120+ 1121+ 1122+

Lands

1	#! /bin/sh /usr/share/dpatch/dpatch-run class="c1">## 101cassandra-11752_2.2.dpatch by Eric Evans <eevans@wikimedia.org> class="c1">## class="c1">## All lines beginning with `## DP:' are a description of the patch. class="c1">## DP: No description. ATCH@ --git a/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java b/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java 6fdb2ff..8e5671b 100644 a/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java b/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java -60,7 +60,7 @@ public class CassandraMetricsRegistry extends MetricRegistry public Histogram histogram(MetricName name, boolean considerZeroes) { Histogram histogram = register(name, new ClearableHistogram(new EstimatedHistogramReservoir(considerZeroes))); Histogram histogram = register(name, new ClearableHistogram(new DecayingEstimatedHistogramReservoir(considerZeroes))); registerMBean(histogram, name.getMBeanName()); return histogram; -68,7 +68,7 @@ public class CassandraMetricsRegistry extends MetricRegistry public Timer timer(MetricName name) { Timer timer = register(name, new Timer(new EstimatedHistogramReservoir(false))); Timer timer = register(name, new Timer(new DecayingEstimatedHistogramReservoir())); registerMBean(timer, name.getMBeanName()); return timer; --git a/src/java/org/apache/cassandra/metrics/ClearableHistogram.java b/src/java/org/apache/cassandra/metrics/ClearableHistogram.java 85f2fa9..4a081d8 100644 a/src/java/org/apache/cassandra/metrics/ClearableHistogram.java b/src/java/org/apache/cassandra/metrics/ClearableHistogram.java -26,14 +26,14 @@ import com.codahale.metrics.Histogram; / public class ClearableHistogram extends Histogram { private final EstimatedHistogramReservoir reservoirRef; private final DecayingEstimatedHistogramReservoir reservoirRef; /* * Creates a new {@link com.codahale.metrics.Histogram} with the given reservoir. * * @param reservoir the reservoir to create a histogram from / public ClearableHistogram(EstimatedHistogramReservoir reservoir) public ClearableHistogram(DecayingEstimatedHistogramReservoir reservoir) { super(reservoir); --git a/src/java/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoir.java b/src/java/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoir.java file mode 100644 0000000..14a4366 /dev/null b/src/java/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoir.java -0,0 +1,549 @@ * Licensed to the Apache Software Foundation (ASF) under one * or more contributor license agreements. See the NOTICE file * distributed with this work for additional information * regarding copyright ownership. The ASF licenses this file * to you under the Apache License, Version 2.0 (the * "License"); you may not use this file except in compliance * with the License. You may obtain a copy of the License at * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. / org.apache.cassandra.metrics; java.io.OutputStream; java.io.OutputStreamWriter; java.io.PrintWriter; java.nio.charset.Charset; java.util.Arrays; java.util.concurrent.atomic.AtomicBoolean; java.util.concurrent.atomic.AtomicLong; java.util.concurrent.atomic.AtomicLongArray; java.util.concurrent.locks.ReentrantReadWriteLock; com.google.common.annotations.VisibleForTesting; com.codahale.metrics.Clock; com.codahale.metrics.Reservoir; com.codahale.metrics.Snapshot; org.apache.cassandra.utils.EstimatedHistogram; * * A decaying histogram reservoir where values collected during each minute will be twice as significant as the values * collected in the previous minute. Measured values are collected in variable sized buckets, using small buckets in the * lower range and larger buckets in the upper range. Use this histogram when you want to know if the distribution of * the underlying data stream has changed recently and you want high resolution on values in the lower range. * * The histogram use forward decay [1] to make recent values more significant. The forward decay factor will be doubled * every minute (half-life time set to 60 seconds) [2]. The forward decay landmark is reset every 30 minutes (or at * first read/update after 30 minutes). During landmark reset, updates and reads in the reservoir will be blocked in a * fashion similar to the one used in the metrics library [3]. The 30 minute rescale interval is used based on the * assumption that in an extreme case we would have to collect a metric 1M times for a single bucket each second. By the * end of the 30:th minute all collected values will roughly add up to 1.000.000 * 60 * pow(2, 30) which can be * represented with 56 bits giving us some head room in a signed 64 bit long. * * Internally two reservoirs are maintained, one with decay and one without decay. All public getters in a {@Snapshot} * will expose the decay functionality with the exception of the {@link Snapshot#getValues()} which will return values * from the reservoir without decay. This makes it possible for the caller to maintain precise deltas in an interval of * its choise. * * The bucket size starts at 1 and grows by 1.2 each time (rounding and removing duplicates). It goes from 1 to around * 18T by default (creating 164+1 buckets), which will give a timing resolution from microseconds to roughly 210 days, * with less precision as the numbers get larger. * * The series of values to which the counts in `decayingBuckets` correspond: * 1, 2, 3, 4, 5, 6, 7, 8, 10, 12, 14, 17, 20, 24, 29, 35, 42, 50, 60, 72 etc. * Thus, a `decayingBuckets` of [0, 0, 1, 10] would mean we had seen 1 value of 3 and 10 values of 4. * * Each bucket represents values from (previous bucket offset, current offset]. * * [1]: http://dimacs.rutgers.edu/~graham/pubs/papers/fwddecay.pdf * [2]: https://en.wikipedia.org/wiki/Half-life * [3]: https://github.com/dropwizard/metrics/blob/v3.1.2/metrics-core/src/main/java/com/codahale/metrics/ExponentiallyDecayingReservoir.java / class DecayingEstimatedHistogramReservoir implements Reservoir class="o">{ /* * The default number of decayingBuckets. Use this bucket count to reduce memory allocation for bucket offsets. / public static final int DEFAULT_BUCKET_COUNT = 164; public static final boolean DEFAULT_ZERO_CONSIDERATION = false; // The offsets used with a default sized bucket array without a separate bucket for zero values. public static final long[] DEFAULT_WITHOUT_ZERO_BUCKET_OFFSETS = EstimatedHistogram.newOffsets(DEFAULT_BUCKET_COUNT, false); // The offsets used with a default sized bucket array with a separate bucket for zero values. public static final long[] DEFAULT_WITH_ZERO_BUCKET_OFFSETS = EstimatedHistogram.newOffsets(DEFAULT_BUCKET_COUNT, true); // Represents the bucket offset as created by {@link EstimatedHistogram#newOffsets()} private final long[] bucketOffsets; // decayingBuckets and buckets are one element longer than bucketOffsets -- the last element is values greater than the last offset private final AtomicLongArray decayingBuckets; private final AtomicLongArray buckets; public static final long HALF_TIME_IN_S = 60L; public static final double MEAN_LIFETIME_IN_S = HALF_TIME_IN_S / Math.log(2.0); public static final long LANDMARK_RESET_INTERVAL_IN_MS = 30L 60L * 1000L; private final AtomicBoolean rescaling = new AtomicBoolean(false); private volatile long decayLandmark; private final ReentrantReadWriteLock lock = new ReentrantReadWriteLock(); // Wrapper around System.nanoTime() to simplify unit testing. private final Clock clock; /** * Construct a decaying histogram with default number of buckets and without considering zeroes. / public DecayingEstimatedHistogramReservoir() { this(DEFAULT_ZERO_CONSIDERATION, DEFAULT_BUCKET_COUNT, Clock.defaultClock()); } /* * Construct a decaying histogram with default number of buckets. * * @param considerZeroes when true, 0-value measurements in a separate bucket, otherwise they will be collected in same bucket as 1-value measurements / public DecayingEstimatedHistogramReservoir(boolean considerZeroes) { this(considerZeroes, DEFAULT_BUCKET_COUNT, Clock.defaultClock()); } /* * Construct a decaying histogram. * * @param considerZeroes when true, 0-value measurements in a separate bucket, otherwise they will be collected in same bucket as 1-value measurements * @param bucketCount number of buckets used to collect measured values / public DecayingEstimatedHistogramReservoir(boolean considerZeroes, int bucketCount) { this(considerZeroes, bucketCount, Clock.defaultClock()); } @VisibleForTesting DecayingEstimatedHistogramReservoir(boolean considerZeroes, int bucketCount, Clock clock) { if (bucketCount == DEFAULT_BUCKET_COUNT) { if (considerZeroes == true) { bucketOffsets = DEFAULT_WITH_ZERO_BUCKET_OFFSETS; } else { bucketOffsets = DEFAULT_WITHOUT_ZERO_BUCKET_OFFSETS; } } else { bucketOffsets = EstimatedHistogram.newOffsets(bucketCount, considerZeroes); } decayingBuckets = new AtomicLongArray(bucketOffsets.length + 1); buckets = new AtomicLongArray(bucketOffsets.length + 1); this.clock = clock; decayLandmark = clock.getTime(); } /* * Increments the count of the bucket closest to n, rounding UP. * * @param value the data point to add to the histogram / public void update(long value) { long now = clock.getTime(); rescaleIfNeeded(now); int index = Arrays.binarySearch(bucketOffsets, value); if (index < 0) { // inexact match, take the first bucket higher than n index = -index - 1; } // else exact match; we're good class="s1">+ class="s1">+ lockForRegularUsage(); class="s1">+ class="s1">+ try class="s1">+ { class="s1">+ decayingBuckets.getAndAdd(index, forwardDecayWeight(now)); class="s1">+ } class="s1">+ finally class="s1">+ { class="s1">+ unlockForRegularUsage(); class="s1">+ } class="s1">+ class="s1">+ buckets.getAndIncrement(index); class="s1">+ } class="s1">+ class="s1">+ private long forwardDecayWeight(long now) class="s1">+ { class="s1">+ return Math.round(Math.exp(((now - decayLandmark) / 1000L) / MEAN_LIFETIME_IN_S)); class="s1">+ } class="s1">+ class="s1">+ /* class="s1">+ * Return the number of buckets where recorded values are stored. class="s1">+ * class="s1">+ * This method does not return the number of recorded values as suggested by the {@link Reservoir} interface. class="s1">+ * class="s1">+ * @return the number of buckets class="s1">+ / class="s1">+ public int size() class="s1">+ { class="s1">+ return decayingBuckets.length(); class="s1">+ } class="s1">+ class="s1">+ /* class="s1">+ * Returns a snapshot of the decaying values in this reservoir. class="s1">+ * class="s1">+ * Non-decaying reservoir will not be included in the snapshot. class="s1">+ * class="s1">+ * @return the snapshot class="s1">+ / class="s1">+ public Snapshot getSnapshot() class="s1">+ { class="s1">+ rescaleIfNeeded(); class="s1">+ class="s1">+ lockForRegularUsage(); class="s1">+ class="s1">+ try class="s1">+ { class="s1">+ return new EstimatedHistogramReservoirSnapshot(this); class="s1">+ } class="s1">+ finally class="s1">+ { class="s1">+ unlockForRegularUsage(); class="s1">+ } class="s1">+ } class="s1">+ class="s1">+ /* class="s1">+ * @return true if this histogram has overflowed -- that is, a value larger than our largest bucket could bound was added class="s1">+ / class="s1">+ @VisibleForTesting class="s1">+ boolean isOverflowed() class="s1">+ { class="s1">+ return decayingBuckets.get(decayingBuckets.length() - 1) > 0; class="s1">+ } class="s1">+ class="s1">+ private void rescaleIfNeeded() class="s1">+ { class="s1">+ rescaleIfNeeded(clock.getTime()); class="s1">+ } class="s1">+ class="s1">+ private void rescaleIfNeeded(long now) class="s1">+ { class="s1">+ if (needRescale(now)) class="s1">+ { class="s1">+ if (rescaling.compareAndSet(false, true)) class="s1">+ { class="s1">+ try class="s1">+ { class="s1">+ rescale(now); class="s1">+ } class="s1">+ finally class="s1">+ { class="s1">+ rescaling.set(false); class="s1">+ } class="s1">+ } class="s1">+ } class="s1">+ } class="s1">+ class="s1">+ private void rescale(long now) class="s1">+ { class="s1">+ // Check again to make sure that another thread didn't complete rescale already if (needRescale(now)) { lockForRescale(); try { final long rescaleFactor = forwardDecayWeight(now); decayLandmark = now; final int bucketCount = decayingBuckets.length(); for (int i = 0; i < bucketCount; i++) { long newValue = Math.round((decayingBuckets.get(i) / rescaleFactor)); decayingBuckets.set(i, newValue); } } finally { unlockForRescale(); } } } private boolean needRescale(long now) { return (now - decayLandmark) > LANDMARK_RESET_INTERVAL_IN_MS; } @VisibleForTesting public void clear() { lockForRescale(); try { final int bucketCount = decayingBuckets.length(); for (int i = 0; i < bucketCount; i++) { decayingBuckets.set(i, 0L); buckets.set(i, 0L); } } finally { unlockForRescale(); } } private void lockForRegularUsage() { this.lock.readLock().lock(); } private void unlockForRegularUsage() { this.lock.readLock().unlock(); } private void lockForRescale() { this.lock.writeLock().lock(); } private void unlockForRescale() { this.lock.writeLock().unlock(); } private static final Charset UTF_8 = Charset.forName("UTF-8"); /* * Represents a snapshot of the decaying histogram. * * The decaying buckets are copied into a snapshot array to give a consistent view for all getters. However, the * copy is made without a write-lock and so other threads may change the buckets while the array is copied, * probably causign a slight skew up in the quantiles and mean values. * * The decaying buckets will be used for quantile calculations and mean values, but the non decaying buckets will be * exposed for calls to {@link Snapshot#getValues()}. / private class EstimatedHistogramReservoirSnapshot extends Snapshot { private final long[] decayingBuckets; public EstimatedHistogramReservoirSnapshot(DecayingEstimatedHistogramReservoir reservoir) { final int length = reservoir.decayingBuckets.length(); this.decayingBuckets = new long[length]; for (int i = 0; i < length; i++) this.decayingBuckets[i] = reservoir.decayingBuckets.get(i); } /* * Get the estimated value at the specified quantile in the distribution. * * @param quantile the quantile specified as a value between 0.0 (zero) and 1.0 (one) * @return estimated value at given quantile * @throws IllegalStateException in case the histogram overflowed / public double getValue(double quantile) { assert quantile >= 0 && quantile <= 1.0; final int lastBucket = decayingBuckets.length - 1; if (decayingBuckets[lastBucket] > 0) throw new IllegalStateException("Unable to compute when histogram overflowed"); final long qcount = (long) Math.ceil(count() quantile); if (qcount == 0) return 0; long elements = 0; for (int i = 0; i < lastBucket; i++) { elements += decayingBuckets[i]; if (elements >= qcount) return bucketOffsets[i]; } return 0; } /** * Will return a snapshot of the non-decaying buckets. * * The values returned will not be consistent with the quantile and mean values. The caller must be aware of the * offsets created by {@link EstimatedHistogram#getBucketOffsets()} to make use of the values returned. * * @return a snapshot of the non-decaying buckets. / public long[] getValues() { final int length = buckets.length(); long[] values = new long[length]; for (int i = 0; i < length; i++) values[i] = buckets.get(i); return values; } /* * Return the number of buckets where recorded values are stored. * * This method does not return the number of recorded values as suggested by the {@link Snapshot} interface. * * @return the number of buckets / public int size() { return decayingBuckets.length; } /* * Return the number of registered values taking forward decay into account. * * @return the sum of all bucket values / private long count() { long sum = 0L; for (int i = 0; i < decayingBuckets.length; i++) sum += decayingBuckets[i]; return sum; } /* * Get the estimated max-value that could have been added to this reservoir. * * As values are collected in variable sized buckets, the actual max value recored in the reservoir may be less * than the value returned. * * @return the largest value that could have been added to this reservoir, or Long.MAX_VALUE if the reservoir * overflowed / public long getMax() { final int lastBucket = decayingBuckets.length - 1; if (decayingBuckets[lastBucket] > 0) return Long.MAX_VALUE; for (int i = lastBucket - 1; i >= 0; i--) { if (decayingBuckets[i] > 0) return bucketOffsets[i]; } return 0; } /* * Get the estimated mean value in the distribution. * * @return the mean histogram value (average of bucket offsets, weighted by count) * @throws IllegalStateException if any values were greater than the largest bucket threshold / public double getMean() { final int lastBucket = decayingBuckets.length - 1; if (decayingBuckets[lastBucket] > 0) throw new IllegalStateException("Unable to compute when histogram overflowed"); long elements = 0; long sum = 0; for (int i = 0; i < lastBucket; i++) { long bCount = decayingBuckets[i]; elements += bCount; sum += bCount bucketOffsets[i]; } return (double) sum / elements; } /** * Get the estimated min-value that could have been added to this reservoir. * * As values are collected in variable sized buckets, the actual min value recored in the reservoir may be * higher than the value returned. * * @return the smallest value that could have been added to this reservoir / public long getMin() { for (int i = 0; i < decayingBuckets.length; i++) { if (decayingBuckets[i] > 0) return i == 0 ? 0 : 1 + bucketOffsets[i - 1]; } return 0; } /* * Get the estimated standard deviation of the values added to this reservoir. * * As values are collected in variable sized buckets, the actual deviation may be more or less than the value * returned. * * @return an estimate of the standard deviation / public double getStdDev() { final int lastBucket = decayingBuckets.length - 1; if (decayingBuckets[lastBucket] > 0) throw new IllegalStateException("Unable to compute when histogram overflowed"); final long count = count(); if(count <= 1) { return 0.0D; } else { double mean = this.getMean(); double sum = 0.0D; for(int i = 0; i < lastBucket; ++i) { long value = bucketOffsets[i]; double diff = (double)value - mean; sum += diff diff * decayingBuckets[i]; } return Math.sqrt(sum / (double)(count - 1)); } } public void dump(OutputStream output) { try (PrintWriter out = new PrintWriter(new OutputStreamWriter(output, UTF_8))) { int length = decayingBuckets.length; for(int i = 0; i < length; ++i) { out.printf("%d%n", decayingBuckets[i]); } } } } class="o">} --git a/src/java/org/apache/cassandra/metrics/EstimatedHistogramReservoir.java b/src/java/org/apache/cassandra/metrics/EstimatedHistogramReservoir.java file mode 100644 29baad8..0000000 a/src/java/org/apache/cassandra/metrics/EstimatedHistogramReservoir.java /dev/null -1,111 +0,0 @@ /* * Licensed to the Apache Software Foundation (ASF) under one * or more contributor license agreements. See the NOTICE file * distributed with this work for additional information * regarding copyright ownership. The ASF licenses this file * to you under the Apache License, Version 2.0 (the * "License"); you may not use this file except in compliance * with the License. You may obtain a copy of the License at * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. / org.apache.cassandra.metrics; com.google.common.annotations.VisibleForTesting; com.codahale.metrics.Reservoir; com.codahale.metrics.Snapshot; com.codahale.metrics.UniformSnapshot; org.apache.cassandra.utils.EstimatedHistogram; /* * Allows our Histogram implementation to be used by the metrics library. * * Default buckets allows nanosecond timings. / class EstimatedHistogramReservoir implements Reservoir class="o">{ EstimatedHistogram histogram; // Default to >4 hours of in nanoseconds of buckets public EstimatedHistogramReservoir(boolean considerZeroes) { this(164, considerZeroes); } public EstimatedHistogramReservoir(int numBuckets, boolean considerZeroes) { histogram = new EstimatedHistogram(numBuckets, considerZeroes); } @Override public int size() { return histogram.getBucketOffsets().length + 1; } @Override public void update(long value) { histogram.add(value); } @Override public Snapshot getSnapshot() { return new HistogramSnapshot(histogram); } @VisibleForTesting public void clear() { histogram.getBuckets(true); } static class HistogramSnapshot extends UniformSnapshot { EstimatedHistogram histogram; public HistogramSnapshot(EstimatedHistogram histogram) { super(histogram.getBuckets(false)); this.histogram = histogram; } @Override public double getValue(double quantile) { return histogram.percentile(quantile); } @Override public long getMax() { return histogram.max(); } @Override public long getMin() { return histogram.min(); } @Override public double getMean() { return histogram.rawMean(); } @Override public long[] getValues() { return histogram.getBuckets(false); } } class="o">} --git a/src/java/org/apache/cassandra/utils/EstimatedHistogram.java b/src/java/org/apache/cassandra/utils/EstimatedHistogram.java 36048fb..1a48039 100644 a/src/java/org/apache/cassandra/utils/EstimatedHistogram.java b/src/java/org/apache/cassandra/utils/EstimatedHistogram.java -85,7 +85,7 @@ public class EstimatedHistogram buckets = new AtomicLongArray(bucketData); } private static long[] newOffsets(int size, boolean considerZeroes) public static long[] newOffsets(int size, boolean considerZeroes) { long[] result = new long[size + (considerZeroes ? 1 : 0)]; int i = 0; --git a/test/unit/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoirTest.java b/test/unit/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoirTest.java file mode 100644 0000000..f2d817f /dev/null b/test/unit/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoirTest.java -0,0 +1,381 @@ / * Licensed to the Apache Software Foundation (ASF) under one * or more contributor license agreements. See the NOTICE file * distributed with this work for additional information * regarding copyright ownership. The ASF licenses this file * to you under the Apache License, Version 2.0 (the * "License"); you may not use this file except in compliance * with the License. You may obtain a copy of the License at * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. / org.apache.cassandra.metrics; org.junit.Test; com.codahale.metrics.Clock; com.codahale.metrics.Snapshot; static org.junit.Assert.assertEquals; static org.junit.Assert.assertFalse; static org.junit.Assert.assertTrue; class DecayingEstimatedHistogramReservoirTest class="o">{ private static final double DOUBLE_ASSERT_DELTA = 0; @Test public void testSimple() { { // 0 and 1 map to the same, first bucket DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(); histogram.update(0); assertEquals(1, histogram.getSnapshot().getValues()[0]); histogram.update(1); assertEquals(2, histogram.getSnapshot().getValues()[0]); } { // 0 and 1 map to different buckets DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(true, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT); histogram.update(0); assertEquals(1, histogram.getSnapshot().getValues()[0]); histogram.update(1); Snapshot snapshot = histogram.getSnapshot(); assertEquals(1, snapshot.getValues()[0]); assertEquals(1, snapshot.getValues()[1]); } } @Test public void testOverflow() { DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, 1); histogram.update(100); assert histogram.isOverflowed(); assertEquals(Long.MAX_VALUE, histogram.getSnapshot().getMax()); } @Test public void testMinMax() { DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(); histogram.update(16); Snapshot snapshot = histogram.getSnapshot(); assertEquals(15, snapshot.getMin()); assertEquals(17, snapshot.getMax()); } @Test public void testMean() { { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); for (int i = 0; i < 40; i++) histogram.update(0); for (int i = 0; i < 20; i++) histogram.update(1); for (int i = 0; i < 10; i++) histogram.update(2); assertEquals(1.14D, histogram.getSnapshot().getMean(), 0.1D); } { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(true, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); for (int i = 0; i < 40; i++) histogram.update(0); for (int i = 0; i < 20; i++) histogram.update(1); for (int i = 0; i < 10; i++) histogram.update(2); assertEquals(0.57D, histogram.getSnapshot().getMean(), 0.1D); } } @Test public void testStdDev() { { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); for (int i = 0; i < 20; i++) histogram.update(10); for (int i = 0; i < 40; i++) histogram.update(20); for (int i = 0; i < 20; i++) histogram.update(30); Snapshot snapshot = histogram.getSnapshot(); assertEquals(20.0D, snapshot.getMean(), 2.0D); assertEquals(7.07D, snapshot.getStdDev(), 2.0D); } } @Test public void testFindingCorrectBuckets() { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, 90, clock); histogram.update(23282687); assertFalse(histogram.isOverflowed()); assertEquals(1, histogram.getSnapshot().getValues()[89]); histogram.update(9); assertEquals(1, histogram.getSnapshot().getValues()[8]); histogram.update(21); histogram.update(22); Snapshot snapshot = histogram.getSnapshot(); assertEquals(2, snapshot.getValues()[13]); assertEquals(6277304.5D, snapshot.getMean(), DOUBLE_ASSERT_DELTA); } @Test public void testPercentile() { { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); // percentile of empty histogram is 0 assertEquals(0D, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); histogram.update(1); // percentile of a histogram with one element should be that element assertEquals(1D, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); histogram.update(10); assertEquals(10D, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); } { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); histogram.update(1); histogram.update(2); histogram.update(3); histogram.update(4); histogram.update(5); Snapshot snapshot = histogram.getSnapshot(); assertEquals(0, snapshot.getValue(0.00), DOUBLE_ASSERT_DELTA); assertEquals(3, snapshot.getValue(0.50), DOUBLE_ASSERT_DELTA); assertEquals(3, snapshot.getValue(0.60), DOUBLE_ASSERT_DELTA); assertEquals(5, snapshot.getValue(1.00), DOUBLE_ASSERT_DELTA); } { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); for (int i = 11; i <= 20; i++) histogram.update(i); // Right now the histogram looks like: // 10 12 14 17 20 // 0 2 2 3 3 // %: 0 20 40 70 100 Snapshot snapshot = histogram.getSnapshot(); assertEquals(12, snapshot.getValue(0.01), DOUBLE_ASSERT_DELTA); assertEquals(14, snapshot.getValue(0.30), DOUBLE_ASSERT_DELTA); assertEquals(17, snapshot.getValue(0.50), DOUBLE_ASSERT_DELTA); assertEquals(17, snapshot.getValue(0.60), DOUBLE_ASSERT_DELTA); assertEquals(20, snapshot.getValue(0.80), DOUBLE_ASSERT_DELTA); } { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(true, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); histogram.update(0); histogram.update(0); histogram.update(1); Snapshot snapshot = histogram.getSnapshot(); assertEquals(0, snapshot.getValue(0.5), DOUBLE_ASSERT_DELTA); assertEquals(1, snapshot.getValue(0.99), DOUBLE_ASSERT_DELTA); } } @Test public void testDecayingPercentile() { { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); // percentile of empty histogram is 0 assertEquals(0, histogram.getSnapshot().getValue(1.0), DOUBLE_ASSERT_DELTA); for (int v = 1; v <= 100; v++) { for (int i = 0; i < 10_000; i++) { histogram.update(v); } } Snapshot snapshot = histogram.getSnapshot(); assertEstimatedQuantile(05, snapshot.getValue(0.05)); assertEstimatedQuantile(20, snapshot.getValue(0.20)); assertEstimatedQuantile(40, snapshot.getValue(0.40)); assertEstimatedQuantile(99, snapshot.getValue(0.99)); clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); snapshot = histogram.getSnapshot(); assertEstimatedQuantile(05, snapshot.getValue(0.05)); assertEstimatedQuantile(20, snapshot.getValue(0.20)); assertEstimatedQuantile(40, snapshot.getValue(0.40)); assertEstimatedQuantile(99, snapshot.getValue(0.99)); for (int v = 1; v <= 50; v++) { for (int i = 0; i < 10_000; i++) { histogram.update(v); } } snapshot = histogram.getSnapshot(); assertEstimatedQuantile(04, snapshot.getValue(0.05)); assertEstimatedQuantile(14, snapshot.getValue(0.20)); assertEstimatedQuantile(27, snapshot.getValue(0.40)); assertEstimatedQuantile(98, snapshot.getValue(0.99)); + clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); snapshot = histogram.getSnapshot(); assertEstimatedQuantile(04, snapshot.getValue(0.05)); assertEstimatedQuantile(14, snapshot.getValue(0.20)); assertEstimatedQuantile(27, snapshot.getValue(0.40)); assertEstimatedQuantile(98, snapshot.getValue(0.99)); + for (int v = 1; v <= 50; v++) { for (int i = 0; i < 10_000; i++) { histogram.update(v); } } + snapshot = histogram.getSnapshot(); assertEstimatedQuantile(03, snapshot.getValue(0.05)); assertEstimatedQuantile(12, snapshot.getValue(0.20)); assertEstimatedQuantile(23, snapshot.getValue(0.40)); assertEstimatedQuantile(96, snapshot.getValue(0.99)); + clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); snapshot = histogram.getSnapshot(); assertEstimatedQuantile(03, snapshot.getValue(0.05)); assertEstimatedQuantile(12, snapshot.getValue(0.20)); assertEstimatedQuantile(23, snapshot.getValue(0.40)); assertEstimatedQuantile(96, snapshot.getValue(0.99)); + for (int v = 11; v <= 20; v++) { for (int i = 0; i < 5_000; i++) { histogram.update(v); } } + snapshot = histogram.getSnapshot(); assertEstimatedQuantile(04, snapshot.getValue(0.05)); assertEstimatedQuantile(12, snapshot.getValue(0.20)); assertEstimatedQuantile(20, snapshot.getValue(0.40)); assertEstimatedQuantile(95, snapshot.getValue(0.99)); + clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); snapshot = histogram.getSnapshot(); assertEstimatedQuantile(04, snapshot.getValue(0.05)); assertEstimatedQuantile(12, snapshot.getValue(0.20)); assertEstimatedQuantile(20, snapshot.getValue(0.40)); assertEstimatedQuantile(95, snapshot.getValue(0.99)); + } + { TestClock clock = new TestClock(); + DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); // percentile of empty histogram is 0 assertEquals(0, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); + for (int m = 0; m < 40; m++) { for (int i = 0; i < 1_000_000; i++) { histogram.update(2); } // percentile of a histogram with one element should be that element clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); assertEquals(2, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); } + clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S 100); assertEquals(0, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); } + { TestClock clock = new TestClock(); + DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); + histogram.update(20); histogram.update(21); histogram.update(22); Snapshot snapshot = histogram.getSnapshot(); assertEquals(1, snapshot.getValues()[12]); assertEquals(2, snapshot.getValues()[13]); + clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); + histogram.update(20); histogram.update(21); histogram.update(22); snapshot = histogram.getSnapshot(); assertEquals(2, snapshot.getValues()[12]); assertEquals(4, snapshot.getValues()[13]); } } + private void assertEstimatedQuantile(long expectedValue, double actualValue) { assertTrue("Expected at least [" + expectedValue + "] but actual is [" + actualValue + "]", actualValue >= expectedValue); assertTrue("Expected less than [" + Math.round(expectedValue * 1.2) + "] but actual is [" + actualValue + "]", actualValue < Math.round(expectedValue * 1.2)); } + public class TestClock extends Clock { private long tick = 0; + public void addSeconds(long seconds) { tick += seconds * 1_000_000_000L; } + public long getTick() { return tick; } + public long getTime() { return tick / 1_000_000L; }; } class="o">} Overview A closer look at the rates Write rate (`org.apache.cassandra.metrics.ColumnFamily.all.WriteLatency.1MinuteRate`). Read rate (`org.apache.cassandra.metrics.ColumnFamily.all.ReadLatency.1MinuteRate`). A closer look at latencies Write latency (`org.apache.cassandra.metrics.ColumnFamily.all.WriteLatency.99percentile`) Read latency (`org.apache.cassandra.metrics.ColumnFamily.all.ReadLatency.99percentile`) Here you can spot what looks like a bit of difference; xenon-a trends generally close to the other two nodes, but the larger spikes would seem to be smoothed out somewhat. Eevans closed this task as Resolved.Sep 6 2016, 8:22 PM Comment Actions I'm satisfied that this is Good Enough (and others have indicated the same), so I'm closing this issue. Log In to Comment Content licensed under Creative Commons Attribution-ShareAlike (CC BY-SA) 4.0 unless otherwise noted; code licensed under GNU General Public License (GPL) 2.0 or later and other open source licenses. By using this site, you agree to the Terms of Use, Privacy Policy, and Code of Conduct. · Wikimedia Foundation · Privacy Policy · Code of Conduct · Terms of Use · Disclaimer · CC-BY-SA · GPL

#! /bin/sh /usr/share/dpatch/dpatch-run class="c1">## 101cassandra-11752_2.2.dpatch by Eric Evans <eevans@wikimedia.org> class="c1">## class="c1">## All lines beginning with `## DP:' are a description of the patch. class="c1">## DP: No description. ATCH@ --git a/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java b/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java 6fdb2ff..8e5671b 100644 a/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java b/src/java/org/apache/cassandra/metrics/CassandraMetricsRegistry.java -60,7 +60,7 @@ public class CassandraMetricsRegistry extends MetricRegistry public Histogram histogram(MetricName name, boolean considerZeroes) { Histogram histogram = register(name, new ClearableHistogram(new EstimatedHistogramReservoir(considerZeroes))); Histogram histogram = register(name, new ClearableHistogram(new DecayingEstimatedHistogramReservoir(considerZeroes))); registerMBean(histogram, name.getMBeanName()); return histogram; -68,7 +68,7 @@ public class CassandraMetricsRegistry extends MetricRegistry public Timer timer(MetricName name) { Timer timer = register(name, new Timer(new EstimatedHistogramReservoir(false))); Timer timer = register(name, new Timer(new DecayingEstimatedHistogramReservoir())); registerMBean(timer, name.getMBeanName()); return timer; --git a/src/java/org/apache/cassandra/metrics/ClearableHistogram.java b/src/java/org/apache/cassandra/metrics/ClearableHistogram.java 85f2fa9..4a081d8 100644 a/src/java/org/apache/cassandra/metrics/ClearableHistogram.java b/src/java/org/apache/cassandra/metrics/ClearableHistogram.java -26,14 +26,14 @@ import com.codahale.metrics.Histogram; */ public class ClearableHistogram extends Histogram { private final EstimatedHistogramReservoir reservoirRef; private final DecayingEstimatedHistogramReservoir reservoirRef; /** * Creates a new {@link com.codahale.metrics.Histogram} with the given reservoir. * * @param reservoir the reservoir to create a histogram from */ public ClearableHistogram(EstimatedHistogramReservoir reservoir) public ClearableHistogram(DecayingEstimatedHistogramReservoir reservoir) { super(reservoir); --git a/src/java/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoir.java b/src/java/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoir.java file mode 100644 0000000..14a4366 /dev/null b/src/java/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoir.java -0,0 +1,549 @@ * * Licensed to the Apache Software Foundation (ASF) under one * or more contributor license agreements. See the NOTICE file * distributed with this work for additional information * regarding copyright ownership. The ASF licenses this file * to you under the Apache License, Version 2.0 (the * "License"); you may not use this file except in compliance * with the License. You may obtain a copy of the License at * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. */ org.apache.cassandra.metrics; java.io.OutputStream; java.io.OutputStreamWriter; java.io.PrintWriter; java.nio.charset.Charset; java.util.Arrays; java.util.concurrent.atomic.AtomicBoolean; java.util.concurrent.atomic.AtomicLong; java.util.concurrent.atomic.AtomicLongArray; java.util.concurrent.locks.ReentrantReadWriteLock; com.google.common.annotations.VisibleForTesting; com.codahale.metrics.Clock; com.codahale.metrics.Reservoir; com.codahale.metrics.Snapshot; org.apache.cassandra.utils.EstimatedHistogram; ** * A decaying histogram reservoir where values collected during each minute will be twice as significant as the values * collected in the previous minute. Measured values are collected in variable sized buckets, using small buckets in the * lower range and larger buckets in the upper range. Use this histogram when you want to know if the distribution of * the underlying data stream has changed recently and you want high resolution on values in the lower range. * * The histogram use forward decay [1] to make recent values more significant. The forward decay factor will be doubled * every minute (half-life time set to 60 seconds) [2]. The forward decay landmark is reset every 30 minutes (or at * first read/update after 30 minutes). During landmark reset, updates and reads in the reservoir will be blocked in a * fashion similar to the one used in the metrics library [3]. The 30 minute rescale interval is used based on the * assumption that in an extreme case we would have to collect a metric 1M times for a single bucket each second. By the * end of the 30:th minute all collected values will roughly add up to 1.000.000 * 60 * pow(2, 30) which can be * represented with 56 bits giving us some head room in a signed 64 bit long. * * Internally two reservoirs are maintained, one with decay and one without decay. All public getters in a {@Snapshot} * will expose the decay functionality with the exception of the {@link Snapshot#getValues()} which will return values * from the reservoir without decay. This makes it possible for the caller to maintain precise deltas in an interval of * its choise. * * The bucket size starts at 1 and grows by 1.2 each time (rounding and removing duplicates). It goes from 1 to around * 18T by default (creating 164+1 buckets), which will give a timing resolution from microseconds to roughly 210 days, * with less precision as the numbers get larger. * * The series of values to which the counts in `decayingBuckets` correspond: * 1, 2, 3, 4, 5, 6, 7, 8, 10, 12, 14, 17, 20, 24, 29, 35, 42, 50, 60, 72 etc. * Thus, a `decayingBuckets` of [0, 0, 1, 10] would mean we had seen 1 value of 3 and 10 values of 4. * * Each bucket represents values from (previous bucket offset, current offset]. * * [1]: http://dimacs.rutgers.edu/~graham/pubs/papers/fwddecay.pdf * [2]: https://en.wikipedia.org/wiki/Half-life * [3]: https://github.com/dropwizard/metrics/blob/v3.1.2/metrics-core/src/main/java/com/codahale/metrics/ExponentiallyDecayingReservoir.java */ class DecayingEstimatedHistogramReservoir implements Reservoir class="o">{ /** * The default number of decayingBuckets. Use this bucket count to reduce memory allocation for bucket offsets. */ public static final int DEFAULT_BUCKET_COUNT = 164; public static final boolean DEFAULT_ZERO_CONSIDERATION = false; // The offsets used with a default sized bucket array without a separate bucket for zero values. public static final long[] DEFAULT_WITHOUT_ZERO_BUCKET_OFFSETS = EstimatedHistogram.newOffsets(DEFAULT_BUCKET_COUNT, false); // The offsets used with a default sized bucket array with a separate bucket for zero values. public static final long[] DEFAULT_WITH_ZERO_BUCKET_OFFSETS = EstimatedHistogram.newOffsets(DEFAULT_BUCKET_COUNT, true); // Represents the bucket offset as created by {@link EstimatedHistogram#newOffsets()} private final long[] bucketOffsets; // decayingBuckets and buckets are one element longer than bucketOffsets -- the last element is values greater than the last offset private final AtomicLongArray decayingBuckets; private final AtomicLongArray buckets; public static final long HALF_TIME_IN_S = 60L; public static final double MEAN_LIFETIME_IN_S = HALF_TIME_IN_S / Math.log(2.0); public static final long LANDMARK_RESET_INTERVAL_IN_MS = 30L * 60L * 1000L; private final AtomicBoolean rescaling = new AtomicBoolean(false); private volatile long decayLandmark; private final ReentrantReadWriteLock lock = new ReentrantReadWriteLock(); // Wrapper around System.nanoTime() to simplify unit testing. private final Clock clock; /** * Construct a decaying histogram with default number of buckets and without considering zeroes. */ public DecayingEstimatedHistogramReservoir() { this(DEFAULT_ZERO_CONSIDERATION, DEFAULT_BUCKET_COUNT, Clock.defaultClock()); } /** * Construct a decaying histogram with default number of buckets. * * @param considerZeroes when true, 0-value measurements in a separate bucket, otherwise they will be collected in same bucket as 1-value measurements */ public DecayingEstimatedHistogramReservoir(boolean considerZeroes) { this(considerZeroes, DEFAULT_BUCKET_COUNT, Clock.defaultClock()); } /** * Construct a decaying histogram. * * @param considerZeroes when true, 0-value measurements in a separate bucket, otherwise they will be collected in same bucket as 1-value measurements * @param bucketCount number of buckets used to collect measured values */ public DecayingEstimatedHistogramReservoir(boolean considerZeroes, int bucketCount) { this(considerZeroes, bucketCount, Clock.defaultClock()); } @VisibleForTesting DecayingEstimatedHistogramReservoir(boolean considerZeroes, int bucketCount, Clock clock) { if (bucketCount == DEFAULT_BUCKET_COUNT) { if (considerZeroes == true) { bucketOffsets = DEFAULT_WITH_ZERO_BUCKET_OFFSETS; } else { bucketOffsets = DEFAULT_WITHOUT_ZERO_BUCKET_OFFSETS; } } else { bucketOffsets = EstimatedHistogram.newOffsets(bucketCount, considerZeroes); } decayingBuckets = new AtomicLongArray(bucketOffsets.length + 1); buckets = new AtomicLongArray(bucketOffsets.length + 1); this.clock = clock; decayLandmark = clock.getTime(); } /** * Increments the count of the bucket closest to n, rounding UP. * * @param value the data point to add to the histogram */ public void update(long value) { long now = clock.getTime(); rescaleIfNeeded(now); int index = Arrays.binarySearch(bucketOffsets, value); if (index < 0) { // inexact match, take the first bucket higher than n index = -index - 1; } // else exact match; we're good class="s1">+ class="s1">+ lockForRegularUsage(); class="s1">+ class="s1">+ try class="s1">+ { class="s1">+ decayingBuckets.getAndAdd(index, forwardDecayWeight(now)); class="s1">+ } class="s1">+ finally class="s1">+ { class="s1">+ unlockForRegularUsage(); class="s1">+ } class="s1">+ class="s1">+ buckets.getAndIncrement(index); class="s1">+ } class="s1">+ class="s1">+ private long forwardDecayWeight(long now) class="s1">+ { class="s1">+ return Math.round(Math.exp(((now - decayLandmark) / 1000L) / MEAN_LIFETIME_IN_S)); class="s1">+ } class="s1">+ class="s1">+ /** class="s1">+ * Return the number of buckets where recorded values are stored. class="s1">+ * class="s1">+ * This method does not return the number of recorded values as suggested by the {@link Reservoir} interface. class="s1">+ * class="s1">+ * @return the number of buckets class="s1">+ */ class="s1">+ public int size() class="s1">+ { class="s1">+ return decayingBuckets.length(); class="s1">+ } class="s1">+ class="s1">+ /** class="s1">+ * Returns a snapshot of the decaying values in this reservoir. class="s1">+ * class="s1">+ * Non-decaying reservoir will not be included in the snapshot. class="s1">+ * class="s1">+ * @return the snapshot class="s1">+ */ class="s1">+ public Snapshot getSnapshot() class="s1">+ { class="s1">+ rescaleIfNeeded(); class="s1">+ class="s1">+ lockForRegularUsage(); class="s1">+ class="s1">+ try class="s1">+ { class="s1">+ return new EstimatedHistogramReservoirSnapshot(this); class="s1">+ } class="s1">+ finally class="s1">+ { class="s1">+ unlockForRegularUsage(); class="s1">+ } class="s1">+ } class="s1">+ class="s1">+ /** class="s1">+ * @return true if this histogram has overflowed -- that is, a value larger than our largest bucket could bound was added class="s1">+ */ class="s1">+ @VisibleForTesting class="s1">+ boolean isOverflowed() class="s1">+ { class="s1">+ return decayingBuckets.get(decayingBuckets.length() - 1) > 0; class="s1">+ } class="s1">+ class="s1">+ private void rescaleIfNeeded() class="s1">+ { class="s1">+ rescaleIfNeeded(clock.getTime()); class="s1">+ } class="s1">+ class="s1">+ private void rescaleIfNeeded(long now) class="s1">+ { class="s1">+ if (needRescale(now)) class="s1">+ { class="s1">+ if (rescaling.compareAndSet(false, true)) class="s1">+ { class="s1">+ try class="s1">+ { class="s1">+ rescale(now); class="s1">+ } class="s1">+ finally class="s1">+ { class="s1">+ rescaling.set(false); class="s1">+ } class="s1">+ } class="s1">+ } class="s1">+ } class="s1">+ class="s1">+ private void rescale(long now) class="s1">+ { class="s1">+ // Check again to make sure that another thread didn't complete rescale already if (needRescale(now)) { lockForRescale(); try { final long rescaleFactor = forwardDecayWeight(now); decayLandmark = now; final int bucketCount = decayingBuckets.length(); for (int i = 0; i < bucketCount; i++) { long newValue = Math.round((decayingBuckets.get(i) / rescaleFactor)); decayingBuckets.set(i, newValue); } } finally { unlockForRescale(); } } } private boolean needRescale(long now) { return (now - decayLandmark) > LANDMARK_RESET_INTERVAL_IN_MS; } @VisibleForTesting public void clear() { lockForRescale(); try { final int bucketCount = decayingBuckets.length(); for (int i = 0; i < bucketCount; i++) { decayingBuckets.set(i, 0L); buckets.set(i, 0L); } } finally { unlockForRescale(); } } private void lockForRegularUsage() { this.lock.readLock().lock(); } private void unlockForRegularUsage() { this.lock.readLock().unlock(); } private void lockForRescale() { this.lock.writeLock().lock(); } private void unlockForRescale() { this.lock.writeLock().unlock(); } private static final Charset UTF_8 = Charset.forName("UTF-8"); /** * Represents a snapshot of the decaying histogram. * * The decaying buckets are copied into a snapshot array to give a consistent view for all getters. However, the * copy is made without a write-lock and so other threads may change the buckets while the array is copied, * probably causign a slight skew up in the quantiles and mean values. * * The decaying buckets will be used for quantile calculations and mean values, but the non decaying buckets will be * exposed for calls to {@link Snapshot#getValues()}. */ private class EstimatedHistogramReservoirSnapshot extends Snapshot { private final long[] decayingBuckets; public EstimatedHistogramReservoirSnapshot(DecayingEstimatedHistogramReservoir reservoir) { final int length = reservoir.decayingBuckets.length(); this.decayingBuckets = new long[length]; for (int i = 0; i < length; i++) this.decayingBuckets[i] = reservoir.decayingBuckets.get(i); } /** * Get the estimated value at the specified quantile in the distribution. * * @param quantile the quantile specified as a value between 0.0 (zero) and 1.0 (one) * @return estimated value at given quantile * @throws IllegalStateException in case the histogram overflowed */ public double getValue(double quantile) { assert quantile >= 0 && quantile <= 1.0; final int lastBucket = decayingBuckets.length - 1; if (decayingBuckets[lastBucket] > 0) throw new IllegalStateException("Unable to compute when histogram overflowed"); final long qcount = (long) Math.ceil(count() * quantile); if (qcount == 0) return 0; long elements = 0; for (int i = 0; i < lastBucket; i++) { elements += decayingBuckets[i]; if (elements >= qcount) return bucketOffsets[i]; } return 0; } /** * Will return a snapshot of the non-decaying buckets. * * The values returned will not be consistent with the quantile and mean values. The caller must be aware of the * offsets created by {@link EstimatedHistogram#getBucketOffsets()} to make use of the values returned. * * @return a snapshot of the non-decaying buckets. */ public long[] getValues() { final int length = buckets.length(); long[] values = new long[length]; for (int i = 0; i < length; i++) values[i] = buckets.get(i); return values; } /** * Return the number of buckets where recorded values are stored. * * This method does not return the number of recorded values as suggested by the {@link Snapshot} interface. * * @return the number of buckets */ public int size() { return decayingBuckets.length; } /** * Return the number of registered values taking forward decay into account. * * @return the sum of all bucket values */ private long count() { long sum = 0L; for (int i = 0; i < decayingBuckets.length; i++) sum += decayingBuckets[i]; return sum; } /** * Get the estimated max-value that could have been added to this reservoir. * * As values are collected in variable sized buckets, the actual max value recored in the reservoir may be less * than the value returned. * * @return the largest value that could have been added to this reservoir, or Long.MAX_VALUE if the reservoir * overflowed */ public long getMax() { final int lastBucket = decayingBuckets.length - 1; if (decayingBuckets[lastBucket] > 0) return Long.MAX_VALUE; for (int i = lastBucket - 1; i >= 0; i--) { if (decayingBuckets[i] > 0) return bucketOffsets[i]; } return 0; } /** * Get the estimated mean value in the distribution. * * @return the mean histogram value (average of bucket offsets, weighted by count) * @throws IllegalStateException if any values were greater than the largest bucket threshold */ public double getMean() { final int lastBucket = decayingBuckets.length - 1; if (decayingBuckets[lastBucket] > 0) throw new IllegalStateException("Unable to compute when histogram overflowed"); long elements = 0; long sum = 0; for (int i = 0; i < lastBucket; i++) { long bCount = decayingBuckets[i]; elements += bCount; sum += bCount * bucketOffsets[i]; } return (double) sum / elements; } /** * Get the estimated min-value that could have been added to this reservoir. * * As values are collected in variable sized buckets, the actual min value recored in the reservoir may be * higher than the value returned. * * @return the smallest value that could have been added to this reservoir */ public long getMin() { for (int i = 0; i < decayingBuckets.length; i++) { if (decayingBuckets[i] > 0) return i == 0 ? 0 : 1 + bucketOffsets[i - 1]; } return 0; } /** * Get the estimated standard deviation of the values added to this reservoir. * * As values are collected in variable sized buckets, the actual deviation may be more or less than the value * returned. * * @return an estimate of the standard deviation */ public double getStdDev() { final int lastBucket = decayingBuckets.length - 1; if (decayingBuckets[lastBucket] > 0) throw new IllegalStateException("Unable to compute when histogram overflowed"); final long count = count(); if(count <= 1) { return 0.0D; } else { double mean = this.getMean(); double sum = 0.0D; for(int i = 0; i < lastBucket; ++i) { long value = bucketOffsets[i]; double diff = (double)value - mean; sum += diff * diff * decayingBuckets[i]; } return Math.sqrt(sum / (double)(count - 1)); } } public void dump(OutputStream output) { try (PrintWriter out = new PrintWriter(new OutputStreamWriter(output, UTF_8))) { int length = decayingBuckets.length; for(int i = 0; i < length; ++i) { out.printf("%d%n", decayingBuckets[i]); } } } } class="o">} --git a/src/java/org/apache/cassandra/metrics/EstimatedHistogramReservoir.java b/src/java/org/apache/cassandra/metrics/EstimatedHistogramReservoir.java file mode 100644 29baad8..0000000 a/src/java/org/apache/cassandra/metrics/EstimatedHistogramReservoir.java /dev/null -1,111 +0,0 @@ /* * Licensed to the Apache Software Foundation (ASF) under one * or more contributor license agreements. See the NOTICE file * distributed with this work for additional information * regarding copyright ownership. The ASF licenses this file * to you under the Apache License, Version 2.0 (the * "License"); you may not use this file except in compliance * with the License. You may obtain a copy of the License at * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. */ org.apache.cassandra.metrics; com.google.common.annotations.VisibleForTesting; com.codahale.metrics.Reservoir; com.codahale.metrics.Snapshot; com.codahale.metrics.UniformSnapshot; org.apache.cassandra.utils.EstimatedHistogram; /** * Allows our Histogram implementation to be used by the metrics library. * * Default buckets allows nanosecond timings. */ class EstimatedHistogramReservoir implements Reservoir class="o">{ EstimatedHistogram histogram; // Default to >4 hours of in nanoseconds of buckets public EstimatedHistogramReservoir(boolean considerZeroes) { this(164, considerZeroes); } public EstimatedHistogramReservoir(int numBuckets, boolean considerZeroes) { histogram = new EstimatedHistogram(numBuckets, considerZeroes); } @Override public int size() { return histogram.getBucketOffsets().length + 1; } @Override public void update(long value) { histogram.add(value); } @Override public Snapshot getSnapshot() { return new HistogramSnapshot(histogram); } @VisibleForTesting public void clear() { histogram.getBuckets(true); } static class HistogramSnapshot extends UniformSnapshot { EstimatedHistogram histogram; public HistogramSnapshot(EstimatedHistogram histogram) { super(histogram.getBuckets(false)); this.histogram = histogram; } @Override public double getValue(double quantile) { return histogram.percentile(quantile); } @Override public long getMax() { return histogram.max(); } @Override public long getMin() { return histogram.min(); } @Override public double getMean() { return histogram.rawMean(); } @Override public long[] getValues() { return histogram.getBuckets(false); } } class="o">} --git a/src/java/org/apache/cassandra/utils/EstimatedHistogram.java b/src/java/org/apache/cassandra/utils/EstimatedHistogram.java 36048fb..1a48039 100644 a/src/java/org/apache/cassandra/utils/EstimatedHistogram.java b/src/java/org/apache/cassandra/utils/EstimatedHistogram.java -85,7 +85,7 @@ public class EstimatedHistogram buckets = new AtomicLongArray(bucketData); } private static long[] newOffsets(int size, boolean considerZeroes) public static long[] newOffsets(int size, boolean considerZeroes) { long[] result = new long[size + (considerZeroes ? 1 : 0)]; int i = 0; --git a/test/unit/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoirTest.java b/test/unit/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoirTest.java file mode 100644 0000000..f2d817f /dev/null b/test/unit/org/apache/cassandra/metrics/DecayingEstimatedHistogramReservoirTest.java -0,0 +1,381 @@ /* * Licensed to the Apache Software Foundation (ASF) under one * or more contributor license agreements. See the NOTICE file * distributed with this work for additional information * regarding copyright ownership. The ASF licenses this file * to you under the Apache License, Version 2.0 (the * "License"); you may not use this file except in compliance * with the License. You may obtain a copy of the License at * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. */ org.apache.cassandra.metrics; org.junit.Test; com.codahale.metrics.Clock; com.codahale.metrics.Snapshot; static org.junit.Assert.assertEquals; static org.junit.Assert.assertFalse; static org.junit.Assert.assertTrue; class DecayingEstimatedHistogramReservoirTest class="o">{ private static final double DOUBLE_ASSERT_DELTA = 0; @Test public void testSimple() { { // 0 and 1 map to the same, first bucket DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(); histogram.update(0); assertEquals(1, histogram.getSnapshot().getValues()[0]); histogram.update(1); assertEquals(2, histogram.getSnapshot().getValues()[0]); } { // 0 and 1 map to different buckets DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(true, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT); histogram.update(0); assertEquals(1, histogram.getSnapshot().getValues()[0]); histogram.update(1); Snapshot snapshot = histogram.getSnapshot(); assertEquals(1, snapshot.getValues()[0]); assertEquals(1, snapshot.getValues()[1]); } } @Test public void testOverflow() { DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, 1); histogram.update(100); assert histogram.isOverflowed(); assertEquals(Long.MAX_VALUE, histogram.getSnapshot().getMax()); } @Test public void testMinMax() { DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(); histogram.update(16); Snapshot snapshot = histogram.getSnapshot(); assertEquals(15, snapshot.getMin()); assertEquals(17, snapshot.getMax()); } @Test public void testMean() { { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); for (int i = 0; i < 40; i++) histogram.update(0); for (int i = 0; i < 20; i++) histogram.update(1); for (int i = 0; i < 10; i++) histogram.update(2); assertEquals(1.14D, histogram.getSnapshot().getMean(), 0.1D); } { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(true, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); for (int i = 0; i < 40; i++) histogram.update(0); for (int i = 0; i < 20; i++) histogram.update(1); for (int i = 0; i < 10; i++) histogram.update(2); assertEquals(0.57D, histogram.getSnapshot().getMean(), 0.1D); } } @Test public void testStdDev() { { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); for (int i = 0; i < 20; i++) histogram.update(10); for (int i = 0; i < 40; i++) histogram.update(20); for (int i = 0; i < 20; i++) histogram.update(30); Snapshot snapshot = histogram.getSnapshot(); assertEquals(20.0D, snapshot.getMean(), 2.0D); assertEquals(7.07D, snapshot.getStdDev(), 2.0D); } } @Test public void testFindingCorrectBuckets() { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, 90, clock); histogram.update(23282687); assertFalse(histogram.isOverflowed()); assertEquals(1, histogram.getSnapshot().getValues()[89]); histogram.update(9); assertEquals(1, histogram.getSnapshot().getValues()[8]); histogram.update(21); histogram.update(22); Snapshot snapshot = histogram.getSnapshot(); assertEquals(2, snapshot.getValues()[13]); assertEquals(6277304.5D, snapshot.getMean(), DOUBLE_ASSERT_DELTA); } @Test public void testPercentile() { { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); // percentile of empty histogram is 0 assertEquals(0D, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); histogram.update(1); // percentile of a histogram with one element should be that element assertEquals(1D, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); histogram.update(10); assertEquals(10D, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); } { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); histogram.update(1); histogram.update(2); histogram.update(3); histogram.update(4); histogram.update(5); Snapshot snapshot = histogram.getSnapshot(); assertEquals(0, snapshot.getValue(0.00), DOUBLE_ASSERT_DELTA); assertEquals(3, snapshot.getValue(0.50), DOUBLE_ASSERT_DELTA); assertEquals(3, snapshot.getValue(0.60), DOUBLE_ASSERT_DELTA); assertEquals(5, snapshot.getValue(1.00), DOUBLE_ASSERT_DELTA); } { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); for (int i = 11; i <= 20; i++) histogram.update(i); // Right now the histogram looks like: // 10 12 14 17 20 // 0 2 2 3 3 // %: 0 20 40 70 100 Snapshot snapshot = histogram.getSnapshot(); assertEquals(12, snapshot.getValue(0.01), DOUBLE_ASSERT_DELTA); assertEquals(14, snapshot.getValue(0.30), DOUBLE_ASSERT_DELTA); assertEquals(17, snapshot.getValue(0.50), DOUBLE_ASSERT_DELTA); assertEquals(17, snapshot.getValue(0.60), DOUBLE_ASSERT_DELTA); assertEquals(20, snapshot.getValue(0.80), DOUBLE_ASSERT_DELTA); } { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(true, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); histogram.update(0); histogram.update(0); histogram.update(1); Snapshot snapshot = histogram.getSnapshot(); assertEquals(0, snapshot.getValue(0.5), DOUBLE_ASSERT_DELTA); assertEquals(1, snapshot.getValue(0.99), DOUBLE_ASSERT_DELTA); } } @Test public void testDecayingPercentile() { { TestClock clock = new TestClock(); DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); // percentile of empty histogram is 0 assertEquals(0, histogram.getSnapshot().getValue(1.0), DOUBLE_ASSERT_DELTA); for (int v = 1; v <= 100; v++) { for (int i = 0; i < 10_000; i++) { histogram.update(v); } } Snapshot snapshot = histogram.getSnapshot(); assertEstimatedQuantile(05, snapshot.getValue(0.05)); assertEstimatedQuantile(20, snapshot.getValue(0.20)); assertEstimatedQuantile(40, snapshot.getValue(0.40)); assertEstimatedQuantile(99, snapshot.getValue(0.99)); clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); snapshot = histogram.getSnapshot(); assertEstimatedQuantile(05, snapshot.getValue(0.05)); assertEstimatedQuantile(20, snapshot.getValue(0.20)); assertEstimatedQuantile(40, snapshot.getValue(0.40)); assertEstimatedQuantile(99, snapshot.getValue(0.99)); for (int v = 1; v <= 50; v++) { for (int i = 0; i < 10_000; i++) { histogram.update(v); } } snapshot = histogram.getSnapshot(); assertEstimatedQuantile(04, snapshot.getValue(0.05)); assertEstimatedQuantile(14, snapshot.getValue(0.20)); assertEstimatedQuantile(27, snapshot.getValue(0.40)); assertEstimatedQuantile(98, snapshot.getValue(0.99)); + clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); snapshot = histogram.getSnapshot(); assertEstimatedQuantile(04, snapshot.getValue(0.05)); assertEstimatedQuantile(14, snapshot.getValue(0.20)); assertEstimatedQuantile(27, snapshot.getValue(0.40)); assertEstimatedQuantile(98, snapshot.getValue(0.99)); + for (int v = 1; v <= 50; v++) { for (int i = 0; i < 10_000; i++) { histogram.update(v); } } + snapshot = histogram.getSnapshot(); assertEstimatedQuantile(03, snapshot.getValue(0.05)); assertEstimatedQuantile(12, snapshot.getValue(0.20)); assertEstimatedQuantile(23, snapshot.getValue(0.40)); assertEstimatedQuantile(96, snapshot.getValue(0.99)); + clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); snapshot = histogram.getSnapshot(); assertEstimatedQuantile(03, snapshot.getValue(0.05)); assertEstimatedQuantile(12, snapshot.getValue(0.20)); assertEstimatedQuantile(23, snapshot.getValue(0.40)); assertEstimatedQuantile(96, snapshot.getValue(0.99)); + for (int v = 11; v <= 20; v++) { for (int i = 0; i < 5_000; i++) { histogram.update(v); } } + snapshot = histogram.getSnapshot(); assertEstimatedQuantile(04, snapshot.getValue(0.05)); assertEstimatedQuantile(12, snapshot.getValue(0.20)); assertEstimatedQuantile(20, snapshot.getValue(0.40)); assertEstimatedQuantile(95, snapshot.getValue(0.99)); + clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); snapshot = histogram.getSnapshot(); assertEstimatedQuantile(04, snapshot.getValue(0.05)); assertEstimatedQuantile(12, snapshot.getValue(0.20)); assertEstimatedQuantile(20, snapshot.getValue(0.40)); assertEstimatedQuantile(95, snapshot.getValue(0.99)); + } + { TestClock clock = new TestClock(); + DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); // percentile of empty histogram is 0 assertEquals(0, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); + for (int m = 0; m < 40; m++) { for (int i = 0; i < 1_000_000; i++) { histogram.update(2); } // percentile of a histogram with one element should be that element clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); assertEquals(2, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); } + clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S * 100); assertEquals(0, histogram.getSnapshot().getValue(0.99), DOUBLE_ASSERT_DELTA); } + { TestClock clock = new TestClock(); + DecayingEstimatedHistogramReservoir histogram = new DecayingEstimatedHistogramReservoir(DecayingEstimatedHistogramReservoir.DEFAULT_ZERO_CONSIDERATION, DecayingEstimatedHistogramReservoir.DEFAULT_BUCKET_COUNT, clock); + histogram.update(20); histogram.update(21); histogram.update(22); Snapshot snapshot = histogram.getSnapshot(); assertEquals(1, snapshot.getValues()[12]); assertEquals(2, snapshot.getValues()[13]); + clock.addSeconds(DecayingEstimatedHistogramReservoir.HALF_TIME_IN_S); + histogram.update(20); histogram.update(21); histogram.update(22); snapshot = histogram.getSnapshot(); assertEquals(2, snapshot.getValues()[12]); assertEquals(4, snapshot.getValues()[13]); } } + private void assertEstimatedQuantile(long expectedValue, double actualValue) { assertTrue("Expected at least [" + expectedValue + "] but actual is [" + actualValue + "]", actualValue >= expectedValue); assertTrue("Expected less than [" + Math.round(expectedValue * 1.2) + "] but actual is [" + actualValue + "]", actualValue < Math.round(expectedValue * 1.2)); } + public class TestClock extends Clock { private long tick = 0; + public void addSeconds(long seconds) { tick += seconds * 1_000_000_000L; } + public long getTick() { return tick; } + public long getTime() { return tick / 1_000_000L; }; } class="o">}

Overview

Screenshot from 2016-08-24 19-43-37.png (727×1 px, 167 KB)

A closer look at the rates

Write rate (org.apache.cassandra.metrics.ColumnFamily.all.WriteLatency.1MinuteRate).

Screenshot from 2016-08-24 19-45-02.png (696×1 px, 113 KB)

Read rate (org.apache.cassandra.metrics.ColumnFamily.all.ReadLatency.1MinuteRate).

Screenshot from 2016-08-24 19-44-25.png (692×1 px, 104 KB)

A closer look at latencies

Write latency (org.apache.cassandra.metrics.ColumnFamily.all.WriteLatency.99percentile)

Screenshot from 2016-08-24 19-46-38.png (696×1 px, 107 KB)

Read latency (org.apache.cassandra.metrics.ColumnFamily.all.ReadLatency.99percentile)

Screenshot from 2016-08-24 19-46-15.png (693×1 px, 119 KB)

Here you can spot what looks like a bit of difference; xenon-a trends generally close to the other two nodes, but the larger spikes would seem to be smoothed out somewhat.

I'm satisfied that this is Good Enough (and others have indicated the same), so I'm closing this issue.

	F4400926: Screenshot from 2016-08-24 19-46-38.png
	Aug 25 2016, 9:39 PM

	F4400929: Screenshot from 2016-08-24 19-43-37.png
	Aug 25 2016, 9:39 PM

	F4400927: Screenshot from 2016-08-24 19-45-02.png
	Aug 25 2016, 9:39 PM

	F4400930: Screenshot from 2016-08-24 19-46-15.png
	Aug 25 2016, 9:39 PM

	F4400928: Screenshot from 2016-08-24 19-44-25.png
	Aug 25 2016, 9:39 PM

	F4161746: Screenshot from 2016-06-13 13-30-07.png
	Jun 13 2016, 6:31 PM

	F4150582: Screenshot from 2016-06-10 17-29-06.png
	Jun 10 2016, 3:32 PM

	F4147640: Screenshot from 2016-06-09 21-40-31.png
	Jun 9 2016, 7:42 PM

Investigate lack of recency bias in Cassandra histogram metricsClosed, ResolvedPublicActions