consider moving Cassandra to G1GC in production
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Eevans
	Jun 19 2015, 7:14 PM

Description

More and more people are weighing in with their experiences using the G1 garbage collector and Cassandra. Rumor has it that it enables the use of enormous heap sizes, with little to no tuning required, while still out-performing CMS. If true, it could have significant impact on our node-density story, and seems worth looking into.

G1GC will be the default in Cassandra 3.0.

Details

	Subject	Repo	Branch	Lines +/-
	Move Cassandra to g1gc collector and increase heap size	operations/puppet	production	+19 -12

Customize query in gerrit

Related Objects

Mentioned In: rOPUP331b66cea99b: Move Cassandra to g1gc collector and increase heap size
Mentioned Here: T221026: Gerrit thread use GC thrashing
T104888: upgrade to latest openjdk 8 8u66-b01-1

Event Timeline

Eevans created this task.Jun 19 2015, 7:14 PM

Eevans raised the priority of this task from to Needs Triage.

Eevans updated the task description. (Show Details)

Eevans added a project: RESTBase-Cassandra.

Eevans added subscribers: Eevans, • GWicke, • mobrovac, fgiunchedi.

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJun 19 2015, 7:14 PM

I have done some testing with g1gc in staging and now on restbase1004. It's looking promising so far; at least it seems to let us survive the current overload a bit better than CMS.

Settings:

commented out "GC tuning options" section
instead, added JVM_OPTS="$JVM_OPTS -XX:+UseG1GC -XX:G1RSetUpdatingPauseTimePercent=5 -XX:MaxGCPauseMillis=500"
MAX_HEAP_SIZE="14g"
commented out JVM_OPTS="$JVM_OPTS -Xmn${HEAP_NEWSIZE}"; we should not set new gen size for g1gc

Change 221993 had a related patch set uploaded (by GWicke):
Move Cassandra to g1gc collector and increase heap size

https://gerrit.wikimedia.org/r/221993

gerritbot added a project: Patch-For-Review.Jun 30 2015, 9:09 PM

Change 221993 merged by Filippo Giunchedi:
Move Cassandra to g1gc collector and increase heap size

https://gerrit.wikimedia.org/r/221993

• GWicke mentioned this in rOPUP331b66cea99b: Move Cassandra to g1gc collector and increase heap size.Jun 30 2015, 11:49 PM

Graphs: http://grafana.wikimedia.org/#/dashboard/db/restbase-cassandra-gc

restbase1004 has been running with a 16GB heap size and MaxGCPauseMillis=250 for two days now, and from what I can tell, there is no significant impact on I/O wait times. Moreover, it's the only node that hasn't been restarted during this period, despite being the node with the biggest amount of storage to handle.

We are applying https://gerrit.wikimedia.org/r/#/c/222899/ as a tentative way of improving stability.

• mobrovac added a project: acl*sre-team.Jul 5 2015, 12:51 PM

• mobrovac set Security to None.

Restricted Application added a subscriber: Matanya. · View Herald TranscriptJul 5 2015, 12:51 PM

See also T104888 for ongoing JDK8 testing.

fgiunchedi claimed this task.Jul 20 2015, 9:37 AM

fgiunchedi triaged this task as Medium priority.Jul 20 2015, 2:52 PM

we're running g1gc everywhere

also T221026#5143639 for using G1 GC on Gerrit

consider moving Cassandra to G1GC in productionClosed, ResolvedPublicActions

Description

Details

Related Objects

Event Timeline

consider moving Cassandra to G1GC in production
Closed, ResolvedPublic
Actions