This task was generated as a sub-task to T190540. On T190540, we discovered multiple cp systems in codfw with memory errors.
The hosts have undergone memory testing, and report fine. However, ECC errors in the SEL have lead us to want to replace dimms B2 and B6 in cp2006.
2018-03-23 14:55:12 Multi-bit memory errors detected on a memory device at location(s) DIMM_B2. 2018-03-23 14:55:08 A problem was detected in Memory Reference Code (MRC). 2018-01-08 07:59:09 Correctable memory error rate exceeded for DIMM_B6. 2018-01-08 07:58:32 Correctable memory error rate exceeded for DIMM_B6. 2018-01-08 07:58:24 Correctable memory error rate exceeded for DIMM_B2. 2017-11-07 09:55:56 Correctable memory error rate exceeded for DIMM_B6. 2017-11-07 09:55:42 Correctable memory error rate exceeded for DIMM_B2. 2017-11-07 09:55:39 Correctable memory error rate exceeded for DIMM_B6. 2016-06-01 15:52:24 Multi-bit memory errors detected on a memory device at location(s) DIMM_B2. 2016-06-01 15:52:21 A problem was detected in Memory Reference Code (MRC). 2016-02-06 06:30:51 Correctable memory error rate exceeded for DIMM_B2. 2016-02-06 06:30:49 Correctable memory error rate exceeded for DIMM_B6. 2016-02-06 06:30:47 Correctable memory error rate exceeded for DIMM_B2. 2016-02-05 17:06:00 Multi-bit memory errors detected on a memory device at location(s) DIMM_B2. 2016-02-05 17:05:57 A problem was detected in Memory Reference Code (MRC). 2015-07-28 23:03:06 Correctable memory error rate exceeded for DIMM_B2. 2015-07-28 23:03:03 Correctable memory error rate exceeded for DIMM_B2. 2015-06-20 06:57:50 Correctable memory error rate exceeded for DIMM_B6. 2015-06-20 06:57:49 Correctable memory error rate exceeded for DIMM_B2. 2015-06-20 06:57:46 Correctable memory error rate exceeded for DIMM_B6.