Page MenuHomePhabricator

cr1-esams:fpc0 errors
Closed, ResolvedPublic

Description

Logs are full of:

Sep 19 14:41:32  re0.cr1-esams fpc0 dispatch_event_handler(691): EA[0:0].disp[3] SECONDARY_TIMEOUT (PPE 5 Zone 19).
Sep 19 14:41:35  re0.cr1-esams fpc0 dispatch_event_handler(691): EA[0:0].disp[3] SECONDARY_TIMEOUT (PPE 5 Zone 19).
Sep 19 14:41:38  re0.cr1-esams fpc0 dispatch_event_handler(691): EA[0:0].disp[3] SECONDARY_TIMEOUT (PPE 5 Zone 19).

relevant doc: https://supportportal.juniper.net/s/article/Syslog-message-SECONDARY-TIMEOUT-or-PRIMARY-TIMEOUT?language=en_US

TLDR, FPC restart is the first step.

Event Timeline

ayounsi triaged this task as High priority.
Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Mentioned in SAL (#wikimedia-operations) [2023-11-15T13:55:56Z] <XioNoX> disable peering/transit on cr1-esams for linecard reboot - T346779

Change 987782 had a related patch set uploaded (by Ayounsi; author: Ayounsi):

[operations/dns@master] Depool esams for cr1 maintenance

https://gerrit.wikimedia.org/r/987782

Change 987782 merged by Ayounsi:

[operations/dns@master] Depool esams for cr1 maintenance

https://gerrit.wikimedia.org/r/987782

Mentioned in SAL (#wikimedia-operations) [2024-01-04T15:01:28Z] <XioNoX> depool esams for router work - T346779

Change 987732 had a related patch set uploaded (by Ayounsi; author: Ayounsi):

[operations/dns@master] Repool esams after maintenance

https://gerrit.wikimedia.org/r/987732

Mentioned in SAL (#wikimedia-operations) [2024-01-04T15:16:09Z] <XioNoX> drain esams-eqiad transport - T346779

Mentioned in SAL (#wikimedia-operations) [2024-01-04T15:26:55Z] <XioNoX> disable peering/transit on cr1-esams for linecard reboot - T346779

Mentioned in SAL (#wikimedia-operations) [2024-01-04T15:37:10Z] <XioNoX> re-enable peering/transit on cr1-esams - T346779

Mentioned in SAL (#wikimedia-operations) [2024-01-04T15:38:57Z] <XioNoX> undrain esams-eqiad transport - T346779

Change 987732 merged by Ayounsi:

[operations/dns@master] Repool esams after maintenance

https://gerrit.wikimedia.org/r/987732

Error logs stopped showing up after the linecard reboot. Monitoring it for a bit before closing the task.