Thu, May 13
The last Dell tech that came in identified the problem as a riser card, Oddly enough this was replaced already but maybe the second time is the charm. Dell is sending the part directly to me and I will replace and fingers crossed this works
Tue, May 11
Added Airport Express and connected to mr1 in netbox
@Jclark-ctr moss-be1001 cables are wrong, the ports you have them connected to are already labeled for cloudcephosd1016 but I see that the server is not connected to the switch and also listed as decommission in Netbox (unsure about that status as well). I am confused about what's going on with this switch and available ports. Can you let me know which ports are available?
Dell is supposed to be here today to replace several more parts. We will see how it goes
thanks @ayounsi could you add a device type to the list, that is a netbox requirement for me to save.
all for are removed from rack and decom'd on netbox.
@wiki_willy I can add to netbox but how do I classify it? I could add as an access switch but Apple is not a manufacturer listed in our devices. Please advise.
it appears the power cable was loose and not properly seated, pushed it back in and the LED lit up. These servers are over 5 years old.
Thu, May 6
Mon, Apr 26
Dell sent an email with a list of things they want to be done, considering that they've had 2 technicians out to fix the issue with zero resolution, I replied that they will need to send one of their technicians out to perform these tasks. I do not feel it is wise to start poking around myself in case we need to RMA this server.
Fri, Apr 23
Assigning to @RobH for installs
Assigning this to @RobH to complete install
Loose power cable
Password was incorrect, fixed
The disk has been swapped, I am resolving this task because the on-site work has been completed.
replaced cpu1 and cleared the idrac log, resolving, if the issue returns please re-open.
Thu, Apr 15
@Jclark-ctr netbox script ran for wcqs1001 and 1002. I'm not sure why 1003 is in C4, that's a 10G rack. If it is can you please move to a standard rack please.
@RobH the 2nd interface was added to these, can you try the install again please.
mc1041-50 netbox and network ports updated have been completed, need to go on-site and setup idrac
I need to be able to login to servers and run megacli commands as well as cat /proc/
ticket opened with Dell! You have successfully submitted request SR1057103007.
Dell has us on a wild goose hunt. Responded to their questions with the following:
Apr 13 2021
@RobH all the secondary ports are updated and added to the private vlan per the instructions above. Feel free to do the installs whenever you have a moment.
Apr 12 2021
another Dell tech arrived today with what was believed to be the replacement part. The part was replaced and the error persisted. Several reboots and TSR reports later, we do not know what is going on. At what point in time reseating the CMOS battery worked but then the PCI error returned on the next reboot. The Dell technician is still on-site attempting to troubleshoot with Dell tech support now. Once something has been decided I will update the task.
Apr 9 2021
the Dell tech came out and replaced the motherboard, that did not fix the issue, it turns out that there is bad cable to the backplane. A new part has been ordered.
Apr 8 2021
Fixed the report has zero errors
updated the BIOS and submitted Dell ticket You have successfully submitted request SR1056516502.
@aborrero The 2nd interfaces are
cloudgw1001 cloudsw1-c8 xe-0/0/19 cable id 5321
These are all connected, the 2nd interfaces are not setup, it seems that we're all confused on how to do this so I didn't do anything. maybe @Papaul can let us know if the 2nd interface is automated or requires a manual setup.
Apr 7 2021
@elukey that's a first! Maybe the raid bios settings are wrong?
Apr 5 2021
Apr 1 2021
@RobH assigning this to you, 1040-1045 are ready for installs. I set up both ports in netbox. Since we're waiting on a new nic card for 1046 please reassign to John after (assuming no issues that need me to get involved). Thanks!
Dell Ticket Created
@wiki_willy That will work! Thanks
@elukey I have not forgotten about this, A7 is a rack for the possible move but we are already maxing out our power utilization in that rack and adding another R740XD is probably not a good idea.
Looks like a possible DIMM error, since the server is already depooled I will run a couple of tests to determine if it's a DIMM, CPU or motherboard issue.
The DIMM only reported the error that one day and has not returned. I am clearing the system log and resolving this for now, if the issue persists please re-open.
There are new servers installed in this rack, most of these are the new cloudvirts. The servers will be racked and connected and then netbox gets updated so there is a lag between these things happening.
@elukey that was my fault, I left it in it's BIOS settings when I left yesterday. I rebooted and it's back.
@RobH these are ready for installs when you have the time.
@RobH this is ready for install when you have time.
Mar 23 2021
Fixed the primary port for cloudgw1001
Disk replaced with a disk from decom'd db host
Mar 19 2021
@elukey can I move the 2 servers anytime or does this need to be scheduled?
Mar 18 2021
@ayounsi I verified all of the ports listed in https://librenms.wikimedia.org/ports/state=down/hostname=asw/format=list_basic/ are not in service at the moment. There were 2 in fundraising that I already took care of, please disable these ports. Thanks!