Page MenuHomePhabricator

Phamhi (Phamhi)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Aug 5 2019, 1:02 PM (6 w, 19 h)
Availability
Available
LDAP User
Phamhi
MediaWiki User
HPham (WMF) [ Global Accounts ]

Recent Activity

Yesterday

Phamhi claimed T232264: Change db password or tools.machtsinn.
Mon, Sep 16, 9:18 PM · Data-Services, cloud-services-team (Kanban)

Fri, Sep 13

Phamhi added a comment to T232769: Document some etcd cluster operations for Toolforge.

I have started the documentation which can be found here: https://wikitech.wikimedia.org/wiki/Etcd

Fri, Sep 13, 11:11 AM · cloud-services-team (Kanban), Toolforge, Wikimedia-Incident

Wed, Sep 11

Phamhi closed T230147: Toolforge: collect prometheus node exporter metrics from new k8s worker nodes as Resolved.

Documentation at https://wikitech.wikimedia.org/wiki/Portal:Toolforge/Admin/Kubernetes#Worker_nodes completed as per request. Marking as resolved.

Wed, Sep 11, 3:35 PM · cloud-services-team (Kanban)
Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Wed, Sep 11, 3:23 PM · cloud-services-team (Kanban)

Fri, Sep 6

Phamhi added a comment to T194859: Toolforge maintain-kubeusers doesn't fail well when LDAP servers are unreachable.

As a workaround, I'm going to use /usr/bin/timeout utility to wrap the command.

Fri, Sep 6, 5:44 PM · cloud-services-team (Kanban), Toolforge
Phamhi added a comment to T194859: Toolforge maintain-kubeusers doesn't fail well when LDAP servers are unreachable.
$ sudo grep RuntimeMaxSec /var/log/daemon.log
Sep  4 16:29:27 tools-k8s-master-01 systemd[1]: [/lib/systemd/system/maintain-kubeusers-timer.service:7] Unknown lvalue 'RuntimeMaxSec' in section 'Service'
Fri, Sep 6, 4:13 PM · cloud-services-team (Kanban), Toolforge

Thu, Aug 29

Phamhi added a comment to T230147: Toolforge: collect prometheus node exporter metrics from new k8s worker nodes.

I have updated the docs located at https://wikitech.wikimedia.org/wiki/Portal:Toolforge/Admin/Kubernetes#Worker_nodes to include the command to update the prometheus-node-exporter package after the build.

Thu, Aug 29, 4:14 PM · cloud-services-team (Kanban)

Wed, Aug 28

Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Wed, Aug 28, 5:37 PM · cloud-services-team (Kanban)

Tue, Aug 27

Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Tue, Aug 27, 6:22 PM · cloud-services-team (Kanban)

Mon, Aug 26

Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Mon, Aug 26, 12:40 PM · cloud-services-team (Kanban)
Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Mon, Aug 26, 12:38 PM · cloud-services-team (Kanban)
Phamhi added a comment to T230147: Toolforge: collect prometheus node exporter metrics from new k8s worker nodes.

Does it make more sense to close this ticket as the original issue has been resolved? We then create a new ticket to prevent this issue from re-occurring?

Mon, Aug 26, 12:35 PM · cloud-services-team (Kanban)

Aug 15 2019

Phamhi added a comment to T229871: relocate/reimage cloudvirt1023 with 10G interfaces.

Blocked due to https://phabricator.wikimedia.org/T212855

Aug 15 2019, 3:33 PM · ops-eqiad, DC-Ops, Operations, Epic, cloud-services-team (Kanban)
Phamhi updated subscribers of T212855: debian installer prompts in cloudvirt servers partman configuration.
Aug 15 2019, 3:32 PM · cloud-services-team (Kanban)

Aug 14 2019

Phamhi added a comment to T229871: relocate/reimage cloudvirt1023 with 10G interfaces.

I managed to bypass that issue by running

Aug 14 2019, 5:16 PM · ops-eqiad, DC-Ops, Operations, Epic, cloud-services-team (Kanban)

Aug 13 2019

Phamhi added a comment to T230147: Toolforge: collect prometheus node exporter metrics from new k8s worker nodes.

I created a new instance "toolsbeta-test-puppet-sandbox" with jessie image and it looks like it came with prometheus-node-exporter version 0.14.0 not 0.17.0. As per Arturo's suggestion, I am looking into create a Puppet patch for this issue.

Aug 13 2019, 12:13 PM · cloud-services-team (Kanban)

Aug 12 2019

Phamhi added a comment to T230147: Toolforge: collect prometheus node exporter metrics from new k8s worker nodes.

The metrics are now exposed

Aug 12 2019, 5:08 PM · cloud-services-team (Kanban)
Phamhi added a comment to T230147: Toolforge: collect prometheus node exporter metrics from new k8s worker nodes.

During the prometheus-node-exporter.service startup, the following error occurs

Aug 12 2019, 3:15 PM · cloud-services-team (Kanban)
Phamhi added a comment to T230147: Toolforge: collect prometheus node exporter metrics from new k8s worker nodes.

In horizon, "Instance Console Log", I can see the following logs for tools-worker-1030

Aug 12 2019, 11:45 AM · cloud-services-team (Kanban)

Aug 9 2019

Phamhi claimed T230147: Toolforge: collect prometheus node exporter metrics from new k8s worker nodes.
Aug 9 2019, 5:43 PM · cloud-services-team (Kanban)
Phamhi committed rLPRIa651e13db225: admin: add phamhi public key (authored by Phamhi).
admin: add phamhi public key
Aug 9 2019, 4:39 PM

Aug 8 2019

Phamhi added a comment to T230126: LDAP: multiples accounts for Phamhi.
  • uid=hpham,ou=people,dc=wikimedia,dc=org has no Cloud VPS memberships
    • Created by the OIT group on the first day.. I think.. according to them.. it's for "your Gmail, SF Office WiFi, VPN, and Fileserver".. if this is the case is it a good idea to block it?
Aug 8 2019, 6:54 PM · Patch-For-Review, LDAP, cloud-services-team (Kanban)
Phamhi added a comment to T230126: LDAP: multiples accounts for Phamhi.

If I can only keep one, I prefer the first one
uid=phamhi,ou=people,dc=wikimedia,dc=org

Aug 8 2019, 4:02 PM · Patch-For-Review, LDAP, cloud-services-team (Kanban)

Aug 6 2019

Phamhi added a comment to T229833: SRE: root access for Hieu Pham, SRE @ WMCS.

My SSH public key:

Aug 6 2019, 12:00 PM · Operations, SRE-Access-Requests, cloud-services-team (Kanban)

Aug 5 2019

Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Aug 5 2019, 6:50 PM · cloud-services-team (Kanban)
Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Aug 5 2019, 3:54 PM · cloud-services-team (Kanban)
Phamhi claimed T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Aug 5 2019, 3:42 PM · cloud-services-team (Kanban)
Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Aug 5 2019, 2:42 PM · cloud-services-team (Kanban)
Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Aug 5 2019, 2:38 PM · cloud-services-team (Kanban)
Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Aug 5 2019, 2:28 PM · cloud-services-team (Kanban)
Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Aug 5 2019, 1:55 PM · cloud-services-team (Kanban)
Phamhi updated the task description for T228942: Onboard Hieu Pham to Wikimedia Foundation as SRE in Cloud Services.
Aug 5 2019, 1:06 PM · cloud-services-team (Kanban)