Page MenuHomePhabricator

crusnov (Cas Rusnov)
User

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Tuesday

  • Clear sailing ahead.

User Details

User Since
Oct 15 2018, 5:56 PM (105 w, 5 d)
Availability
Available
LDAP User
CRusnov
MediaWiki User
CRusnov (WMF) [ Global Accounts ]

Recent Activity

Wed, Oct 21

crusnov moved T265905: Netbox: ImportPuppetDB should warn if it imported a SLAAC IP from Backlog to Discussion / Design / Consensus Making on the netbox board.
Wed, Oct 21, 9:35 PM · netbox
crusnov moved T209182: Setup Swift Storage for Netbox image (was: netbox won't allow me to upload photos of the rack) from Patches / Reviews / WIP to Complete on the netbox board.
Wed, Oct 21, 3:52 PM · Patch-For-Review, netbox, Operations
crusnov closed T241074: Upgrade pynetbox package to latest as Resolved.

This has been done several times over, and so this ticket isn't actually that useful.

Wed, Oct 21, 3:50 PM · SRE-tools, netbox, User-crusnov

Thu, Oct 1

crusnov moved T240176: OKR: Provide a mechanism to obtain the FQDN given a short hostname from Up next to In Progress on the SRE-tools board.
Thu, Oct 1, 10:09 PM · netbox, SRE-tools

Tue, Sep 29

crusnov triaged T264106: Make netbox-extras repository auto-merge on +2 as High priority.
Tue, Sep 29, 4:04 PM · SRE-tools
jcrespo awarded T264100: check_systemd_state fails unexpectedly when run with Python 3 a Piece of Eight token.
Tue, Sep 29, 3:51 PM · SRE-tools, User-crusnov
crusnov closed T264100: check_systemd_state fails unexpectedly when run with Python 3, a subtask of T247364: Forward port Python2 files to Python3 in Puppet Repository, as Resolved.
Tue, Sep 29, 3:50 PM · Patch-For-Review, User-MoritzMuehlenhoff, User-crusnov, User-jbond, Python3-Porting, SRE-tools, Puppet
crusnov closed T264100: check_systemd_state fails unexpectedly when run with Python 3 as Resolved.

Fix deployed!

Tue, Sep 29, 3:50 PM · SRE-tools, User-crusnov
crusnov triaged T264105: Investigate switching diffscan to ndiff as Low priority.
Tue, Sep 29, 3:48 PM · SRE-tools
crusnov added a subtask for T247364: Forward port Python2 files to Python3 in Puppet Repository: T264100: check_systemd_state fails unexpectedly when run with Python 3.
Tue, Sep 29, 3:17 PM · Patch-For-Review, User-MoritzMuehlenhoff, User-crusnov, User-jbond, Python3-Porting, SRE-tools, Puppet
crusnov added a parent task for T264100: check_systemd_state fails unexpectedly when run with Python 3: T247364: Forward port Python2 files to Python3 in Puppet Repository.
Tue, Sep 29, 3:17 PM · SRE-tools, User-crusnov
crusnov triaged T264100: check_systemd_state fails unexpectedly when run with Python 3 as High priority.
Tue, Sep 29, 3:16 PM · SRE-tools, User-crusnov
crusnov created T264100: check_systemd_state fails unexpectedly when run with Python 3.
Tue, Sep 29, 3:16 PM · SRE-tools, User-crusnov

Mon, Sep 28

crusnov archived P12820 Masterwork From Distant Lands.
Mon, Sep 28, 11:31 PM
crusnov moved T258729: netbox DNS Automation Workflow checklist for Commissioning and Decommissioning 2020Q1 from Discussion / Design / Consensus Making to Work in Progress / Tasks to Do on the netbox board.
Mon, Sep 28, 6:03 PM · Patch-For-Review, SRE-tools, User-crusnov, netbox

Fri, Sep 25

crusnov closed T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces, a subtask of T205897: Netbox: fill network topology, as Resolved.
Fri, Sep 25, 5:03 PM · netbox, Operations
crusnov closed T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces, a subtask of T250429: Homer: Netbox driven switch interfaces, as Resolved.
Fri, Sep 25, 5:03 PM · netops, Operations, homer
crusnov closed T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces, a subtask of T258729: netbox DNS Automation Workflow checklist for Commissioning and Decommissioning 2020Q1, as Resolved.
Fri, Sep 25, 5:03 PM · Patch-For-Review, SRE-tools, User-crusnov, netbox
crusnov closed T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces as Resolved.

I believe this is complete, any further patches should point at the parent task T258729

Fri, Sep 25, 5:03 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov

Sep 24 2020

crusnov added projects to T148976: Strongswan Icinga check: do not report issues about depooled hosts: observability, serviceops.
Sep 24 2020, 4:02 PM · Patch-For-Review, serviceops, observability, Operations
crusnov triaged T262587: recommendation-api alerting and api errors as Medium priority.
Sep 24 2020, 3:53 PM · serviceops, Operations, Recommendation-API

Sep 23 2020

crusnov triaged T263291: experiment with a "unified" ATS-BE pool as Medium priority.
Sep 23 2020, 7:51 PM · Performance-Team (Radar), Traffic, Operations
crusnov triaged T263437: Allow easier ICU transitions in MediaWiki as Medium priority.
Sep 23 2020, 7:03 PM · Platform Engineering, MediaWiki-General, Operations
crusnov assigned T263536: Aggregated metrics for ats-tls <-> clients ttfb percentiles to fgiunchedi.
Sep 23 2020, 7:02 PM · Patch-For-Review, User-fgiunchedi, Operations, observability, Traffic
crusnov triaged T263545: Decide on logging in k8s for ShellBox as Medium priority.
Sep 23 2020, 6:56 PM · MW-on-K8s, serviceops, Operations
crusnov added a comment to T247364: Forward port Python2 files to Python3 in Puppet Repository.

These all pass Python 3 tox and have no patches for automated porting, so seem ready for the Python 3 future. They are low priority to touch at all so I'll let the lie for now.

Sep 23 2020, 4:23 PM · Patch-For-Review, User-MoritzMuehlenhoff, User-crusnov, User-jbond, Python3-Porting, SRE-tools, Puppet

Sep 22 2020

crusnov triaged T263423: librenms page didn't auto-resolve in VO as Medium priority.
Sep 22 2020, 11:18 PM · Operations, observability
crusnov triaged T135226: Wikimedia-RU mailing list page has wrong encoding as Medium priority.
Sep 22 2020, 6:44 PM · Operations, Wikimedia-Mailing-lists

Sep 21 2020

crusnov added a comment to T262468: wikitech-static access for Sam Reed.

This has been approved in the team meeting and followed up on IRC. Let us know if there's anything that needs to be done further.

Sep 21 2020, 8:33 PM · cloud-services-team (Kanban), Operations, SRE-Access-Requests
crusnov added a comment to T263191: Allow Nicholas Skaggs to issue icinga commands.

I see the patch is already merged, @nskaggs please test icinga command and followup so we can close ticket. Thanks!

Sep 21 2020, 8:30 PM · SRE-Access-Requests, Operations
crusnov added a comment to T233937: Add U2F/FIDO as second factor for CAS.

@Volans just asked if there is a way to register multiple u2f devices to the same account. Of the top of my head im not sure how to achive that but placing a not here as something to investigate

Sep 21 2020, 5:22 PM · CAS-SSO, User-jbond, Patch-For-Review, Operations

Sep 16 2020

crusnov added a comment to T236582: "automation-framework" Cloud VPS project jessie deprecation.

We shall have to destroy this and recreate it as buster which should be relatively painless. We actually do need this since we need "secrets" for this cluster of boxes afaik.

Sep 16 2020, 4:11 PM · Cloud-VPS (Debian Jessie Deprecation)

Sep 14 2020

crusnov committed rOSNE527e7b870e91: interface_automation: Fix the messages when setting IP as primary (authored by crusnov).
interface_automation: Fix the messages when setting IP as primary
Sep 14 2020, 9:16 PM
crusnov committed rOSNEf034a36bc475: reports/cables.py: Exclude servers from interface name report (authored by crusnov).
reports/cables.py: Exclude servers from interface name report
Sep 14 2020, 6:34 PM

Sep 11 2020

crusnov updated subscribers of T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.

Update on offline VMs
https://netbox.wikimedia.org/virtualization/virtual-machines/?q=&status=offline&mac_address=

Sep 11 2020, 11:40 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov

Sep 10 2020

crusnov added a subtask for T229397: Puppet: get row/rack info from Netbox: T262446: Import row information into Netbox for Ganeti instances.
Sep 10 2020, 2:57 PM · observability, User-crusnov, User-jbond, Patch-For-Review, Puppet, Operations
crusnov added a parent task for T262446: Import row information into Netbox for Ganeti instances: T229397: Puppet: get row/rack info from Netbox.
Sep 10 2020, 2:57 PM · netbox

Sep 9 2020

crusnov triaged T262446: Import row information into Netbox for Ganeti instances as Medium priority.
Sep 9 2020, 5:14 PM · netbox
crusnov awarded T262393: Improve process to add/update keys for pwstore repo a 100 token.
Sep 9 2020, 4:10 PM · Operations

Sep 2 2020

crusnov closed T258942: Upgrade Netbox to >= 2.8.8 as Resolved.

Done and done.

Sep 2 2020, 2:41 PM · Patch-For-Review, netbox

Sep 1 2020

crusnov added a comment to T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.

Some notes about the current progress:

  • Imports seem 99% good.
  • Massimport miniscript deals with the manual cases
  • there is a hack in the interface automation script to deal with some mis-netmasking on some extra ip addresses on aqs*,restbase*,sessionstore*
Sep 1 2020, 8:43 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov

Aug 31 2020

crusnov committed rOBPY509628b20e89: Update for 5.0.7 and fix some Debian issues (authored by crusnov).
Update for 5.0.7 and fix some Debian issues
Aug 31 2020, 11:18 PM
crusnov committed rOBPYcaa7fcc6fea7: Update for 5.0.7 and fix some Debian issues (authored by crusnov).
Update for 5.0.7 and fix some Debian issues
Aug 31 2020, 11:17 PM
crusnov committed rOBPY8ec45e00baf3: Update for 5.0.7 and fix some Debian issues (authored by crusnov).
Update for 5.0.7 and fix some Debian issues
Aug 31 2020, 8:02 PM
crusnov committed rOBPY5022a7be3d4c: Make minor debian package fixes to fix some lintian issues (authored by crusnov).
Make minor debian package fixes to fix some lintian issues
Aug 31 2020, 7:06 PM

Aug 28 2020

crusnov committed rOBPY3cec858b11dd: changelog: Add entry for 5.0.7 release. (authored by crusnov).
changelog: Add entry for 5.0.7 release.
Aug 28 2020, 9:35 PM
crusnov committed rOBPYbd39a1b168fc: Revert local hack to sources (authored by hashar).
Revert local hack to sources
Aug 28 2020, 9:35 PM
crusnov committed rOBPY57d979d53a79: Configuration for gbp buildpackage (authored by hashar).
Configuration for gbp buildpackage
Aug 28 2020, 9:35 PM
crusnov committed rOBPY2c790d5a6ed5: add final newline (authored by crusnov).
add final newline
Aug 28 2020, 9:35 PM
crusnov committed rOBPYe4c18ae85f58: Patch requirements in debian branch (authored by crusnov).
Patch requirements in debian branch
Aug 28 2020, 9:35 PM
crusnov committed rOBPY91b6e14b4134: Fix patches (authored by crusnov).
Fix patches
Aug 28 2020, 9:35 PM
crusnov committed rOBPY094b1f89e6a8: Update changelog (authored by crusnov).
Update changelog
Aug 28 2020, 9:35 PM
crusnov committed rOBPYb81684c3d198: Add git build-dep. Add vcs links (authored by crusnov).
Add git build-dep. Add vcs links
Aug 28 2020, 9:35 PM
crusnov committed rOBPY48c123a90cc6: Bump dh compat to 9 for stretch. (authored by crusnov).
Bump dh compat to 9 for stretch.
Aug 28 2020, 9:35 PM
crusnov committed rOBPY13db8fd5c7df: debian/rules: skip autotest (authored by crusnov).
debian/rules: skip autotest
Aug 28 2020, 9:35 PM
crusnov committed rOBPY5ba478661c2c: update gpb.conf (authored by crusnov).
update gpb.conf
Aug 28 2020, 9:35 PM
crusnov committed rOBPYb1dc44ab05a6: update gpb.conf (authored by crusnov).
update gpb.conf
Aug 28 2020, 9:35 PM
crusnov committed rOBPYbc96d13596df: initial debian stuff (authored by crusnov).
initial debian stuff
Aug 28 2020, 9:35 PM

Aug 21 2020

crusnov updated subscribers of T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.
DNS vs host mismatch:
Host ganeti5001 has inet6 2001:df2:e500:101:d294:66ff:fe81:9090/64 while DNS has  1H  IN AAAA 2001:df2:e500:101:10:132:0:21
similar for ganeti5002 and ganeti5003
Aug 21 2020, 9:33 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov
crusnov added a comment to T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.
FAILED not in puppetdb:
Found origin 21.64.10
Missing line: 125 1H IN PTR   an-presto1004.eqiad.wmnet.
Found origin 5.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2
Missing line: 5.2.1.0.1.2.0.0.4.6.0.0.0.1.0.0 1H IN PTR   an-presto1004.eqiad.wmnet.
Missing line: an-presto1004    1H IN A             10.64.21.125
Missing line: an-presto1004    1H IN AAAA          2620:0:861:105:10:64:21:125
Found origin 16.192.10
Missing line: 47  1H IN PTR   wtp2005.codfw.wmnet.
Missing line: wtp2005         1H  IN A    10.192.16.47
Aug 21 2020, 9:28 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov
crusnov renamed T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces from Import PuppetDB interfaces into Netbox to Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.
Aug 21 2020, 12:00 AM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov

Aug 20 2020

crusnov committed rOSNBbb8d6a3c8f5b: Fix imports for settings mod. (authored by crusnov).
Fix imports for settings mod.
Aug 20 2020, 7:08 PM
crusnov committed rOSNB501d84bcc65f: Add a passthrough configuration system (authored by crusnov).
Add a passthrough configuration system
Aug 20 2020, 7:08 PM
crusnov committed rOSNBe28fed247d94: switch swagger to nonpublic mode (authored by crusnov).
switch swagger to nonpublic mode
Aug 20 2020, 7:08 PM
crusnov added a comment to T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.

Missing line: 1 1H IN PTR vrrp-gw-103.esams.wikimedia.org. vs vrrp-gw-103.esams.wmnet in https://netbox.wikimedia.org/ipam/ip-addresses/2691/
Fixed.

Found origin 1.0.0.0.0.0.5.e.2.f.d.0.1.0.0.2
Missing line: 2.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   ae1-510.cr3-eqsin.wikimedia.org.
Missing line: 3.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   ae1-510.cr2-eqsin.wikimedia.org.
Found origin 1.0.1.0.0.0.5.e.2.f.d.0.1.0.0.2
Missing line: 2.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   ae1-520.cr3-eqsin.eqsin.wmnet.
Missing line: 3.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   ae1-520.cr2-eqsin.eqsin.wmnet.
Found origin 1.0.2.0.0.0.5.e.2.f.d.0.1.0.0.2
Missing line: 2.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   ae1-530.cr3-eqsin.wikimedia.org.
Missing line: 3.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   ae1-530.cr2-eqsin.wikimedia.org.
Found origin 7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2
Missing line: 2.0.0.0.0.0.0.0.0.0.0.0.0.0.e.f 1H IN PTR   ae4-1020.cr2-eqiad.wikimedia.org.

Invalid (don't match a real IP).

Missing zonefile 1.0.e.f.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa for 1.0.e.f.1.6.8.0.0.0.0.0.0.2.6.2
Missing line: 1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   gr-4-3-0-1.cr1-eqiad.wikimedia.org.
Missing line: 2.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   gr-0-0-0-1.cr2-eqord.wikimedia.org.

Decommissioned some time ago.

Processing zone 2.6.8.0.0.0.0.0.0.2.6.2 in zonefile 2.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa
Found origin 1.0.0.0.2.6.8.0.0.0.0.0.0.2.6.2
Missing line: 1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   vrrp-gw-100.esams.wikimedia.org.
Found origin 2.0.1.0.2.6.8.0.0.0.0.0.0.2.6.2
Missing line: 1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   vrrp-gw-103.esams.wikimedia.org.
Found origin 1.0.2.0.2.6.8.0.0.0.0.0.0.2.6.2
Missing line: 1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   vrrp-gw-102.wikimedia.org.
Processing zone 3.6.8.0.0.0.0.0.0.2.6.2 in zonefile 3.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa
Found origin 1.0.0.0.3.6.8.0.0.0.0.0.0.2.6.2
Missing line: 1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   vrrp-gw-1201.wikimedia.org.
Found origin 1.0.1.0.3.6.8.0.0.0.0.0.0.2.6.2
Missing line: 1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   vrrp-gw-1211.wikimedia.org.
Found origin 1.0.2.0.3.6.8.0.0.0.0.0.0.2.6.2
Missing line: 1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 1H IN PTR   vrrp-gw-1221.wikimedia.org.

Now correct in Netbox.

Aug 20 2020, 3:57 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov

Aug 10 2020

crusnov closed T258912: netbox1001's root partition is filling up as Resolved.

I have fixed this problem and placed a long term fix in place.

Aug 10 2020, 6:02 PM · Operations
crusnov committed rOSNEf6970c0085f3: rotatedump: Enhance to retain period copies (authored by crusnov).
rotatedump: Enhance to retain period copies
Aug 10 2020, 3:54 PM

Aug 4 2020

crusnov updated subscribers of T247364: Forward port Python2 files to Python3 in Puppet Repository.

update on this project:

Aug 4 2020, 10:08 PM · Patch-For-Review, User-MoritzMuehlenhoff, User-crusnov, User-jbond, Python3-Porting, SRE-tools, Puppet

Jul 31 2020

crusnov added a comment to T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.

I have dumped and reloaded from a more recent dump (2020-07-31) and rerun import with preserving all IP addresses in the 'bindings' list instead of just primary IPv4 and IPv6.

Jul 31 2020, 9:56 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov

Jul 30 2020

crusnov added a comment to T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.

In terms of the extra IP addresses, the import only looks at the ip/ip6 fields and not the binding fields. It would be "relatively" easy to make it look at interface bindings instead but it would mean restructuring some things. A big problem is that the names aren't stored in puppetdb at all, and would have to be manually added to those IP addresses anyway, which makes me think that extra things like these should be handled in a manual fashion or at least separately.

Jul 30 2020, 3:46 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov
crusnov added a comment to T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.

@crusnov was a new dump used? From a quick run I didn't get any diff from the last run of yesterday.

Jul 30 2020, 3:43 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov
crusnov added a comment to T259166: Move device attributes.

This sounds completely doable, and not to big of a deal.

Jul 30 2020, 3:42 PM · SRE-tools, netbox

Jul 29 2020

crusnov added a comment to T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.

Oh also note recent update fixes devices which use previously reserved ip addresses now set the state to active and clear the comment if the comment is 'reserved for infra'

Jul 29 2020, 10:53 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov
crusnov added a comment to T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.

I have dumped and reloaded, the results are similar but now all VMs have their interfaces set to the virtual type since the API explodes if you don't.

Jul 29 2020, 9:50 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov

Jul 28 2020

crusnov committed rOSNB6c23bc127423: Fix imports for settings mod. (authored by crusnov).
Fix imports for settings mod.
Jul 28 2020, 7:58 PM
crusnov committed rOSNB9ae6a4ae3236: Add a passthrough configuration system (authored by crusnov).
Add a passthrough configuration system
Jul 28 2020, 7:58 PM
crusnov committed rOSNBe0d4d9ee3ab0: switch swagger to nonpublic mode (authored by crusnov).
switch swagger to nonpublic mode
Jul 28 2020, 7:58 PM
crusnov added a comment to T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.

The output and logs (starting on line 2823) of the last test run can be found in P11972. The data was imported into netbox-next.wikimedia.org after its data was reset using the last production DB dump.

After checking the above paste and spot-checking devices in netbox-next, here my comments/questions/suggestions/issues found:

  • Devices with Netbox status of Planned and Inventory are reported as they were missing facts in PuppetDB, let's skip them entirely to reduce noise
  • Devices with a tenant must be managed in a different way as they are not managed by our production Puppet hence not in PuppetDB.
    • Do not lookup devices with a tenant (not null) in PuppetDB
    • Manage Frack devices
    • Manage OIT host (just one for now) or skip if we decide to manage them manually or not manage them
    • Manage RIPE Atlas anchors or skip if we decide to manage them manually or not manage them
  • d-i-test is a special case VM as will never be in PuppetDB and is most of the time poweroff in Ganeti
  • install4001 has some issues, seems unrelated to this script, see T254157#6241107
  • people2001 is a VM marked as offline in Netbox, to be double checked
  • LVS hosts have all the cross-VLAN interfaces, for which we don't currently manage their DNS records, maybe we should, TBD. See for example https://netbox-next.wikimedia.org/dcim/devices/121/
  • Loopback interfaces need some more thoughts on what to do with them. As they are virtual and mostly managed by Puppet we might even decide to skip them entirely, although we have some DNS records related to them, in that case we might want to manage them but we'll need to manage their DNS records too. Notable cases:
    • Every host behind LVS has a lo:LVS with the VIP IP address to make LVS-DR work. This end up being set on multiple devices, creating duplicate IP addresses in Netbox (see https://netbox-next.wikimedia.org/ipam/ip-addresses/2775/ for example). I think we should probably skip those.
    • Some category of hosts have a long dynamic list of loopback interfaces that most likely has no sense to have in Netbox, but we should double check with their owners probably. In particular:
      • Kubernetes hosts with cali* interfaces
      • Ganeti hosts with tap* interfaces
      • Openstack hosts (cloudvirt*) with tap* interfaces
Jul 28 2020, 6:27 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov

Jul 27 2020

crusnov added a comment to T258942: Upgrade Netbox to >= 2.8.8.

Yep, will do. Moving that forward to asap.

Jul 27 2020, 9:09 PM · Patch-For-Review, netbox

Jul 23 2020

crusnov updated the task description for T258729: netbox DNS Automation Workflow checklist for Commissioning and Decommissioning 2020Q1.
Jul 23 2020, 5:32 PM · Patch-For-Review, SRE-tools, User-crusnov, netbox
crusnov updated the task description for T258729: netbox DNS Automation Workflow checklist for Commissioning and Decommissioning 2020Q1.
Jul 23 2020, 5:13 PM · Patch-For-Review, SRE-tools, User-crusnov, netbox
crusnov added a parent task for T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces: T258729: netbox DNS Automation Workflow checklist for Commissioning and Decommissioning 2020Q1.
Jul 23 2020, 5:12 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov
crusnov added a subtask for T258729: netbox DNS Automation Workflow checklist for Commissioning and Decommissioning 2020Q1: T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.
Jul 23 2020, 5:12 PM · Patch-For-Review, SRE-tools, User-crusnov, netbox
crusnov triaged T258729: netbox DNS Automation Workflow checklist for Commissioning and Decommissioning 2020Q1 as High priority.
Jul 23 2020, 5:11 PM · Patch-For-Review, SRE-tools, User-crusnov, netbox
crusnov created T258729: netbox DNS Automation Workflow checklist for Commissioning and Decommissioning 2020Q1.
Jul 23 2020, 5:10 PM · Patch-For-Review, SRE-tools, User-crusnov, netbox

Jul 22 2020

crusnov added a comment to T244153: Import IP addresses, interfaces and DNS names into Netbox for Primary Interfaces.

The output and logs (starting on line 2823) of the last test run can be found in P11972. The data was imported into netbox-next.wikimedia.org after its data was reset using the last production DB dump.

After checking the above paste and spot-checking devices in netbox-next, here my comments/questions/suggestions/issues found:

[ x] Devices with Netbox status of Planned and Inventory are reported as they were missing facts in PuppetDB, let's skip them entirely to reduce noise

  • Devices with a tenant must be managed in a different way as they are not managed by our production Puppet hence not in PuppetDB.
    • Do not lookup devices with a tenant (not null) in PuppetDB
    • Manage Frack devices
    • Manage OIT host (just one for now) or skip if we decide to manage them manually or not manage them
    • Manage RIPE Atlas anchors or skip if we decide to manage them manually or not manage them
Jul 22 2020, 6:39 PM · Patch-For-Review, DC-Ops, SRE-tools, netbox, User-crusnov

Jul 13 2020

crusnov committed rOSNB12ae28e0e2ed: Fix imports for settings mod. (authored by crusnov).
Fix imports for settings mod.
Jul 13 2020, 11:21 PM
crusnov committed rOSNB8137a7d69045: switch swagger to nonpublic mode (authored by crusnov).
switch swagger to nonpublic mode
Jul 13 2020, 11:21 PM
crusnov committed rOSNB80da9dac167c: Add a passthrough configuration system (authored by crusnov).
Add a passthrough configuration system
Jul 13 2020, 11:21 PM
crusnov moved T253140: Create Scratch instance of Netbox from Discussion / Design / Consensus Making to Complete on the netbox board.
Jul 13 2020, 10:16 PM · netbox

Jul 6 2020

crusnov closed T253140: Create Scratch instance of Netbox as Resolved.

This is complete.

Jul 6 2020, 7:07 PM · netbox

Jun 30 2020

crusnov committed rOSNE80ac106bc303: offline_device: Clear primary IP addresses from device before deleting them. (authored by crusnov).
offline_device: Clear primary IP addresses from device before deleting them.
Jun 30 2020, 10:54 PM

Jun 23 2020

crusnov added a comment to T253140: Create Scratch instance of Netbox.

Thank you, I'll take care of it.

Jun 23 2020, 4:00 PM · netbox

Jun 17 2020

crusnov moved T255654: PhysicalHosts report erroring because of dubious host from Backlog to Work in Progress / Tasks to Do on the netbox board.
Jun 17 2020, 3:33 PM · User-crusnov, netbox
crusnov claimed T255654: PhysicalHosts report erroring because of dubious host .
Jun 17 2020, 3:33 PM · User-crusnov, netbox
crusnov added a comment to T255654: PhysicalHosts report erroring because of dubious host .

It should be straight forward to add a filter as you suggest although it's notable that the puppetdb proxy already gives us the short hostname so it'll have to altered very slightly but nbd i think.

Jun 17 2020, 3:31 PM · User-crusnov, netbox
crusnov added a comment to T233183: Automate generation of Management DNS records from Netbox.

Nice, this is what we pretty much had in mind, although in the future of course if we add more prefixes or change them we'll have to reserve them manually or whatever.

Jun 17 2020, 3:25 PM · netbox, Patch-For-Review, User-jbond, Operations, Traffic, User-crusnov, Goal, SRE-tools

Jun 16 2020

crusnov closed T253833: Netbox CSV dump failed as Resolved.
Jun 16 2020, 11:43 PM · netbox

May 28 2020

crusnov committed rOSNEb7ad2472fd90: rotatedump: Call correct dumpbackup script (authored by crusnov).
rotatedump: Call correct dumpbackup script
May 28 2020, 5:57 PM
crusnov added a comment to T253833: Netbox CSV dump failed.

Thanks, the log error is not the cause however, it is a longstanding issue that the dumps have had, I'll look at it.

May 28 2020, 3:44 PM · netbox