Page MenuHomePhabricator

Re-import full planet data into eqiad and codfw
Closed, ResolvedPublic

Description

Background information

Re-import full planet data into maps1009 and then maps2009

Process documented at Run a planet import

Open questions

  • How much SRE intervention will be needed?

Acceptance criteria

  • Planet imported on eqiad
  • Planet imported on codfw
  • DB is refreshed with most updated data
  • perform DC switchover

Event Timeline

Restricted Application added a subscriber: Aklapper. ยท View Herald TranscriptAug 3 2022, 11:41 AM

Hi @Jgiannelos,

Which of the above steps would require an SRE to intervene? Merging puppet changes is one that comes to mind, but others may too?

One question that does come to mind is whether the machine have enough disk space to have both schemas around. A quick

ssh maps1009.eqiad.wmnet df -h
Filesystem            Size  Used Avail Use% Mounted on
/dev/mapper/vg0-root   73G   37G   33G  54% /
/dev/mapper/vg0-srv   2.7T  2.0T  591G  78% /srv

appears to suggest that no?

I think that eqiad is left in a stale state and we never deleted the backup schema after an issue we had with the import.

On maps1009:

    schema_name     | pg_size_pretty 
--------------------+----------------
 import             | 728 GB
 information_schema | 96 kB
 pg_catalog         | 9000 kB
 pg_toast           | 95 GB
 public             | 805 GB
(5 rows)

and maps2009:

    schema_name     | pg_size_pretty 
--------------------+----------------
 information_schema | 96 kB
 pg_catalog         | 7584 kB
 pg_toast           | 53 GB
 public             | 800 GB

On maps1009 there are 700G of import data that are going to be replaced on next import, so i think we are good with storage.

Regarding SRE support, planet import is probably the most common task we do for maps maintenance so I think it might be a good opportunity for folks that are or will be involved with maps operation to get familiarized with. I was thinking that it can be a good example of knowledge transfer so we can do it together (me or @MSantos) with somebody from serviceops.

I think that eqiad is left in a stale state and we never deleted the backup schema after an issue we had with the import.

On maps1009:

    schema_name     | pg_size_pretty 
--------------------+----------------
 import             | 728 GB
 information_schema | 96 kB
 pg_catalog         | 9000 kB
 pg_toast           | 95 GB
 public             | 805 GB
(5 rows)

and maps2009:

    schema_name     | pg_size_pretty 
--------------------+----------------
 information_schema | 96 kB
 pg_catalog         | 7584 kB
 pg_toast           | 53 GB
 public             | 800 GB

On maps1009 there are 700G of import data that are going to be replaced on next import, so i think we are good with storage.

OK, good enough for me.

Regarding SRE support, planet import is probably the most common task we do for maps maintenance so I think it might be a good opportunity for folks that are or will be involved with maps operation to get familiarized with. I was thinking that it can be a good example of knowledge transfer so we can do it together (me or @MSantos) with somebody from serviceops.

Let me see what I can do about that.

jijiki renamed this task from Re-import full planet data into eqiad to Re-import full planet data into eqiad and codfw.Nov 22 2022, 2:49 PM
jijiki changed the task status from Open to In Progress.
jijiki updated the task description. (Show Details)

Planet import in eqiad (on maps1009) started at 11:53 UTC

Mentioned in SAL (#wikimedia-operations) [2022-11-30T15:33:58Z] <effie> jiji@maps1009 imposm-removebackup-import - T314472

@Jgiannelos and I have successfully completed re-imported full planet data on eqiad. Next up, we are working on ways we can warm up eqiad's tile cache; we are leaning towards mirroring a percentage of codfw's (live) traffic to eqiad, which will trigger tile creation

Change 866377 had a related patch set uploaded (by Effie Mouzeli; author: Effie Mouzeli):

[operations/deployment-charts@master] tegola-vector-tiles: use new tegola swift container in eqiad

https://gerrit.wikimedia.org/r/866377

Change 866379 had a related patch set uploaded (by Effie Mouzeli; author: Effie Mouzeli):

[operations/puppet@production] hieradata: enable maps replication and tile_generation timers

https://gerrit.wikimedia.org/r/866379

Change 866377 merged by Effie Mouzeli:

[operations/deployment-charts@master] tegola-vector-tiles: use new tegola swift container in eqiad

https://gerrit.wikimedia.org/r/866377

Change 866379 merged by Effie Mouzeli:

[operations/puppet@production] hieradata: enable maps replication and tile_generation timers

https://gerrit.wikimedia.org/r/866379

Change 866442 had a related patch set uploaded (by Jgiannelos; author: Jgiannelos):

[operations/puppet@production] maps: Use new swift container for eqiad pregeneration

https://gerrit.wikimedia.org/r/866442

Change 866442 merged by Effie Mouzeli:

[operations/puppet@production] maps: Use new swift container for eqiad pregeneration

https://gerrit.wikimedia.org/r/866442

Change 868343 had a related patch set uploaded (by Effie Mouzeli; author: Effie Mouzeli):

[operations/deployment-charts@master] tegola-vector-tiles: add more replicas

https://gerrit.wikimedia.org/r/868343

Change 868343 merged by jenkins-bot:

[operations/deployment-charts@master] tegola-vector-tiles: add more replicas

https://gerrit.wikimedia.org/r/868343

Import to eqiad has been completed and traffic is being served via eqiad.

Change 879556 had a related patch set uploaded (by Effie Mouzeli; author: Effie Mouzeli):

[operations/puppet@production] hieradata: disable maps tile_generation timers for planet import

https://gerrit.wikimedia.org/r/879556

Change 879556 merged by Effie Mouzeli:

[operations/puppet@production] hieradata: disable maps tile_generation timers for planet import

https://gerrit.wikimedia.org/r/879556

Mentioned in SAL (#wikimedia-operations) [2023-01-12T15:28:47Z] <effie> Planet import in codfw (on maps2009) started at 15:26 UTC - T314472

Change 880933 had a related patch set uploaded (by Effie Mouzeli; author: Effie Mouzeli):

[operations/deployment-charts@master] tegola-vector-tiles: use new tegola swift container in codfw

https://gerrit.wikimedia.org/r/880933

Change 880934 had a related patch set uploaded (by Effie Mouzeli; author: Effie Mouzeli):

[operations/deployment-charts@master] tegola-vector-tiles: disable tile pregeneration in codfw

https://gerrit.wikimedia.org/r/880934

Change 880933 merged by jenkins-bot:

[operations/deployment-charts@master] tegola-vector-tiles: use new tegola swift container in codfw

https://gerrit.wikimedia.org/r/880933

Change 880934 merged by jenkins-bot:

[operations/deployment-charts@master] tegola-vector-tiles: disable tile pregeneration in codfw

https://gerrit.wikimedia.org/r/880934

Change 881597 had a related patch set uploaded (by Effie Mouzeli; author: Effie Mouzeli):

[operations/deployment-charts@master] tegola-vector-tiles: tile pregeneration and replicas bump

https://gerrit.wikimedia.org/r/881597

Change 881599 had a related patch set uploaded (by Effie Mouzeli; author: Effie Mouzeli):

[operations/puppet@production] hieradata: enable maps timers in codfw

https://gerrit.wikimedia.org/r/881599

Change 881599 merged by Effie Mouzeli:

[operations/puppet@production] hieradata: enable maps timers in codfw

https://gerrit.wikimedia.org/r/881599

Change 881597 merged by jenkins-bot:

[operations/deployment-charts@master] tegola-vector-tiles: tile pregeneration and replicas bump

https://gerrit.wikimedia.org/r/881597

Import to codfw has been completed, and we have bootstrapped its tile storage using https://gerrit.wikimedia.org/r/c/operations/puppet/+/875973

We are happily serving maps from codfw, and both datacentres are up to date ๐ŸŽ‰

Lots hope itโ€™s finally more predictable from this point forward. Thank you for the perseverance team!

We still have some docs pending in order to close this ticket.

Closing, I will add the URL of the relevant documentation when I finish writing it