Page MenuHomePhabricator

Connection timeout from 195.77.175.64/29 to text-lb.esams.wikimedia.org
Closed, ResolvedPublic

Description

Issue initially reported to OTRS (#2018031310009017)

A customer of Telefónica de España with the range 195.77.175.64/29 is getting persistent timeouts when attempting to access es.wikipedia.org (text-lb.esams.wikimedia.org).

From the ticket:

TRACERT:
 
Tracing route to 91.198.174.192:80

1         43 ms        195.77.175.65   TimeExceeded
2          8 ms        225.red-193-152-56.static.ccgg.telefonica.net [193.152.56.225]  TimeExceeded
3       2006 ms        timed out
3       2005 ms        timed out
3       2003 ms        timed out
4       2003 ms        timed out
4       2004 ms        timed out
4       2001 ms        timed out
5       2005 ms        timed out
5       2002 ms        timed out
5       2002 ms        timed out
6         19 ms        ae-0-400-grtmadde3.net.telefonicaglobalsolutions.com [213.140.51.58]    TimeExceeded
7         61 ms        213.140.35.141  TimeExceeded
8         59 ms        213.140.53.57   TimeExceeded
9         50 ms        ldn-bb2-link.telia.net [62.115.143.26]  TimeExceeded
10        56 ms        adm-bb3-link.telia.net [213.155.136.99] TimeExceeded
11        53 ms        adm-b3-link.telia.net [62.115.122.191]  TimeExceeded
12        58 ms        wikimedia-ic-316335-adm-b3.c.telia.net [62.115.145.25]  TimeExceeded
13      2002 ms        timed out
13      2003 ms        timed out
13      2005 ms        timed out
14      2005 ms        timed out
14      2004 ms        timed out
14      2003 ms        timed out
15      2005 ms        timed out
15      2003 ms        timed out
15      2002 ms        timed out
16      2007 ms        timed out
16      2002 ms        timed out
16      2001 ms        timed out
17      2003 ms        timed out
17      2002 ms        timed out
17      2002 ms        timed out
18      2006 ms        timed out
18      2004 ms        timed out
18      2003 ms        timed out
19      2004 ms        timed out
19      2004 ms        timed out
19      2005 ms        timed out
20      2003 ms        timed out
20      2003 ms        timed out
20      2003 ms        timed out
21      2003 ms        timed out
21      2004 ms        timed out
21      2005 ms        timed out
22      2008 ms        timed out

Event Timeline

First, the questions and troubleshooting commands listed on https://wikitech.wikimedia.org/wiki/Reporting_a_connectivity_issue are useful to us.

Especially, please provide a curl output, as well as a pings (ideally also mtr).

In addition please run the same tests toward bast3002.wikimedia.org

For information, we can reach a random IP in the mentioned prefix so it doesn't seem to be a routing issue:

ayounsi@bast3002:~$ mtr 195.77.175.66 -z --report-wide
Start: Wed Mar 14 23:03:03 2018
HOST: bast3002                                                       Loss%   Snt   Last   Avg  Best  Wrst StDev
  1. AS14907 ae1-100.cr2-esams.wikimedia.org                          0.0%    10    0.2   2.2   0.2  20.0   6.2
  2. AS1299  adm-b3-link.telia.net                                    0.0%    10    0.5   0.4   0.2   1.2   0.0
  3. AS1299  adm-bb4-link.telia.net                                   0.0%    10    1.0   1.1   1.0   1.2   0.0
  4. AS1299  ldn-bb2-link.telia.net                                   0.0%    10    5.9   6.2   5.8   8.9   0.9
  5. AS1299  ldn-b1-link.telia.net                                    0.0%    10    6.6   6.5   6.4   6.8   0.0
  6. AS12956 et-7-0-0-0-grtlontl1.net.telefonicaglobalsolutions.com   0.0%    10    7.5  24.9   7.5  95.5  26.3
  7. AS12956 213.140.35.136                                           0.0%    10   48.2  48.2  48.2  48.3   0.0
  8. AS12956 213.140.51.59                                           90.0%    10   50.0  50.0  50.0  50.0   0.0
  9. AS???   ???                                                     100.0    10    0.0   0.0   0.0   0.0   0.0
 10. AS???   ???                                                     100.0    10    0.0   0.0   0.0   0.0   0.0
 11. AS3352  234.red-217-124-112.static.ccgg.telefonica.net           0.0%    10   57.5  57.6  57.4  58.3   0.0
 12. AS3352  226.red-193-152-56.static.ccgg.telefonica.net           90.0%    10   58.6  58.6  58.6  58.6   0.0
 13. AS3352  195.77.175.66                                            0.0%    10   51.0  51.1  51.0  51.9   0.0

Note that the timeouts in tracert don't explicitly means that there is an issue.

In fact, those timeouts appear even on the early hops. The ticket also includes a successful tcp traceroute, and no explicit mention of timeouts.

The original claim is that the user on behalf of which they mailed us "can not access the web: es.wikipedia.org", which could mean pretty much anything on higher-level layers, too, from a varnish block to a stuck entry on their browser cache. :(

@ayounsi I'm happy to pass on comments and suggestions to the OTRS ticket.

@Platonides Please feel free to update the task description to something more accurate - I attempted to summarise information (given we can't directly post OTRS ticket content) and I apologise if I summarised incorrectly :-)

No new updates since March 2018, feel free to reopen if the issue is still there.