Page MenuHomePhabricator

Character encoding issue with wm-bot's *.txt files
Closed, ResolvedPublic

Description

View the raw text file http://bots.wmflabs.org/~wm-bot/logs/%23wikimedia-office/20150806.txt
Both Firefox and chromium on my Ubuntu display

This week our compère

because the HTTP response doesn't specify an encoding,

From
http://stackoverflow.com/questions/913869/how-to-change-the-default-encoding-to-utf-8-for-server , the fix is either

  • Add a .htaccess file to http://bots.wmflabs.org/~wm-bot/ specifying charset
  • Specify AddDefaultCharset utf-8 in the httpd.conf of the server. I bet no tool relies on the browsers' default iso8859/Windows code page 1252 encoding, it dates from the Windows 98 era.

The workaround in Firefox is View > Text Encoding and choose Unicode or one of the Auto-Detect values. In chromium, choose [hamburger menu] > More tools > Encoding > Unicode (UTF-8).

(This bug sounds like T71662: Character encoding issues on wm-bot logs but that bug's example URL is a web page whose HTML has <meta charset="UTF-8">, so it displays fine.)

Event Timeline

Spage raised the priority of this task from to Needs Triage.
Spage updated the task description. (Show Details)
Spage added projects: Tool-WMT-bots, WM-Bot.
Spage subscribed.
Petrb claimed this task.
Petrb subscribed.

Yay just 2 years to fix this!