User Details
- User Since
- Apr 24 2016, 12:52 AM (501 w, 5 d)
- Availability
- Available
- LDAP User
- Antigng
- MediaWiki User
- Unknown
Jul 7 2016
Jun 15 2016
Jun 14 2016
I don't think api.php?action=query&prop=revisions&rvprop=content can be the same performant as index.php?action=raw, and the latter is the easiest way to get the source code of a page. I would appreciate it if there was a way to perform api.php?action=raw.
Also, there doesn't exist a clear request rate limit for mediawiki api, as[[T135240| the rest api]] does. If you want to set one, you should document it.
Most of my tasks don't generate such " unacceptable amount of traffic". They usually send a few hundred to thousand requests before exit. But they still need a way to bypass the TLS redirect.
If you don't give me a good reason why cp1008.wikimedia.org:3128 / index.php?action=raw shouldn't be used, I will start some of my jobs that don't involve mass page content fetching, such as projectstat.
Labs replicas can't do that job, as revision tables are removed on such databases. Dumps are not updated such often.
Jun 13 2016
My bot was using /w/index.php?action=raw to fetch the content of each page/redirect at zhwiki, then it will do some simple search/replace/template addition work.
Jun 2 2016
I could reduce the concurrency by lowering the number of threads in the pool. (Current is 50.) But what if another bot task running on the same node exceeds the rate limit?
The rate limiting is breaking my bot.
May 2 2016
As reported by User:Kanashimi, some api query output is broken, either. For example, https://zh.wikipedia.org/w/api.php?action=query&prop=revisions&rvprop=content|timestamp&titles=LGBT%E7%9B%B8%E5%85%B3%E7%94%B5%E5%BD%B1%E5%88%97%E8%A1%A8&rvlimit=1&format=json&utf8 returns unnecessary "w6" at the end.
Apr 24 2016
I'm still seeing these problems:
