Page MenuHomePhabricator

marcmiquel (marcmiquel)
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Mar 18 2015, 2:27 PM (238 w, 6 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
Marcmiquel [ Global Accounts ]

Recent Activity

Jul 18 2019

marcmiquel added a comment to T228450: Mount Dumps NFS share on instances in the wcdo Cloud VPS project.

It works! Thanks!

Jul 18 2019, 9:13 PM · cloud-services-team (Kanban), VPS-Projects, Data-Services
marcmiquel created T228450: Mount Dumps NFS share on instances in the wcdo Cloud VPS project.
Jul 18 2019, 4:42 PM · cloud-services-team (Kanban), VPS-Projects, Data-Services

Feb 15 2019

marcmiquel added a comment to T191639: Wikidata JSON dumps do not have the 'ns' (namespace).

I need all the Wikidata qitems that relate to Wikipedia articles. If I understand it correctly, these are qitems that have namespace 0. Although not all qitems with namespace 0 necessarily have sitelinks (they could be just qitems without an article).

Feb 15 2019, 7:09 PM · Datasets-General-or-Unknown, Dumps-Generation, Wikidata

Feb 8 2019

marcmiquel added a comment to T191639: Wikidata JSON dumps do not have the 'ns' (namespace).

The use case is to process the dumps and filter out qitems which do not relate to articles, this is why we put NS0. The JSON dump sample says there is ns field but in the final dump there is no such field.

Feb 8 2019, 10:26 AM · Datasets-General-or-Unknown, Dumps-Generation, Wikidata

Jan 11 2019

marcmiquel added a comment to T213420: Surface translation suggestions based on the Wikipedia Cultural Diversity Observatory.

Current version of the Top CCC articles is at https://wcdo.wmflabs.org/databases/ with the name top_ccc_articles_current.db.

Jan 11 2019, 9:16 AM · Language-Team (Language-2019-July-September), Design, ContentTranslation

Dec 12 2018

marcmiquel added a comment to T210485: Investigate high usage of Apertium and V2 endpoint.

Nope. I haven't for days. Since your last email.

Dec 12 2018, 4:57 PM · CX-deployments, Language-Team (Language-2018-October-December)

Dec 2 2018

marcmiquel added a comment to T210485: Investigate high usage of Apertium and V2 endpoint.

Thanks for the message akosiaris. I'm sorry the posts were heavy. This process runs only once a month.

Dec 2 2018, 8:52 PM · CX-deployments, Language-Team (Language-2018-October-December)

May 6 2018

marcmiquel edited projects for T193984: project Wikipedia Cultural Diversity Observatory (WCDO) / datasets, visualizations and vital articles lists, added: Wikimedia-Hackathon-2018; removed Wikimania-Hackathon-2018.
May 6 2018, 4:55 PM · Wikimedia-Hackathon-2018, Datasets-General-or-Unknown
marcmiquel updated the task description for T193984: project Wikipedia Cultural Diversity Observatory (WCDO) / datasets, visualizations and vital articles lists.
May 6 2018, 2:32 PM · Wikimedia-Hackathon-2018, Datasets-General-or-Unknown
marcmiquel created T193984: project Wikipedia Cultural Diversity Observatory (WCDO) / datasets, visualizations and vital articles lists.
May 6 2018, 2:21 PM · Wikimedia-Hackathon-2018, Datasets-General-or-Unknown
marcmiquel added a comment to T192825: HELP: Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks.

The solution was to do the join by code logics, even though this implied that the script had to run for a longer period of time.

May 6 2018, 12:48 PM · cloud-services-team, Data-Services

Apr 27 2018

marcmiquel renamed T192825: HELP: Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks from Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks to HELP: Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks.
Apr 27 2018, 6:49 PM · cloud-services-team, Data-Services

Apr 26 2018

marcmiquel updated the task description for T192825: HELP: Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks.
Apr 26 2018, 7:59 PM · cloud-services-team, Data-Services

Apr 24 2018

marcmiquel added a comment to T192825: HELP: Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks.

The size/number of parameters depends on the max_allowed_packet value.
I checked that: SELECT @@max_allowed_packet; and is 33554432 (32M).
It can go as high as 1G. The good news is that this buffer is only allocated as needed, so setting it to 1G is fairly harmless. 1G will allow you to insert all those 2 million tuples at once.

Apr 24 2018, 10:24 PM · cloud-services-team, Data-Services
marcmiquel added a comment to T192825: HELP: Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks.

a) OK. Bad news.

Apr 24 2018, 10:12 PM · cloud-services-team, Data-Services
marcmiquel added a comment to T192825: HELP: Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks.

p.pl_title IN (SELECT page_title FROM u3532__.'+item+'_page_titleswithredirects)

Apr 24 2018, 9:48 PM · cloud-services-team, Data-Services
marcmiquel added a comment to T192825: HELP: Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks.
  1. OK to not possible. I had already assumed that I could not do that join.
Apr 24 2018, 9:33 PM · cloud-services-team, Data-Services
marcmiquel added a comment to T192825: HELP: Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks.

I previously selected these two million articles with several criteria. I have them in a Sqlite3 database in the VPS.

Apr 24 2018, 9:15 PM · cloud-services-team, Data-Services

Apr 23 2018

marcmiquel created T192825: HELP: Calculate the number of inlinks/outlinks from/to a large group of articles / Bottlenecks.
Apr 23 2018, 5:52 PM · cloud-services-team, Data-Services

Apr 6 2018

marcmiquel created T191639: Wikidata JSON dumps do not have the 'ns' (namespace).
Apr 6 2018, 2:40 PM · Datasets-General-or-Unknown, Dumps-Generation, Wikidata

Mar 17 2018

marcmiquel added a comment to T189897: Reset wikitech 2FA for user Marcmiquel.

I had no problem in accessing tools-login.wmflabs.org before. I created the 2fa_reset.txt in marcmiquel account in toolforge, not in the wcdo tool account.

Mar 17 2018, 1:34 PM · cloud-services-team (Kanban), wikitech.wikimedia.org

Mar 16 2018

marcmiquel created T189897: Reset wikitech 2FA for user Marcmiquel.
Mar 16 2018, 6:00 PM · cloud-services-team (Kanban), wikitech.wikimedia.org

Mar 11 2018

marcmiquel added a comment to T189165: Request creation of WCDO VPS project.

OK. Sorry if it becomes boring, but let me reframe it again and explain bit better. I develop the current situation with some new experiments with MySQL in Toolforge.

Mar 11 2018, 2:34 PM · Cloud-VPS (Project-requests)

Mar 7 2018

marcmiquel updated subscribers of T189165: Request creation of WCDO VPS project.
Mar 7 2018, 9:56 PM · Cloud-VPS (Project-requests)
marcmiquel created T189165: Request creation of WCDO VPS project.
Mar 7 2018, 9:54 PM · Cloud-VPS (Project-requests)

Mar 6 2018

marcmiquel created T189058: Performance with Wikidata parsing and storing into sqlite and toolsdb (MySQL).
Mar 6 2018, 11:09 PM · Data-Services, Toolforge

Aug 2 2017

marcmiquel added a comment to T105964: HELP! Database is getting Slow: A test which took less than 10 min, now it takes 3 hours. I cannot do my experiments..

Nope, closed. Thank you

Aug 2 2017, 2:43 PM · Data-Services, DBA

Mar 28 2017

marcmiquel added a comment to T133322: u3532__ (=marcmiquel) table using 64G on labsdb1001 and 108 GB on labsdb1003.

You can delete the databases. I am sorry for not replying, I did not see the e-mail. Thanks.

Mar 28 2017, 10:06 AM · Cloud-Services, Toolforge, DBA

Feb 6 2017

marcmiquel closed T133322: u3532__ (=marcmiquel) table using 64G on labsdb1001 and 108 GB on labsdb1003 as Resolved.

Ok! Thanks.

Feb 6 2017, 9:36 AM · Cloud-Services, Toolforge, DBA
marcmiquel closed T133322: u3532__ (=marcmiquel) table using 64G on labsdb1001 and 108 GB on labsdb1003, a subtask of T132431: labsdb1001 and labsdb1003 short on available space, as Resolved.
Feb 6 2017, 9:36 AM · Cloud-Services, Toolforge, DBA

Aug 9 2016

marcmiquel added a comment to T142482: u3532 is executing several concurrent, highly-intensive, innefficient long-running queries on at least labsdb1003, potentially hurting the stability of the system.

I just fixed the problem and now I'm going to code a workround not to saturate the server. thanks.

Aug 9 2016, 4:02 PM · Toolforge, Cloud-Services

Jun 5 2016

marcmiquel added a comment to T133322: u3532__ (=marcmiquel) table using 64G on labsdb1001 and 108 GB on labsdb1003.

hi chasemp, I cleant 30GB more. in few weeks i'll clean more. thanks! :)

Jun 5 2016, 6:41 PM · Cloud-Services, Toolforge, DBA

Apr 22 2016

marcmiquel added a comment to T133322: u3532__ (=marcmiquel) table using 64G on labsdb1001 and 108 GB on labsdb1003.

Done! I cleant more than 50 GB. I hope it's enough. However, I will clean
more in the following weeks. Cheers
Marc

Apr 22 2016, 1:49 PM · Cloud-Services, Toolforge, DBA

Apr 21 2016

marcmiquel added a comment to T133322: u3532__ (=marcmiquel) table using 64G on labsdb1001 and 108 GB on labsdb1003.

Thanks for the message. I created new files few days ago, in particular a very big one from the English Wiki. I will free it as soon as I can. I hope next week I can be done :)

Apr 21 2016, 6:32 PM · Cloud-Services, Toolforge, DBA

Apr 14 2016

marcmiquel added a comment to T130041: June 2016 Research Showcase.

Hello @DarTar.
I requested a slot for the showcase, as @Halfak said. However, I am superbusy in the last months of my thesis and I am not able to prepare it properly. So I'd prefer waiting for another occasion to present you my research. Also when I have my material a bit more mature :) Thank you very much.

Apr 14 2016, 11:09 AM · Research-outreach, Research

Mar 15 2016

marcmiquel added a comment to T125245: March Research Showcase.

@DarTar no problem. I am quite busy, so waiting until April or May won't be bad. I will let you know! Thanks! :)

Mar 15 2016, 2:53 PM · Research-Archive, Research-outreach

Feb 24 2016

marcmiquel added a comment to T127992: tools-bastion-05 is super slow.

My excuses, I run a script in the bastion when I should have instead sent it to the grid. They noticed me this in the IRC channel and now it is clear.

Feb 24 2016, 7:04 PM · Toolforge, Cloud-Services

Feb 21 2016

marcmiquel added a comment to T125245: March Research Showcase.

Thanks Halfak!

Feb 21 2016, 11:00 AM · Research-Archive, Research-outreach

Aug 11 2015

marcmiquel added a comment to T104457: Sessions for several languages.

Excellent! I see some differences and more information in it. Great. Thank
you!

Aug 11 2015, 10:22 AM · Data-release, Research

Jul 16 2015

marcmiquel added a comment to T105964: HELP! Database is getting Slow: A test which took less than 10 min, now it takes 3 hours. I cannot do my experiments..

Thanks for checking MZMcBride!

Jul 16 2015, 8:30 PM · Data-Services, DBA
marcmiquel added a comment to P997 various sql queries for T105964.

SELECT "eswiki",user_editcount FROM eswiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "enwiki",user_editcount FROM enwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "frwiki",user_editcount FROM frwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "itwiki",user_editcount FROM itwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "plwiki",user_editcount FROM plwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "nlwiki",user_editcount FROM nlwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "dewiki",user_editcount FROM dewiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "ruwiki",user_editcount FROM ruwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "ptwiki",user_editcount FROM ptwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "euwiki",user_editcount FROM euwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "zhwiki",user_editcount FROM zhwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "glwiki",user_editcount FROM glwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "ocwiki",user_editcount FROM ocwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "anwiki",user_editcount FROM anwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "huwiki",user_editcount FROM huwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "ukwiki",user_editcount FROM ukwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "warwiki",user_editcount FROM warwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "arwiki",user_editcount FROM arwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "viwiki",user_editcount FROM viwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "svwiki",user_editcount FROM svwiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "rowiki",user_editcount FROM rowiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "mswiki",user_editcount FROM mswiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "fawiki",user_editcount FROM fawiki_p.user WHERE user_name LIKE %s UNION ALL SELECT "cawiki",user_editcount FROM cawiki_p.user WHERE user_name LIKE %s ORDER BY user_editcount DESC

Jul 16 2015, 2:37 PM
marcmiquel assigned T105964: HELP! Database is getting Slow: A test which took less than 10 min, now it takes 3 hours. I cannot do my experiments. to jcrespo.
Jul 16 2015, 2:13 PM · Data-Services, DBA

Jul 15 2015

marcmiquel created T105964: HELP! Database is getting Slow: A test which took less than 10 min, now it takes 3 hours. I cannot do my experiments..
Jul 15 2015, 9:44 PM · Data-Services, DBA

Jul 13 2015

marcmiquel added a comment to T105503: Tables corrupted or impossible to work with them.

Apparently all issues are solved. I'm a bit surprised by all these changes. I haven't changed any line of code.

Jul 13 2015, 9:03 AM · Cloud-Services, DBA
marcmiquel added a comment to T105503: Tables corrupted or impossible to work with them.

Now it does work.Thanks!

Jul 13 2015, 8:40 AM · Cloud-Services, DBA

Jul 11 2015

marcmiquel added a comment to T105503: Tables corrupted or impossible to work with them.

A while ago I couldn't connect...

Jul 11 2015, 11:52 PM · Cloud-Services, DBA

Jul 10 2015

marcmiquel updated subscribers of T105503: Tables corrupted or impossible to work with them.
Jul 10 2015, 3:44 PM · Cloud-Services, DBA
marcmiquel created T105503: Tables corrupted or impossible to work with them.
Jul 10 2015, 3:42 PM · Cloud-Services, DBA

Jul 3 2015

marcmiquel added a comment to T104457: Sessions for several languages.

Would it be possible to obtain first cawiki? This is the language I usually take for initial testing, since I can understand results better.

Jul 3 2015, 1:15 PM · Data-release, Research

Jul 1 2015

marcmiquel created T104457: Sessions for several languages.
Jul 1 2015, 3:36 PM · Data-release, Research

Jun 24 2015

marcmiquel created T103708: Recovering my Python files in the root.
Jun 24 2015, 4:45 PM · Cloud-Services, Incident-20150617-LabsNFSOutage, Labs-Sprint-103

May 13 2015

marcmiquel edited P647 PRoblem installing Scipy with VirtualEnv.
May 13 2015, 6:09 PM

Mar 26 2015

marcmiquel added a comment to T93074: Memory Exhausted Near / Tool labs error while querying with Python.

Indeed, it broke:

Mar 26 2015, 8:36 AM · Cloud-Services, Toolforge

Mar 25 2015

marcmiquel added a comment to T93074: Memory Exhausted Near / Tool labs error while querying with Python.

Dear Springle,

Mar 25 2015, 12:00 PM · Cloud-Services, Toolforge

Mar 23 2015

marcmiquel added a comment to T93074: Memory Exhausted Near / Tool labs error while querying with Python.

Yes. I do. Now I got a memory error after days working well.

Mar 23 2015, 3:40 PM · Cloud-Services, Toolforge

Mar 18 2015

marcmiquel created T93074: Memory Exhausted Near / Tool labs error while querying with Python.
Mar 18 2015, 2:33 PM · Cloud-Services, Toolforge