Page MenuHomePhabricator

Sebotic
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Tuesday

  • Clear sailing ahead.

User Details

User Since
Sep 12 2015, 6:46 PM (210 w, 9 h)
Availability
Available
LDAP User
Unknown
MediaWiki User
Sebotic [ Global Accounts ]

Recent Activity

Jan 13 2017

Sebotic added a comment to T154660: Increase length limit for external identifier, string and URL datatype.

@thiemowmde agreed, it would bring down the error rate for this specific identifier/string. But currently, that's the only one we have a size distribution for. I think, Lydia intended to solve this issue here for every property of data type string. So, if for technical reasons (MySQL index field length) it should be limited to 768 for now, this would also be fine for chemistry for now, but how about other properties?

Jan 13 2017, 6:18 PM · User-Addshore, MW-1.33-notes (1.33.0-wmf.9; 2018-12-18), Patch-For-Review, Wikidata-Campsite (Wikidata-Campsite-Iteration-∞), MediaWiki-extensions-WikibaseRepository, Wikidata

Jan 12 2017

Sebotic added a comment to T154660: Increase length limit for external identifier, string and URL datatype.

I calculated these numbers above, they are solely valid for the chemical structure property InChI (P234), based on ~68 million InChI values in the largest public chemistry database PubChem (also valid for other chemical structure properties like canonical and isomeric SMILES). For any other data of Wikidata datatype string/text, I cannot provide numbers, as I lack the distribution of string lengths relevant to other data which should be represented as strings in Wikidata. And as you can see from the distribution above, increasing the limit would only influence representation of the top ~1% of total chemical structure data.

Jan 12 2017, 6:19 PM · User-Addshore, MW-1.33-notes (1.33.0-wmf.9; 2018-12-18), Patch-For-Review, Wikidata-Campsite (Wikidata-Campsite-Iteration-∞), MediaWiki-extensions-WikibaseRepository, Wikidata

Jan 6 2017

Sebotic added a comment to T154660: Increase length limit for external identifier, string and URL datatype.

Thanks Lydia!

Jan 6 2017, 12:24 AM · User-Addshore, MW-1.33-notes (1.33.0-wmf.9; 2018-12-18), Patch-For-Review, Wikidata-Campsite (Wikidata-Campsite-Iteration-∞), MediaWiki-extensions-WikibaseRepository, Wikidata

Oct 25 2016

Sebotic created T149129: External ID formatter URLs for EC number does not get rendered as HTML link for logged in users.
Oct 25 2016, 9:41 PM · TestMe, Wikidata

Jul 14 2016

Sebotic added a comment to T112397: WDQS returns current AND old data.

thanks, here are the headers for r1 and r2, respectively:

Jul 14 2016, 10:42 PM · User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service
Sebotic added a comment to T112397: WDQS returns current AND old data.

I have a quick follow up for this. I made 2 slightly differing sparql queries one accessing values directly and one inderectly. They should give the same return values, but it seems that if each query is executed on a different server, the 2 result sets differ, one gives back 54320 values, the other 54315. Irrespective of the counts, some values differ. Seem my code here: https://gist.github.com/sebotic/a92f9291175f4968ce265ffe31e0e9c2

Jul 14 2016, 10:20 PM · User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service

Sep 15 2015

Sebotic added a comment to T112397: WDQS returns current AND old data.

@Smalyshev I just tested the query once again. Some of the old data is gone now, but one still comes up. It is this item: 'http://www.wikidata.org/entity/Q402633 I currently do not have other queries to execute, but I will think of some.

Sep 15 2015, 8:57 PM · User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service

Sep 12 2015

Sebotic created T112397: WDQS returns current AND old data.
Sep 12 2015, 7:15 PM · User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service