Smalyshev (Stas Malyshev)
Engineer in Discovery team

Projects (7)

Today

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Nov 28 2014, 7:04 AM (146 w, 5 d)
Availability
Available
IRC Nick
Smalyshev
LDAP User
Smalyshev
MediaWiki User
Smalyshev (WMF)

Recent Activity

Yesterday

Smalyshev added a comment to T121274: Provide an RDF mapping for external identifiers.

My point was: expanding IDs to resource references is not really the same thing as normalizing values

Tue, Sep 19, 10:57 PM · User-Smalyshev, User-Daniel, Wikidata-Sprint, Wikidata-Sprint-2015-12-01, Patch-For-Review, MediaWiki-extensions-WikibaseRepository, Wikidata, Story
Smalyshev added a comment to T175199: Index certain statements for Wikidata items.

I've renamed it to statement_keywords. Hopefully it's better.

Tue, Sep 19, 10:44 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev closed T176160: Puppet fails on newly created WDQS labs instance as Resolved.
Tue, Sep 19, 10:20 PM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev added a comment to T168041: Assign different favicons to query.wikidata.org and test.wikidata.org.

Looks like SVG support for icons is kinda spotty: https://en.wikipedia.org/wiki/Favicon#File_format_support

Tue, Sep 19, 9:08 PM · Patch-For-Review, User-Smalyshev, Design, WMDE-Design, Wikidata, Wikidata-Query-Service, Discovery
Smalyshev added a comment to T176239: WDQS updater should have a keep running after error mode.

Right now it's hardcoded in RdfRepository class:

Tue, Sep 19, 8:20 PM · Discovery, Wikidata-Query-Service, Wikidata
Smalyshev updated the task description for T176239: WDQS updater should have a keep running after error mode.
Tue, Sep 19, 8:14 PM · Discovery, Wikidata-Query-Service, Wikidata
Smalyshev moved T121274: Provide an RDF mapping for external identifiers from Backlog to Next on the User-Smalyshev board.
Tue, Sep 19, 5:29 PM · User-Smalyshev, User-Daniel, Wikidata-Sprint, Wikidata-Sprint-2015-12-01, Patch-For-Review, MediaWiki-extensions-WikibaseRepository, Wikidata, Story
Smalyshev moved T175741: Set ElasticSearch implementation as default for wbsearchentites on Wikidata from Waiting to Next on the User-Smalyshev board.
Tue, Sep 19, 5:29 PM · User-Smalyshev, Discovery-Search, Wikidata
Smalyshev moved T175741: Set ElasticSearch implementation as default for wbsearchentites on Wikidata from Next to Waiting on the User-Smalyshev board.
Tue, Sep 19, 5:28 PM · User-Smalyshev, Discovery-Search, Wikidata
Smalyshev added a comment to T176195: WDQS UI should show a warning if the backend is down.

I think you meant T=10s, N=3.

Tue, Sep 19, 4:27 AM · Wikidata Query UI, Discovery, Wikidata
Smalyshev triaged T176195: WDQS UI should show a warning if the backend is down as Normal priority.

I would propose a warning banner of something like that which contains:

  • Warning that the server is not responding
  • Link to a page that describes what to do, who to contact, etc. (probably should be configurable)
Tue, Sep 19, 3:36 AM · Wikidata Query UI, Discovery, Wikidata
Smalyshev edited projects for T176190: Support non-default namespaces in GUI, added: Wikidata-Query-Service; removed Wikidata Query UI.
Tue, Sep 19, 12:51 AM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev created T176190: Support non-default namespaces in GUI.
Tue, Sep 19, 12:49 AM · Discovery, Wikidata, Wikidata-Query-Service

Mon, Sep 18

Smalyshev added a comment to T175578: Wikidata Query Service function geof:distance is not working outside Earth.

the actual calculation is just “Earth distance × (globe radius / Earth radius)”, right?

Mon, Sep 18, 11:52 PM · Discovery, Wikidata-Query-Service, Wikidata
Smalyshev closed T174930: Problem with federated SPARQL query using UK Ordnance Survey open data -- bad prefix URL being sent ? as Resolved.
Mon, Sep 18, 7:31 PM · Patch-For-Review, User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service
Smalyshev added a project to T176160: Puppet fails on newly created WDQS labs instance: Wikidata-Query-Service.
Mon, Sep 18, 5:50 PM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev created T176160: Puppet fails on newly created WDQS labs instance.
Mon, Sep 18, 5:50 PM · Discovery, Wikidata, Wikidata-Query-Service

Sun, Sep 17

Smalyshev added a comment to T164773: Error replicating wikidata blazegraph setup.

Not sure whether it has some relation, hard to diagnose from this. Did you see the GC logs? What the status page for Blazegraph showed?

Sun, Sep 17, 8:08 AM · Discovery, Wikidata-Query-Service, Wikidata

Sat, Sep 16

Smalyshev closed T135241: SPARQL queryes for test.wikidata are not implemented as Declined.

I don't think we have plans to implement the service for test.wikidata, so closing this one.

Sat, Sep 16, 1:12 AM · Wikidata-Query-Service, Discovery, Pywikibot-pagegenerators.py, Wikidata
Smalyshev moved T175448: Provide mechanism(s) to avoid WDQS timeouts for certain pre-approved queries from Ready for work to Need investigation on the Wikidata-Query-Service board.
Sat, Sep 16, 1:09 AM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev moved T175448: Provide mechanism(s) to avoid WDQS timeouts for certain pre-approved queries from All WDQS-related tasks to Ready for work on the Wikidata-Query-Service board.
Sat, Sep 16, 1:09 AM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev moved T174981: Add pageviews total counts to WDQS from All WDQS-related tasks to Need investigation on the Wikidata-Query-Service board.
Sat, Sep 16, 12:46 AM · Discovery, Analytics, Wikidata-Query-Service, Wikidata
Smalyshev moved T175312: WDQS word cloud view gets broken on zoom from All WDQS-related tasks to GUI on the Wikidata-Query-Service board.
Sat, Sep 16, 12:45 AM · Discovery, Wikidata-Query-Service, Wikidata
Smalyshev added a comment to T175148: uppercase schema:inLanguage on Wikidata Query Service.

The language code for links works this way (see SiteLinksRdfBuilder class):

Sat, Sep 16, 12:44 AM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev triaged T175578: Wikidata Query Service function geof:distance is not working outside Earth as Normal priority.

geof:distance is definitely assuming Earth for now. Doing it for other globes is tricky since even if we assume they all spherical (which may be good approximation for larger planets but less so for dwarf planets and completely wrong for things like asteroids) we'd need to account for radius, etc. which is hardcoded now for Earth data. I'm not even sure how to efficiently implement it for random globe. It may be possible to do it for predefined set of globes.

Sat, Sep 16, 12:30 AM · Discovery, Wikidata-Query-Service, Wikidata
Smalyshev moved T175578: Wikidata Query Service function geof:distance is not working outside Earth from All WDQS-related tasks to Need investigation on the Wikidata-Query-Service board.
Sat, Sep 16, 12:27 AM · Discovery, Wikidata-Query-Service, Wikidata
Smalyshev added a comment to T175448: Provide mechanism(s) to avoid WDQS timeouts for certain pre-approved queries.

Blazegraph has mechanism of stored queries, however what is not entirely clear for me is how abuse prevention would work in such case. I.e., let's assume we have a heavy query, and we have found a way to run it past common limits. What would happen if somebody, by mistake or out of malice, runs it 100 times? This may take down the whole service, at least temporarily. We need some way to prevent this from happening.

Sat, Sep 16, 12:19 AM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev closed T172710: send wdqs logs to logstash as Resolved.
Sat, Sep 16, 12:14 AM · Patch-For-Review, Discovery, Wikidata, Operations, Wikidata-Query-Service
Smalyshev moved T175919: investigate GC times on wikidata query service from All WDQS-related tasks to Operations on the Wikidata-Query-Service board.
Sat, Sep 16, 12:13 AM · Patch-For-Review, Discovery-Wikidata-Query-Service-Sprint, Discovery, Wikidata, Wikidata-Query-Service
Smalyshev moved T175948: Add normalized predicates to Blazegraph vocabulary from All WDQS-related tasks to Ready for work on the Wikidata-Query-Service board.
Sat, Sep 16, 12:13 AM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev moved T175976: Ctrl+Enter should only work inside text editor, not on links from All WDQS-related tasks to GUI on the Wikidata-Query-Service board.
Sat, Sep 16, 12:13 AM · Discovery, Wikidata, Accessibility, Wikidata-Query-Service
Smalyshev closed T174414: WDQS incorrectly calculates geof:distance as Resolved.
Sat, Sep 16, 12:03 AM · Discovery-Wikidata-Query-Service-Sprint, Patch-For-Review, Upstream, Discovery, Wikidata-Query-Service, Wikidata

Fri, Sep 15

Smalyshev updated subscribers of T175982: Askplatyp.us SPARQL query integration.
Fri, Sep 15, 6:20 PM · Patch-For-Review, Wikidata, Wikidata Query UI
Smalyshev added a comment to T175919: investigate GC times on wikidata query service.

The heap was bumped from 8G because there were some OOMs with heavy queries (some of them still use a bit of heap even if most of the data uses Blazegraph's own allocator). So let's not be over-zealous in reducing it yet. 12G could still be fine.

Fri, Sep 15, 4:53 PM · Patch-For-Review, Discovery-Wikidata-Query-Service-Sprint, Discovery, Wikidata, Wikidata-Query-Service

Thu, Sep 14

Smalyshev added a project to T121274: Provide an RDF mapping for external identifiers: User-Smalyshev.
Thu, Sep 14, 11:16 PM · User-Smalyshev, User-Daniel, Wikidata-Sprint, Wikidata-Sprint-2015-12-01, Patch-For-Review, MediaWiki-extensions-WikibaseRepository, Wikidata, Story
Smalyshev moved T168041: Assign different favicons to query.wikidata.org and test.wikidata.org from Backlog to Waiting on the User-Smalyshev board.
Thu, Sep 14, 11:16 PM · Patch-For-Review, User-Smalyshev, Design, WMDE-Design, Wikidata, Wikidata-Query-Service, Discovery
Smalyshev moved T175199: Index certain statements for Wikidata items from Doing to Waiting on the User-Smalyshev board.
Thu, Sep 14, 11:16 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev moved T175199: Index certain statements for Wikidata items from Backlog to Needs review on the Discovery-Search (Current work) board.
Thu, Sep 14, 11:15 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev moved T175199: Index certain statements for Wikidata items from Up Next to Current work on the Discovery-Search board.
Thu, Sep 14, 11:15 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev triaged T175948: Add normalized predicates to Blazegraph vocabulary as Normal priority.
Thu, Sep 14, 6:36 PM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev created T175948: Add normalized predicates to Blazegraph vocabulary.
Thu, Sep 14, 6:35 PM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev added a comment to T121274: Provide an RDF mapping for external identifiers.

I'm reviewing the patch now and will update by the end of the day.

Thu, Sep 14, 6:17 PM · User-Smalyshev, User-Daniel, Wikidata-Sprint, Wikidata-Sprint-2015-12-01, Patch-For-Review, MediaWiki-extensions-WikibaseRepository, Wikidata, Story
Smalyshev moved T173231: Wikidata Elastic search drops results with matches in different language label from Needs review to Done on the Discovery-Search (Current work) board.
Thu, Sep 14, 5:51 PM · User-Smalyshev, Discovery-Search (Current work), Patch-For-Review, Wikidata
Smalyshev moved T173231: Wikidata Elastic search drops results with matches in different language label from Next to Waiting on the User-Smalyshev board.
Thu, Sep 14, 5:50 PM · User-Smalyshev, Discovery-Search (Current work), Patch-For-Review, Wikidata
Smalyshev added a comment to T140131: Show text from Wikidata usage instructions property (P2559) when auto-suggesting properties or items.

What I would suggest doing is maybe putting this as a non-indexed field in the index and returning it together with the response. Of course, it could also be done purely client-side but at the cost of one extra round-trip.

Thu, Sep 14, 5:27 PM · MediaWiki-extensions-WikibaseRepository, Wikidata
Smalyshev added a comment to T175199: Index certain statements for Wikidata items.

We may also want to store some values as non-indexed data, e.g. see T140131

Thu, Sep 14, 5:26 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev added a comment to T175199: Index certain statements for Wikidata items.

now if you decide to add P1559 (monolingual text) we should not index it in the "statements" elastic field they'll require totally different analyzers (one is an identifier, the other is written language)

Thu, Sep 14, 4:50 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata

Wed, Sep 13

Smalyshev added a comment to T175199: Index certain statements for Wikidata items.

Moving the filtering to the mapping (which I'll find more flexible in the future) will require some custom mapper/analyzer.

Wed, Sep 13, 10:45 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev added a comment to T168041: Assign different favicons to query.wikidata.org and test.wikidata.org.

@Lydia_Pintscher Can we use the icon from https://gerrit.wikimedia.org/r/377912 ?

Wed, Sep 13, 10:38 PM · Patch-For-Review, User-Smalyshev, Design, WMDE-Design, Wikidata, Wikidata-Query-Service, Discovery
Smalyshev closed T171807: Create ontology URL for mediawiki as Resolved.
Wed, Sep 13, 6:25 PM · Patch-For-Review, Wikidata, Discovery, Wikidata-Query-Service
Smalyshev closed T171807: Create ontology URL for mediawiki, a subtask of T157676: Provide access to category information from WDQS SPARQL, as Resolved.
Wed, Sep 13, 6:25 PM · Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-08-29 (1.30.0-wmf.16)), Discovery-Wikidata-Query-Service-Sprint, Wikidata, Wikidata-Query-Service, Discovery
Smalyshev added a comment to T175595: decommission wdqs100[12].

Was ldf server moved from wdqs1001? If not we should move it first thing.

Wed, Sep 13, 4:34 PM · Patch-For-Review, hardware-requests, Operations, Discovery-Wikidata-Query-Service-Sprint
Smalyshev added a comment to T121274: Provide an RDF mapping for external identifiers.

I think this use of psn::

p:P227 [ # full statement
   ps:P227 "4015139-6"; # simple value
   psn:P227 <http://d-nb.info/gnd/4015139-6> # normalized simple value
 ].

is OK. I'll take time to review it more thoroughly in coming days, but on the face of it it looks OK. Also, please note that psn:P123 and psn:P345 do not have to be of the same type - you have to preserve consistency within the same predicate, but different predicates with the same prefix can have different types. In this case, they even happen to have the same type, due to how we represent values, but in general that's not a requirement as long as overall semantics is close.

Wed, Sep 13, 4:25 PM · User-Smalyshev, User-Daniel, Wikidata-Sprint, Wikidata-Sprint-2015-12-01, Patch-For-Review, MediaWiki-extensions-WikibaseRepository, Wikidata, Story

Tue, Sep 12

Sjoerddebruin awarded T175741: Set ElasticSearch implementation as default for wbsearchentites on Wikidata a Love token.
Tue, Sep 12, 10:46 PM · User-Smalyshev, Discovery-Search, Wikidata
Smalyshev moved T173231: Wikidata Elastic search drops results with matches in different language label from Waiting to Next on the User-Smalyshev board.
Tue, Sep 12, 8:35 PM · User-Smalyshev, Discovery-Search (Current work), Patch-For-Review, Wikidata
Smalyshev moved T175741: Set ElasticSearch implementation as default for wbsearchentites on Wikidata from Backlog to Next on the User-Smalyshev board.
Tue, Sep 12, 8:34 PM · User-Smalyshev, Discovery-Search, Wikidata
Smalyshev closed T119067: Adjust rescoring config for Wikidata to consider sitelink count, a subtask of T110648: [Bug] high-ranking items seemed to have dropped significantly in Special:Search results for wikidata, as Resolved.
Tue, Sep 12, 8:34 PM · MW-1.27-release (WMF-deploy-2016-01-12_(1.27.0-wmf.10)), Patch-For-Review, CirrusSearch, Elasticsearch, Discovery, Wikidata
Smalyshev closed T119067: Adjust rescoring config for Wikidata to consider sitelink count as Resolved.
Tue, Sep 12, 8:34 PM · Wikidata-Sprint-2016-04-12, Wikidata-Sprint-2016-03-01, Wikidata
Smalyshev added a comment to T175741: Set ElasticSearch implementation as default for wbsearchentites on Wikidata.

Sounds good. Let's announce the intent to do it on Monday next week and then flip the switch as soon as T173231 fix is merged and it and code for T172467 are deployed.

Tue, Sep 12, 8:24 PM · User-Smalyshev, Discovery-Search, Wikidata
Smalyshev updated the task description for T125500: [Epic] Index Wikidata labels and descriptions as separate fields in ElasticSearch.
Tue, Sep 12, 8:03 PM · Epic, Discovery-Search (Current work), Wikidata-Sprint-2016-08-02, Wikidata-Sprint-2016-07-19, Wikidata-Sprint-2016-07-05, Wikidata-Sprint-2016-05-24, Wikidata-Sprint-2016-05-10, Patch-For-Review, Wikidata-Sprint-2016-04-26, Wikidata-Sprint-2016-04-12, Wikidata-Sprint-2016-03-01, Wikidata-Sprint-2016-02-16, Wikidata, Discovery, CirrusSearch, Wikidata-Sprint-2016-02-02
Smalyshev added a comment to T119067: Adjust rescoring config for Wikidata to consider sitelink count.

Now the ElasticSearch configs account for sitelinks (and in general any field can be used in search profile with various functions and weights). Do we still need to do anything for this one? Is this for full-test search (which does not feature ElasticSearch yet)?

Tue, Sep 12, 8:01 PM · Wikidata-Sprint-2016-04-12, Wikidata-Sprint-2016-03-01, Wikidata
Smalyshev added a subtask for T125500: [Epic] Index Wikidata labels and descriptions as separate fields in ElasticSearch: T175741: Set ElasticSearch implementation as default for wbsearchentites on Wikidata.
Tue, Sep 12, 8:00 PM · Epic, Discovery-Search (Current work), Wikidata-Sprint-2016-08-02, Wikidata-Sprint-2016-07-19, Wikidata-Sprint-2016-07-05, Wikidata-Sprint-2016-05-24, Wikidata-Sprint-2016-05-10, Patch-For-Review, Wikidata-Sprint-2016-04-26, Wikidata-Sprint-2016-04-12, Wikidata-Sprint-2016-03-01, Wikidata-Sprint-2016-02-16, Wikidata, Discovery, CirrusSearch, Wikidata-Sprint-2016-02-02
Smalyshev added a parent task for T175741: Set ElasticSearch implementation as default for wbsearchentites on Wikidata: T125500: [Epic] Index Wikidata labels and descriptions as separate fields in ElasticSearch.
Tue, Sep 12, 8:00 PM · User-Smalyshev, Discovery-Search, Wikidata
Smalyshev added a project to T175741: Set ElasticSearch implementation as default for wbsearchentites on Wikidata: User-Smalyshev.
Tue, Sep 12, 7:59 PM · User-Smalyshev, Discovery-Search, Wikidata
Smalyshev created T175741: Set ElasticSearch implementation as default for wbsearchentites on Wikidata.
Tue, Sep 12, 7:59 PM · User-Smalyshev, Discovery-Search, Wikidata
Smalyshev updated the task description for T125500: [Epic] Index Wikidata labels and descriptions as separate fields in ElasticSearch.
Tue, Sep 12, 7:54 PM · Epic, Discovery-Search (Current work), Wikidata-Sprint-2016-08-02, Wikidata-Sprint-2016-07-19, Wikidata-Sprint-2016-07-05, Wikidata-Sprint-2016-05-24, Wikidata-Sprint-2016-05-10, Patch-For-Review, Wikidata-Sprint-2016-04-26, Wikidata-Sprint-2016-04-12, Wikidata-Sprint-2016-03-01, Wikidata-Sprint-2016-02-16, Wikidata, Discovery, CirrusSearch, Wikidata-Sprint-2016-02-02
Smalyshev updated subscribers of T175199: Index certain statements for Wikidata items.
Tue, Sep 12, 5:24 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev added a comment to T175199: Index certain statements for Wikidata items.

In the patch, there was an option raised to index all statements of certain type, instead of just named properties. I am not sure yet whether it is a good idea or not, need some thought. Probably not in the initial iteration, but possibly later.

Tue, Sep 12, 5:24 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev added a subtask for T169798: Create UDFs for analyzing SPARQL queries: T164020: Use hive dynamic partitioning to split webrequest on tags.
Tue, Sep 12, 5:20 PM · User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service
Smalyshev added a parent task for T164020: Use hive dynamic partitioning to split webrequest on tags: T169798: Create UDFs for analyzing SPARQL queries.
Tue, Sep 12, 5:20 PM · Patch-For-Review, Analytics-Kanban
Smalyshev removed a parent task for T169798: Create UDFs for analyzing SPARQL queries: T164020: Use hive dynamic partitioning to split webrequest on tags.
Tue, Sep 12, 5:20 PM · User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service
Smalyshev removed a subtask for T164020: Use hive dynamic partitioning to split webrequest on tags: T169798: Create UDFs for analyzing SPARQL queries.
Tue, Sep 12, 5:20 PM · Patch-For-Review, Analytics-Kanban
Smalyshev added a parent task for T169798: Create UDFs for analyzing SPARQL queries: T164020: Use hive dynamic partitioning to split webrequest on tags.
Tue, Sep 12, 5:19 PM · User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service
Smalyshev added a subtask for T164020: Use hive dynamic partitioning to split webrequest on tags: T169798: Create UDFs for analyzing SPARQL queries.
Tue, Sep 12, 5:19 PM · Patch-For-Review, Analytics-Kanban
Smalyshev closed T171210: rack/setup/install wdqs100[45].eqiad.wmnet as Resolved.
Tue, Sep 12, 5:17 PM · Patch-For-Review, Discovery-Search (Current work), Discovery, Wikidata, Operations, Wikidata-Query-Service
Smalyshev claimed T165982: Investigate using blazegraph for deep category searching / returning of results.
Tue, Sep 12, 5:15 PM · Discovery-Search (Current work), Patch-For-Review, Wikidata-Query-Service, Wikidata, TCB-Team, German-Community-Wishlist, Discovery, CirrusSearch
Smalyshev closed T172467: Make good prefix search profile for Wikidata entities, a subtask of T125500: [Epic] Index Wikidata labels and descriptions as separate fields in ElasticSearch, as Resolved.
Tue, Sep 12, 5:05 PM · Epic, Discovery-Search (Current work), Wikidata-Sprint-2016-08-02, Wikidata-Sprint-2016-07-19, Wikidata-Sprint-2016-07-05, Wikidata-Sprint-2016-05-24, Wikidata-Sprint-2016-05-10, Patch-For-Review, Wikidata-Sprint-2016-04-26, Wikidata-Sprint-2016-04-12, Wikidata-Sprint-2016-03-01, Wikidata-Sprint-2016-02-16, Wikidata, Discovery, CirrusSearch, Wikidata-Sprint-2016-02-02
Smalyshev closed T172467: Make good prefix search profile for Wikidata entities as Resolved.
Tue, Sep 12, 5:05 PM · Patch-For-Review, User-Smalyshev, Discovery, Discovery-Search (Current work), Wikidata
Smalyshev claimed T171921: Use correct timestamp when indexing deletes.
Tue, Sep 12, 12:37 AM · Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Discovery, CirrusSearch
Smalyshev claimed T173231: Wikidata Elastic search drops results with matches in different language label.
Tue, Sep 12, 12:37 AM · User-Smalyshev, Discovery-Search (Current work), Patch-For-Review, Wikidata
Smalyshev moved T174930: Problem with federated SPARQL query using UK Ordnance Survey open data -- bad prefix URL being sent ? from Doing to Waiting on the User-Smalyshev board.
Tue, Sep 12, 12:36 AM · Patch-For-Review, User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service
Smalyshev moved T174930: Problem with federated SPARQL query using UK Ordnance Survey open data -- bad prefix URL being sent ? from Next to Doing on the User-Smalyshev board.
Tue, Sep 12, 12:18 AM · Patch-For-Review, User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service

Sat, Sep 9

Liuxinyu970226 awarded T112715: Enable different URL shorteners for WDQS a Love token.
Sat, Sep 9, 11:51 AM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev added a comment to T167565: Wikidata allows invalid URIs to be entered as units.

Another example here:

Sat, Sep 9, 7:00 AM · Need-volunteer, MediaWiki-extensions-WikibaseRepository, Wikidata

Fri, Sep 8

Smalyshev added a comment to T175199: Index certain statements for Wikidata items.

I'm not sure we should really go as far as indexing all statements, now. Most of them would not be very useful for the search purposes for now, and already served by Query Service. Most useful ones would be those that are legitimately limit the searches for relevant items, which I would imaging mostly are P31/P279. In fact, right now I don't even have much of a use case for using anything but those two, but maybe we'd have it in the future. I think maybe it'd be ok for now yo just index those explicitly mentioned. The idea of using analyzer/filters may be still workable in the future, but I'd postpone it for now.

Fri, Sep 8, 5:49 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev added a comment to T171911: Create WDQS-Optimizer tag.

It's not our component, so having a workboard etc. for it would not be very useful. It's just a tag to easily identify tickets that are related to this functionality.
It's also not Wikidata Query Service Optimizer, it's Blazegraph Optimizer. But since many people do not know what Blazegraph is, I chose to use WDQS. If this does not fit some naming guidelines, please choose the suitable name.

Fri, Sep 8, 5:44 PM · Project-Admins
Smalyshev added a comment to T175199: Index certain statements for Wikidata items.

@EBernhardson yes, this looks like what I've done in the patch, I just wondered if it's correct. Looks like it is then :)

Fri, Sep 8, 1:14 AM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev added a comment to T175199: Index certain statements for Wikidata items.

@dcausse Could you explain a bit more how to set up the analyzer? I tried to figure how to do it but I'm not sure whether I did it right.

Fri, Sep 8, 12:42 AM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata

Thu, Sep 7

Smalyshev moved T175199: Index certain statements for Wikidata items from Backlog to Doing on the User-Smalyshev board.
Thu, Sep 7, 6:48 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev claimed T175199: Index certain statements for Wikidata items.
Thu, Sep 7, 6:48 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev moved T174930: Problem with federated SPARQL query using UK Ordnance Survey open data -- bad prefix URL being sent ? from Backlog to Next on the User-Smalyshev board.
Thu, Sep 7, 6:47 PM · Patch-For-Review, User-Smalyshev, Discovery, Wikidata, Wikidata-Query-Service
Pintoch awarded T112715: Enable different URL shorteners for WDQS a Love token.
Thu, Sep 7, 12:36 PM · Discovery, Wikidata, Wikidata-Query-Service
Smalyshev moved T173231: Wikidata Elastic search drops results with matches in different language label from In progress to Needs review on the Discovery-Search (Current work) board.
Thu, Sep 7, 6:28 AM · User-Smalyshev, Discovery-Search (Current work), Patch-For-Review, Wikidata
Smalyshev added a comment to T175199: Index certain statements for Wikidata items.

I wonder also, is it possible to do the (de)boosting on rescore stage? The reason is because we can select different rescore profiles from URL (which means different widgets can use different boosts) while getting stuff added to the search query itself is more complicated. Of course, we can add more query params or query syntax, but it seems to be for tuning profiles may be easier to do?

Thu, Sep 7, 6:13 AM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata

Wed, Sep 6

Smalyshev added a comment to T175108: Does WDQS pre-declare <https://fr.wikipedia.org/> for schema:isPartOf ?.

I am not sure these micro-optimizations are worth the increased complexity... Maybe need a test to see if it really produces any noticeable difference.

Wed, Sep 6, 11:14 PM · Discovery, Wikidata-Query-Service, Wikidata
Smalyshev moved T173231: Wikidata Elastic search drops results with matches in different language label from Next to Waiting on the User-Smalyshev board.
Wed, Sep 6, 9:55 PM · User-Smalyshev, Discovery-Search (Current work), Patch-For-Review, Wikidata
Smalyshev updated the task description for T175199: Index certain statements for Wikidata items.
Wed, Sep 6, 7:00 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev added a comment to T175199: Index certain statements for Wikidata items.

i wonder if we could rather have some sort of relationship (name tbd) keyword field that encodes both parts

Wed, Sep 6, 6:57 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev added parent tasks for T175199: Index certain statements for Wikidata items: T148411: Item search for statements ranks disambiguation items too highly, T78157: [Story] Use ElasticSearch for entity search on wikidata.org.
Wed, Sep 6, 6:41 PM · Wikidata-Sprint, Discovery-Search (Current work), Patch-For-Review, User-Smalyshev, Wikidata
Smalyshev added a subtask for T78157: [Story] Use ElasticSearch for entity search on wikidata.org: T175199: Index certain statements for Wikidata items.
Wed, Sep 6, 6:41 PM · Wikidata-Sprint-2016-04-12, Discovery-Search, Wikidata-Sprint-2016-03-01, Story, Discovery, MediaWiki-extensions-WikibaseRepository, CirrusSearch, Wikidata, Wikidata-Sprint-2014-12-09§, Performance