Page MenuHomePhabricator
Feed Search

Yesterday

EBernhardson moved T187307: Special:Undelete title search should not be case sensitive from needs triage to Bugs on the Discovery-Search board.
Tue, Apr 21, 6:10 PM · Discovery-Search, CirrusSearch, MediaWiki-Page-deletion
EBernhardson edited projects for T187307: Special:Undelete title search should not be case sensitive, added: Discovery-Search; removed Discovery-Search (2026.04.06 - 2026.05.01).

We talked about this at our wednesday meeting and decided it's going to be a mid-sized investment to get this working. We need to pick a new unique id (plausibly log_id, but needs verification), then we would need to migrate to the new ids. We've never done a migration of doc ids so while we have some ideas, it will need further exploration and evaluation to determine how that change can be done in production without disabling archive search while the change is in progress.

Tue, Apr 21, 6:10 PM · Discovery-Search, CirrusSearch, MediaWiki-Page-deletion
EBernhardson moved T420239: Regex pattern in SearchHighlighter::highlightSimple must be escaped from Needs Review to Done on the Discovery-Search (2026.04.06 - 2026.05.01) board.
Tue, Apr 21, 4:29 PM · MW-1.45-notes, Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), Performance Issue, MW-1.45-release, MediaWiki-Search
EBernhardson added a comment to T420239: Regex pattern in SearchHighlighter::highlightSimple must be escaped.

Thanks for the fix!

Will it be backported to 1.45.x and released sooner than 1.46?

Tue, Apr 21, 4:29 PM · MW-1.45-notes, Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), Performance Issue, MW-1.45-release, MediaWiki-Search
EBernhardson created P91297 (An Untitled Masterwork).
Tue, Apr 21, 2:28 PM

Mon, Apr 20

EBernhardson added a comment to T417694: Perform a one-time clean up of retained data sets in event_sanitize.

I don't believe I've used the event_santized tables either. We do use some of the data beyond 90 days, but that's in a separate rollup table. It should be safe, afaik, to drop searchsatisfaction from the event_sanitized database.

Mon, Apr 20, 1:03 PM · Patch-For-Review, Essential-Work, Data-Engineering (Q4 FS25/26 April 1st - June 30st)

Thu, Apr 2

EBernhardson claimed T420239: Regex pattern in SearchHighlighter::highlightSimple must be escaped.
Thu, Apr 2, 5:36 PM · MW-1.45-notes, Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), Performance Issue, MW-1.45-release, MediaWiki-Search

Wed, Apr 1

EBernhardson added a comment to T420239: Regex pattern in SearchHighlighter::highlightSimple must be escaped.

By chance are you using postgresql? SearchHighlighter::highlightSimple is documented as using the result of SearchDatabase::regexTerm. That looks to be applied in sqlite and mysql, but i suspect it is not being applied in the postgresql context.

Wed, Apr 1, 8:09 PM · MW-1.45-notes, Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), Performance Issue, MW-1.45-release, MediaWiki-Search
EBernhardson claimed T421192: Cindy The Browser Test Bot fails with 0 failures.

The patch is not 100% related, but also addresses this issue as part of updating the messages posted to gerrit.

Wed, Apr 1, 1:05 PM · Discovery-Search (2026.04.06 - 2026.05.01), ci-test-error (WMF-deployed Build Failure), CirrusSearch

Tue, Mar 31

EBernhardson claimed T187307: Special:Undelete title search should not be case sensitive.
Tue, Mar 31, 6:33 PM · Discovery-Search, CirrusSearch, MediaWiki-Page-deletion
EBernhardson added a comment to T187307: Special:Undelete title search should not be case sensitive.

For Integrated Technology Group(ITG) the problem looks to be that log_page is 0, but we use log_page as the unique id of the page. There is a relevant ar_page_id in the archive table, but for reasons i don't remember the archive indexing works off the logging table, not off the archive table. These particular rows are from 2014, querying enwiki shows there are 0 delete logs since jan 1 2026 with log_type='delete' and log_action='delete' and log_page = 0, making me suspect this is a historical artifact. We could potentially change ForceSearchIndex to recognize log_page = 0 and try and look it up in the archive table.

Tue, Mar 31, 6:23 PM · Discovery-Search, CirrusSearch, MediaWiki-Page-deletion

Mon, Mar 30

EBernhardson moved T420886: The search token is no longer propagated in autocomplete search satisfaction logs from Needs Review to To be Deployed on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 30, 7:13 PM · Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), CirrusSearch
EBernhardson added a comment to T421395: Evaluate analyzer changes between opensearch 1.3.x and 2.19.5.

One change the integration test suite found:

Mon, Mar 30, 7:00 PM · Discovery-Search (2026.04.06 - 2026.05.01), CirrusSearch
EBernhardson moved T420859: EntityHandlerTestCase causes invalid data provider failures under PHPUnit 10 from Incoming to Ready for Dev on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 30, 3:30 PM · MW-1.46-notes (1.46.0-wmf.24; 2026-04-14), Wikidata Lexicographical data, Wikidata
EBernhardson moved T187307: Special:Undelete title search should not be case sensitive from Incoming to Ready for Dev on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 30, 3:30 PM · Discovery-Search, CirrusSearch, MediaWiki-Page-deletion
EBernhardson set the point value for T420859: EntityHandlerTestCase causes invalid data provider failures under PHPUnit 10 to 1.
Mon, Mar 30, 3:30 PM · MW-1.46-notes (1.46.0-wmf.24; 2026-04-14), Wikidata Lexicographical data, Wikidata
EBernhardson moved T421395: Evaluate analyzer changes between opensearch 1.3.x and 2.19.5 from Incoming to Ready for Dev on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 30, 3:28 PM · Discovery-Search (2026.04.06 - 2026.05.01), CirrusSearch
EBernhardson moved T421192: Cindy The Browser Test Bot fails with 0 failures from Incoming to Ready for Dev on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 30, 3:28 PM · Discovery-Search (2026.04.06 - 2026.05.01), ci-test-error (WMF-deployed Build Failure), CirrusSearch
EBernhardson set the point value for T421395: Evaluate analyzer changes between opensearch 1.3.x and 2.19.5 to 5.
Mon, Mar 30, 3:27 PM · Discovery-Search (2026.04.06 - 2026.05.01), CirrusSearch
EBernhardson set the point value for T421192: Cindy The Browser Test Bot fails with 0 failures to 2.
Mon, Mar 30, 3:25 PM · Discovery-Search (2026.04.06 - 2026.05.01), ci-test-error (WMF-deployed Build Failure), CirrusSearch
EBernhardson moved T421192: Cindy The Browser Test Bot fails with 0 failures from needs triage to 2026.03.03 - 2026.04.03 on the Discovery-Search board.
Mon, Mar 30, 3:23 PM · Discovery-Search (2026.04.06 - 2026.05.01), ci-test-error (WMF-deployed Build Failure), CirrusSearch
EBernhardson added a comment to T421192: Cindy The Browser Test Bot fails with 0 failures.

This could have a better error message, what happens is no junit logs were created which is where the count comes from, but the pass/fail comes from the return code of running the tests. We could at least have a better message. The 0 failures seems to happen when docker gets wedged and refuses to bring up new containers.

Mon, Mar 30, 3:23 PM · Discovery-Search (2026.04.06 - 2026.05.01), ci-test-error (WMF-deployed Build Failure), CirrusSearch
EBernhardson edited projects for T421718: Search Platform: Re-IP eqiad private baremetal hosts to new per-rack vlans/subnets, added: Data-Platform-SRE; removed Discovery-Search.
Mon, Mar 30, 3:19 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17)
EBernhardson moved T420407: Migrate opensearch plugins to 2.19.5 from In Progress to Done on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 30, 3:18 PM · Patch-For-Review, Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson moved T420759: Update wmf-opensearch-search-plugins in apt.wikimedia.org to 2.19.5 from Blocked / Waiting to Done on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 30, 3:17 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch

Thu, Mar 26

EBernhardson created T421395: Evaluate analyzer changes between opensearch 1.3.x and 2.19.5.
Thu, Mar 26, 3:12 PM · Discovery-Search (2026.04.06 - 2026.05.01), CirrusSearch

Mon, Mar 23

EBernhardson created T420965: Templated insource:// queries.
Mon, Mar 23, 5:00 PM · User-EBernhardson
EBernhardson closed T415398: Wikimedia Commons deepcategory searches return unexpected results for categories with "&" in name as Invalid.

It looks like the search components are doing as expected, the issue is in an external add-on. The issue will need to be addressed in that add on.

Mon, Mar 23, 4:29 PM · Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson moved T420759: Update wmf-opensearch-search-plugins in apt.wikimedia.org to 2.19.5 from Incoming to Blocked / Waiting on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 23, 4:24 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson set the point value for T420886: The search token is no longer propagated in autocomplete search satisfaction logs to 3.
Mon, Mar 23, 4:24 PM · Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), CirrusSearch
EBernhardson renamed T420886: The search token is no longer propagated in autocomplete search satisfaction logs from The search token is not longer propagated in search satisfaction logs to The search token is not longer propagated in autocomplete search satisfaction logs.
Mon, Mar 23, 4:23 PM · Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), CirrusSearch
EBernhardson set the point value for T417648: [MEX] M4 - improve findability of properties on lookups to 3.
Mon, Mar 23, 4:22 PM · Discovery-Search (2026.04.06 - 2026.05.01), Wikidata-Omega, Wikidata
EBernhardson set the point value for T420239: Regex pattern in SearchHighlighter::highlightSimple must be escaped to 2.
Mon, Mar 23, 4:21 PM · MW-1.45-notes, Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), Performance Issue, MW-1.45-release, MediaWiki-Search
EBernhardson set the point value for T420427: Search shouldn't trim trailing space when suggesting suggestions to 3.
Mon, Mar 23, 4:19 PM · Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), Patch-For-Review, CirrusSearch
EBernhardson moved T420582: Migrate Airflow Search instance code away from deprecated VariableProperties from Incoming to Ready for Dev on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 23, 4:18 PM · Discovery-Search (2026.04.06 - 2026.05.01)
EBernhardson set the point value for T420582: Migrate Airflow Search instance code away from deprecated VariableProperties to 1.
Mon, Mar 23, 4:18 PM · Discovery-Search (2026.04.06 - 2026.05.01)
EBernhardson moved T417648: [MEX] M4 - improve findability of properties on lookups from needs triage to 2026.03.03 - 2026.04.03 on the Discovery-Search board.
Mon, Mar 23, 4:16 PM · Discovery-Search (2026.04.06 - 2026.05.01), Wikidata-Omega, Wikidata
EBernhardson moved T420230: Migrate to OpenSearch 3.x from needs triage to [epic] on the Discovery-Search board.
Mon, Mar 23, 4:15 PM · Discovery-Search, CirrusSearch, Epic
EBernhardson moved T420239: Regex pattern in SearchHighlighter::highlightSimple must be escaped from needs triage to 2026.03.03 - 2026.04.03 on the Discovery-Search board.
Mon, Mar 23, 4:15 PM · MW-1.45-notes, Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), Performance Issue, MW-1.45-release, MediaWiki-Search
EBernhardson moved T420427: Search shouldn't trim trailing space when suggesting suggestions from needs triage to 2026.03.03 - 2026.04.03 on the Discovery-Search board.
Mon, Mar 23, 4:13 PM · Discovery-Search (2026.04.06 - 2026.05.01), MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), Patch-For-Review, CirrusSearch
EBernhardson moved T420582: Migrate Airflow Search instance code away from deprecated VariableProperties from needs triage to 2026.03.03 - 2026.04.03 on the Discovery-Search board.
Mon, Mar 23, 4:11 PM · Discovery-Search (2026.04.06 - 2026.05.01)
EBernhardson moved T420859: EntityHandlerTestCase causes invalid data provider failures under PHPUnit 10 from needs triage to 2026.03.03 - 2026.04.03 on the Discovery-Search board.
Mon, Mar 23, 4:11 PM · MW-1.46-notes (1.46.0-wmf.24; 2026-04-14), Wikidata Lexicographical data, Wikidata
EBernhardson moved T414095: Configure opensearch ML connectors/models from Needs Review to Done on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 23, 4:06 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Discovery-Search (2026.03.03 - 2026.04.03), Semantic Search
EBernhardson moved T418241: RevisionSearchResultTrait::initFromTitle() crashes on titles that are not proper pages from To be Deployed to Reported on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 23, 4:04 PM · MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), Discovery-Search (2026.03.03 - 2026.04.03), MediaWiki-Search
EBernhardson moved T419590: Semantic Search fails on frwiki Special:Search from To be Deployed to Done on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mon, Mar 23, 4:04 PM · Semantic Search, MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), CirrusSearch, Discovery-Search (2026.03.03 - 2026.04.03)

Mar 20 2026

EBernhardson created T420759: Update wmf-opensearch-search-plugins in apt.wikimedia.org to 2.19.5.
Mar 20 2026, 5:09 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch

Mar 18 2026

EBernhardson created T420473: Propagate article topics from clicked pages to source query.
Mar 18 2026, 1:59 PM · User-EBernhardson

Mar 17 2026

EBernhardson moved T420407: Migrate opensearch plugins to 2.19.5 from Incoming to In Progress on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mar 17 2026, 6:27 PM · Patch-For-Review, Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson created T420407: Migrate opensearch plugins to 2.19.5.
Mar 17 2026, 6:26 PM · Patch-For-Review, Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson moved T414103: Mjolnir feature collection failing in mjolnir_weekly Airflow DAG from To be Deployed to Done on the Discovery-Search (2026.03.03 - 2026.04.03) board.

Problem was traced down to null timestamps coming out of query_clicks_hourly. This was due to an overly specific format specifier and the source data adding millisecond precision to the timestamp. Timestamp conversion was changed to a more permissive conversion. The last three months of query_clicks_hourly and query_clicks_daily were backfilled. mjolnir dag was unpaused and completed a run.

Mar 17 2026, 4:52 PM · Discovery-Search (2026.03.03 - 2026.04.03)

Mar 16 2026

EBernhardson moved T414623: [Vector Search] Estimate resource consumption at scale from Reported to Done on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mar 16 2026, 4:07 PM · Semantic Search, Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson moved T414623: [Vector Search] Estimate resource consumption at scale from Needs Review to Reported on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mar 16 2026, 4:07 PM · Semantic Search, Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson moved T419590: Semantic Search fails on frwiki Special:Search from Needs Review to To be Deployed on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mar 16 2026, 4:06 PM · Semantic Search, MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), CirrusSearch, Discovery-Search (2026.03.03 - 2026.04.03)
EBernhardson created P89865 (An Untitled Masterwork).
Mar 16 2026, 3:26 PM

Mar 13 2026

EBernhardson moved T414103: Mjolnir feature collection failing in mjolnir_weekly Airflow DAG from Ready for Dev to To be Deployed on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mar 13 2026, 3:27 PM · Discovery-Search (2026.03.03 - 2026.04.03)

Mar 12 2026

EBernhardson closed T419029: Grant Access to ops for ebernhardson as Invalid.
Mar 12 2026, 9:39 PM · Data-Platform-SRE (2026-03-06 - 2026-03-27), SRE-Access-Requests, SRE

Mar 11 2026

EBernhardson set the point value for T419590: Semantic Search fails on frwiki Special:Search to 2.
Mar 11 2026, 6:37 PM · Semantic Search, MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), CirrusSearch, Discovery-Search (2026.03.03 - 2026.04.03)
EBernhardson claimed T419590: Semantic Search fails on frwiki Special:Search.
Mar 11 2026, 6:36 PM · Semantic Search, MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), CirrusSearch, Discovery-Search (2026.03.03 - 2026.04.03)
EBernhardson edited projects for T419727: Stand up semantic search cluster in codfw, added: Data-Platform-SRE (2026-03-06 - 2026-03-27); removed Data-Platform-SRE.
Mar 11 2026, 4:00 PM · Data-Platform-SRE (2026-03-06 - 2026-03-27), Discovery-Search, CirrusSearch
EBernhardson added a project to T419727: Stand up semantic search cluster in codfw: Data-Platform-SRE.
Mar 11 2026, 4:00 PM · Data-Platform-SRE (2026-03-06 - 2026-03-27), Discovery-Search, CirrusSearch
EBernhardson created T419727: Stand up semantic search cluster in codfw.
Mar 11 2026, 3:59 PM · Data-Platform-SRE (2026-03-06 - 2026-03-27), Discovery-Search, CirrusSearch

Mar 10 2026

EBernhardson added a comment to T414103: Mjolnir feature collection failing in mjolnir_weekly Airflow DAG.

Had a bit of time to start looking into this, some findings:

Mar 10 2026, 8:16 PM · Discovery-Search (2026.03.03 - 2026.04.03)
EBernhardson added a comment to T187307: Special:Undelete title search should not be case sensitive.

Checked with some people that have admin, it is still returning different results for each. The cased query returns 4 results, one of the results found is the lower cased variant. The uncased query returns 2 results, one cased and one uncased. Curiously of the two results that go missing one is an exact match other than casing.

Mar 10 2026, 6:26 PM · Discovery-Search, CirrusSearch, MediaWiki-Page-deletion
EBernhardson moved T419590: Semantic Search fails on frwiki Special:Search from needs triage to 2026.03.03 - 2026.04.03 on the Discovery-Search board.
Mar 10 2026, 6:21 PM · Semantic Search, MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), CirrusSearch, Discovery-Search (2026.03.03 - 2026.04.03)
EBernhardson created T419590: Semantic Search fails on frwiki Special:Search.
Mar 10 2026, 6:20 PM · Semantic Search, MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), CirrusSearch, Discovery-Search (2026.03.03 - 2026.04.03)
EBernhardson added a comment to T187307: Special:Undelete title search should not be case sensitive.

I poked through the code and tested this localy, I wasn't able to reproduce. Potentially the problem has been fixed over the last number of years, or maybe it requires more specific conditions to be triggered. I don't have rights to test on enwiki directly, would appreciate if someone could re-verify the links in the ticket.

Mar 10 2026, 5:56 PM · Discovery-Search, CirrusSearch, MediaWiki-Page-deletion
EBernhardson moved T418241: RevisionSearchResultTrait::initFromTitle() crashes on titles that are not proper pages from Needs Review to To be Deployed on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mar 10 2026, 3:49 PM · MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), Discovery-Search (2026.03.03 - 2026.04.03), MediaWiki-Search

Mar 9 2026

EBernhardson moved T414623: [Vector Search] Estimate resource consumption at scale from In Progress to Needs Review on the Discovery-Search (2026.03.03 - 2026.04.03) board.

index/memory ratio has been a bit vague, to be more concrete:

Mar 9 2026, 8:45 PM · Semantic Search, Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson claimed T418241: RevisionSearchResultTrait::initFromTitle() crashes on titles that are not proper pages.

Thanks for the report! It looks like your fix should do the trick, i've put it up into gerrit for review.

Mar 9 2026, 6:40 PM · MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), Discovery-Search (2026.03.03 - 2026.04.03), MediaWiki-Search
EBernhardson moved T419041: Enable custom readahead settings for Ceph block devices serving workload on the dse-k8s clusters from Incoming to Blocked / Waiting on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mar 9 2026, 6:21 PM · Patch-For-Review, Discovery-Search (2026.03.03 - 2026.04.03), Data-Platform-SRE (2026-03-06 - 2026-03-27)
EBernhardson added a comment to T414095: Configure opensearch ML connectors/models.

This has expanded a bit, it now also handles roles and role groups, since we need to set permissions such that the incoming requests can execute msearch and load models.

Mar 9 2026, 5:44 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Discovery-Search (2026.03.03 - 2026.04.03), Semantic Search
EBernhardson updated subscribers of T418469: latest Elastica release does not include Elastica\Client?.

@Reedy It looks like in 1.42 the vendor/ directory was included, but in 43-45 it was not. This feels like a change in the way mediawiki is packaged, but i didn't find anything in the release notes. Any ideas?

Mar 9 2026, 4:48 PM · VPS-project-Extdist, Discovery-Search, CirrusSearch
EBernhardson moved T187307: Special:Undelete title search should not be case sensitive from needs triage to 2026.03.03 - 2026.04.03 on the Discovery-Search board.
Mar 9 2026, 4:34 PM · Discovery-Search, CirrusSearch, MediaWiki-Page-deletion
EBernhardson moved T419397: Get search results for different embedding models from semantic search from needs triage to 2026.03.03 - 2026.04.03 on the Discovery-Search board.
Mar 9 2026, 4:20 PM · Discovery-Search (2026.04.06 - 2026.05.01), Research, Semantic Search
EBernhardson moved T419409: Get search results from semantic search using MIRACL benchmark dataset from needs triage to 2026.03.03 - 2026.04.03 on the Discovery-Search board.
Mar 9 2026, 4:19 PM · Discovery-Search (2026.04.06 - 2026.05.01), Research, Semantic Search
EBernhardson claimed T414623: [Vector Search] Estimate resource consumption at scale.
Mar 9 2026, 4:07 PM · Semantic Search, Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson moved T418388: Upgrade DSE k8s opensearch clusters to 3.5.0 from Blocked / Waiting to Done on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mar 9 2026, 4:06 PM · Discovery-Search (2026.03.03 - 2026.04.03), Data-Platform-SRE (2026-03-06 - 2026-03-27)
EBernhardson moved T414095: Configure opensearch ML connectors/models from To be Deployed to Needs Review on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mar 9 2026, 4:03 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Discovery-Search (2026.03.03 - 2026.04.03), Semantic Search
EBernhardson moved T413969: Make semantic search accessible through Action API from To be Deployed to Done on the Discovery-Search (2026.03.03 - 2026.04.03) board.
Mar 9 2026, 4:03 PM · MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), Discovery-Search (2026.03.03 - 2026.04.03), Semantic Search, CirrusSearch

Mar 6 2026

EBernhardson added a comment to T413969: Make semantic search accessible through Action API.

There are probably still a number of rough edges, but this is generally working now: frwiki example

Mar 6 2026, 8:59 PM · MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), Discovery-Search (2026.03.03 - 2026.04.03), Semantic Search, CirrusSearch
EBernhardson created T419283: MediaWiki Telemetry::getRequestId() has different format between cli (mwscript) and web requests.
Mar 6 2026, 7:39 PM · MW-1.46-notes (1.46.0-wmf.19; 2026-03-10), Observability-Tracing
EBernhardson closed T419174: qwen3-embedding:predict returning 503 to all requests as Resolved.
Mar 6 2026, 7:26 PM · Machine-Learning-Team, Discovery-Search (2026.02.02 - 2026.02.27), Semantic Search, CirrusSearch
EBernhardson closed T419174: qwen3-embedding:predict returning 503 to all requests, a subtask of T413969: Make semantic search accessible through Action API, as Resolved.
Mar 6 2026, 7:26 PM · MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), Discovery-Search (2026.03.03 - 2026.04.03), Semantic Search, CirrusSearch

Mar 5 2026

EBernhardson updated subscribers of T419174: qwen3-embedding:predict returning 503 to all requests.
Mar 5 2026, 10:06 PM · Machine-Learning-Team, Discovery-Search (2026.02.02 - 2026.02.27), Semantic Search, CirrusSearch
EBernhardson added a comment to T419174: qwen3-embedding:predict returning 503 to all requests.

Potentially related: T418976. Not certainly, but that ticket involved changes to the helm bits that serve qwen3 and had deployments today.

Mar 5 2026, 10:05 PM · Machine-Learning-Team, Discovery-Search (2026.02.02 - 2026.02.27), Semantic Search, CirrusSearch
EBernhardson created T419174: qwen3-embedding:predict returning 503 to all requests.
Mar 5 2026, 10:00 PM · Machine-Learning-Team, Discovery-Search (2026.02.02 - 2026.02.27), Semantic Search, CirrusSearch
EBernhardson added a comment to T415299: Incomplete deepcategory search results despite of no warning message.

I put together a self-contained .html page that will request the querys that are executed for two different deepcat queries and report on the differences in categories that will be included/excluded:

Mar 5 2026, 8:07 PM · Discovery-Search (2026.03.03 - 2026.04.03), Commons, CirrusSearch
EBernhardson added a comment to T415299: Incomplete deepcategory search results despite of no warning message.

If we adjust the second query to exclude English-language SVG maps of the world instead of English-language SVG maps we get matching result counts of 1086 for both:

Mar 5 2026, 6:45 PM · Discovery-Search (2026.03.03 - 2026.04.03), Commons, CirrusSearch
EBernhardson moved T415299: Incomplete deepcategory search results despite of no warning message from Ready for Dev to Needs Review on the Discovery-Search (2026.02.02 - 2026.02.27) board.

I finally had a chance to dig into this one. As far as i can tell, English-language SVG maps is not excluded in the first query, but is explicitly added as an exclusion in the second query. So the result descripency is likely to be due to this addition.

Mar 5 2026, 6:43 PM · Discovery-Search (2026.03.03 - 2026.04.03), Commons, CirrusSearch
EBernhardson moved T413969: Make semantic search accessible through Action API from Needs Review to To be Deployed on the Discovery-Search (2026.02.02 - 2026.02.27) board.
Mar 5 2026, 5:26 PM · MW-1.46-notes (1.46.0-wmf.20; 2026-03-17), Discovery-Search (2026.03.03 - 2026.04.03), Semantic Search, CirrusSearch
EBernhardson moved T414095: Configure opensearch ML connectors/models from Needs Review to To be Deployed on the Discovery-Search (2026.02.02 - 2026.02.27) board.
Mar 5 2026, 5:26 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Discovery-Search (2026.03.03 - 2026.04.03), Semantic Search

Mar 4 2026

EBernhardson added a comment to T419029: Grant Access to ops for ebernhardson.

I was thinking of access as not solving the current issue, as we have a plan forward for that, but as more of addressing possibilities on a longer-term basis.  It seems like once or twice a year I run into something that would go easier if I had more access.  I see from the puppet data.yaml file that we have a couple, but very few, engineers with ops access. This isn't the first time the question of ops level access has come up, but in the past I've pushed off requesting access as it seemed not strictly necessary. It's still not strictly necessary, but I'm leaning towards this easing some of the work I do. The full solutions, like the readahead support being setup now, would still be the end-state we would be looking for, but the additional access would better allow figuring out where these things need to be before the full solution is ready to be deployed.

Mar 4 2026, 7:30 PM · Data-Platform-SRE (2026-03-06 - 2026-03-27), SRE-Access-Requests, SRE
EBernhardson created T419029: Grant Access to ops for ebernhardson.
Mar 4 2026, 4:16 PM · Data-Platform-SRE (2026-03-06 - 2026-03-27), SRE-Access-Requests, SRE

Mar 3 2026

EBernhardson created P89748 (An Untitled Masterwork).
Mar 3 2026, 6:36 PM
EBernhardson claimed T414095: Configure opensearch ML connectors/models.
Mar 3 2026, 2:44 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Discovery-Search (2026.03.03 - 2026.04.03), Semantic Search
EBernhardson moved T414095: Configure opensearch ML connectors/models from To be Deployed to Needs Review on the Discovery-Search (2026.02.02 - 2026.02.27) board.
Mar 3 2026, 2:44 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Discovery-Search (2026.03.03 - 2026.04.03), Semantic Search

Mar 2 2026

EBernhardson added a comment to T414623: [Vector Search] Estimate resource consumption at scale.

First tests with the full frwiki semantic search dataset showed high latency and significant ceph IO at ~4GB/sec. This appears to be a problem with readahead on the ceph-backed storage system. It defaults to 8MB which is far too much for the random-access nature of knn search.

Mar 2 2026, 6:15 PM · Semantic Search, Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson moved T418130: Analyse Wrong-Keyboard-Detection Usage from Incoming to Ready for Dev on the Discovery-Search (2026.02.02 - 2026.02.27) board.
Mar 2 2026, 4:32 PM · Discovery-Search (2026.04.06 - 2026.05.01), CirrusSearch
EBernhardson set the point value for T414623: [Vector Search] Estimate resource consumption at scale to 8.
Mar 2 2026, 4:31 PM · Semantic Search, Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch
EBernhardson moved T414623: [Vector Search] Estimate resource consumption at scale from Incoming to In Progress on the Discovery-Search (2026.02.02 - 2026.02.27) board.
Mar 2 2026, 4:29 PM · Semantic Search, Discovery-Search (2026.03.03 - 2026.04.03), CirrusSearch