Page MenuHomePhabricator
Feed Advanced Search

Fri, Jun 7

Isaac added a comment to T361637: Support for topic infrastructure work.

Weekly updates:

  • None (at ICWSM during the week)
Fri, Jun 7, 1:50 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly update:

  • No progress on blogpost but lots of notes from ICWSM about datasets for computational social science. In particular, some clear trends about the importance of having wiki-specific characteristics available for regression modeling (like MI's Wiki Comparison sheet which we could better share to those communities) and enabling more multilingual datasets for certain phenomena of interest such as things related to patrolling/moderation.
Fri, Jun 7, 1:49 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T360572: Extend Article Quality Model to use HTML.

Weekly updates:

  • Merged Destinie's tutorial notebook MR (!) and assigned the bug as a next step
  • Put together an exploration of a different type of model (ordinal logistic regression instead of linear regression). This came out of Destinie's results and pair-plot analyses which were showing that as the number of features grew, the "effect while holding other variables constant" aspect of linear regression was leading to coefficients whose interpretation didn't match reality -- e.g., fewer sources -> higher quality. At first the thought was to go for something like Naive Bayes where the coefficients are learned independently but I could not find a model of that type that did linear regression so I went back to the drawing board to reconsider what a ordinal logistic regression model would look like. I had originally abandoned because the coefficients were less interpretable, it wasn't clear to me that I could effectively convert the class probabilities to a single point prediction between 0 and 1 (the desired output), and I was hoping to avoid the semi-complicated statsmodel dependency on LiftWing. In this revisiting, I figured out how to reproduce the model outputs without the statsmodel dependency and convert the logits generated by the model into a reasonable 0-1 range in a reproducible and not purely arbitrary way. The coefficients still are harder to interpret than the linear model ones but they're pretty straightforward (positive = good; negative = bad) and they match expectations. And an initial eval of model performance suggests that it matches or beats the linear approach. Notebook: https://public-paws.wmcloud.org/User:Isaac%20(WMF)/Quality/html-qual-exploration.ipynb#Ordinal-Logistic-Regression
Fri, Jun 7, 1:46 PM · Research (FY2023-24-Research-April-June), Epic

Tue, Jun 4

Isaac added a comment to T363022: Have Event Invitations scoring model analyzed by Research.

Just acknowledging -- thanks @ifried for engaging! All of this makes sense to me. Thinking about not spamming high-edit-count editors though I don't have anything that obviously fixes it if reaching some of these relevant highly-active editors is a priority:

  • Simplest way is probably to just pass along global edit count to organizers and encourage them to practice care before inviting -- e.g., checking user pages to see if they express an interest in the topic.
  • I raised the prospect of filtering out edits with edit tags that suggested that they're semi-automated (autowikibrowser etc.) which could be a small step in filtering out some of this
  • Maybe there's a way to use watchlists to help filter this out -- e.g., for high edit-count editors, check if the worklist articles are on their watchlists? The challenge here is one of privacy -- this is private information and you wouldn't want to leak it. So maybe this is instead future functionality that actually privately notifies editors if an upcoming event has a high overlap with their watchlist?
  • Maybe for each worklist, you also collect a small random sample of articles (e.g., 20) and filter out anyone who also appears heavily in those?
  • Larger than this one project, but we probably will eventually need some sort of opt-out system for these things (invitations but also surveys etc.)? And then it would be just a matter of cross-referencing the invitations with e.g., the user preference table to see who can be contacted
Tue, Jun 4, 1:50 PM · Campaign-Registration, CampaignEvents, Campaign-Tools (Campaign-Tools-Current-Sprint)

Fri, May 31

Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly updates:

  • No progress
Fri, May 31, 6:09 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T360572: Extend Article Quality Model to use HTML.

Weekly updates:

  • Provided feedback on Destinie's tutorial notebook for mwparserfromhtml (MR) and was excited that her work surfaced a bug in the library (though it's my code that is buggy). Many eyes!
Fri, May 31, 5:43 PM · Research (FY2023-24-Research-April-June), Epic
Isaac added a comment to T361637: Support for topic infrastructure work.

Weekly updates:

  • Put together a task (T366273) and meta page for my Q1 hypothesis of the article-country model. Hypotheses will be officially shared in the next few weeks and the request was put out for Meta pages with additional information for interested community members.
  • Provided feedback on Campaign's worklist -> editor invitation candidates scoring approach (T363022)
  • Shared some thoughts on topic infrastructure with Inuka team
Fri, May 31, 5:40 PM · Research (FY2023-24-Research-April-June)

Thu, May 30

Isaac updated the task description for T366273: Article country model.
Thu, May 30, 6:49 PM · Research, Epic
Isaac added a comment to T363022: Have Event Invitations scoring model analyzed by Research.

Thanks for engaging! Sounds like we're on the same page, and again I wanted to stress that I think what you all have built is quite reasonable.

Thu, May 30, 4:00 PM · Campaign-Registration, CampaignEvents, Campaign-Tools (Campaign-Tools-Current-Sprint)
Isaac moved T366273: Article country model from Backlog to Epics on the Research board.
Thu, May 30, 12:52 PM · Research, Epic
Isaac created T366273: Article country model.
Thu, May 30, 12:51 PM · Research, Epic

Wed, May 29

Isaac updated subscribers of T363022: Have Event Invitations scoring model analyzed by Research.

Ok, I got a chance to take a look. Thanks to @Daimona for the very well-documented code and function graphs, @Iflorez for your notes, and @ifried for the excellent Meta overview -- this made my job far simpler! Broad feedback:

  • Overall:
    • Generally I assume your approach is working pretty well (though this is mostly based on hunches as we don't have enough data to really evaluate yet). I have some thoughts on smaller tweaks but I suspect that the report is spot on when it notes that the main feature that seems to impact the effectiveness of these recommendations/invitations is the quality of the landing page for the event and clarity of what they're doing. My main actionable recommendation is to consider flipping the edit-count feature so it prioritizes newcomers over experienced editors (more below).
  • Modeling choices:
    • When I think about designing an algorithm (or learned ML model) to solve this sort of task of classifying data, I'm generally think about the trade-offs between simplicity and precision. Where on one end, you have a really simple model that anyone can hopefully understand and modify. Maybe its not perfect but that's okay because it's transparent to users and easy to adjust. On the other hand, are much more complex models that can't fully be explained but maybe are far more accurate at the task at hand. While it's tempting to think there's a good compromise solution that has a balance of both, I actually generally find myself either going with maximally simple or maximally performant depending on the task. The middle ground unfortunately usually just means that it's sorta hard to explain to folks/adjust but it still isn't so good that folks will happily accept the outputs.
    • For this model, I think we're clearly in the space of wanting a maximally simple approach because the task is inherently hazy (there is no "right" answer for who to invite). I think you're largely in that space, which is great. I think the features are pretty straightforward at a high-level. The geometric mean might confuse some folks but it still is pretty easy to explain. I'll admit that personally I find some of the more complicated feature normalization functions to be more than is necessary but that's a nitpick and the graphs provided in the comments help a lot.
  • Evaluation data:
    • The main challenge is not having a method to evaluate whether the approach is working and potentially train a model to learn what the more empirical balance of feature weights is (right now they're set to 5 recent-activity vs. 4 bytes vs. 1 edit-count). So most of my feedback is based on hunches. With a fair bit of work, I assume you could encode who was recommended vs. who was invited and use that to adjust the model weights -- e.g., via a logistic regression model as Irene raised in her feedback -- but there isn't a lot of data yet so I don't know that that's useful at this stage. If you expand the pilot, I'd recommend tracking this somewhere (user, overallScore, rank, bytesScore, editCountScore, recentActivityScore, selected-for-invite, did-join-event).
    • As an alternative, it's hard to think of reasonable proxies on the wikis for this sort of task that would have more data. You could seek to re-generate past campaign participant lists but these are quite messy in terms of gathering that data and you'd have to find campaigns that recruited experienced contributors and had some sort of topical focus. WikiProjects don't currently have a good way of recording participants or they'd be a natural fit.
  • Feature feedback:
    • Bytes:
      • Simple and while certainly not perfect, in practice it does map pretty well to edit difficulty / engagement once bots and reverts are filtered out as you do (quick analysis). This also makes sense -- adding a new reference/sentence/etc. generally requires adding a fair amount of bytes. The big maintenance edits are often via bots -- e.g., IABot as is mentioned somewhere. If you wanted, you could also do things like remove any edits that match a set of tools often used for basic maintenance -- e.g., for enwiki, I've used ['AWB', 'twinkle', 'huggle', 'WPCleaner', 'canned edit summary', 'OAuth CID: 1805', 'RedWarn', 'Ultraviolet'] in the past as edit tags that indicate tool-assisted edits but that's a manual list I put together based on scanning Special:Tags.
      • The main challenge I see with this is that you're not distinguishing between editors who are active in the topic space and editors who are just very active. For example, when I was doing some work on finding similar users to a given editor (as a proxy for potential sockpuppets), I was building a list of editors who most overlapped with a given editor based on edit history. But if you just use this info, User:Ser Amantio di Nicolao appears on everyone's list because of how prolific they are. Instead, you need to normalize for how many edits someone makes in general (akin to tf-idf). This is challenging to do via the APIs but you might actually consider weakly "penalizing" editors for high-edit count instead of rewarding them. Where the logic is that while high-edit-count editors might be good contributors, they also are more likely to have just incidentally edited these pages and not have an actual topical interest and we want to reduce spam to them too (them ending up on everyone's invitation list).
    • Edit count:
      • Building further on the above about "penalizing" high-edit-count editors. Given the nature of geometric mean (tends towards the lowest value in the set), this means that if you have an editor who signed up and maybe made a few edits via Newcomer Homepage about a topic of interest to them but isn't sure what to do next, they're going to have a really small value for the edit-count feature and also probably for bytes-changed (assuming they made simple edits). This is going to put them always at the bottom of the list when I feel like maybe they're the most important people to invite (as they need that nudge to continue and structure of a campaign and they're least likely to discover the campaign themselves). As I said above, I feel like you might want to actually take the inverse of someone's edit count for this feature so lower-edit-count folks who overlapped with the worklist are prioritized.
    • Recent activity:
      • Seems reasonable to me!
  • Expansions:
    • Honestly I feel like you have a reasonable set of features based on what can be efficiently gathered from the databases. I've done some work on classifying edits by what they changed so you could e.g., isolate editors who added new references to articles and prioritize them, but that's a much more expensive feature to compute and it's not cached anywhere yet unfortunately.
    • We've discussed using the list-building tool (or even just Search's morelike API for something production-ready) to expand the worklist (when they're tiny) to include more articles in the same topical space (and therefore more potential editors). We don't have any way of evaluating how impactful that would be. My feeling would be let's explore it if you're hearing that the invitation lists are too short or overlap too much with who they already were going to invite. But otherwise not necessary yet.
Wed, May 29, 8:26 PM · Campaign-Registration, CampaignEvents, Campaign-Tools (Campaign-Tools-Current-Sprint)

Mon, May 27

Isaac added a comment to T354565: Edit summaries: dissemination of findings.

Paper: https://arxiv.org/abs/2404.03428

Mon, May 27, 5:06 PM · Research (FY2023-24-Research-January-March)
Isaac added a comment to T360455: Add Article Quality Model to LiftWing.

Just adding another note of where these quality scores could be useful (filtering machine translation candidates): T293648#9816202

Mon, May 27, 2:33 PM · Research, Machine-Learning-Team

Fri, May 17

Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly updates:

Fri, May 17, 9:46 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T360572: Extend Article Quality Model to use HTML.

Weekly updates:

  • Sources is proving tricky as a feature so I left some ideas for how to "debug" what's going on there in: T364014#9810129
  • Documentation work is continuning! Details: T365269
Fri, May 17, 8:50 PM · Research (FY2023-24-Research-April-June), Epic
Isaac added a comment to T361637: Support for topic infrastructure work.

Not too much movement in this space this week on my end though some additional tasks that I signed up for:

  • Helping with upcoming Community Growth contract hiring
  • Review for T363022
Fri, May 17, 8:28 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T364014: Incorporate in new HTML features to quality model.

Thanks @DJames-WMF for your first pass (notebook)! Next steps now that we have some initial findings:

  • You're extracting message-boxes/infoboxes correctly but you'll update the code to make sure they're retained in the final model as well.
  • You can trim out some of the old summary at the top of the notebook to focus on just this expanded model and its coefficients (with a link back to Copy-2 where folks can find more details about the earlier iterations).
  • What to do about this mysterious negative sources coefficient?
    • I've been wondering how consistent the different languages are. One way to test this would be to train a separate model for each language and see how similar the coefficients are. This might help identify features that are less stable and worth investigating. Perhaps some insight on the sources feature or maybe even some of the others too. I expect the magnitudes might vary a bit but if any switch from positive to negative (or vice versa), that would be the interesting sign to pay attention to.
    • In these sorts of models, each feature coefficient is the impact of that feature when all other features are held constant. Because of this dependence on other features, the coefficient for a feature like sources doesn't just depend on its relationship to quality but also on its relationship to all the other features. I think this is what is going on. This (quite long) tutorial on linear model coefficients/interpretation/fine-tuning has some useful examples of this and ways to chart things out. In particular, I would love to see similar charts for our data/model to:
      • The pairplots in this section, which will help us see which coefficients are correlated with each other.
      • The coefficient variability plot in this section, which will help us see which model coefficients are unstable.
      • If it turns out that sources is highly correlated with the other features, we might have to take it out. It might also be possible to e.g., switch it to a simpler boolean (>= 5 sources) and that that would help reduce the correlation while still retaining most of the benefits of counting unique sources separately from references.
Fri, May 17, 8:12 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T308164: Migrate Content Translation Recommendation API to Lift Wing.

@ngkountas I think you're right. Let me know if you have a task for making the switch because once you all have completed the transfer and verified it's working for a bit, I'll look into deprecating the current endpoint that you're using (as it's long overdue for being shut down).

Fri, May 17, 5:01 PM · Language-Team, Machine-Learning-Team, Epic
Isaac created T365269: Improve documentation for mwparserfromhtml.
Fri, May 17, 3:59 PM · Research (FY2023-24-Research-April-June)
Isaac reassigned T364014: Incorporate in new HTML features to quality model from Isaac to DJames-WMF.
Fri, May 17, 3:26 PM · Research (FY2023-24-Research-April-June)

Fri, May 10

Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly updates:

Fri, May 10, 8:28 PM · Research (FY2023-24-Research-April-June)

May 9 2024

Isaac added a comment to T343241: Build a taxonomy for "impactful topics".

Summary of some data analysis I did for evaluating the current topic taxonomy and gathering some thoughts about potential changes (google doc with more data/notes):

  • We'll have to do some cleaning up of the WikiProject->Topic mapping as WikiProject names etc. have shifted since it was created in 2020. For example, WikiProject Climate Change used to be a task force of WikiProject Environment (I think) and then became its own project so is not currently tracked in the data. This seems pretty doable though as a one-time manual pass and allocation of larger WikiProjects to specific topics.
  • Big changes that I think we are pretty certain about:
    • Shifting of geographic topics to a country-based model. This will allow for more granularity than current regions and incorporate data from Wikidata so build on that community work.
    • Shifting of model-based outputs for people (biography/women topics) to a Wikidata-based output (deterministic based on instance-of:human and gender properties). This will lose some of the hazier, women-related topics that the model-based women topic could surface but be clearer (less likely to provide problematic predictions) and we will see about addressing some of this change with the new topics.
  • A number of small changes to the arts/science topics -- e.g., perhaps merge a few categories that get low usage and have low coverage.
  • The larger discussion will be around how to handle some of the existing history/society topics and what topics are possible for folks engaged in sustainability and human rights work.
  • Expanding the data pipeline to incorporate WikiProjects from other language editions wouldn't have a large effect at the moment (most major wikiprojects with coverage of non-English articles are for geographic/biographical topics and only a few are in areas where we probably do need more diverse data like history/society topics). But this might be useful for certain topics if we do have low data volume/diversity from English and we know there are relevant WikiProjects in other language editions supported by PageAssessments.
May 9 2024, 2:02 PM · Research
Isaac added a parent task for T361637: Support for topic infrastructure work: T343241: Build a taxonomy for "impactful topics".
May 9 2024, 1:55 PM · Research (FY2023-24-Research-April-June)
Isaac added a subtask for T343241: Build a taxonomy for "impactful topics": T361637: Support for topic infrastructure work.
May 9 2024, 1:55 PM · Research
Isaac added a comment to T361637: Support for topic infrastructure work.

@Miriam I don't mind either way but I'll be bold. This is my quarterly goal task so it touches on the topic classification evolution but also other related aspects and I mainly see as a personal tracking task that I intend to close out at the end of this quarter. I think best thing would be to make this a subtask of T343241 (as in I'm playing a supporting role for the taxonomy work) and I'll shift my updates over there when they're about the topic taxonomy.

May 9 2024, 1:54 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T361637: Support for topic infrastructure work.

Weekly updates (adding early while it's fresh):

  • I put together some data and thoughts around the next iteration of the topic classification model in discussion with Alex as far as what steps Community Growth should be leading to do the community consultations on making improvements to it (google doc). Summary:
    • We'll have to do some cleaning up of the WikiProject->Topic mapping as WikiProject names etc. have shifted since it was created in 2020. This seems pretty doable though.
    • Big changes we both agree on are the shifting of geographic topics to a country-based model and shifting of model-based outputs for biography/gender to a Wikidata-based output (deterministic).
    • A number of small changes to the arts/science topics -- e.g., perhaps merge a few categories that get low usage and have low coverage.
    • The larger discussion will be around how to handle some of the existing history/society topics and what topics are possible for folks engaged in sustainability and human rights work.
    • Expanding the data pipeline to incorporate WikiProjects from other language editions wouldn't have a large effect at the moment (most major wikiprojects with coverage of non-English articles are for geographic/biographical topics and only a few are in areas where we probably do need more diverse data like history/society topics).
May 9 2024, 1:15 PM · Research (FY2023-24-Research-April-June)

May 3 2024

Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly updates:

May 3 2024, 9:00 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T360572: Extend Article Quality Model to use HTML.

Weekly updates:

  • New normalization values generated! code and values.
  • Destinie has made good progress on incorporating these into the notebook and I think we're essentially in a place where we can begin to add new features to the model. I put together a task for that (T364014). She also has begun work on improving some of the mwparserfromhtml documentation (MRs) so she'll have a better sense of what the library can do re: potential new features.
May 3 2024, 1:34 PM · Research (FY2023-24-Research-April-June), Epic
Isaac added a comment to T361637: Support for topic infrastructure work.

Weekly updates:

  • Added basic form of link-based inference to the country-article prototype -- e.g., https://wiki-region.wmcloud.org/regions?lang=en&title=Japanese%20iris. That was a final feasibility check for me and I'm going to pause on development for that for now until Q1 begins. The next steps for when I pick back up that work:
    • Evaluation:
      • Offline: probably a large stratified sample by geo + language edition to test link-based logic -- i.e. whether it can reproduce what's already in Wikidata. I think I should be able to easily re-write the API logic to use the cluster instead so it's fast to test/iterate.
      • Human: a small sample of articles with Wikidata properties to just verify that those are indeed accurate and complete when present but I think it's fair to assume ~100% precision/recall for those. Focus then would be on articles lacking Wikidata-based country properties. For those, just have folks go through the corresponding Wikipedia article and tag with any relevant countries. Might need to stratify by continent to make sure even-ish coverage but I want to keep the sample size manageable.
    • Guardrails (how to handle links):
      • Motivating challenge here is something like the biodiversity articles -- e.g., https://en.wikipedia.org/wiki/Limonium_strictissimum. This plant is native to Italy/France, which is mentioned in the article, so ideally those two countries would be predicted. There are actually more links, however, to US/UK because many of the orgs linked to in the Taxonbar at the end of the article who track information about plant species are based in those countries.
      • Why it's not trivial to fix: we use the pagelinks API to get info on links because it can easily be run as a generator so with a single API call we can get all the links and their corresponding Wikidata IDs (for looking up countries associated with each). So we can't e.g., exclude links based on how they're presented in the page. We could in theory maintain a list of links to ignore based on how many articles they're present in -- the list is probably not super long and would be effective and filtering out these sorts of links but it's also an additional layer of complexity.
      • My current approach is two-fold:
        • I do apply a tf-idf transformation (code) to the link proportions for each country so e.g., a few links to Ecuador will be treated as a strong signal than a few links to the US. This helps a bit with the US/UK problem (also dampens France quite a bit).
        • I require a minimum count (3) and minimum proportion (0.25) of links in order to elevate a linked country to a prediction (code). This was aimed pretty directly at the taxonbar issue but I'm sure it could be fine-tuned. The challenge is balancing a requirement of enough support to be "real" without making the bar too high for stub articles to exceed. The minimum proportion part also makes it hard for articles that are relevant to many countries to ever reach the threshold, which I don't love but also might be acceptable behavior. For example, the WWII article is certainly relevant to many countries but isn't necessarily a useful result if you're filtering by country to find content to edit.
      • Another possible guardrail is restricting where we apply the link-based logic. One approach could be only running the code for those articles lacking coordinates / any Wikidata-geo-property? This would reduce the possibility of false positives and probably let us better fine-tune the model to articles in topics lacking Wikidata properties about countries. Might confuse things for the end-user but also better latency and maybe nudges editors to improve Wikidata if they find issues with the predictions.
May 3 2024, 1:26 PM · Research (FY2023-24-Research-April-June)

May 2 2024

Isaac created T364014: Incorporate in new HTML features to quality model.
May 2 2024, 3:03 PM · Research (FY2023-24-Research-April-June)

Apr 26 2024

Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly update:

  • Presented initial ideas to Gap Team (presentation). Both help me process all my thoughts and led to some good feedback and sparking of ideas. Writing should be far easier now.
  • Things I'm pondering:
    • Folks appreciated my attempt to describe the different types of AI models at Wikimedia as background to why these datasets are important. This is a lot to fit into a blogpost but maybe I write a separate one or even just a wiki page as a explainer
    • The blogpost is definitely ballooning in size. I probably don't have to decide right now what to include/exclude, but TA made the good suggestion that it could be a series instead of single blogpost. So something like: 1. Intro / Current State; 2. Data Gaps; 3. Benchmarks
Apr 26 2024, 5:00 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T360572: Extend Article Quality Model to use HTML.

Weekly updates:

  • I started on loading the HTML dumps into HDFS (code courtesy of Fabian) -- this is working well and I tested with Arabic and was quite happy with how quickly it processed. Though loading in English is taking some time...
  • Destinie is working out some kinks in our HTML features
Apr 26 2024, 4:39 PM · Research (FY2023-24-Research-April-June), Epic
Isaac added a comment to T361637: Support for topic infrastructure work.

Weekly updates:

  • ML Platform and Search Platform indicated that my plans were fine for the article-country hypothesis and they can support deployment. In particular, EB on Search indicated that the broader expansion of tags on Search index for recommender systems shouldn't pose any issues.
  • Put together basic API for using just the Wikidata properties: https://wiki-topic.toolforge.org/countries
  • Good meeting put together by Miriam in which we charted out that Community Growth could do some outreach to get feedback on the current topic taxonomy and we'd work to make updates based on that but then try to freeze the taxonomy.
Apr 26 2024, 4:36 PM · Research (FY2023-24-Research-April-June)

Apr 25 2024

Isaac updated subscribers of T363514: Requesting access to analytics-privatedata-users for YLiou_WMF (no server access).

@YLiou_WMF here's the task -- please sign L3

Apr 25 2024, 6:54 PM · SRE, SRE-Access-Requests
Isaac created T363514: Requesting access to analytics-privatedata-users for YLiou_WMF (no server access).
Apr 25 2024, 6:51 PM · SRE, SRE-Access-Requests

Apr 24 2024

Isaac added a comment to T308164: Migrate Content Translation Recommendation API to Lift Wing.

Ahh this is great news @kevinbazira ! @KartikMistry is there any reason from the Content Translation side why we can't switch over to the LiftWing endpoint? My read is that the code is quite simple -- e.g., if I go to Content Translation on Spanish Wikipedia, the tool hits this endpoint:
https://recommend.wmflabs.org/types/translation/v1/articles?source=en&target=es&seed=Music%20Modernization%20Act|Felony%20disenfranchisement&search=morelike&application=CX

Apr 24 2024, 2:48 PM · Language-Team, Machine-Learning-Team, Epic

Apr 23 2024

Isaac added a comment to T308164: Migrate Content Translation Recommendation API to Lift Wing.

hey all (not sure who exactly to tag but maybe I'll start with @kevinbazira just because I know you did a lot of good work on this) -- I'm working on some planning for improvements to our recommender systems for next fiscal year around what topic filters we provide to editors. Content Translation is of special interest but Android's SuggestedEdits is important too. The recommendation logic for both of these systems is still hosted on GapFinder as far as I can tell, but deploying any improvements is going to require moving them to a proper service (LiftWing). Does anyone know why this effort to move Content Translation's recommendation API over to LiftWing (along with Android's endpoints T340854) stalled last year?

Apr 23 2024, 8:26 PM · Language-Team, Machine-Learning-Team, Epic

Apr 19 2024

Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly update:

  • Spoke with Stephanie (Enterprise) who will be attending a symposium on ML benchmarking with Wikimedia data and shared my early thoughts on the subject.
  • Spoke with Adam B about his 10% project around a Commons dump and some of the likely challenges / needs associated with that. It left me feeling optimistic though it won't be solved overnight.
  • Forgot that I was to present on this this week to our team though and postponed to this upcoming Thursday so will be doing some deep thinking and writing in the meantime
Apr 19 2024, 10:28 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T361637: Support for topic infrastructure work.

Weekly updates:

  • I put forth a draft hypothesis for next year related to a country-level article prediction model: If we build a country-level inference model for Wikipedia articles, we will be able to filter lists of articles to those about a specific region with >70% precision and >50% recall. I had a conversation with Fabian about this too and it'd be easy to pull in the cultural/geographic code that currently exists for inferring countries based on Wikidata properties. To take it a step further and cover articles without Wikidata items or with incomplete items or for geographic aspects that are not really covered in Wikidata -- e.g., geographic extent of flora/fauna -- I'd want to do some inference based on the country topics of the links in an article. Doing this online would be challenging (likely high latency as you'd need to evaluate many articles at once). There are ways to build a cache of predictions for articles and use that for evaluating the links but then you run into challenges with cache invalidation etc. Because the intent is to load the model predictions into the Search index as weighted tags, however, we can actually probably use the Search APIs to gather the country predictions for an article's links (analogous example for articletopic for en:Japanese_iris) and infer from there. This is nice because the Search index will always have up-to-date information and so we won't need to store this source of truth in multiple places.
Apr 19 2024, 10:25 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T360572: Extend Article Quality Model to use HTML.

@DJames-WMF has made progress on converting the wikitext over to HTML features. We're finding that the old normalization values -- e.g., how many references are expected in a top-quality article for a given wiki -- are no longer well-aligned for a few features. This seems to be most relevant for page-length which then affects wikilinks and references as well. I'll need to look into re-generating these normalization values. A few options:

  • Use the APIs to fetch HTML for a random sample of articles to re-calibrate the values. Sample size could be a challenge though because we're looking at the 95th percentile so we need a large enough sample for that to be stable.
  • Slowly loop through the whole Enterprise HTML dump -- this would take a very long time and in my experience the article ordering is not random so we can't stop early unfortunately without biasing the result.
  • Load a snapshot of the HTML dumps for the relevant languages into HDFS and process in parallel -- this is probably the most sensible solution because then we can re-use the data if we ever need to come back and recompute a value.
Apr 19 2024, 10:19 PM · Research (FY2023-24-Research-April-June), Epic
Isaac added a comment to T219903: Keep research.wikimedia.org landing page updated.

Confirmed -- thanks @DDeSouza !

Apr 19 2024, 11:50 AM · periodic-update, Research

Apr 16 2024

Isaac added a comment to T343228: Changes to Research Showcase MediaWiki.

Would it be possible to add the theme of the showcase to its subpage title

Good idea. I think it should be doable (just need to move the pages to the new titles). The downside would be it's harder to guess the page title but maybe that's not an issue. One alternative too: when we picked up this task, there was also a question about whether we wanted some sort of "summary" of our archive too. Maybe the listing of pages isn't the place to do that and instead we add a basic table to the page with each month, a link to the full description, and the theme?

Apr 16 2024, 10:00 PM · Research, Research-outreach, Research-foundational
Isaac updated the task description for T362416: Attend ICWSM 2024 conference.
Apr 16 2024, 9:29 PM · Research-foundational, address-knowledge-gaps, Research-outreach, Research
Isaac added a comment to T219903: Keep research.wikimedia.org landing page updated.

@DDeSouza I went ahead and made a merge request for a new paper and some small adjustments to the other papers (seemed easier than trying to explain in this case): https://gitlab.wikimedia.org/repos/sre/miscweb/research-landing-page/-/merge_requests/25

Apr 16 2024, 3:37 PM · periodic-update, Research

Apr 15 2024

Isaac added a comment to T343228: Changes to Research Showcase MediaWiki.

Thanks all for the patience on this -- I have now moved all the past showcases onto monthly archive pages and added the search functionality to the main page: https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase#Archive

Apr 15 2024, 7:57 PM · Research, Research-outreach, Research-foundational

Apr 12 2024

Isaac added a comment to T361623: Swap out wikitext for HTML in training quality model.

Next steps for this notebook based on Destinie's assessment (notebook) of how well-distributed each model feature is after switching to HTML. We have three features that are poorly distributed (values all lumped together) so the model cannot learn much from them. They are:

  • Page length: the values are all lumped around 1 because Parsoid HTML (with all of its syntax) is far more verbose that wikitext and by definition a superset of the wikitext. We don't have any perfect way of getting back to the wikitext length but probably a more reasonable assessment of article length is how much text is in it. So instead of len(article_html), let's use the get_plaintext() function and take the length of that. That function has a bunch of settings for it to work appropriately so let's use the approach used by html_to_plaintext() in this notebook with a few small tweaks:
    • Don't exclude List elements (they often have valid content from an article quality standpoint)
    • Take out the if len(paragraph.strip()) > 15 clause for each paragraph (we're just counting up things so I'm okay with the occasional "weird" paragraph)
    • Rather than doing the final if paragraphs: check, just use '\n'.join(paragraphs) for computing length -- this will just be an empty string (length 0) if no paragraphs.
  • Media: the reason they're lumping to 1s is probably because many articles have lots of little icons that aren't defined in the wikitext (transcluded via templates). These are inflating our counts of images in the article. I put together one heuristic to filter these out in the test cases and I think we can re-use that pixel-size logic here too (code). This should reduce our media counts back to where they're more evenly distributed.
  • Categories: Here the lumping towards 1 values likely is the result of hidden categories (usually transcluded via templates again and not in the wikitext). One way around this is to check each category returned by get_categories() to see if it was transcluded. There's an existing function in the library (example import statement) and then we can do something like len([1 for c in article.wikistew.get_categories() if not is_transcluded(c)]).
Apr 12 2024, 8:43 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly update: no progress

Apr 12 2024, 6:49 PM · Research (FY2023-24-Research-April-June)
Isaac updated the task description for T361623: Swap out wikitext for HTML in training quality model.
Apr 12 2024, 6:48 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T361637: Support for topic infrastructure work.

Weekly updates:

  • Asked for input from Search on adding in the different topic tags we're considering (countries, quality, wikiprojects): https://etherpad.wikimedia.org/p/recsys-search-tags-future
  • Talked with Inuka Team about challenges/opportunities in this space as they consider potential projects to take on
  • Part of discussions with EH at Wikimedia Uruguay and others around their new templates for WikiProjects, which automatically find tasks to surface to editors: https://es.wikipedia.org/wiki/Wikiproyecto:Cambio_clim%C3%A1tico. This is an exciting replication of the infrastructure that Growth has worked on for Newcomer Homepage but by community members within the WikiProject context. It's further motivation for adding WikiProject tags to Search as well because without that, it's much harder to use our structured task filters (add-a-link; add-an-image) because there's no single query that filters by Wikiproject and task availability.
  • Began exploring feasibility of geography model on LiftWing. Ascertained that there could be key-value store support in the future that might be useful (if we use links to infer countries, we'll need to quickly look up the associated countries with each article link). In the meantime, it should be easy to just grab an item's Wikidata JSON and just check the country-related properties as we do with the culture metrics.
Apr 12 2024, 6:37 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T346089: Investigate Isaac article quality ML model as option .

Yep, that work is happening under T360455 and T360572

Apr 12 2024, 2:33 PM · Wikimedia Enterprise - Content Integrity, Wikimedia Enterprise

Apr 10 2024

Isaac added a comment to T361623: Swap out wikitext for HTML in training quality model.

Added another step for the bug-fixing we're working on right now with 0-values for some of the features. I also unchecked the optional exploration -- that actually is separate from the notebook (it involves updating a README file in a code repository) so we can talk about it in a future meeting and decide whether to pick it up or not.

Apr 10 2024, 6:49 PM · Research (FY2023-24-Research-April-June)
Isaac updated the task description for T361623: Swap out wikitext for HTML in training quality model.
Apr 10 2024, 6:47 PM · Research (FY2023-24-Research-April-June)

Apr 3 2024

Isaac added a comment to T318384: Put API on Cloud VPS .

Thanks! Unlikely to happen soon but when we reach a stage where we are re-training the model, I'll see if we can experiment with nudging the model away from these sorts of responses (because agreed that it's ideal to solve it via model architecture / training as opposed to post-hoc filters if possible). And please continue to share if you see other patterns in incorrect recommendations.

Apr 3 2024, 8:43 PM · Wikipedia-Android-App-Backlog (Android Release - FY2023-24)
Isaac reassigned T361623: Swap out wikitext for HTML in training quality model from Isaac to DJames-WMF.
Apr 3 2024, 4:31 PM · Research (FY2023-24-Research-April-June)
Isaac closed T360815: Replicate Article Quality Training Notebook as Resolved.

Excellent work @DJames-WMF ! Took a readthrough of your notebook and everything looked good. Closing this as resolved. We didn't pursue the Nepalese Wikipedia extension but that's okay -- we can always come back to it later. For now, I'd like to progress to the HTML work that you've started in T361623.

Apr 3 2024, 4:30 PM · Research (FY2023-24-Research-April-June)
Isaac updated the task description for T360815: Replicate Article Quality Training Notebook.
Apr 3 2024, 4:30 PM · Research (FY2023-24-Research-April-June)
Isaac closed T360815: Replicate Article Quality Training Notebook, a subtask of T360572: Extend Article Quality Model to use HTML, as Resolved.
Apr 3 2024, 4:30 PM · Research (FY2023-24-Research-April-June), Epic

Apr 2 2024

Isaac added a comment to T318384: Put API on Cloud VPS .

(Shouldn't this be a factor for machine learning? I mean, if matching the title produced a wrong description as a general rule, wouldn't the machine learning algorithm infer it from the training set?)

Apr 2 2024, 6:25 PM · Wikipedia-Android-App-Backlog (Android Release - FY2023-24)
Isaac moved T360815: Replicate Article Quality Training Notebook from Backlog to FY2023-24-Research-April-June on the Research board.
Apr 2 2024, 6:21 PM · Research (FY2023-24-Research-April-June)
Isaac moved T348329: [Stretch] Support evaluation of text summarization for potential harms from FY2023-24-Research-January-March to FY2023-24-Research-April-June on the Research board.
Apr 2 2024, 6:19 PM · Research (FY2023-24-Research-April-June)
Isaac moved T361623: Swap out wikitext for HTML in training quality model from Backlog to FY2023-24-Research-April-June on the Research board.
Apr 2 2024, 6:19 PM · Research (FY2023-24-Research-April-June)
Isaac moved T361637: Support for topic infrastructure work from Backlog to FY2023-24-Research-April-June on the Research board.
Apr 2 2024, 6:16 PM · Research (FY2023-24-Research-April-June)
Isaac created T361637: Support for topic infrastructure work.
Apr 2 2024, 6:15 PM · Research (FY2023-24-Research-April-June)
Isaac closed T354565: Edit summaries: dissemination of findings, a subtask of T293465: Edit Types Research, as Resolved.
Apr 2 2024, 5:49 PM · Research, Epic
Isaac closed T354565: Edit summaries: dissemination of findings as Resolved.
Apr 2 2024, 5:49 PM · Research (FY2023-24-Research-January-March)
Isaac added a comment to T354565: Edit summaries: dissemination of findings.

Closing this task out. We can re-open or create a new one in case substantial new work is required as a result of COLM etc. I'll still update with an arXiv link when available.

Apr 2 2024, 5:49 PM · Research (FY2023-24-Research-January-March)
Isaac moved T354559: Put together diff blogpost on AI + Wikimedia + Datasets from FY2023-24-Research-January-March to FY2023-24-Research-April-June on the Research board.
Apr 2 2024, 5:26 PM · Research (FY2023-24-Research-April-June)
Isaac moved T360572: Extend Article Quality Model to use HTML from Backlog to FY2023-24-Research-April-June on the Research board.
Apr 2 2024, 5:26 PM · Research (FY2023-24-Research-April-June), Epic
Isaac added a comment to T361623: Swap out wikitext for HTML in training quality model.

@DJames-WMF can claim and start this task when T360815 is complete.

Apr 2 2024, 4:33 PM · Research (FY2023-24-Research-April-June)
Isaac created T361623: Swap out wikitext for HTML in training quality model.
Apr 2 2024, 4:32 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T318384: Put API on Cloud VPS .

Human
3 beams:

Ethnic group
Ethnic group of humanes
Ethnic group of humans

Thanks for passing this along @Jack_who_built_the_house! I checked a number of other very high-level topics and didn't find it in Civilization or Primates but did get "Class of plants" for Plants. This sort of error seems most likely with article about very high-level concepts (which often already have article descriptions thankfully) but would still be nice to fix obviously. We might be able to address this sort of tautological output by adding a simple string-matching check to ensure that the output doesn't contain the title itself. Before we implement anything, I'd want to think about what sort of issues this might cause though with e.g., very simple titles where text matching might introduce a bunch of false positives (and therefore not return results).

Apr 2 2024, 12:58 PM · Wikipedia-Android-App-Backlog (Android Release - FY2023-24)

Mar 22 2024

Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly updates:

  • I'm behind on this in part because my thinking is still pretty wide-ranging but I'm continuing to process what I read and mull over the different angles of this. I've been attending to more urgent aspects too with Annual Planning / mentorship / etc.
  • Chris A. put together a nice spreadsheet of a few knowledge integrity tasks that he, Marshall, and Maryana P. put together for benchmarking some LLMs: https://docs.google.com/spreadsheets/d/1b2eG8ZlWVJa5LQDSJMACivWzZ9DCxiYjv6oImucnc20/edit#gid=0
Mar 22 2024, 8:19 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T354565: Edit summaries: dissemination of findings.

Paper submitted to COLM and we'll hear May 24. I'll link to arXiv paper when posted.

Mar 22 2024, 8:16 PM · Research (FY2023-24-Research-January-March)
Isaac closed T360576: Extend evaluation data to include Chinese Wikipedia as Resolved.

Notebook looks good - thanks for the hard work and patience on this!

Mar 22 2024, 7:41 PM · Chinese-Sites, Research
Isaac closed T360576: Extend evaluation data to include Chinese Wikipedia, a subtask of T360572: Extend Article Quality Model to use HTML, as Resolved.
Mar 22 2024, 7:40 PM · Research (FY2023-24-Research-April-June), Epic
Isaac created T360815: Replicate Article Quality Training Notebook.
Mar 22 2024, 7:32 PM · Research (FY2023-24-Research-April-June)

Mar 20 2024

Isaac created T360576: Extend evaluation data to include Chinese Wikipedia.
Mar 20 2024, 8:28 PM · Chinese-Sites, Research
Isaac added a parent task for T360455: Add Article Quality Model to LiftWing: T360572: Extend Article Quality Model to use HTML.
Mar 20 2024, 8:15 PM · Research, Machine-Learning-Team
Isaac added a subtask for T360572: Extend Article Quality Model to use HTML: T360455: Add Article Quality Model to LiftWing.
Mar 20 2024, 8:15 PM · Research (FY2023-24-Research-April-June), Epic
Isaac created T360572: Extend Article Quality Model to use HTML.
Mar 20 2024, 8:14 PM · Research (FY2023-24-Research-April-June), Epic

Mar 19 2024

Isaac added a comment to T360455: Add Article Quality Model to LiftWing.

Task created -- @isarantopoulos just let me know if any details are missing or anything I can do to help with next steps when you are ready!

Mar 19 2024, 5:17 PM · Research, Machine-Learning-Team
Isaac created T360455: Add Article Quality Model to LiftWing.
Mar 19 2024, 5:16 PM · Research, Machine-Learning-Team

Mar 15 2024

Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly updates:

  • Sent thoughts over to SB about wiki benchmark datasets so we'll see if he's available to give feedback or has suggestions for others folk to talk with.
  • Bellagio draft points to importance of the ideas I had of writing a history of machine translation at Wikimedia with Eli. Specifically relating to the call for: A review of currently deployed systems, including (where available) quantitative and qualitative evidence of use and impact.
  • https://foundation.mozilla.org/en/research/library/generative-ai-training-data/common-crawl/ elicits to me a lot of parallels between Wikimedia data for AI and Common Crawl -- e.g., neither was intended as an AI dataset but are heavily used for that purpose, which can bring some considerations for appropriate usage. The major difference is that while there is no moderation of the internet (Common Crawl), Wikimedia does have excellent moderation and therefore the possibility for harm is far lower. In general, a good blogpost to cite when talking about why it's important to think critically about how content is used for training AI.
Mar 15 2024, 5:54 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T354565: Edit summaries: dissemination of findings.

Weekly update:

  • Paper solidifying -- just style / readability aspects at this point. Should be good to submit to COLM this month. We'll upload the submitted paper to arXiv as well.
Mar 15 2024, 4:43 PM · Research (FY2023-24-Research-January-March)

Mar 8 2024

Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly updates:

  • Reached out to Sam Bowman who has worked on natural language benchmarks / LLM alignment so I could pick his brain over what it would mean to work on a Wikimedia benchmark and/or instruction-tuning dataset.
  • Connecting with Enterprise as they have some potential planned work in next FY around datasets on HuggingFace and benchmarks! They're also talking about the feasibility of a Commons dataset as there was another high-profile request for imagery data.
Mar 8 2024, 9:23 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T354565: Edit summaries: dissemination of findings.

Weekly update:

  • Made Leila's requested fixes including adding a future work section that touched on different dataset filtering processes and ways of representing an edit to language models. We're a bit over 9 pages (the limit) but working on bringing it within requirements.
Mar 8 2024, 9:20 PM · Research (FY2023-24-Research-January-March)

Mar 7 2024

Isaac added a comment to T252227: Mobile redirects drop provenance parameters.

Very excited to see this gaining some traction (thanks @mpopov and @dr0ptp4kt)! Commenting on the analytics side of things (I don't know enough about Varnish to comment on implementation details):

Mar 7 2024, 1:24 PM · Data-Engineering, Data Pipelines, Traffic-Icebox, SRE

Mar 6 2024

Isaac added a comment to T343123: Migrate Machine-generated Article Descriptions from toolforge to liftwing..

This is really wonderful news! Thanks @kevinbazira for slogging through this with us and @isarantopoulos for your support as well! Those endpoints were working for me too so I'll let Android indicate what the next steps are.

Mar 6 2024, 5:10 PM · Wikipedia-Android-App-Backlog (Android Release - FY2023-24), Machine-Learning-Team
Isaac added a comment to T359340: Requesting GitLab account activation for desianabae1.

Thanks @taavi! Indeed, unblocked now

Mar 6 2024, 1:28 PM · GitLab (Account Approval), Release-Engineering-Team
Isaac created T359340: Requesting GitLab account activation for desianabae1.
Mar 6 2024, 1:18 PM · GitLab (Account Approval), Release-Engineering-Team

Mar 5 2024

Isaac added a comment to T257638: Topic Dataset : Model, Threshold, Post-processing.

Connecting to another ticket focused on producing these topic snapshots: T351118

Mar 5 2024, 1:33 PM · Movement-Insights

Mar 4 2024

Isaac updated subscribers of T358095: Outreachy Application Task: Tutorial for Wikipedia language-agnostic article quality modeling data.
Mar 4 2024, 3:25 PM · Outreachy (Round 28)
Isaac updated the task description for T358095: Outreachy Application Task: Tutorial for Wikipedia language-agnostic article quality modeling data.
Mar 4 2024, 3:15 PM · Outreachy (Round 28)

Mar 1 2024

Isaac added a comment to T354559: Put together diff blogpost on AI + Wikimedia + Datasets.

Weekly updates:

  • iteration on recommender system vision
  • began writing some of my thoughts around AI datasets though haven't gotten to benchmark data part, which I think is the piece I most need to think through (the rest is things that I've been thinking about and working on for a while now)
  • Read the STORM paper on generating Wikipedia articles from scratch using LLMs. There's a lot in there but in particular relevant to this work, they put together a dataset of newly-created, high-quality articles as a means of benchmarking a large-scale LLM on just new content. Good example for the benchmark datasets. I'd want to take their approach probably but combined with some details on how recent the references are perhaps (would add some complications around extracting that info unfortunately).
  • Watched a nice, long overview of tokenizers that would have been useful before we did the mwtokenizer project but also largely validated a lot of our thinking there and gives some more food for thought. Looking back, I think I would have dropped the sentences aspect and stuck to just paragraphs (easy) and words (hard). With increasing computational power, most things that use sentences can instead use paragraphs and that's way less prone to issues. Sentences become very important really only in the context of citations because the citation generally refers to the prior sentence and so it's useful to be able to extract that for determining entailment.
Mar 1 2024, 10:00 PM · Research (FY2023-24-Research-April-June)
Isaac added a comment to T354565: Edit summaries: dissemination of findings.

Weekly update:

  • Marija moved paper onto new template for COLM so going to do a pass on that next week
Mar 1 2024, 9:29 PM · Research (FY2023-24-Research-January-March)
Isaac added a comment to T305688: Make HTML Dumps available in hadoop.

I'll let others chime in but that would be my feeling about the correct scope. Going historical indeed adds a lot of complications and I think current snapshots are a huge first step. I'd coordinate with Enterprise obviously just to see if any changes are going to happen with schema etc. but hopefully relatively straightforward.

Mar 1 2024, 3:21 PM · Research, Data-Engineering (Q4 2024 April 1st - June 30th), Structured-Data-Backlog
Isaac added a comment to T305688: Make HTML Dumps available in hadoop.

Moving this to discuss with the team. Seems reasonable to have 1 or 2 versions of this if we source it from the Enterprise dumps.

Thanks @lbowmaker for considering and @mfossati for raising! Just chiming in to add my support that having a current snapshot of Parsoid HTML from Enterprise would be very helpful. We've developed a Python library (mwparserfromhtml) that enables us to extract lots of features (references, infoboxes, plaintext, etc.) easily from the HTML so are in a good position to make use of it. Within Research, we're working on switching more of our models to using it too because the gap between wikitext and HTML is definitely growing (example with references). For example, we have an intern who will be working on converting the quality model used for knowledge gap metrics from using wikitext to HTML for this reason, so having a regular snapshot that could be used for computing article quality for all articles would be very helpful.

Mar 1 2024, 1:17 PM · Research, Data-Engineering (Q4 2024 April 1st - June 30th), Structured-Data-Backlog

Feb 28 2024

Isaac added a comment to T358195: Investigate increased preprocessing latencies on LW of article-descriptions model.

Can we investigation reducing the computational need to just the language requested?

The model definitely benefits from some translations so "just the language requested" I would say is not the right approach. Are you suggesting capping it at 5 for example? It no longer seems to be an issue from the pre-processing perspective with Ilias' fix at least so hopefully not a blocker at this point and just a bonus for capping model latency. That said, if there's a desire to further constrain, I can look into it in the next few weeks but please don't let it be a blocker given that Android has said they're comfortable moving forward per T343123#9558740.

Feb 28 2024, 7:45 PM · Wikipedia-Android-App-Backlog, Machine-Learning-Team
Isaac added a comment to T352177: Proposal: Improve how Wiki Education Dashboard counts references added.

we currently have 4 workers consuming tasks for programs and events dashboard, and 3 for wikieducational dashboard. So I guess that the max number of concurrent requests would be 7 (if all the workers are working at the same time).

Yeah, that's quite reasonable! Thanks for looping back about it.

Feb 28 2024, 12:37 PM · Outreachy (Round 27)