Cross-team Blog Content: Running a desktop-based LLM with an Enterprise RAG index
Open, Needs TriagePublic5 Estimated Story Points
Actions

Assigned To

Authored By

	Cpetrillo
	Wed, Apr 10, 8:21 PM

Description

User Story:
As a user, I would like a written example of using the Enterprise APIs to generate embeddings for a RAG Index.
As a user, I would like a written example of using a Wikipedia RAG index in a desktop-based LLM.

Objective (O2.KR1):
Documentation and content for Enterprise products is expanded to reduce the barrier to use for, and to enable further outreach efforts towards a broader range of organization reusers.

Acceptance criteria

An EN Wikipedia based RAG index of N (est. <1000) embeddings has been created using the structured contents endpoint.
A desktop-based foundational language model (e.g. Ollama) has used a Wikipedia-based RAG index for N (est. <50) test queries.
Results of generating a Wikipedia-based RAG index and using the index in a desktop LLM experiment have been written up and summarized for content use by the product and growth marketing teams.

ToDo

Select N page set and use page set to generate results from structured content endpoint (~500 articles to start experimenting)
Use results to generate embeddings and store embeddings in a queryable vector database
Select and configure desktop-based LLM/runner to query vector database to use in response mechanism
Select and run N queries to test RAG-based Q&A and log results
[50%] Summarize steps to reproduce testing framework and review with product and product marketing for handoff

Test Strategy

Notes from engineering discussion [To be refined]:

Run the ingestion and embedding on Apple M2 laptops to have zero costs
Potentially use Ollama post and model as a framework to follow
Use either Simple Wiki or Wikipedia as a data source and keep the page list small for ease of reuse and lower LoE
Secondary objective (P2) Publish dataset on huggingface as an initial PoC for other datasets in the future and to set up WME posting process

Checklist for testing
We need good example chat prompts that show different responses when RAG is enabled and disabled

Things to consider:

Scope of work for the post and size of dataset
Do we want to document this elsewhere as well?

Description (optional)

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Open		None	T362422 O2.KR1: LLM language and (2) LLM use-cases blog posts published
		Open		ROdonnell-WMF	T362282 Cross-team Blog Content: Running a desktop-based LLM with an Enterprise RAG index

Event Timeline

Cpetrillo created this task.Wed, Apr 10, 8:21 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptWed, Apr 10, 8:21 PM

Cpetrillo updated the task description. (Show Details)Wed, Apr 10, 8:22 PM

ROdonnell-WMF moved this task from Incoming to To Be Estimated/To Be Discussed on the Wikimedia Enterprise board.Thu, Apr 11, 11:26 AM

JArguello-WMF moved this task from To Be Estimated/To Be Discussed to Sprint 58 on the Wikimedia Enterprise board.Thu, Apr 11, 1:44 PM

JArguello-WMF edited projects, added Wikimedia Enterprise (Sprint 58); removed Wikimedia Enterprise.

JArguello-WMF added a parent task: T362422: O2.KR1: LLM language and (2) LLM use-cases blog posts published.Fri, Apr 12, 3:27 PM

ROdonnell-WMF claimed this task.Mon, Apr 15, 10:48 AM

ROdonnell-WMF moved this task from Next Up to In Progress on the Wikimedia Enterprise (Sprint 58) board.

ROdonnell-WMF updated the task description. (Show Details)Mon, Apr 15, 12:51 PM

ROdonnell-WMF updated the task description. (Show Details)Tue, Apr 16, 12:26 PM

I have a work POC that I shared with Chuck.

I'll work on the Python code to save the CSV dataset, first version is in Go. A blog post would be better in Python to give it a broader appeal.

I'll do a second draft of the blog post steps, including:

The dataset steps
The dependency installation steps
Import dataset into ChromaDB
CLI query testing
Bonus steps to build Web UI with Streamlit

ROdonnell-WMF moved this task from In Progress to QA on the Wikimedia Enterprise (Sprint 58) board.Tue, Apr 16, 4:50 PM

There is a new repo in the Experiments group: for-blog-LLM-RAG

ROdonnell-WMF updated the task description. (Show Details)Tue, Apr 16, 5:00 PM

JArguello-WMF moved this task from QA to Sign Off on the Wikimedia Enterprise (Sprint 58) board.Wed, Apr 17, 1:06 PM

@creynolds Can you please let us know if the information presented is enough for you to do your part? Do you need anything else? FYI, the sprint ends next Thursday and we'd like to know if engineering work is done on this one. Thanks so much!

@JArguello-WMF code/readme is great. Only asked ROd for some helper intro talk as precursor to help write copy then I'm good and can take it from here.

Thanks @creynolds!

Moved to done, because engineering work is done.

I think we cover the intro text in our chat last night. @creynolds do you have what you need?

yep

Cross-team Blog Content: Running a desktop-based LLM with an Enterprise RAG index Open, Needs TriagePublic5 Estimated Story PointsActions