We went with Simple Wikipedia because the size is much smaller than English Wikipedia so it was more reasonable that you could process it via these notebooks.
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Advanced Search
Apr 21 2021
Apr 20 2021
- I also had one thing in mind. Since there are already wikipedia articles in english on en.wikipedia, and if people search for articles they may come across en.wikipedia articles frequently on search engine results, so why simple.wikipedia was required?
In T276274#7019989, @Isaac wrote:Only thing left is- how can I make this content readable, this is revision diff from Craig Noone article:
@DaneshwariK I'm not sure if this is what you're asking, but the Compare API will provide you with HTML diffs for edits -- e.g., https://en.wikipedia.org/w/api.php?action=compare&fromrev=930870273&torev=933163076
The diff that you pasted there is wikitext, which is the raw code that is used for writing Wikipedia articles. It must be parsed then into HTML to be "readable". You can also just screenshot diffs in the visual mode on Wikipedia if that's easier and what you want -- e.g., https://en.wikipedia.org/w/index.php?title=Craig_Noone&diff=933163076&oldid=930870273&diffmode=visual
Hi @Isaac and @srodlund I'm about to finish my tutorial notebook. Only thing left is- how can I make this content readable, this is revision diff from Craig Noone article:
https://en.wikipedia.org/w/index.php?title=Craig_Noone&diff=933163076&oldid=930870273
Apr 17 2021
Hi! @Nizz009 For once you can try to print page IDs to see whether these are same or different pages and using these page IDs see what revisions IDs are there in that page. I feel this will help to solve the problem to some extent.
Apr 13 2021
Hello @Fonyuy237 , maybe you can work on PAWS since python, bash, and jupyter-notebook support is provided there itself. Only configuration required is for pywikibot and user-config.py file. If you need any help for setup on PAWS, maybe I can help you.
Hello @Mike_Peel and @MSGJ Here is the link to my page.
https://www.wikidata.org/wiki/User:DaneshwariK/Outreachy_2
Please review and provide feedback regarding this. Thanks!
Apr 9 2021
In T276274#6983819, @Isaac wrote:Does anyone know if Wikimedia SQL dumps always contain a single table, or if there can be several in the same file?
@Slst2020 the SQL dumps should only ever have a single table in them though obviously some tables are small while others are much larger.
In T276274#6980978, @srodlund wrote:Hi everyone,
First of all, thanks for your interest in this project! I wanted to share some additional resources about technical documentation that you may find helpful when you designing your tutorial.
We don't yet have a tutorial template just for Jupyter notebooks. This is something we will work on (or maybe include in the scope of the Outreachy project) but for now, this is okay. We'd like to see how you would approach formatting your tutorials, so we can better understand your approach to documentation overall.
While you're working on your tutorial, you may find some of the following resources useful:
- The Wikimedia technical style guide generally addresses best practices for documenting on wikis.
- This Walkthroughs/tutorials template may help you think about how to develop organize your tutorial -- it is more oriented towards software, but you can adapt it for your purposes.
- Here are some documentation resources specific to Wikimedia projects.
- This blog post has some useful tips for working with Jupyter notebooks in general and an additional list of resources at the bottom.
Finally, here are some notebook tutorials for a variety of Wikimedia projects. It may be helpful to take a look at them to see how they are formatted.
- https://public.paws.wmcloud.org/User:SRodlund_(WMF)/PAWS-Tutorial.ipynb
- https://public.paws.wmcloud.org/User:SRodlund_(WMF)/*2021%20UPDATED*%20API%20Connections%20With%20PAWS.ipynb
- https://public.paws.wmcloud.org//59320301/en-wikipedia-images-tile-app.ipynb
- https://public.paws.wmcloud.org//59320301/en-wikipedia-search-content.ipynb
- https://public.paws.wmcloud.org/User:JHernandez_(WMF)/Accessing%20the%20new%20replicas,%20changes%20from%20the%20previous%20cluster.ipynb
- https://public.paws.wmcloud.org/User:JHernandez_%28WMF%29/Accessing%20Wikireplicas%20from%20PAWS.ipynb
Apr 6 2021
Hello @Mike_Peel . I have completed my page. Kindly give your feedback and review.
Link is https://www.wikidata.org/wiki/User:DaneshwariK/Outreachy_1
Apr 3 2021
I'm currently working on #TODO: Loop through TAG_DESC_DUMP_FN and identifies the ctd_id associated with mobile edit and I have completed the task using .sql file by approach 1. But I later recognized that my approach might be wrong.
Hi! Isaac and sarah, I'm Daneshwari and I'm looking to contribute to this project for Outreachy 2021. Seems that I started a bit late but I would contribute to my full strength. Thanks!