Page MenuHomePhabricator

OlehOnyshchcak (Oleh Onyshchak)
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Monday

  • Clear sailing ahead.

User Details

User Since
Oct 22 2019, 11:40 AM (235 w, 3 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
OlehOnyshchcak [ Global Accounts ]

Recent Activity

Feb 7 2020

OlehOnyshchcak added a comment to T236614: Page.title(as_filename=True) don't remove "\"" (quotes) forbidden character.

and also wasted me a lot of time in the process of realizing that I need to implement this and re-download data.

Obviously developers and maintainers are not happy when bugs slip-through and users are annoyed. Anyhow, it does not seem right to me that you are complaining of having "wasted" your time, when people are donating their free time to carry on this project.

Feb 7 2020, 8:46 AM · Patch-For-Review, Pywikibot

Dec 4 2019

OlehOnyshchcak added a comment to T236614: Page.title(as_filename=True) don't remove "\"" (quotes) forbidden character.

I just want to give my perspective on an issue in order for it to be better triaged:

  1. pywikibot claims to support both Windows and Linux systems, so functionality must have a common denominator of "what is acceptable on all systems". If pywikibot doesn't support Windows anymore, probably we will need to remove the installation instructions for Windows to avoid confusion.
  2. if we don't support Windows anymore, it's still beneficial to restrict filenames to the POSIX portable filename character set, which will guarantee that filename is portable between all Posix-like systems. For example, Kaggle, which is arguably one of the most popular platforms for data storage & processing, relies on the above-mentioned restrictions. Just a few days ago it was explained to me, that Kaggle does not support filenames not following POSIX portable filename character set. And probably, there are a lot of other use cases when this becomes unportable.
Dec 4 2019, 4:43 PM · Patch-For-Review, Pywikibot

Dec 3 2019

OlehOnyshchcak added a comment to T236156: [ImgRec UCU] Implementation of the basic model for a project.
Dec 3 2019, 8:20 AM · Research
OlehOnyshchcak updated the task description for T236156: [ImgRec UCU] Implementation of the basic model for a project.
Dec 3 2019, 8:18 AM · Research
OlehOnyshchcak added a comment to T236146: [ImgRec UCU] Data Collection.

Full dataset - https://www.kaggle.com/jacksoncrow/extended-wikipedia-multimodal-dataset
Full dataset with raw images - https://drive.google.com/file/d/1l0Oyv2Y6LmPGN3lP9MB6i8WWCinqkYPk/view?usp=sharing

Dec 3 2019, 8:17 AM · Research

Nov 24 2019

OlehOnyshchcak updated the task description for T236156: [ImgRec UCU] Implementation of the basic model for a project.
Nov 24 2019, 12:33 PM · Research
OlehOnyshchcak added a comment to T236156: [ImgRec UCU] Implementation of the basic model for a project.

Filed a bug about missing functionality of pywikibot while collecting additional data T238992. Currently, I applied an inefficient workaround.

Nov 24 2019, 12:29 PM · Research

Nov 23 2019

OlehOnyshchcak created T238992: [Feature Request]: Querying image description with Pywikibot API.
Nov 23 2019, 6:26 PM · Pywikibot

Oct 29 2019

OlehOnyshchcak updated the task description for T236156: [ImgRec UCU] Implementation of the basic model for a project.
Oct 29 2019, 12:04 PM · Research
OlehOnyshchcak updated the task description for T236156: [ImgRec UCU] Implementation of the basic model for a project.
Oct 29 2019, 12:03 PM · Research

Oct 27 2019

OlehOnyshchcak added a comment to T236146: [ImgRec UCU] Data Collection.

Full dataset - https://drive.google.com/open?id=18i0D-N1J18UC1ebT9qbHZegKJQiKba5z
Will post a link with full dataset on Kagle.

Oct 27 2019, 10:34 PM · Research
OlehOnyshchcak closed T236146: [ImgRec UCU] Data Collection as Resolved.
Oct 27 2019, 10:33 PM · Research
OlehOnyshchcak closed T236146: [ImgRec UCU] Data Collection, a subtask of T236142: Evaluate existing multimodal learning methods for Wikipedia image recommendation [Master thesis], as Resolved.
Oct 27 2019, 10:33 PM · Research
OlehOnyshchcak updated the task description for T236146: [ImgRec UCU] Data Collection.
Oct 27 2019, 10:32 PM · Research
OlehOnyshchcak added a comment to T236146: [ImgRec UCU] Data Collection.

FIled another defect, while finetuning image download T236614

Oct 27 2019, 11:35 AM · Research
OlehOnyshchcak created T236614: Page.title(as_filename=True) don't remove "\"" (quotes) forbidden character.
Oct 27 2019, 11:32 AM · Patch-For-Review, Pywikibot

Oct 25 2019

OlehOnyshchcak updated the task description for T236146: [ImgRec UCU] Data Collection.
Oct 25 2019, 12:03 PM · Research
OlehOnyshchcak updated the task description for T236156: [ImgRec UCU] Implementation of the basic model for a project.
Oct 25 2019, 12:02 PM · Research
OlehOnyshchcak added a comment to T236405: pywikibot.page.BasePage.imagelinks() crashes.

@Dvorapa @Xqt, thanks for support and information!

Oct 25 2019, 11:55 AM · Pywikibot
OlehOnyshchcak added a comment to T236405: pywikibot.page.BasePage.imagelinks() crashes.

@Xqt, thank you for the resolution! While pypi version still has this bug, I will switch directly to the master branch.

Oct 25 2019, 9:06 AM · Pywikibot

Oct 24 2019

OlehOnyshchcak added a comment to T236146: [ImgRec UCU] Data Collection.

Created a truncated dataset for 500 articles(1.3 Gb) - https://www.kaggle.com/jacksoncrow/wiki-articles-multimodal
Full dataset for 5638 articles(14.2 Gb) is still uploading, will follow up with a link

Oct 24 2019, 9:42 PM · Research
OlehOnyshchcak added a comment to T236405: pywikibot.page.BasePage.imagelinks() crashes.

I see, thanks. To make things more simple, I just reproduced the bug in a clean online environment with the latest pywikibot - https://jupyter.org/try
In other words, it allows you to reproduce it easily as well. You can see installation/demonstration of the bug in the attached file:

Oct 24 2019, 8:43 PM · Pywikibot
OlehOnyshchcak added a comment to T236405: pywikibot.page.BasePage.imagelinks() crashes.

@Dvorapa, thank you for such a fast response!

Oct 24 2019, 7:00 PM · Pywikibot
OlehOnyshchcak updated the task description for T236146: [ImgRec UCU] Data Collection.
Oct 24 2019, 4:24 PM · Research
OlehOnyshchcak updated the task description for T236405: pywikibot.page.BasePage.imagelinks() crashes.
Oct 24 2019, 4:22 PM · Pywikibot
OlehOnyshchcak created T236405: pywikibot.page.BasePage.imagelinks() crashes.
Oct 24 2019, 4:19 PM · Pywikibot
OlehOnyshchcak added a comment to T236146: [ImgRec UCU] Data Collection.

@Aklapper, thank you for the clarification! Will report it properly

Oct 24 2019, 2:23 PM · Research
OlehOnyshchcak added a comment to T236146: [ImgRec UCU] Data Collection.

Sent email with found pywikibot bug for some articles (crashes and can't download some images)

Oct 24 2019, 12:41 PM · Research

Oct 22 2019

OlehOnyshchcak updated the task description for T236146: [ImgRec UCU] Data Collection.
Oct 22 2019, 5:28 PM · Research
OlehOnyshchcak renamed T236156: [ImgRec UCU] Implementation of the basic model for a project from [ImgRec UCU] Implementation of the first part of the project: basic model to [ImgRec UCU] Implementation of the basic model for a project.
Oct 22 2019, 1:08 PM · Research
OlehOnyshchcak updated the task description for T236142: Evaluate existing multimodal learning methods for Wikipedia image recommendation [Master thesis].
Oct 22 2019, 1:07 PM · Research
OlehOnyshchcak created T236161: [ImgRec] Thesis wrap up and submission.
Oct 22 2019, 1:06 PM · Research
OlehOnyshchcak created T236159: [ImgRec UCU] Evaluation of improvements.
Oct 22 2019, 1:05 PM · Research
OlehOnyshchcak created T236158: [ImgRec UCU] Planning and implementation of improvements.
Oct 22 2019, 1:04 PM · Research
OlehOnyshchcak created T236157: [ImgRec UCU] Evaluation of the basic model.
Oct 22 2019, 1:04 PM · Research
OlehOnyshchcak created T236156: [ImgRec UCU] Implementation of the basic model for a project.
Oct 22 2019, 1:02 PM · Research
OlehOnyshchcak triaged T236146: [ImgRec UCU] Data Collection as Medium priority.
Oct 22 2019, 12:14 PM · Research
OlehOnyshchcak added a parent task for T236146: [ImgRec UCU] Data Collection: T236142: Evaluate existing multimodal learning methods for Wikipedia image recommendation [Master thesis].
Oct 22 2019, 12:12 PM · Research
OlehOnyshchcak added a subtask for T236142: Evaluate existing multimodal learning methods for Wikipedia image recommendation [Master thesis]: T236146: [ImgRec UCU] Data Collection.
Oct 22 2019, 12:12 PM · Research
OlehOnyshchcak created T236146: [ImgRec UCU] Data Collection.
Oct 22 2019, 12:11 PM · Research
OlehOnyshchcak claimed T236142: Evaluate existing multimodal learning methods for Wikipedia image recommendation [Master thesis].
Oct 22 2019, 12:04 PM · Research