  • google calendar... email invite !
T129611: [GSoC 2016 Proposal] Port to pywikibot-core, notes:

  • about replacing packages code with others; one main concer was to have an infrastructure that allows to use the newest experimental packages as well (e.g. the C code make and compile functionallity) that is available some specific interface (e.g. C) only - we should still support this in an python module that allows to do that for any chosen code
  • the same holds for the package install and patch apply functionallity
  • move meeting to fri
  • dig up my latest (most recent) etc. code:
  • figure out what's wrong with skype: the connection issue seams not to be on my side
  • test mumble/plumble (fdroid)
Finish by the end of this week:


  • decide last 50% of the gsoc - project plan !!! irc - T129611: [GSoC 2016 Proposal] Port to pywikibot-core
    • what from the original proposal do we need (must have) and is essential?
    • what from the original bot is still missing?
    • rest: free!
  • do beta-test (in vm - on labs?) - T136985: file-metadata testing log
  • think about training for specific database uploads
    • do not train a haarcascade form a set specific to a upload to be done
    • but create a set of e.g. persons that are likely to appear and use image comparing algorithms instead of feature detection (haarcascade, etc.)
    • what was this algorithm for the video copyright project called again?
  • think about training in general
    • Z441#5657: SVM for line drawing / graphics detection?
  • do facial landmarks give more info than just the face position? e.g. "landmarks"? yes
  • Z441#5705


  • video2copyright project meeting
    • 80
    • splitted videos
    • python wrapper vs .net ones
    • pydotnet or else ironpython idea still alive
  • afraid but do not have time today for "video2copyright project meeting" - could we find another date? else I try to make it

-> problem - doch heute

  • next want to check scripts

(run it on eth-bib set!!)
(check and write code - fork and update README.rst in file-metadata)
(change usage syntax in docu string)
(check my todos!)

osboxes@osboxes:~/core$ python -search:'eth-bib' -logname:ETH-Bib -> move this and output to testing log

pywikibot search page generator should output number of matches before starting

bot script and/or

  • does the bot script and/or write local logs e.g. conatining errors, warnings, etc. occured?
    • use pywikibot logging for the bot script
    • use tee for
  • retry downloading if errors occur (or check filesize)

$ python -u -search:'eth-bib' -logname:ETH-Bib 2>&1 | tee bulk.log

osboxes@osboxes:~/core$ ps -aux | grep "python"
root 1960 0.0 0.2 76308 4580 ? S 14:11 0:00 /usr/bin/python /usr/share/apt-xapian-index/update-apt-xapian-index-dbus
osboxes 21607 34.5 8.8 1045072 182028 pts/9 Sl+ 15:45 44:13 python -search:eth-bib -logname:ETH-Bib
osboxes 23479 0.0 0.1 11756 2188 pts/0 S+ 17:53 0:00 grep --color=auto python
osboxes@osboxes:~/core$ kill -STOP 21607
osboxes@osboxes:~/core$ kill -CONT 21607

  • need categories stats - discriminate the file type cats from others
  • graphics detection into other direction, lot of shades, gradients/contours, etc. -> Category:Photographs_by_camera
  • enable conda opencv install on labs and re-run eth-bib job
  • adopt catimages-gsoc docker according to how the test is run on labs and renamed scripts
  • Questions for @AbdealiJK:
    • unittest: why and when are they skipped? what about skipping the ones for opencv?
    • implement haarcascade in skilearn; what about implementing T141633 (the wavelet paper) too?
    • do we have any kind of segmentation implemented in file-metadata?
    • what about writing a qt client for file-metadata? can that be combined with the web-client @AbdealiJK plans to write?


drtrigon@tools-bastion-03:~$ wget -O
drtrigon@tools-bastion-03:~$ bash -f -b -p $HOME/miniconda
$ conda list

see also

It looks like this task was about GSoC 2016, hence closing.