Change Details

- DrTrigon (1. Development Phase around 2013) [ ] https://en.wikipedia.org/wiki/User:DrTrigonBot/doc#.28generic.29 [ ] alternative to pHash (not developed since 2013): http://blockhash.io/ (or just create an icon by averaging over pixels, resp. reducing resolution/scale/zoom) [ ] wavlet decompositions for peak detection, color regions, fingerprinting/hashing, frequency decomp., denoise, compress, etc. - code/software: - http://www.pybytes.com/pywavelets/regression/wp2d.html (supports 2D data, mature, see WaveletPacket2D.get_leaf_nodes() and store as xml/json) - http://jseabold.net/blog/2012/02/23/wavelet-regression-in-python/ - literature/paper: - https://www.researchgate.net/post/How_wavelet_transform_coefficient_used_for_image_classification - http://www.cmapx.polytechnique.fr/~yu/publications/ICPR08Final.pdf <- **implement this as it supports object recognition, texture and satelite images classification, text/image language identification, sound classification** - patch transformation: http://people.csail.mit.edu/taegsang/Documents/CVPRPatch.pdf - http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.684.5988&rep=rep1&type=pdf - http://ac.els-cdn.com/S0377042706006431/1-s2.0-S0377042706006431-main.pdf?_tid=dca58f04-2708-11e6-9c23-00000aab0f27&acdnat=1464683149_3f76b534460f9c998182d86714c80597 (watermarking) - http://soundlab.cs.princeton.edu/publications/2001_amta_aadwt.pdf [ ] head pose estimation - see: http://rpg.ifi.uzh.ch/software_datasets.html (Perspective 3-Point (P3P) Algorithm) - http://rpg.ifi.uzh.ch/software/p3p_code_final.zip - http://rpg.ifi.uzh.ch/docs/CVPR11_kneip.pdf [ ] T137558: render error detection (see T136934) - using `convert` (ImageMagick) the commons default, allows to compare commons results against other libraries and e.g. find rendering errors, see https://github.com/AbdealiJK/file-metadata/issues/37 - possible categories to check for testing are: - https://commons.wikimedia.org/wiki/Category:PDF_files_affected_by_MediaWiki_restrictions - https://commons.wikimedia.org/wiki/Category:Images_without_thumbnails - https://commons.wikimedia.org/wiki/Category:Images_with_render_problem - AbdealiJK, jayvdb, DrTrigon (2. Development Phase GSoC 2016) [ ] T135836#2314683, T135836#2314835: face recognition (e.g. like facebook) as well as age and gender - needs kind of DB (e.g. commons) [ ] T135836#2314683: facial landmarks [ ] learning? how time consuming? (not to spend too much time on something that we cannot finish - though a actually should be quite easy to have a first theoretically working script) [ ] train the bot with images of persons we now in advance that they will appear in a dataset (e.g. generals or politicians during wars, etc.) [ ] train the bot on the dataset itself at least after humans have gone over it Categories to assign (see https://etherpad.wikimedia.org/p/Zl7V7KuK7J): - [[ https://commons.wikimedia.org/wiki/Category:Portraits | Category:Portraits ]] -> size of face (ration compared to picture size - kind of coverage) and orientation (head pose) -

- DrTrigon (1. Development Phase around 2013) [ ] https://en.wikipedia.org/wiki/User:DrTrigonBot/doc#.28generic.29 [ ] alternative to pHash (not developed since 2013): http://blockhash.io/ (or just create an icon by averaging over pixels, resp. reducing resolution/scale/zoom) [ ] wavlet decompositions for peak detection, color regions, fingerprinting/hashing, frequency decomp., denoise, compress, etc. - code/software: - http://www.pybytes.com/pywavelets/regression/wp2d.html (supports 2D data, mature, see WaveletPacket2D.get_leaf_nodes() and store as xml/json) - http://jseabold.net/blog/2012/02/23/wavelet-regression-in-python/ - literature/paper: - https://www.researchgate.net/post/How_wavelet_transform_coefficient_used_for_image_classification - http://www.cmapx.polytechnique.fr/~yu/publications/ICPR08Final.pdf <- **implement this as it supports object recognition, texture and satelite images classification, text/image language identification, sound classification** - patch transformation: http://people.csail.mit.edu/taegsang/Documents/CVPRPatch.pdf - http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.684.5988&rep=rep1&type=pdf - http://ac.els-cdn.com/S0377042706006431/1-s2.0-S0377042706006431-main.pdf?_tid=dca58f04-2708-11e6-9c23-00000aab0f27&acdnat=1464683149_3f76b534460f9c998182d86714c80597 (watermarking) - http://soundlab.cs.princeton.edu/publications/2001_amta_aadwt.pdf [ ] head pose estimation - see: http://rpg.ifi.uzh.ch/software_datasets.html (Perspective 3-Point (P3P) Algorithm) - http://rpg.ifi.uzh.ch/software/p3p_code_final.zip - http://rpg.ifi.uzh.ch/docs/CVPR11_kneip.pdf [ ] T137558: render error detection (see T136934) - using `convert` (ImageMagick) the commons default, allows to compare commons results against other libraries and e.g. find rendering errors, see https://github.com/AbdealiJK/file-metadata/issues/37 - possible categories to check for testing are: - https://commons.wikimedia.org/wiki/Category:PDF_files_affected_by_MediaWiki_restrictions - https://commons.wikimedia.org/wiki/Category:Images_without_thumbnails - https://commons.wikimedia.org/wiki/Category:Images_with_render_problem - AbdealiJK, jayvdb, DrTrigon (2. Development Phase GSoC 2016) [ ] T135836#2314683, T135836#2314835: face recognition (e.g. like facebook) as well as age and gender - needs kind of DB (e.g. commons) [ ] T135836#2314683: facial landmarks [ ] learning? how time consuming? (not to spend too much time on something that we cannot finish - though a actually should be quite easy to have a first theoretically working script) [ ] train the bot with images of persons we now in advance that they will appear in a dataset (e.g. generals or politicians during wars, etc.) [ ] train the bot on the dataset itself at least after humans have gone over it [ ] Z441#5618: what happens if you take an image flandmark cannot detect, and some amount of random noise, resize and rotate it a bit and re-try - as if you were sitting in front of the cam and move and tilt your head a bit untill it get the detection Categories to assign (see https://etherpad.wikimedia.org/p/Zl7V7KuK7J): - [[ https://commons.wikimedia.org/wiki/Category:Portraits | Category:Portraits ]] -> size of face (ration compared to picture size - kind of coverage) and orientation (head pose) -