@cscott proposed that it might be useful to collect per-stripped-attribute contribution to the Parsoid HTML so that we can make decisions per-attribute to strip or not.
So, the current code for stripping might need to be rewritten to strip attributes one at a time and collect stats.
Given this, it might also be useful to look at a larger set of attributes we could strip beyond the initial set of attributes used to generate stats for T272331