Page MenuHomePhabricator
Paste P8634

Db table audit for dumps 2019
ActivePublic

Authored by ArielGlenn on Jun 20 2019, 12:27 PM.
Referenced Files
F29781798: raw.txt
Jul 16 2019, 10:15 AM
F29688810: raw.txt
Jul 6 2019, 7:24 AM
F29688808: raw.txt
Jul 6 2019, 7:21 AM
F29688755: raw.txt
Jul 6 2019, 6:06 AM
F29607131: raw.txt
Jun 20 2019, 12:27 PM
Subscribers
On a given not-to-special wiki (not Commons, not Wikidata), the below tables
are available. Note that some wikis have additional tables from optional extensions;
these should be added to the list later and checked.
left to review:
ores_classification -- not useful unless ores is set up on mirror install, would be auto-generated?
ores_model -- same?
--------
not yet dumped but could be
babel
change_tag_def
user_former_groups
--------
already dumped
categorylinks
category
change_tag
externallinks
geo_tags
imagelinks
image
iwlinks
langlinks
pagelinks
page_props
page_restrictions
page
protected_titles
redirect
sites
site_stats
templatelinks
user_groups
wbc_entity_usage
--------
known private data
abuse_filter
abuse_filter_action
abuse_filter_history
abuse_filter_log
archive
archive_save
cu_changes
cu_log
filearchive
global_block_whitelist (some fields in some entries may be private)
hidden
ipblocks
logging
oldimage
securepoll_cookie_match
securepoll_elections
securepoll_entity
securepoll_lists
securepoll_msgs
securepoll_options
securepoll_properties
securepoll_questions
securepoll_strike
securepoll_voters
securepoll_votes
spoofuser (also rebuildable)
user
user_properties (see T150679)
watchlist
---------
old private data
bv2009_edits
bv2011_edits
bv2013_edits
bv2015_edits
bv2017_edits
povwatch_log
povwatch_subscribers
---------
relevant bits dumped as xml (revision metadata/content)
actor
comment
content
content_models
revision
revision_actor_temp
revision_comment_temp
slot_roles
slots
text
----------
caches, temporary data, rebuildable data, data specific to WMF wiki state
__wmf_checksums (table checksums for internal dba use at WMF)
betafeatures_user_counts (not useful for mirror)
ip_changes (populateIpChanges.php)
ipblocks_restrictions (specific to ip blocks on a wiki, useless w/o ipblocks)
linter (captures errors found by Parsoid, not needed)
log_search (populateLogSearch.php)
mathoid
module_deps (emptied at each run of update.php, uses abs file paths)
objectcache
querycache
querycache_info
querycachetwo
recentchanges (rebuildrecentchanges.php)
searchindex (rebuildtextindex.php)
transcache
transcode (transcoded items aren't dumped, can be recreated)
uploadstash
updatelog (wiki-specific, should not be imported)
user_newtalk (specific to wiki state, should not be imported)
wikilove_log (tracks usage of the wikilove extension, actual data is edits to talk pages, already dumped)
----------
empty/unused on WMF
cur
edit_page_tracking
filejournal (also private)
interwiki
job
l10n_cache
math
pif_edits (would be private if it had content)
-----------
dumped in other format
site_identifiers (site matrix in json)