User Details
User Details
- User Since
- Mar 21 2015, 12:56 AM (484 w, 7 h)
- Availability
- Available
- LDAP User
- Unknown
- MediaWiki User
- Nitingupta910 [ Global Accounts ]
Aug 24 2016
Aug 24 2016
Nitingupta910 added a comment to T17017: Wikimedia static HTML dumps broken.
Just plain HTML dumps would be so much better than any cooked up format. Plain HTML/XML gives so much flexibility and is so easy to parse with parsers available in almost any language or even cli.
Aug 25 2015
Aug 25 2015
Nitingupta910 added a comment to T94457: Install nodejs, nginx and other dependencies on francium.
are there any updates on making HTML dumps generally available?
Apr 21 2015
Apr 21 2015
Nitingupta910 added a comment to T93396: Decide on format options for HTML and possibly other dumps.
FWIW, I would vote for compressed sqlite db over custom formats like zim.
Is there any enwiki HTML dump ready for download?
Mar 21 2015
Mar 21 2015
Nitingupta910 added a comment to T93396: Decide on format options for HTML and possibly other dumps.
I think a NoSql solution like a key-value store is ideal for providing HTML dumps -- it has a natural mapping: key:title => value:HTML. Any additional metadata can be easily modeled in the key (e.g. title\0timestamp\0revision => HTML)