use wmf; SELECT * FROM webrequest WHERE webrequest_source='maps' AND year=2015 and month=9 and content_type like 'text/html%' limit 50;
Running that produces a flood of these:
Sep 18, 2015 3:06:17 AM INFO: parquet.hadoop.InternalParquetRecordReader: RecordReader initialized will read a total of 46 records. Sep 18, 2015 3:06:17 AM INFO: parquet.hadoop.InternalParquetRecordReader: at row 0. reading next block Sep 18, 2015 3:06:17 AM INFO: parquet.hadoop.InternalParquetRecordReader: block read in memory in 3 ms. row count = 46 Sep 18, 2015 3:06:17 AM WARNING: parquet.hadoop.ParquetRecordReader: Can not initialize counter due to context is not a instance of TaskInputOutputContext, but is org.apache.hadoop.mapreduce.task.TaskAttemptContextImpl