Refine failed for event.mediawiki_cirrussearch_request
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	EBernhardson
	Feb 10 2020, 5:31 PM

Description

Looking at the jobs that were supposed to run over the weekend, one set is stalled waiting for two missing partitions. Looking in hdfs I see input data available in /wmf/data/raw/..., but the refined output locations instead contain _REFINE_FAILED files.

Could these be re-run? Alternatively, if refine will simply not run on these hours, could we place _SUCCESS files in the directories so downstream oozie tasks continue?

$ hdfs dfs -ls /wmf/data/event/mediawiki_cirrussearch_request/datacenter=eqiad/year=2020/month=2/day=*/hour=*/_REFINE_FAILED
Picked up JAVA_TOOL_OPTIONS: -Dfile.encoding=UTF-8
-rw-r--r--   3 analytics analytics         26 2020-02-03 10:21 /wmf/data/event/mediawiki_cirrussearch_request/datacenter=eqiad/year=2020/month=2/day=3/hour=6/_REFINE_FAILED
-rw-r--r--   3 analytics analytics         26 2020-02-03 10:22 /wmf/data/event/mediawiki_cirrussearch_request/datacenter=eqiad/year=2020/month=2/day=3/hour=7/_REFINE_FAILED

Related Objects

Mentioned In: T240230: Run a script to check REFINE_FAILED flags daily

Event Timeline

EBernhardson created this task.Feb 10 2020, 5:31 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptFeb 10 2020, 5:31 PM

EBernhardson updated the task description. (Show Details)Feb 10 2020, 5:31 PM

Done!

elukey@an-coord1001:~$ sudo -u analytics /usr/local/bin/refine_mediawiki_events --ignore_failure_flag=true --since=2020-02-03T05:00:00 --until=2020-02-03T08:00:00 table_whitelist_regex=mediawiki_cirrussearch_request --verbose refine_mediawiki_event

elukey@an-coord1001:~$ hdfs dfs -ls /wmf/data/event/mediawiki_cirrussearch_request/datacenter=eqiad/year=2020/month=2/day=*/hour=*/_REFINE_FAILED
Picked up JAVA_TOOL_OPTIONS: -Dfile.encoding=UTF-8
ls: `/wmf/data/event/mediawiki_cirrussearch_request/datacenter=eqiad/year=2020/month=2/day=*/hour=*/_REFINE_FAILED': No such file or directory

Closing, re-open if anything is still missing :)

elukey mentioned this in T240230: Run a script to check REFINE_FAILED flags daily.Feb 11 2020, 5:29 PM

Aklapper removed a project: Analytics.Jul 4 2020, 7:59 AM

Refine failed for event.mediawiki_cirrussearch_requestClosed, ResolvedPublicActions

Description

Related Objects

Event Timeline

Refine failed for event.mediawiki_cirrussearch_request
Closed, ResolvedPublic
Actions