Skip to content

Instantly share code, notes, and snippets.

@zvada
Last active May 23, 2019 19:57
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save zvada/edd35af3b8496641bda9bf4971109f2f to your computer and use it in GitHub Desktop.
Save zvada/edd35af3b8496641bda9bf4971109f2f to your computer and use it in GitHub Desktop.
Factory HeldReason
[1249] gfactory@gfactory-2 ~$ condor_q -g -const 'JobStatus=?=5 && owner=?="feosgflock"' -af HoldReason | sort | uniq -c
38 CE job in status 1 put on hold by SYSTEM_PERIODIC_HOLD due to non-existent route in JOB_ROUTER_ENTRIES or route job limit.
45 Error connecting to schedd ce01.cmsaf.mit.edu: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
104 Error connecting to schedd crane-gw1.unl.edu: AUTHENTICATE:1003:Failed to authenticate with any method|AUTHENTICATE:1004:Failed to authenticate using GSI|GSI:5002:Failed to authenticate because the remote (server) side was not able to acquire its credentials.|AUTHENTICATE:1004:Failed to authenticate using FS
82 Error connecting to schedd hadoop-osg.rcac.purdue.edu: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
43 Error connecting to schedd hpcosgce.fiu.edu: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.
207 Error connecting to schedd osggrid01.hep.wisc.edu: SECMAN:2007:Failed to received post-auth ClassAd|AUTHENTICATE:1004:Failed to authenticate using FS
42 Error connecting to schedd umiss001.hep.olemiss.edu: SECMAN:2007:Failed to received post-auth ClassAd|AUTHENTICATE:1004:Failed to authenticate using FS
143 HTCondor-CE held job due to no matching routes, route job limit, or route failure threshold; see 'HTCondor-CE Troubleshooting Guide'
409 Job not found
[1249] gfactory@gfactory-1 /etc/osg-gfactory$ condor_q -g -const 'JobStatus=?=5 && owner=?="feosgflock"' -af HoldReason | sort | uniq -c
82 CE job in status 1 put on hold by SYSTEM_PERIODIC_HOLD due to non-existent route in JOB_ROUTER_ENTRIES or route job limit.
56 Error connecting to schedd ce01.cmsaf.mit.edu: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
465 Error connecting to schedd crane-gw1.unl.edu: AUTHENTICATE:1003:Failed to authenticate with any method|AUTHENTICATE:1004:Failed to authenticate using GSI|GSI:5002:Failed to authenticate because the remote (server) side was not able to acquire its credentials.|AUTHENTICATE:1004:Failed to authenticate using FS
8 Error connecting to schedd crane-gw1.unl.edu: AUTHENTICATE:1003:Failed to authenticate with any method|AUTHENTICATE:1004:Failed to authenticate using GSI|GSI:5003:Failed to authenticate. Globus is reporting error (851968:20160). There is probably a problem with your credentials. (Did you run grid-proxy-init?)|AUTHENTICATE:1004:Failed to authenticate using FS
1 Error connecting to schedd crane-gw1.unl.edu: AUTHENTICATE:1003:Failed to authenticate with any method|AUTHENTICATE:1004:Failed to authenticate using GSI|GSI:5003:Failed to authenticate. Globus is reporting error (851968:239). There is probably a problem with your credentials. (Did you run grid-proxy-init?)|AUTHENTICATE:1004:Failed to authenticate using FS
2 Error connecting to schedd crane-gw1.unl.edu: AUTHENTICATE:1003:Failed to authenticate with any method|AUTHENTICATE:1004:Failed to authenticate using GSI|GSI:5003:Failed to authenticate. Globus is reporting error (851968:254). There is probably a problem with your credentials. (Did you run grid-proxy-init?)|AUTHENTICATE:1004:Failed to authenticate using FS
5 Error connecting to schedd crane-gw1.unl.edu: AUTHENTICATE:1003:Failed to authenticate with any method|AUTHENTICATE:1004:Failed to authenticate using GSI|GSI:5003:Failed to authenticate. Globus is reporting error (851968:269). There is probably a problem with your credentials. (Did you run grid-proxy-init?)|AUTHENTICATE:1004:Failed to authenticate using FS
1 Error connecting to schedd gridgk01.racf.bnl.gov: AUTHENTICATE:1003:Failed to authenticate with any method|AUTHENTICATE:1004:Failed to authenticate using GSI|GSI:5003:Failed to authenticate. Globus is reporting error (851968:149). There is probably a problem with your credentials. (Did you run grid-proxy-init?)|AUTHENTICATE:1004:Failed to authenticate using FS
11 Error connecting to schedd gridgk01.racf.bnl.gov: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
39 Error connecting to schedd hadoop-osg.rcac.purdue.edu: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
54 Error connecting to schedd hpcosgce.fiu.edu: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.
6 Error connecting to schedd iitce1.iit.edu: AUTHENTICATE:1003:Failed to authenticate with any method|AUTHENTICATE:1004:Failed to authenticate using GSI|GSI:5003:Failed to authenticate. Globus is reporting error (851968:329). There is probably a problem with your credentials. (Did you run grid-proxy-init?)|AUTHENTICATE:1004:Failed to authenticate using FS
1 Error connecting to schedd iitce1.iit.edu: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
3 Error connecting to schedd iitce2.iit.edu: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
2 Error connecting to schedd iut2-gk.mwt2.org: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
26 Error connecting to schedd osg-gk.mwt2.org: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
216 Error connecting to schedd osggrid01.hep.wisc.edu: SECMAN:2007:Failed to received post-auth ClassAd|AUTHENTICATE:1004:Failed to authenticate using FS
20 Error connecting to schedd uct2-gk.mwt2.org: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
46 Error connecting to schedd umiss001.hep.olemiss.edu: SECMAN:2007:Failed to received post-auth ClassAd|AUTHENTICATE:1004:Failed to authenticate using FS
6 Error connecting to schedd umiss001.hep.olemiss.edu: SECMAN:2010:Received "DENIED" from server for user flock.opensciencegrid.org@daemon.opensciencegrid.org using method GSI.|AUTHENTICATE:1004:Failed to authenticate using FS
1 Error receiving files from schedd its-condor-ce2.syr.edu: DCSchedd::receiveJobSandbox:7003:File transfer failed for target job 4867168.0: SCHEDD at 128.230.171.120 failed to send file(s) to <169.228.38.36:8175>: error reading from /var/lib/condor-ce/spool/7168/0/cluster4867168.proc0.subproc0/_condor_stderr: (errno 2) No such file or directory; C_GAHP_WORKER_THREAD failed to receive file(s) from <128.230.171.120:9619>
48 Failed to initialize GAHP
1 Failed to start GAHP: Agent pid 404968\nssh: Could not resolve hostname commodore.grid.wayne.edu: Temporary failure in name resolution\nAgent pid 404968 killed\n
20 Globus error 12: the connection to the server failed (check host and port)
30 HTCondor-CE held job due to expired user proxy.
352 HTCondor-CE held job due to no matching routes, route job limit, or route failure threshold; see 'HTCondor-CE Troubleshooting Guide'
545 Job not found
5 submission command failed (exit code = -15) (stdout:echo "bls_opt_run_dir = $bls_opt_run_dir"-echo "bls_tmp_name = $bls_tmp_name"-echo "blah_wn_temporary_home_dir = $blah_wn_temporary_home_dir"-echo "run_dir = $run_dir"-) (stderr: <blah> execute_cmd: 30 seconds timeout expired, killing child process.- <blah> killed by signal 15.-)
5 submission command failed (exit code = -15) (stdout:) (stderr: <blah> execute_cmd: 30 seconds timeout expired, killing child process.- <blah> killed by signal 15.-)
3 Unspecified gridmanager error
164 via condor_hold (by user condor)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment