Tags:
,
view all tags
---+ OSG-RSV troubleshotting ---++Description Our osg-rsv wasn't reporting to central colector. We restarted this services as follows: <pre> /etc/init.d/osg-rsv stop /etc/init.d/condor-cron stop /etc/init.d/condor-cron start /etc/init.d/osg-rsv start </pre> it restarted well, looking at the jobs <pre> condor_cron_q </pre> but looking at its logs, we find a lot of errors <pre> tail -f tail -f /OSG/osg-rsv/logs/consumers/gratia-script-consumer.err sh: line 1: -osg-ce.sprace.org.br-org.osg.general.osg-version.18528.py: command not found sh: /opt/osg-1.0.0/osg-rsv/output/gratia/2008-10-11T22:46:52Z: No such file or directory sh: line 1: -osg-ce.sprace.org.br-org.osg.general.osg-version.13681.py: command not found </pre> In this case, the solution was clean old files on <pre> rm -f $VDT_LOCATION/osg-rsv/output/gratia/* </pre> restarted all agents again and <pre> /etc/init.d/apache restart </pre> It seems fine now: <pre> tail -f /opt/osg-1.0.0/osg-rsv/logs/consumers/gratia-script-consumer.out 2008-10-19 05:54:37 BRST Gratia: handshake records sent successfuly: 1 2008-10-19 05:54:37 BRST Gratia: handshake records failed: 0 </pre> and you can double check the date on probes at https://osg-ce.sprace.org.br:8443/rsv/ with https://lcg-sam.cern.ch:8443/sam/sam.py?sensors=OSGCE®ions=OpenScienceGrid&vo=ops&order=SiteName&funct=ShowSensorTests As a subproduct of this we checked an error on condor-cron. Again it restarted well, but.. <pre> tail -f /OSG/condor-cron/local.osg-ce/log/MasterLog 10/17 08:43:05 attempt to connect to <192.168.1.150:9619 <http://192.168.1.150:9619>> failed: Connection refused (connect errno = 111). 10/17 08:43:05 ERROR: SECMAN:2003:TCP connection to <192.168.1.150:9619 <http://192.168.1.150:9619>> failed </pre> It was a error in our port setup <pre> vim /OSG/condor-cron/etc/condor_config COLLECTOR_HOST = $(CONDOR_HOST):9618 </pre> restart condor-cron again ---++Updates ---+++Marco on 21/10/2008 we changed our condor_cron configuration to avoid condor-cron publish schedd to our production Condor. <pre> /etc/init.d/osg-rsv stop /etc/init.d/condor-cron stop vim /OSG/condor-cron/etc/condor_config COLLECTOR_HOST = /etc/init.d/condor-cron start /etc/init.d/osg-rsv start </pre> This error is harmless, when you start condor-cron: <pre> tail -f /OSG/condor-cron/local.osg-ce/log/MasterLog ERROR: Unable to find collector info in configuration file!!! </pre> ---+++Ciclano em dd/mm/aaaa Mais comentarios -- Main.MarcoAndreFerreiraDias - 19 Oct 2008
Edit
|
Attach
|
P
rint version
|
H
istory
:
r3
<
r2
<
r1
|
B
acklinks
|
V
iew topic
|
Raw edit
|
More topic actions...
Topic revision: r2 - 2008-10-21
-
MarcoAndreFerreiraDias
Home
Site map
Main web
Sandbox web
TWiki web
Main Web
Users
Groups
Index
Search
Changes
Notifications
RSS Feed
Statistics
Preferences
P
View
Raw View
Print version
Find backlinks
History
More topic actions
Edit
Raw edit
Attach file or image
Edit topic preference settings
Set new parent
More topic actions
Account
Log In
Copyright © 2008-2025 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback