CRS报CRS-2409告警信息问题分析与处理(一)

2014-11-24 08:55:59 · 作者: · 浏览: 0
CRS报CRS-2409告警信息问题分析与处理
ORACLE 11.2.0.3
1、报错信息
检查第1节点的CRS alert log,发现存在有下面异常信息
2013-0X-XX 19:27:17.609
[ctssd(18809056)]CRS-2409:The clock on host XXXdb1 is not synchronous with the mean cluster time. No action has been taken as the Cluster Time Synchronization Service is running in observer mode.
2013-0X-XX 19:59:42.312
[ctssd(18809056)]CRS-2409:The clock on host XXXdb1 is not synchronous with the mean cluster time. No action has been taken as the Cluster Time Synchronization Service is running in observer mode.
上面报错的意思是,ORACLE的CTSSD服务发现异常,不能处于观察模式
2、报错信息问题分析
2.1、操作系统的NTPD服务处于启动状态,CTSSD就不会工作,但是只要CTSSD服务启动,正常情况下应该处于观察模式
2.2、当前OS的NTPD服务正在运行,并且,CTSSD不能处于观察模式运行
3、排查过程
3.1、检查两个节点的时间是否存在差异
XXXdb1:/u01/app/11.2.0.3/grid/log/XXXdb1$ssh XXXdb2 date
Mon Jul 15 20:30:17 GMT+08:00 2013

XXXdb1:/u01/app/11.2.0.3/grid/log/XXXdb1$date
Mon Jul 15 20:30:18 GMT+08:00 2013

经检查,时间不存在差异
3.2、检查OS的NTPD服务
XXXdb1:/# lssrc -ls xntpd
 Program name:    /usr/sbin/xntpd
 Version:         3
 Leap indicator:  00 (No leap second today.)
 Sys peer:        10.XXX.XXX.71
 Sys stratum:     2
 Sys precision:   -18
 Debug/Tracing:   DISABLED
 Root distance:   0.000397
 Root dispersion: 0.013458
 Reference ID:    10.XXX.XXX.71
 Reference time:  d58e6e41.d0fca000  Mon, Jul 15 2013 20:49:05.816
 Broadcast delay: 0.003906 (sec)
 Auth delay:      0.000122 (sec)
 System flags:    bclient auth pll monitor filegen 
 System uptime:   30149695 (sec)
 Clock stability: 0.047607 (sec)
 Clock frequency: 0.000000 (sec)
 Peer: 10.XXX.XXX.71
      flags: (configured)(sys peer)
      stratum:  1, version: 3
      our mode: client, his mode: server
Subsystem         Group            PID          Status
 xntpd            tcpip            4128900      active

经检查两个节点,OS层的NTPD都在运行,并且可以做时间同步
3.3、检查ctssd的运行情况
XXXdb1:/#su - grid
XXXdb1:/home/grid$ crsctl stat res ora.ctssd -init
NAME=ora.ctssd
TYPE=ora.ctss.type
TARGET=ONLINE
STATE=ONLINE on XXXdb1

经检查两个节点,CTSSD服务都已经启动
3.4、借助CRS的cluvfy工具诊断CTSS错误的原因
XXXdb1:/home/grid$cluvfy comp clocksync -n all -verbose
 Verifying Clock Synchronization across the cluster nodes
 Checking if Clusterware is installed on all nodes...
Check of Clusterware install passed
 Checking if CTSS Resource is running on all nodes...
Check: CTSS Resource running on all nodes
  Node Name                             Status                 
  ------------------------------------  ------------------------
  XXXdb2                                passed                 
  XXXdb1                                passed                 
Result: CTSS resource check passed
 Querying CTSS for time offset on all nodes...
Result: Query of CTSS for time offset passed
 Check CTSS state started...
Check: CTSS state
  Node Name                             State                  
  ------------------------------------  ------------------------
  XXXdb2                                Observer               
  XXXdb1                                Observer               
CTSS is in Observer state. Switching over to clock synchronization checks using NTP
 Starting Clock synchronization checks using Network Time Protocol(NTP)...
 NTP Configuration file check started...
The NTP configuration file "/etc/ntp.conf" is available on all nodes
NTP Configuration file check passed
……
Checking NTP daemon command line for slewing option "-x"
Check: NTP daemon command line
  Node Name                             Slewing Option Set     
  ------------------------------------  ------------------------
  XXXdb2                                no                     
  XXXdb1                                no                     
Result:
NTP daemon slewing option check failed on some nodes
PRVF-5436 : The NTP daemon running on one or more n