pps...
2015-08-26 15:36:57: DHCP_flag=0
2015-08-26 15:36:57: nodes_to_start=linuxidc1
2015-08-26 15:37:18: exit value of start nodeapps/vip is 1
2015-08-26 15:37:18: output for start nodeapps is? PRCR-1013 : Failed to start resource ora.ons PRCR-1064 : Failed to start resource ora.ons on node linuxidc1 CRS-5016: Process "/u01/app/product/11.2.0/crs/opmn/bin/onsctli" spawned by agent "/u01/app/product/11.2.0/crs/bin/oraagent.bin" for action "start" failed: details at "(:CLSN00010:)" in "/u01/app/product/11.2.0/crs/log/linuxidc1/agent/crsd/oraagent_grid/oraagent_grid.log" CRS-2674: Start of 'ora.ons' on 'linuxidc1' failed
2015-08-26 15:37:18: output of startnodeapp after removing already started mesgs is PRCR-1013 : Failed to start resource ora.ons PRCR-1064 : Failed to start resource ora.ons on node linuxidc1 CRS-5016: Process "/u01/app/product/11.2.0/crs/opmn/bin/onsctli" spawned by agent "/u01/app/product/11.2.0/crs/bin/oraagent.bin" for action "start" failed: details at "(:CLSN00010:)" in "/u01/app/product/11.2.0/crs/log/linuxidc1/agent/crsd/oraagent_grid/oraagent_grid.log" CRS-2674: Start of 'ora.ons' on 'linuxidc1' failed
2015-08-26 15:37:18: /u01/app/product/11.2.0/crs/bin/srvctl start nodeapps -n linuxidc1 ... failed
检查I $GRID_HOME/opmn/logs/ons.log.*文件,看是否有以下错误:
1.
[grid@linuxidc1 oraagent_grid]$ cd $ORACLE_HOME/opmn/logs/
[grid@linuxidc1 logs]$ ls -lrt
total 8
-rw-r--r-- 1 grid oinstall 576 Aug 26 15:48 ons.log.linuxidc1
-rw-r--r-- 1 grid oinstall 267 Aug 26 15:48 ons.out
[grid@linuxidc1 logs]$ cat ons.log.linuxidc1
[2015-08-26T15:37:02+08:00] [internal] getaddrinfo(::0, 6200, 1) failed (Hostname and service name not provided or found): Connection timed out
如果存在上面的错误信息,那么原因就是/etc/hosts文件中localhost对应的IP地址不是127.0.0.1。解决方法如就是确保DNS和/etc/hosts文件正确设置了localhost,DNS或/etc/hosts文件依赖于(/etc/nsswitch.conf, or /etc/netsvc.conf depend on platform),这些配置文件中的命名解决方案的设置,可以参考MOS中的ID 942166.1 or ID 969254.1文档来进行处理。
2.
[grid@linuxidc1 oraagent_grid]$ cd $ORACLE_HOME/opmn/logs/
[grid@linuxidc1 logs]$ ls -lrt
total 8
-rw-r--r-- 1 grid oinstall 576 Aug 26 15:48 ons.log.linuxidc1
-rw-r--r-- 1 grid oinstall 267 Aug 26 15:48 ons.out
[grid@linuxidc1 logs]$ cat ons.log.linuxidc1
[2015-08-26T15:37:02+08:00] [ons] [NOTIFICATION:1] [104] [ons-internal] ONS server initiated
[2015-08-26T15:37:02+08:00] [ons] [ERROR:1] [17] [ons-listener] any: BIND (Address already in use)
[2015-08-26T15:39:42+08:00] [ons] [NOTIFICATION:1] [104] [ons-internal] ONS server initiated
[2015-08-26T15:39:42+08:00] [ons] [ERROR:1] [17] [ons-listener] any: BIND (Address already in use)
[2015-08-26T15:48:40+08:00] [ons] [NOTIFICATION:1] [104] [ons-internal] ONS server initiated
[2015-08-26T15:48:40+08:00] [ons] [ERROR:1] [17] [ons-listener] any: BIND (Address already in use)
原因是有其它的进程占用的ONS服务的端口
[grid@linuxidc1 logs]$ grep port $ORACLE_HOME/opmn/conf/ons.config
localport=6100? ? ? ? ? # line added by Agent
remoteport=6200? ? ? ? # line added by Agent
[root@linuxidc1 /]# lsof | grep 6200 | grep LISTEN
ons? ? ? 16413? ? ? grid? ? 6u? ? IPv6? ? 162533? ? ? ? ? ? ? ? ? TCP *:6200 (LISTEN)
可以看到进程ID16413的ons进程占用了6200端口,解决方法是确保这个端口不被其它进行所占用,如果是在执行 rootupgrade.sh脚本进行升级之前被占用,那么可能的原因是旧版本的ons进程还在运行。
3.
[grid@linuxidc1 oraagent_grid]$ cd $ORACLE_HOME/opmn/logs/
[grid@linuxidc1 logs]$ ls -lrt
total 8
-rw-r--r-- 1 grid oinstall 576 Aug 26 15:48 ons.log.linuxidc