Invalid parameters, or failed to bring up VIP (host=node2)
2014-11-03 17:29:29.895: [ RACG][3067782848] [5942][3067782848][ora.node2.vip]: clsrcexecut: env ORACLE_CONFIG_HOME=/u01/app/oracle/product/crs
2014-11-03 17:29:29.895: [ RACG][3067782848] [5942][3067782848][ora.node2.vip]: clsrcexecut: cmd = /u01/app/oracle/product/crs/bin/racgeut -e _USR_ORA_DEBUG=0 54 /u01/app/oracle/product/crs/bin/racgvip check node2
2014-11-03 17:29:29.895: [ RACG][3067782848] [5942][3067782848][ora.node2.vip]: clsrcexecut: rc = 1, time = 6.370s
2014-11-03 17:29:29.895: [ RACG][3067782848] [5942][3067782848][ora.node2.vip]: end for resource = ora.node2.vip, action = check, status = 1, time = 6.390s
2014-11-03 17:38:01.381: [ RACG][3067832000] [18181][3067832000][ora.node2.vip]: Interface eth0 checked failed (host=node2)
Invalid parameters, or failed to bring up VIP (host=node2)
2014-11-03 17:38:01.381: [ RACG][3067832000] [18181][3067832000][ora.node2.vip]: clsrcexecut: env ORACLE_CONFIG_HOME=/u01/app/oracle/product/crs
2014-11-03 17:38:01.381: [ RACG][3067832000] [18181][3067832000][ora.node2.vip]: clsrcexecut: cmd = /u01/app/oracle/product/crs/bin/racgeut -e _USR_ORA_DEBUG=0 54 /u01/app/oracle/product/crs/bin/racgvip check node2
2014-11-03 17:38:01.381: [ RACG][3067832000] [18181][3067832000][ora.node2.vip]: clsrcexecut: rc = 1, time = 6.210s
2014-11-03 17:38:01.381: [ RACG][3067832000] [18181][3067832000][ora.node2.vip]: end for resource = ora.node2.vip, action = check, status = 1, time = 6.260s
发现eth0 ping的网关不正常。
将eth1上的网关设置删除后,还是出现相同的报错,怀疑是vip的默认网关配置出了问题。
将所有的服务都停止。
对节点2的vip进行debug,最后的5是_USR_ORA_DEBUG flag ,表示级别。
[root@rac10g2 ~]# crsctl debug log res "ora.node2.vip:5"
Set Resource Debug Module: ora.node2.vip Level: 5
启动ora.node2.vip。
[root@rac10g2 ~]# crs_start ora.node2.vip
查看trace日志。trace日志在$ORA_CRS_HOME/log/
果然发现默认网关设置不正常,设置为192.168.94.1,并不是预计的192.168.1.1。
2014-11-03 15:53:15.830: [ RACG][3086924000] [2200][3086924000][ora.rac10g2.vip]: Fri Nov 03 15:53:09 CST 2014 [ 2204 ] /sbin/mii-tool eth0 error
Fri Nov 18 15:53:09 CST 2014 [ 2204 ] defaultgw: started
Fri Nov 03 15:53:09 CST 2014 [ 2204 ] defaultgw: completed with 192.168.94.1
2014-11-03 15:53:15.830: [ RACG][3086924000] [2200][3086924000][ora.node2.vip]: ping to 192.168.94.1 via eth0 failed, rc = 1 (host=node2)
ping to 192.168.94.1 via eth0 failed, rc = 1 (host=rac10g2)
Fri Nov 03 15:53:15 CST 2014 [ 2204 ] checkIf: ping and RX packets checked if=eth0 failed
Interface eth0 checked failed (host=node2)
2014-11-03 15:53:15.830: [ RACG][3086924000] [2200][3086924000][ora.node2.vip]: Fri Dec 18 15:53:15 CST 2014 [ 2204 ] checkIf: end for if=eth0
Invalid parameters, or failed to bring up VIP (host=node2)
修改默认网关:
修改文件$ORA_CRS_HOME/bin/racgvip,将以下内容修改为想要指派的新的网关。
DEFAULTGW原本是没有值的,现在修改为192.168.1.1。
# hard code default gateway here if needed
DEFAULTGW=192.168.1.1
修改之后,在trace看到该部分已经检测ok。
2014-11-03 16:11:40.984: [ RACG][3086924000] [7808][3086924000][ora.rac10g2.vip]: Fri Nov 03 16:11:40 CST 2014 [ 7812 ] /sbin/mii-tool eth0 error
Fri Nov 03 16:11:40 CST 2014 [ 7812 ] checkIf: ping checked if=eth0 ok
Fri Nov 03 16:11:40 CST 2014 [ 7812 ] checkIf: end for if=eth0
该修改需要在节点1和节点2上同时修改,观察很长一段时间之后,没有再发现之前的问题。
其实该问题的出现,主要的原因是在eth1上配置了默认网关,用于私有网络的网卡不建议配置网关。所以该问题中