CRS appears healthy
EVM appears healthy
检查OCR磁盘状态,没有问题
[oracle@rac2 ~]$ ocrcheck
Status of Oracle Cluster Registry is as follows :
Version : 2
Total space (kbytes) : 104344
Used space (kbytes) : 4344
Available space (kbytes) : 100000
ID : 1752469369
Device/File Name : /dev/raw/raw1
Device/File integrity check succeeded
Device/File not configured
Cluster registry integrity check succeeded
检查vote disk状态
[oracle@rac2 ~]$ crsctl query css votedisk
0. 0 /dev/raw/raw2 显示2号裸设备为表决磁盘
located 1 votedisk(s). 只定位1个表决磁盘
2.手工禁用一个私有网卡
[oracle@rac2 ~]$ cat /etc/hosts
127.0.0.1 localhost.localdomain localhost
::1 localhost6.localdomain6 localhost6
##Public Network - (eth0)
##Private Interconnect - (eth1)
##Public Virtual IP (VIP) addresses - (eth0)
192.168.1.101 rac1 这是RAC的共有网卡
192.168.1.102 rac2
192.168.2.101 rac1-priv 这是RAC的私有网卡
192.168.2.102 rac2-priv
192.168.1.201 rac1-vip 这是RAC虚拟网卡
192.168.1.202 rac2-vip
看一下IP地址和网卡的对应关系
[oracle@rac2 ~]$ ifconfig
eth0 Link encap:Ethernet HWaddr 00:0C:29:8F:F1:87
inet addr:192.168.1.102 Bcast:192.168.1.255 Mask:255.255.255.0
inet6 addr: fe80::20c:29ff:fe8f:f187/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:360 errors:0 dropped:0 overruns:0 frame:0
TX packets:593 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:46046 (44.9 KiB) TX bytes:62812 (61.3 KiB)
Interrupt:185 Base address:0x14a4
eth0:1 Link encap:Ethernet HWaddr 00:0C:29:8F:F1:87
inet addr:192.168.1.202 Bcast:192.168.1.255 Mask:255.255.255.0
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
Interrupt:185 Base address:0x14a4
eth1 Link encap:Ethernet HWaddr 00:0C:29:8F:F1:91
inet addr:192.168.2.102 Bcast:192.168.2.255 Mask:255.255.255.0
inet6 addr: fe80::20c:29ff:fe8f:f191/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:76588 errors:0 dropped:0 overruns:0 frame:0
TX packets:58002 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:65185420 (62.1 MiB) TX bytes:37988820 (36.2 MiB)
Interrupt:193 Base address:0x1824
eth2 Link encap:Ethernet HWaddr 00:0C:29:8F:F1:9B
inet addr:192.168.203.129 Bcast:192.168.203.255 Mask:255.255.255.0
inet6 addr: fe80::20c:29ff:fe8f:f19b/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:339 errors:0 dropped:0 overruns:0 frame:0
TX packets:83 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:42206 (41.2 KiB) TX bytes:10199 (9.9 KiB)
Interrupt:169 Base address:0x18a4
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
inet6 addr: ::1/128 Scope:Host
UP LOOPBACK RUNNING MTU:16436 Metric:1
RX packets:99403 errors:0 dropped:0 overruns:0 frame:0
TX packets:99403 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:18134658 (17.2 MiB) TX bytes:18134658 (17.2 MiB)
eth 0 对应RAC的共有网卡
eth 1 对应RAC的私有网卡
eth0:1对应RAC的虚拟网卡
我们现在禁止eth1私有网卡来完成内联网网络不通现象,方法很简单
ifdown eth1 禁用网卡
ifup eth1 激活网卡
[oracle@rac2 ~]$ su – root 需要使用root用户哦,否则提示Users cannot control this device.
Password:
[root@rac2 ~]# ifdown eth1
我从17:18:51敲入这个命令,4分钟之后节点2重启,大家知道发生了什么现象嘛?
Good 这就是传说中RAC脑裂brain split问题,当节点间的内联网不通时,无法信息共享,就会出现脑裂现象,RAC必须驱逐其中一部分节点来保护数据的一致性,被驱逐的节点被强制重启,这不节点2自动重启了么。又说回来,那为什么节点2重启,其他节点不重启呢。
这里有个驱逐原则:(1)子集群中少节点的被驱逐
(2)节点号大的被驱逐
(3)负载高的节点被驱逐
我们中