oraclerac巡检过程详解(二)

2014-11-24 16:22:20 · 作者: · 浏览: 3
thy

CRS appears healthy

EVM appears healthy

检查OCR磁盘状态,没有问题

[oracle@rac2 ~]$ ocrcheck

Status of Oracle Cluster Registry is as follows :

Version : 2

Total space (kbytes) : 104344

Used space (kbytes) : 4344

Available space (kbytes) : 100000

ID : 1752469369

Device/File Name : /dev/raw/raw1

Device/File integrity check succeeded

Device/File not configured

Cluster registry integrity check succeeded

检查vote disk状态

[oracle@rac2 ~]$ crsctl query css votedisk

0. 0 /dev/raw/raw2 显示2号裸设备为表决磁盘

located 1 votedisk(s). 只定位1个表决磁盘

2.手工禁用一个私有网卡

[oracle@rac2 ~]$ cat /etc/hosts

127.0.0.1 localhost.localdomain localhost

::1 localhost6.localdomain6 localhost6

##Public Network - (eth0)

##Private Interconnect - (eth1)

##Public Virtual IP (VIP) addresses - (eth0)

192.168.1.101 rac1 这是RAC的共有网卡

192.168.1.102 rac2

192.168.2.101 rac1-priv 这是RAC的私有网卡

192.168.2.102 rac2-priv

192.168.1.201 rac1-vip 这是RAC虚拟网卡

192.168.1.202 rac2-vip

看一下IP地址和网卡的对应关系

[oracle@rac2 ~]$ ifconfig

eth0 Link encap:Ethernet HWaddr 00:0C:29:8F:F1:87

inet addr:192.168.1.102 Bcast:192.168.1.255 Mask:255.255.255.0

inet6 addr: fe80::20c:29ff:fe8f:f187/64 Scope:Link

UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1

RX packets:360 errors:0 dropped:0 overruns:0 frame:0

TX packets:593 errors:0 dropped:0 overruns:0 carrier:0

collisions:0 txqueuelen:1000

RX bytes:46046 (44.9 KiB) TX bytes:62812 (61.3 KiB)

Interrupt:185 Base address:0x14a4

eth0:1 Link encap:Ethernet HWaddr 00:0C:29:8F:F1:87

inet addr:192.168.1.202 Bcast:192.168.1.255 Mask:255.255.255.0

UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1

Interrupt:185 Base address:0x14a4

eth1 Link encap:Ethernet HWaddr 00:0C:29:8F:F1:91

inet addr:192.168.2.102 Bcast:192.168.2.255 Mask:255.255.255.0

inet6 addr: fe80::20c:29ff:fe8f:f191/64 Scope:Link

UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1

RX packets:76588 errors:0 dropped:0 overruns:0 frame:0

TX packets:58002 errors:0 dropped:0 overruns:0 carrier:0

collisions:0 txqueuelen:1000

RX bytes:65185420 (62.1 MiB) TX bytes:37988820 (36.2 MiB)

Interrupt:193 Base address:0x1824

eth2 Link encap:Ethernet HWaddr 00:0C:29:8F:F1:9B

inet addr:192.168.203.129 Bcast:192.168.203.255 Mask:255.255.255.0

inet6 addr: fe80::20c:29ff:fe8f:f19b/64 Scope:Link

UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1

RX packets:339 errors:0 dropped:0 overruns:0 frame:0

TX packets:83 errors:0 dropped:0 overruns:0 carrier:0

collisions:0 txqueuelen:1000

RX bytes:42206 (41.2 KiB) TX bytes:10199 (9.9 KiB)

Interrupt:169 Base address:0x18a4

lo Link encap:Local Loopback

inet addr:127.0.0.1 Mask:255.0.0.0

inet6 addr: ::1/128 Scope:Host

UP LOOPBACK RUNNING MTU:16436 Metric:1

RX packets:99403 errors:0 dropped:0 overruns:0 frame:0

TX packets:99403 errors:0 dropped:0 overruns:0 carrier:0

collisions:0 txqueuelen:0

RX bytes:18134658 (17.2 MiB) TX bytes:18134658 (17.2 MiB)

eth 0 对应RAC的共有网卡

eth 1 对应RAC的私有网卡

eth0:1对应RAC的虚拟网卡

我们现在禁止eth1私有网卡来完成内联网网络不通现象,方法很简单

ifdown eth1 禁用网卡

ifup eth1 激活网卡

[oracle@rac2 ~]$ su – root 需要使用root用户哦,否则提示Users cannot control this device.

Password:

[root@rac2 ~]# ifdown eth1

我从17:18:51敲入这个命令,4分钟之后节点2重启,大家知道发生了什么现象嘛?

Good 这就是传说中RAC脑裂brain split问题,当节点间的内联网不通时,无法信息共享,就会出现脑裂现象,RAC必须驱逐其中一部分节点来保护数据的一致性,被驱逐的节点被强制重启,这不节点2自动重启了么。又说回来,那为什么节点2重启,其他节点不重启呢。

这里有个驱逐原则:(1)子集群中少节点的被驱逐

(2)节点号大的被驱逐

(3)负载高的节点被驱逐

我们中