原文博客链接地址:10gR2 rac如何重跑root.sh ?
前几天遇到一客户的10205 rac,出现LMD进程IPC SEND TIMEOUT问题。准备深入研究下Oracle RAC
的LMON,LMD以及LMS进程,发现自己的VM RAC无法启动了,最后看了下,居然是有个节点的分区不见了。
++++Node2
?
| 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 |
[root@rac2 raw]# ls -ltr /dev/sdf* brw-r----- 1 root disk 8, 84 Dec 4 2013 /dev/sdf4 brw-r----- 1 root disk 8, 83 Dec 4 2013 /dev/sdf3 brw-r----- 1 root disk 8, 82 Dec 4 2013 /dev/sdf2 brw-r----- 1 root disk 8, 81 Dec 4 2013 /dev/sdf1 brwxrwxr-x 1 oracle oinstall 8, 80 Dec 4 2013 /dev/sdf [root@rac2 bin]# cat /etc/rc.d/rc.local #!/bin/sh # # This script will be executed *after* all the other init scripts. # You can put your own initialization stuff in here if you don't # want to do the full Sys V style init stuff. touch /var/lock/subsys/local chown -R oracle:oinstall /dev/sdf chown -R oracle:oinstall /dev/sde chown -R oracle:oinstall /dev/sdb chown -R oracle:oinstall /dev/sdd chown -R oracle:oinstall /dev/sdc chmod -R 775 /dev/sdf chmod -R 775 /dev/sde chmod -R 775 /dev/sdb chmod -R 775 /dev/sdd chmod -R 775 /dev/sdc raw /dev/raw/raw1 /dev/sdf1 raw /dev/raw/raw2 /dev/sdf2 raw /dev/raw/raw3 /dev/sdf3 raw /dev/raw/raw4 /dev/sdf4 chown -R oracle:dba /dev/raw |
++++Node1
?
| 1 2 3 4 |
[root@rac1 bin]# partprobe [root@rac1 bin]# ls -ltr /dev/sdf* brwxrwxr-x 1 oracle oinstall 8, 80 Jun 29 01:37 /dev/sdf [root@rac1 bin]# |
我这里是将其中一个共享盘进行了分区,然后绑定为raw。发现其中一个节点的分区不见了,最后把节点2
reboot之后,2个节点的分区都看不到了。无语了。
印象中Linux有一些方法可以恢复partition,尝试使用gpart来试试,如下是我尝试:
?
| 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 |
[root@rac1 repodata]# gpart /dev/sdf Begin scan... End scan. Checking partitions... Ok. Guessed primary partition table: Primary partition(1) type: 000(0x00)(unused) size: 0mb #s(0) s(0-0) chs: (0/0/0)-(0/0/0)d (0/0/0)-(0/0/0)r Primary partition(2) type: 000(0x00)(unused) size: 0mb #s(0) s(0-0) chs: (0/0/0)-(0/0/0)d (0/0/0)-(0/0/0)r Primary partition(3) type: 000(0x00)(unused) size: 0mb #s(0) s(0-0) chs: (0/0/0)-(0/0/0)d (0/0/0)-(0/0/0)r Primary partition(4) type: 000(0x00)(unused) size: 0mb #s(0) s(0-0) chs: (0/0/0)-(0/0/0)d (0/0/0)-(0/0/0)r [root@rac1 repodata]# gpart -W /dev/sdf /dev/sdf Begin scan... End scan. Checking partitions... Ok. Guessed primary partition table: Primary partition(1) type: 000(0x00)(unused) size: 0mb #s(0) s(0-0) chs: (0/0/0)-(0/0/0)d (0/0/0)-(0/0/0)r Primary partition(2) type: 000(0x00)(unused) size: 0mb #s(0) s(0-0) chs: (0/0/0)-(0/0/0)d (0/0/0)-(0/0/0)r Primary partition(3) type: 000(0x00)(unused) size: 0mb #s(0) s(0-0) chs: (0/0/0)-(0/0/0)d (0/0/0)-(0/0/0)r Primary partition(4) type: 000(0x00)(unused) size: 0mb #s(0) s(0-0) chs: (0/0/0)-(0/0/0)d (0/0/0)-(0/0/0)r Edit this table (y,n) : y Edit which partition (1..4, q to quit) : q Activate which partition (1..4, q to quit) : 1 Write this partition table (y,n) : y * Warning: partition table written, you should reboot now. [root@rac1 repodata]# ls -ltr /dev/sdf* brwxrwxr-x 1 oracle oinstall 8, 80 Jun 29 03:57 /dev/sdf |
大家可以看到,我这里gpart扫出来的,虽然能看到有4个分区,但是全是空的。不知道为什么,太怪异了。
?
| 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 |
[root@rac1 ~]# fdisk -l Disk /dev/sda: 21.4 GB, 21474836480 bytes 255 heads, 63 sectors/track, 2610 cylinders Units = cylinders of 16065 * 512 = 8225280 bytes Device Boot Start End Blocks Id System /dev/sda1 * 1 6 48163+ 8 |