下面开始在CentOS 6.5上安装并配置drbd,环境的话继续使用之前安装完heartbeat的两台主机,同时也是为后面实现heartbeat存储和数据库高可用做准备,所以如果需要单独操作,请按照之前安装heartbeat的准备工作进行配置,然后根据下面的步骤进行也可以。

1、环境准备

这里我以其中一台为例,介绍准备工作包括哪些点:

#检查防火墙是否关闭(或者开启7788端口)

[root@heartbeat01 ~]# iptables -L -n

Chain INPUT (policy ACCEPT)

target     prot opt source               destination         

Chain FORWARD (policy ACCEPT)

target     prot opt source               destination         

Chain OUTPUT (policy ACCEPT)

target     prot opt source               destination  

#检查SELinux是否禁用       

[root@heartbeat01 ~]# getenforce

Disabled

#检查是否已经添加了时间同步定时任务

[root@heartbeat01 ~]# crontab -l

0 * * * * /usr/sbin/ntpdate   210.72.145.44 64.147.116.229 time.nist.gov

#检查hosts文件中是否有两个节点的记录

[root@heartbeat01 ~]# cat /etc/hosts

127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4

::1         localhost localhost.localdomain localhost6 localhost6.localdomain6

192.168.49.133  heartbeat01.contoso.com   heartbeat01

192.168.49.134  heartbeat02.contoso.com   heartbeat02

#检查是否更换国内yum源

[root@heartbeat01 ~]# ll /etc/yum.repos.d/

total 32

-rw-r--r--  1 root root 2006 Sep 18  2014 CentOS-Base.repo

-rw-r--r--. 1 root root 1926 Nov 27  2013 CentOS-Base.repo.bak_2016-07-28

-rw-r--r--. 1 root root  638 Nov 27  2013 CentOS-Debuginfo.repo.bak_2016-07-28

-rw-r--r--. 1 root root  630 Nov 27  2013 CentOS-Media.repo.bak_2016-07-28

-rw-r--r--. 1 root root 3664 Nov 27  2013 CentOS-Vault.repo.bak_2016-07-28

-rw-r--r--. 1 root root  120 Jul 25 18:24 cobbler-config.repo.bak_2016-07-28

另外,跟heartbeat不同的是,drbd需要两块硬盘,所以这里我们还需要添加一块硬盘,我是在虚拟机操作的,直接添加一块2G的硬盘。

[root@heartbeat01 ~]# fdisk -l

Disk /dev/sda: 21.5 GB, 21474836480 bytes

255 heads, 63 sectors/track, 2610 cylinders

Units = cylinders of 16065 * 512 = 8225280 bytes

Sector size (logical/physical): 512 bytes / 512 bytes

I/O size (minimum/optimal): 512 bytes / 512 bytes

Disk identifier: 0x000693e3

   Device Boot      Start         End      Blocks   Id  System

/dev/sda1   *           1          64      512000   83  Linux

Partition 1 does not end on cylinder boundary.

/dev/sda2              64        2611    20458496   8e  Linux LVM

Disk /dev/sdb: 2147 MB, 2147483648 bytes

255 heads, 63 sectors/track, 261 cylinders

Units = cylinders of 16065 * 512 = 8225280 bytes

Sector size (logical/physical): 512 bytes / 512 bytes

I/O size (minimum/optimal): 512 bytes / 512 bytes

Disk identifier: 0x00000000

2、安装drbd

drbd的安装有多种方式,可以源码编译安装,也可以使用rpm包进行安装,当然在centos上也可以使用yum安装,这里我采用yum安装的方式。

rpm -Uvhhttp://www.elrepo.org/elrepo-release-6-6.el6.elrepo.noarch.rpm yum update -yyum -y install kernel*yum -y install drbd83-utils kmod-drbd83modprobe drbd

因为更新了内核,所以需要重启才能加载drbd模块,所以使用yum安装完kernel之后需要重启两台服务器,如果不重启则会出现以下问题:

[root@heartbeat01 ~]# uname -r

2.6.32-431.el6.x86_64

[root@heartbeat01 ~]# lsmod|grep drbd

[root@heartbeat01 ~]# modprobe drbd

FATAL: Module drbd not found.

[root@heartbeat01 ~]# echo $?

1

重启之后重试:

[root@heartbeat01 ~]# uname -r

2.6.32-642.4.2.el6.x86_64

[root@heartbeat01 ~]# lsmod |grep drbd

drbd                  332493  0 

[root@heartbeat01 ~]# /sbin/modprobe drbd

[root@heartbeat01 ~]# echo $?

0

3、准备drbd设备

先对新添加的磁盘进行分区操作,这里仍然以其中一台为例。

[root@heartbeat01 ~]# fdisk -cu /dev/sdb

Device contains neither a valid DOS partition table, nor Sun, SGI or OSF disklabel

Building a new DOS disklabel with disk identifier 0x198bc436.

Changes will remain in memory only, until you decide to write them.

After that, of course, the previous content won't be recoverable.

Warning: invalid flag 0x0000 of partition table 4 will be corrected by w(rite)

Command (m for help): p

Disk /dev/sdb: 2147 MB, 2147483648 bytes

255 heads, 63 sectors/track, 261 cylinders, total 4194304 sectors

Units = sectors of 1 * 512 = 512 bytes

Sector size (logical/physical): 512 bytes / 512 bytes

I/O size (minimum/optimal): 512 bytes / 512 bytes

Disk identifier: 0x198bc436

   Device Boot      Start         End      Blocks   Id  System

Command (m for help): n

Command action

   e   extended

   p   primary partition (1-4)

p

Partition number (1-4): 1

First sector (2048-4194303, default 2048): 

Using default value 2048

Last sector, +sectors or +size{K,M,G} (2048-4194303, default 4194303): 

Using default value 4194303

Command (m for help): w

The partition table has been altered!

Calling ioctl() to re-read partition table.

Syncing disks.

4、编辑drbd配置文件

根据drbd的官方文档,drbd的配置文件是/etc/drbd.conf,但是这个文件中只有简单的2行内容,它引用了/etc/drbd.d/下的global_common.conf和*.res文件,这点和nginx、apache的配置文件类似,我们可以把多个resource相同的定义放到global中去,然后添加不同的res文件,从而更有效和方便的管理drbd的配置文件。

下面是/etc/drbd.conf的默认配置:

[root@heartbeat01 ~]# egrep -v "#|^$" /etc/drbd.conf 

include "drbd.d/global_common.conf";

include "drbd.d/*.res";

下面是/etc/drbd.d/global_common.conf的默认配置:

[root@heartbeat01 ~]# egrep -v "#|^$" /etc/drbd.d/global_common.conf 

global {

usage-count yes;

}

common {

protocol C;

handlers {

}

startup {

}

disk {

}

net {

}

syncer {

}

}

下面是我新建的一个配置文件test.res的内容:

resource test

 {

 startup {

 wfc-timeout 30;

 outdated-wfc-timeout 20;

 degr-wfc-timeout 30;

 }

net {

 cram-hmac-alg sha1;

 shared-secret sync_disk;

 }

syncer {

 rate 10M;

 al-extents 257;

 on-no-data-accessible io-error;

 }

 on heartbeat01.contoso.com {

 device /dev/drbd0;

 disk /dev/sdb1;

 address 192.168.49.133:7788;

 flexible-meta-disk internal;

 }

 on heartbeat02.contoso.com {

 device /dev/drbd0;

 disk /dev/sdb1;

 address 192.168.49.134:7788;

 meta-disk internal;

 }

 }

然后可以将此配置文件使用scp拷贝到节点2上去,两边的配置文件需要保持一致。

[root@heartbeat01 drbd.d]# scp test.res heartbeat02:/etc/drbd.d/

The authenticity of host 'heartbeat02 (192.168.49.134)' can't be established.

RSA key fingerprint is f9:ce:14:5d:cd:bb:3c:b4:0d:0b:fc:21:3a:92:43:6b.

Are you sure you want to continue connecting (yes/no)? yes

Warning: Permanently added 'heartbeat02,192.168.49.134' (RSA) to the list of known hosts.

root@heartbeat02's password: 

test.res                                                   100%  444     0.4KB/s   00:00    

5、启动drbd

drbdadm create-md clusterdb   #初始化metadata数据存储service drbd start   #启动drbd服务

上面的操作需要到两台server上运行,这里以其中一台为例演示结果:

[root@heartbeat01 drbd.d]# drbdadm create-md test

Writing meta data...

initializing activity log

NOT initialized bitmap

New drbd meta data block successfully created.

success

[root@heartbeat01 drbd.d]# service drbd start

Starting DRBD resources: [ d(test) s(test) n(test) ]..........

***************************************************************

 DRBD's startup script waits for the peer node(s) to appear.

 - In case this node was already a degraded cluster before the

   reboot the timeout is 30 seconds. [degr-wfc-timeout]

 - If the peer was available before the reboot the timeout will

   expire after 30 seconds. [wfc-timeout]

   (These values are for resource 'test'; 0 sec -> wait forever)

 To abort waiting enter 'yes' [  11]:

.

这里给出很多信息,大多是因为对端节点并没有启动,所以配置文件中的等待时间生效,但是在节点1启动后,我们再启动节点2时的信息就没有这么多了,因为节点1已经启动,所以无须等待。

[root@heartbeat02 drbd.d]# service drbd start

Starting DRBD resources: [ d(test) s(test) n(test) ].

此时,两个节点上都是Secondary的状态,所以我们需要手动的指定哪个节点为primary。

[root@heartbeat01 drbd.d]# cat /proc/drbd

version: 8.3.16 (api:88/proto:86-97)

GIT-hash: a798fa7e274428a357657fb52f0ecf40192c1985 build by phil@Build64R6, 2014-11-24 14:51:37

 0: cs:Connected ro:Secondary/Secondary ds:Inconsistent/Inconsistent C r-----

    ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:2096028

[root@heartbeat02 drbd.d]# cat /proc/drbd

version: 8.3.16 (api:88/proto:86-97)

GIT-hash: a798fa7e274428a357657fb52f0ecf40192c1985 build by phil@Build64R6, 2014-11-24 14:51:37

 0: cs:Connected ro:Secondary/Secondary ds:Inconsistent/Inconsistent C r-----

    ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:2096028

在要作为主节点的server上执行如下命令:

drbdadm -- --overwrite-data-of-peer primary all(是两个--,不是打错)

这里我在节点1上执行该命令。

[root@heartbeat01 drbd.d]# drbdadm -- --overwrite-data-of-peer primary all

再次看一下两个节点的状态:

[root@heartbeat01 drbd.d]# cat /proc/drbd

version: 8.3.16 (api:88/proto:86-97)

GIT-hash: a798fa7e274428a357657fb52f0ecf40192c1985 build by phil@Build64R6, 2014-11-24 14:51:37

 0: cs:SyncSource ro:Primary/Secondary ds:UpToDate/Inconsistent C r-----

    ns:217088 nr:0 dw:0 dr:217752 al:0 bm:13 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:1878940

[=>..................] sync'ed: 10.6% (1878940/2096028)K

finish: 0:03:01 speed: 10,344 (10,336) K/sec

[root@heartbeat02 drbd.d]#  cat /proc/drbd

version: 8.3.16 (api:88/proto:86-97)

GIT-hash: a798fa7e274428a357657fb52f0ecf40192c1985 build by phil@Build64R6, 2014-11-24 14:51:37

 0: cs:SyncTarget ro:Secondary/Primary ds:Inconsistent/UpToDate C r-----

    ns:0 nr:485248 dw:485248 dr:0 al:0 bm:29 lo:1 pe:1 ua:0 ap:0 ep:1 wo:f oos:1610780

[===>................] sync'ed: 23.3% (1610780/2096028)K

finish: 0:02:34 speed: 10,424 (10,324) want: 10,240 K/sec

可以看到现在heartbeat01是Primary的状态,而heartbeat02是Secondary的状态,而且已经开始从主上面同步数据到从节点上了。

[root@heartbeat01 drbd.d]#  cat /proc/drbd

version: 8.3.16 (api:88/proto:86-97)

GIT-hash: a798fa7e274428a357657fb52f0ecf40192c1985 build by phil@Build64R6, 2014-11-24 14:51:37

 0: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----

    ns:2096028 nr:0 dw:0 dr:2096692 al:0 bm:128 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0

同步完成,两个节点都是UpToDate的状态。

6、创建文件系统并挂载

下面在主节点上为drbd设置创建文件系统

[root@heartbeat01 drbd.d]# mkfs.ext4 /dev/drbd0

mke2fs 1.41.12 (17-May-2010)

Filesystem label=

OS type: Linux

Block size=4096 (log=2)

Fragment size=4096 (log=2)

Stride=0 blocks, Stripe width=0 blocks

131072 inodes, 524007 blocks

26200 blocks (5.00%) reserved for the super user

First data block=0

Maximum filesystem blocks=536870912

16 block groups

32768 blocks per group, 32768 fragments per group

8192 inodes per group

Superblock backups stored on blocks: 

32768, 98304, 163840, 229376, 294912

Writing inode tables: done                            

Creating journal (8192 blocks): done

Writing superblocks and filesystem accounting information: done

This filesystem will be automatically checked every 22 mounts or

180 days, whichever comes first.  Use tune2fs -c or -i to override.

然后创建挂载点,将drbd设备挂载到挂载点从而进行访问。

注意:从节点不需要为drbd创建文件系统,也无须挂载,在主节点挂掉之后,可以直接转换角色,然后将文件系统挂载到挂载点进行使用。

[root@heartbeat01 drbd.d]# mkdir /data

[root@heartbeat01 drbd.d]# mount /dev/drbd0 /data

[root@heartbeat01 drbd.d]# df -h

Filesystem            Size  Used Avail Use% Mounted on

/dev/mapper/VolGroup-lv_root

                       19G  1.6G   16G  10% /

tmpfs                 238M     0  238M   0% /dev/shm

/dev/sda1             477M   83M  369M  19% /boot

/dev/drbd0            2.0G  3.0M  1.9G   1% /data

7、切换Primary和Secondary节点测试

这里,我创建了99个txt文件在/data目录下,等下看切换了主从节点之后,能否在从节点上看到这些文件。

下面在主节点(heartbeat01)上操作:

首先要取消挂载:

[root@heartbeat01 drbd.d]# umount /data

[root@heartbeat01 drbd.d]# mount -n

/dev/mapper/VolGroup-lv_root on / type ext4 (rw)

proc on /proc type proc (rw)

sysfs on /sys type sysfs (rw)

devpts on /dev/pts type devpts (rw,gid=5,mode=620)

tmpfs on /dev/shm type tmpfs (rw)

/dev/sda1 on /boot type ext4 (rw)

none on /proc/sys/fs/binfmt_misc type binfmt_misc (rw)

然后执行下面的命令切换角色:

[root@heartbeat01 drbd.d]# drbdadm secondary test

此时看到主节点已经是Secondary的角色了,而从节点暂时没有处理,也还是Secondary的角色。

[root@heartbeat01 drbd.d]# cat /proc/drbd

version: 8.3.16 (api:88/proto:86-97)

GIT-hash: a798fa7e274428a357657fb52f0ecf40192c1985 build by phil@Build64R6, 2014-11-24 14:51:37

 0: cs:Connected ro:Secondary/Secondary ds:UpToDate/UpToDate C r-----

    ns:2162432 nr:0 dw:66404 dr:2097405 al:23 bm:128 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0

下面到从节点(heartbeat02)上操作:

[root@heartbeat02 drbd.d]# drbdadm primary test

或者

[root@heartbeat02 drbd.d]# drbdadm -- --overwrite-data-of-peer primary all

[root@heartbeat02 drbd.d]# cat /proc/drbd

version: 8.3.16 (api:88/proto:86-97)

GIT-hash: a798fa7e274428a357657fb52f0ecf40192c1985 build by phil@Build64R6, 2014-11-24 14:51:37

 0: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----

    ns:0 nr:2162432 dw:2162432 dr:664 al:0 bm:128 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0

此时,从节点(heartbeat02)已经是primary状态了。

[root@heartbeat02 drbd.d]# mkdir /data

[root@heartbeat02 drbd.d]# mount /dev/drbd0 /data

创建目录并挂载,已经在heartbeat02的/data目录看到之前创建的99个txt文件了,说明drbd已经将数据同步成功了。