Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
x for UNIX:
Fundamentals
Lesson 7
Resolving Hardware Problems
Lesson Introduction
Lesson 1: Virtual Objects
Lesson 2: Installation and Interfaces
Lesson 3: Creating a Volume and File
System
Lesson 4: Selecting Volume Layouts
Lesson 5: Making Basic Configuration
Changes
Lesson 6: Administering File Systems
Lesson 7: Resolving Hardware Problems
Lesson Topics and Objectives
Topic After completing this lesson,
you will be able to:
Topic 1: How Does VxVM Interpret failures in hardware.
Interpret Failures in Hardware
Topic 2: Recovering Disabled Recover disabled disk groups.
Disk Groups
Topic 3:Resolving Disk Resolve disk failures.
Failures
Topic 4: Managing Hot Manage hot relocation at the host
Relocation at the Host Level level.
Topic 1: How Does VxVM Interpret
Failures in Hardware
Can be Permanent
or Temporary
I/O Error Handling
The
TheOS
OSdetects
detectsthe
theI/O
I/Oerror
errorand
andinforms
informsVxVM.
VxVM.
IfIfthe
theLUN/disk
LUN/diskcannot
cannotbe beaccessed
accessedatatall,
all,dynamic
dynamicmultipathing
multipathing(DMP)
(DMP)
disables
disablesthe
thepath.
path.IfIfthere
thereis
isonly
onlyone
onepath,
path,the
theDMP
DMPnode
nodeisisdisabled.
disabled.
Treat
Treatthe
thefailure
failureas
asaa Disable
disk/LUN Disablethe
thedisk
diskgroup.
group.
disk/LUNfailure.
failure.
IfIfthe
thefailure
failureimpacts
impactsany
anyfile
filesystem,
system,disable
disablethe
thefile
filesystem.
system.
Identifying Disabled Disk Groups
VxVM disk and disk group records before the failure:
vxdisk list
DEVICE TYPE DISK GROUP STATUS
…
disk0_1 auto:cdsdisk datadg01 datadg online
disk0_2 auto:cdsdisk datadg02 datadg online
disk0_3 auto:none - - online invalid
…
vxdg list
NAME STATE ID
datadg enabled,cds 1150193039.58.train1
vxconfigbackup diskgroup
vxconfigrestore -p diskgroup
vxconfigrestore -c diskgroup
Disk
Topic 3: Resolving Disk Failures
1 Physical Replacement
Replace corrupt disk with
a new disk.
2 Logical Replacement
Volume
Replace the disk in VxVM.
Start disabled volumes. Volume
Resynchronize redundant volumes.
Physically Replacing a Disk
1. Connect the new disk.
2. Ensure that the operating system recognizes the disk.
3. Get VxVM to recognize the disk:
vxdctl enable
4. Verify that VxVM recognizes the disk:
vxdisk –o alldgs list
Disks marked as failing are not used for any new volume space.
To resolve intermittent disk failure problems:
– If any volumes on the failing disk are not redundant, attempt to
mirror those volumes:
If you can mirror the volumes, continue with the procedure for
redundant volumes.
If you cannot mirror the volume, prepare for backup and restore.
– If the volume is mirrored:
Prevent read I/O from accessing the failing disk by changing the
volume read policy.
Remove the failing disk.
Replace the disk.
Set the volume read policy back to the original policy.
Forced Removal
To forcibly remove a disk and not evacuate the data:
1. Use the vxdiskadm option, “Remove a disk for
replacement.” VxVM handles the drive as if it has
already failed.
2. Use the vxdiskadm option, “Replace a failed or
removed disk.”
Using the command line:
vxdg –k –g diskgroup rmdisk [disk_name]
vxdisksetup –i [new_device_name]
vxdg –k –g diskgroup adddisk \
[disk_name]=[new_device_name]
Topic 4: Managing Hot Relocation at the
Host Level
After completing this topic, you
will be able manage hot
relocation at the host level.
What Is Hot Relocation?
Hot Relocation: The system automatically reacts to I/O
failures on redundant VxVM objects and restores
redundancy to those objects by relocating affected
subdisks.
Spare Disks
VM Disks
Volumes
Spare Disks
1
VM Disks
3
1. vxrelocddetects
1.vxrelocd detectsdisk
diskfailure.
failure.
2.
2.Administrator
Administratoris
isnotified
notifiedbybye-mail.
e-mail.
3.
3.Subdisks
Subdisksare
arerelocated
relocatedtotoaaspare.
spare. 2
4.
4.Volume
Volumerecovery
recoveryisisattempted.
attempted.
Administrator
How Is Space Selected?
Hot relocation attempts to move all subdisks from a
failing drive to a single spare destination disk.
If no disks have been designated as spares, VxVM
uses any available free space in the disk group in
which the failure occurs.
If there is not enough spare disk space, a
combination of spare disk space and free space is
used.
Free space that you exclude from hot relocation is
not used.
Managing Spare Disks
VEA:
Actions—>Set Disk Usage
vxdiskadm:
“Mark a disk as a spare for a disk group”
“Turn off the spare flag on a disk”
“Exclude a disk from hot-relocation use”
“Make a disk available for hot-relocation use”
CLI:
To designate a disk as a spare:
vxedit -g diskgroup set spare=on|off disk_name
To exclude/include a disk for hot relocation:
vxedit -g diskgroup set nohotuse=on|off disk_name
To force hot relocation to only use spare disks:
Add spare=only to /etc/default/vxassist
Lesson Summary
Key Points
This lesson described how to interpret failures in
hardware, recover disabled disk groups, resolve
disk failures, and manage hot relocation at the host
level.
Reference Materials
– VERITAS Volume Manager Administrator’s Guide
– VERITAS Storage Foundation Release Notes
Lab 7
Lab 7: Resolving Hardware Problems
In this lab, you practice recovering from a
variety of hardware failure scenarios, resulting
in disabled disk groups and failed disks.
First you recover a temporarily disabled disk
group, and then you use a set of interactive
lab scripts to investigate and practice
recovery techniques.
Answer
Volume Manager defines a disk as FAILED if:
Next >>
Permanent disk failures:
A. Are failures in which the data on the drive can
no longer be accessed for any reason
B. Are disk devices that have failures that are
repaired some time later
C. Are hardware failures localized to a part of the
disk
D. Are failures that occur off and on and that
involve problems that cannot be consistently
reproduced
Answer
Permanent disk failures:
A. Are failures in which the data on the drive can
no longer be accessed for any reason
B. Are disk devices that have failures that are
repaired some time later
C. Are hardware failures localized to a part of the
disk
D. Are failures that occur off and on and that
involve problems that cannot be consistently
reproduced
The correct answer is A.
Next >>
To recover from the permanent failure, the first
step after physically replacing the disk is to:
A. Recover the redundant volumes.
B. Start any nonredundant volumes.
C. Initialize the new drive.
D. Attach the disk media name to the new drive.
Answer
To recover from the permanent failure, the first
step after physically replacing the disk is to:
A. Recover the redundant volumes.
B. Start any nonredundant volumes.
C. Initialize the new drive.
D. Attach the disk media name to the new drive.
Next >>
Select the command that causes VxVM to
immediately recognize a newly added
replacement disk that is recognized by the
operating system.
A. vxdctl enable
B. vxiod -f set 0
C. vxscandisks
D. vxconfigd restart
Answer
Select the command that causes VxVM to
immediately recognize a newly added
replacement disk that is recognized by the
operating system.
A. vxdctl enable
B. vxiod -f set 0
C. vxscandisks
D. vxconfigd restart
Next >>
After a detached disk is reattached, which plex
state indicates that VERITAS Volume Manager
believes that the data in that plex needs to be
recovered?
A. STALE
B. IOFAIL
C. DISABLED
D. RECOVER
Answer
After a detached disk is reattached, which plex
state indicates that VERITAS Volume Manager
believes that the data in that plex needs to be
recovered?
A. STALE
B. IOFAIL
C. DISABLED
D. RECOVER
Next >>
When relocating subdisks, where is the first
location that VxVM attempts to select a
destination disk?
A. Scatter the subdisks to different disks
B. The same controller, target, and device as the
failed drive
C. The same controller and target, but to a
different device
D. A different controller
Answer
When relocating subdisks, where is the first
location that VxVM attempts to select a
destination disk?
A. Scatter the subdisks to different disks
B. The same controller, target, and device as the
failed drive
C. The same controller and target, but to a
different device
D. A different controller
The correct answer is B.
Next >>
Select the appropriate command to set up a
disk disk01 as a spare. The disk is in the disk
group acctdg.
A. vxedit -g acctdg set spare=on disk01
B. vxassist -g acctdg set spare=on
disk01
C. vxedit -g acctdg set hotuse=on
disk01
D. vxedit -g acctdg set hotrel=on
disk01
Answer
Select the appropriate command to set up a
disk disk01 as a spare. The disk is in the disk
group acctdg.
A. vxedit -g acctdg set spare=on disk01
B. vxassist -g acctdg set spare=on
disk01
C. vxedit -g acctdg set hotuse=on
disk01
D. vxedit -g acctdg set hotrel=on
disk01
The correct answer is A.
Next >>
Select the appropriate command to get VxVM to
recognize that a failed disk is now working
again.
A. vxdctl
B. vxdisk list
C. vxdisk adddisk
D. vxdctl enable
Answer
Select the appropriate command to get VxVM to
recognize that a failed disk is now working
again.
A. vxdctl
B. vxdisk list
C. vxdisk adddisk
D. vxdctl enable
Next >>
The command that attempts to find the name of
a drive in the private region and match it to a
disk media record that is missing a disk access
record is _________.
A. vxreattach
B. vxunreloc
C. vxattach
D. vxrecover
Answer
The command that attempts to find the name of
a drive in the private region and match it to a
disk media record that is missing a disk access
record is _________.
A. vxreattach
B. vxunreloc
C. vxattach
D. vxrecover
Next >>
End of Presentation