You are on page 1of 50

Hazly Thambiraj

Explain the boot process?


Boot process dived into 4 phases.
1. POST
2. OBPROM
3. KERNEL INITIALIZATION
4. INIT PHASES
POST: Power on self test, It will detect hardware, machine host ID,serial No, architecture type, memory
and Ethernet address and it will load the primary program called bootblk.
OBP: Open boot programmable,
1.Diagnosing all the system hardware and memory.
2.Initializing the boot parameter.
3.Creating device trees and load the boot block from (0-15 sector), it is called as
secondary boot programmable ufsboot.
KERNEL
INITIALIZATION: ufsboot load the kernel (generic unix), kernel will load all the necessary
devices modules to mount the root partition to continue the booting process.
Init Phase : It will started by executing of /etc/init program and start other process reading the
/etc/inittab files, as the directory in the /etc/inittab files.
What is the difference between init 1 and init s ?
If you switch from multiuser mode to init s and switch it back to multiuser mode.then remote useres automatically
reconnects to the system.
where as in the case of init 1. they have to reconnect manully means they have to relogin
Explain about init phases
There are 8 run level.
Init0: Shutting down the system and bring back the system to OBP prompt (OK)
Init1: Single user mode for administrative
Init2: Multi user mode without resource sharing
Init3: Multi user mode with resource sharing
Init4: Not in use
Init5: Shutdown and power off the system
Init6: Reboot the system to default run level
Inits: Single user mode but user login are disabled
Explain rc script and run level
Rc script : check and mount the file system, start and stop the various process.
Run level: Base on the init phases 8 run level
Which file will take place while booting.
# /etc/inittab
Explain /etc/inittab files
It will have 4 fields( id, rstat, action, process)
Respawn : Restart daemon if killed
Off : Dont start the daemon if killed
Once: Run once
Wait: Wait for the process to be completed before processing the next line in /etc/inittab
Explain /etc/system file
It will control the kernel modules and it dived into 5 sector
Moddir : default loadable kernel modules
Root.devices & root files configuration : Physical path name of the devices
Exclude: Does not load the kernel during kernel initializing
Forceload: Force the kernel module to load
Set: set maxium user =40

Is it possible to edit the corntab using vi


It is not recommended but it is possible by editing
# vi /var/spool/cron/crontabs/root
The command to edit the crontab is
#crontab -e
Is it possible to create swap in new harddisk without format
No without label the drive, you can't do anything
How will you comment error line in /etc/system file
# Vi /etc/system
(To comment the error line in /etc/system files, we have to use *)
Explain inode
It contain the information of the files and directory
Like ( date, home directory, rights, modified date, etc)
How many file to modify the host name to be changed without rebooting the system.
There are 6 files.
#vi /etc/hosts
#vi /etc/nodename
#vi /etc/hostname.hme
#vi /etc/net/ticlts/hosts
#vi /etc/net/ticosts/hosts
#vi /etc/net/ticotsord/hosts
Where the ip address will be stored
# /etc/hosts
# /etc/hostname.hme
Which file maintain the kernel
# /etc/path_to_inst
This will be quite complicative, because kernel is the core of the operating system, its an image of the
OS. whereas /etc/path_to_inst are the drivers are stored for the enabled hardware.

How will see the kernel version


Ok banner --> This will not show the Kernel
Ok .version --> This will tell the open boot prompt version otherwords, FIRMWARE
# prtconf V
# showrev --> will show the installed patches
# uname x
How will see the hostname
#nodename
#hostname

How will compare 2 host patches


Through ftp copy the copied files nad use dd command to compare both the host patches.
use the command called "diff"

How to find the hardware configuration


OK banner --> from the open boot prompt
# prtconf
# sysdef
# /use/platform/sun4u/sbin/prtdiag
Difference between probe-scsi-all and probe-scsi
OK probe-scsi-all: it will show all the scsi devices connected internal and external
OK probe-scsi: it will show only internal scsi devices connected

How will you find the number of hard disk


# format
#dmesg --> this will not give the correct answer
#sysdef
#iostat En --> it wont show the Fibre channel
OK show-devs --> it will show the device scsi bus channel
OK probe-scsi-all --> This will deteck all the scsi bus channel, it will show CD-ROM/DVD-ROM also
How to create core dump and crash dump
#coreadm

globalcorefilepattern:
initcorefilepattern:core
globalcoredumps:disabled
perprocesscoredumps:enabled
globalsetidcoredumps:disabled
perprocesssetidcoredumps:disabled
globalcoredumplogging:disabled
To disable per-process core file creation, use:

coreadmdprocess
This will modify the /etc/coreadm.conf which is read at boot when /etc/init.d/coreadm is executed from
a runtime control script. To make permanent changes to coreadm, do not edit the /etc/coreadm.conf
file, use the coreadm command.

Procedure for Firmware upgrade


Shutdown root cron on csslu377 servers
#./etc/init.d/cron stop
Reset the system controller from the console.
# resetsc -y
Upgrade the firmware on the system controller
> cd /114527-04
> lom -G ./sgrtos.flash
> lom -G ./sgsc.flash

Escape to lom> and reset the SC:


lom> resetsc -y
Upgrade the firmware on the system boards:
> lom -G ./lw8cpu.flash
> lom -G ./lw8pci.flash
Shutdown the Operating System.
#shutdown i0 g0 y
This may take a while for the server to come up
> poweroff
> poweron
Which file to be configuring the logs messages (like: /var/adm/message etc)
#/etc/syslog.conf
How will see the version of the patches
# showrev p
# patchadd P
How will check patches installed
#
#
#
#

showrev p --> This is the correct one


patchadd P
patchrm : to remove the patch
patchad d (patch no ) : to install the patch

How will you view the packages?


#pkginfo | more
When user logging in which file will take place
.profiles
Default path for patches
# /var/sadm/patch
How many fields in passwd, shadow and group files
7 inPasswd: username:passwd:uid:gid:commentfiled:homedirectory:loginshell
9 in shadow: useraccount:passwd:lastchange:min:max:warring:inactivity:expiration:flag
4 in group: username:passwd:gid:user
Syntax for useradd commad
# useradd u 101 g other d /export/home/ragu m s /bin/ksh c revenue ragu
As a user how he can change is passwd with root privilege.
Setuid to set on /usr/bin/passwd command
What is default permission of /etc/passwd , /etc/shadow files
/etc/passwd: 444
/etc/shadow: 400

What is default permission of files and directory?


file: 644
directory: 755
What is UMASK
UMASK is a Unix environment variable, which automatically sets file permissions on newly created files
Defalut value for umask
umask default value is 022
Explain setuid, setgid and stickybit
Setuid : When setuid permission set on a executable file, user who access the file is
granted access permission of the owner of the file.
# find / -prem 4000
setgid : Permission similar to setuid, The process is changed to owner of the file.
# find / -prem 2000
Stickybit : It is a special permission that protect the files within a public writable directory
Stickybit permission set the shared directory, user can create a files or directory
But only by owner of the directory can modify or delete.
# find / -prem 1000
Expalin soft link and hard link
Softlink : link between different file systems and inode number will be different
(eg) /U3 - /U4
#ln s /U3/file1 /U4/file2
Hardlink : link between same file systems and inode number will be same
(eg) /U3 - /U3
#ln s /U3/file1 /U3/file2
How to set passwd never expire for a user
#passwd x 1 (username)
To lock an account simply type:
Now check the status of the account:

passwd -l user_login_name
passwd -S user_login_name

To unlock the user account... passwd -u user_login_name


passwd -S user_login_name

How to find the kernel bit version or architecture


# isainfo kv
How to set the time zion
# /etc/default/init
TZ=

How to enable and disable the dtlogin


# /usr/dt/bin/dtconfig e
# /usr/dt/bin/dtconfig d

(enable)
( disable)

How to change the hostname and Ethernet address in single command


# /usr/sbin/sys-unconfig

How do you add the disk without reboot the server?


# devfsadm c disk
How to delete abc files using find command
# find / -name abc exec rm {}\;
What is difference between g G while creating useradd?
Small g is primary group
Capital G is for secondary group
We have edit the /etc/passwd file and modify a user forget to give the shell will user able to
loging?
If Passwd f option given In which files it will update.
How many filed in /etc/vfstab
7 fileds
After creating swap file update the same to /etc/vfstab what will be the fstype.
temfs
How will you clear the soft error on disk
# format analyst & verify test
How will you verify the disk and how many field will be there.
#format verify test
How to modify the user from ragu to haz
# usermod d /export/home/haz m l haz ragu
How will you find out enough memory?
# /use/platform/sun4u/sbin/prtdiag
# prtconf | grep i mem
What command can be used in running a background process?
# nohup ( comment ) &
How will you rectify the root passwd in solaris
Boot through cdrom 1/3 cd in single user mode.
Ok boot cdrom-s
# mount F ufs /dev/rdsk/c0t0d0s0 /mnt
# cd mnt
#set TERM=vt100
#export TERM --> If you dont give, vi editor won't start properly
# vi /etc/shadow (remove the passwd entry and save the file)
# umount /mnt
# reboot
( While system come up it will ask the new root passwd.)

Explain inode
It contain the information of the files and directory
Like ( date, home directory, rights, modified date, etc)
If the system panic or system not booting
Boot through cdrom 1/3 cd in single user mode.
Ok boot cdrom-s
# mount F ufs /dev/rdsk/c0t0d0s0 /mnt
# cd mnt
# installboot /usr/platform/uname-i/lib/fs/ufs/bootblk /dev/rdsk/c00t0d0s0
# umount mnt
# boot r
Explain /etc/inittab files
It will have 4 fields( id, rstat, action, process)
Respawn : Restart daemon if killed
Off : Dont start the daemon if killed
Once: Run once
Wait: Wait for the process to be completed before processing the next line in /etc/inittab
How will you ad new aliases name
# vi /etc/mail/aliases ( enter the aliases name (eg) service:servicedesk@hotmail.com
# newaliases
# /usr/lib/sendmail v b (aliases name)
To test send mail:
root on LIVE cathnidp # echo "This is a test msg" | mailx -s test thambirajh@inbangalore.mail.saic.com
root on LIVE cathnidp #
What are printer daemon
# /etc/inetd.conf --> This is an services daemons
# in.lpd daemon
What is the protocol supported by NFS
UDP and TCP
To find the OBP environment variable
OK printenv
Explain FSCK
Utility
It has
Phase
Phase
Phase
Phase
Phase

for checking and repairing the files system inconsistence due to abnormal shutdown.
5 phases
1 : Check block and size
2 : Check pathname
3 : Check connectivity
4 : Check reference count
5 : Check cylinder group

If super block corrupted how to recover


# newfs N /dev/rdsk/c0t0d0s6

Then run the fsck utility using alternative super block


# fsck F ufs o b=32 /dev/rdsk/c0t0d0s6

How to create swap file


#mkfile 20m /filename
#swap a /filename
#swap l ( to view and swap file)
#swap d ( to delete the swap file)
What is difference between Hard and Soft mount?
Hardmount:Normal file system mount used mainly for mounting local file systems.
Once a file system is hard mounted, can use a normal filesystem untill
its umount.
Soft mount:It allows automatic unmounting if the filesystem is idle for a specified timeout
period. It is mainly used for network filesystems like NFS It can be configured
using Autofs and the network filesystem can be soft mounted.
Which command to lock the user Account?
# lk username
How to boot the solaris system in 64 0r 32 bit kernel
For 64 bit
# eeprom boot-file=/kernel/sparc9/unix
or
OK printenv boot-file
OK settenv boot-file kernel/sparc9/unix
For 32 bit
# eeprom boot-file=/kernel/unix
or
OK printenv boot-file
OK settenv boot-file kernel/unix
What is the difference between dsk and rdsk
DSK: Block level devices
RDSK: Raw level device or character level device
How to find the boot path in solaris
# prtconf vp | grep bootpath
or
# eeprom
How to bring the process to offline or online
# psradm f (no of the process)
# psradm a n ( no of the process)
How to check the no of the processor available
# psrinfo v

How will you check the process for particular user


# ps U oracle
or
#ps fu oracle | grep smon
How do you check the no of user logged into the system
# who
How do you check the run level
# who r
How to enable the telnet or ftp session for one host
Edit the /etc/hosts.allow (it will have list of host to access) -> This will show the user list on that host./etc/hosts.deny (it will have list of host to access deny)-> This will show the user list on that host./etc/services file
If telnet not working, what are the thing to be checked
Ping test, service enable or not in (/etc/inetd.conf), NIS problem
If telnet is not working, first of all you will not be get connected, login through console access and then
find out with ps -ef | grep telnetd or Kill -1 <pid_inetd>
In which file port Nos are defined
# /etc/service
What is nice and renice command do
To set the high priority for the process
-20 high priorities
+20 low priorities
Eg: # nice 20 oracle
Tell me the port for Telnet, ftp,nis,ssh,nfs
Telnet = 23
ftp = 25
ssh = 22
nfs = 2049
nis = 749

How will you restart the inetd process?


# Pkill HUP inetd
What are thing you must ensure to provide security the system

1. Latest patches
2. Access to the system:
3.

/etc/default/login
sshd.config
Limited su access

4. Stop unnecessary service at run level


/etc/inetd.config : finger, discard, daytime,charger,tftp,spary & etc
How to configure the network card
# ifconfig hme0 plumb
# ifconfig hme0 192.9.200.10 netmask 255.255.255.0 up
# vi /etc/hosts ( update the ip address )
# vi /etc/hosts.hme0 ( update the ip address ) --> to bring the interface along with the ipaddress
during boot time.
How to configure the virtual ip address
Infinity virtual ip address can be created.
# ifconfig hme0:1 plumb
# ifconfig hme0:1 192.9.200.10 netmask 255.255.255.0 up
# vi /etc/hosts ( update the ip address )
# vi /etc/hosts.hme0:1 ( update the ip address ) --> to bring the interface along with the ipaddress
during boot time.
How to find the network card speed
# ndd get /dev/hme link_speed
1 = 100mbps
0 = 10mbps
How to modify network card speed
# ndd set /dev/hme instance 0
# ndd get /dev/hme link_status
# ndd get /dev/hme link_mode
To modify
# ndd set /dev/eri instance 0
# ndd set /dev/eri adv_100T4_cap0
# ndd set /dev/eri adv_100fdx_cap1
# ndd set /dev/eri adv_100hdx_cap0
# ndd set /dev/eri adv_10fdx_cap0
# ndd set /dev/eri adv_10hdx_cap0
# ndd set /dev/eri adv_autoneg_cap0
100=full duplex
10=half duplex
0=
1= autoneg
2
root on BUILD kirkbiz06 # ndd -set /dev/bge3 adv_autoneg_cap 0
root on BUILD kirkbiz06 # ndd -get /dev/bge3 link_speed
100
root
1
root
2
root
0
root
root
2
root

on BUILD kirkbiz06 # ndd -get /dev/bge3 link_status


on BUILD kirkbiz06 # ndd -get /dev/bge3 link_duplex
on BUILD kirkbiz06 # ndd -get /dev/bge3 link_autoneg
on BUILD kirkbiz06 # ndd -set /dev/bge3 adv_autoneg_cap 1
on BUILD kirkbiz06 # ndd -get /dev/bge3 link_duplex
on BUILD kirkbiz06 # ndd -get /dev/bge3 link_autoneg

1
root on BUILD kirkbiz06 #
Which file to be configuring the logs messages (like: /var/adm/message etc)
#/etc/syslog.conf
What is nslookup
To find the hostname and ip address
To resolve the hostname into ip and ip into hostname
How will you check the disk performance?
# iostat xctd 4 5
If a disk shows consistently high reads/writes along with , the percentage busy (%b) of the disks is
greater than 5 percent, and the average service time (svc_t) is greater than 30 milliseconds, then
one of the following action needs to be taken
What are performance tool used
Iostat ,vmstat , prstat , sar ,netstat, top
Add swap space
# mkfile 500M swap1
# swap -a /swap1/swap1
# swap -s
Which service assign port no dynamically
Rpc service ( /etc/rpc)
Rpc
Portmaping : it will convert the application into port number
What are the NFS daemon in server & client
Server Daemons
1 mountd
2.nfsd
3.nfslogd
4.statd
5.lockd

Client Daemons
statd
lockd

The following daemons play a critical role in NFS service:

1. mountdHandlesfilesystemmountrequestsfromremotesystems,andprovidesaccesscontrol(server)
2. nfsdHandlesclientfilesystemrequests(bothclientandserver)
3. statdWorkswiththelockddaemontoprovidecrashrecoveryfunctionsforthelockmanager(server)
4. lockdSupportsrecordlockingoperationsonNFSfiles
5. nfslogdProvidesfilesystemlogging.Runsonlyifoneormorefilesystemsismountedwithlogattribute.

biod: On the client end, handles asynchronous I/O for blocks of NFS files.

How to start / stop the nfs server


# /etc/init.d/nfs.server start
# /etc/init.d/nfs.server stop

How to find out the shared file system from server and client
Server : # share & dfmount
Client : # showmount e (hostname) and dfshares
What are the daemons in nis server.
rpc.yppasswdd
Ypxfrd
rpc.ypupdated
How to start / stop the nfs server
# /etc/init.d/nfs.server start
# /etc/init.d/nfs.server stop
How to find out the shared file system from server and client
Server : # share & dfmount
Client : # showmount e (hostname) and dfshares
What are processing daemon in nis
Ypserv
Ypbind
What are the daemons in nis server.
rpc.yppasswdd
Ypxfrd
rpc.ypupdated
SDS
Explain RAID0, RAID1, RAID3,
RAID 0 Concatenation/Striping
RAID 1 Mirroring
RAID 5-Striped array with rotating parity.
Concatenation: Concatenation is joining of two or more disk slices to add up the disk space.
Concatenation is serial in nature i.e. sequential data operations are performed serially on first disk then
second disk and so on. Due to serial nature new slices can be added up without having to take the
backup of entire concatenated volume, adding slice and restoring backup.

Striping: Spreading of data over multiple disk drives mainly to enhance the performance by
distributing data in alternating chunks - 16 k interleave across the stripes. Sequential data operations
are performed in parallel on all the stripes by reading/writing 16k data blocks alternatively form the
disk stripes.
Mirroring: Mirroring provides data redundancy by simultaneously writing data on to two sub mirrors of
a mirrored device. A submirror can be a stripe or concatenated volume and a mirror can have three
mirrors. Main concern here is that a mirror needs as much as the volume to be mirrored.
RAID 5: RAID 5 provides data redundancy and advantage of striping and uses less space than
mirroring. A RAID 5 is made up of at least three disks, which are striped with parity information written
alternately on all the disks. In case of a single disk failure the data can be rebuild using the parity
information from the remaining disks.

How many replicas should be for raid5 in sds if I have 5 disk


No of Hard Devices
One

No of State Database Replicas to created


Three, all on one slice

Two-four
Five or more

Two on each drive


One on each drive

Were will be the configuration for metadevice


#/etc/lvm/md.tab
or
#/etc/opt/SUNWmg/md.tab
How to grow disk size in SDS
Identified the free disks and the volumes size and meta device name
#df h
/dev/md/dsk/d19
27G 1.5G 25G
6% /rpbkup
Increase the filesystem by 10Gb
#metattach d102 10G
#growfs -M /agtmgt/ora1data /dev/md/rdsk/d102
To find the free space on soft partition
#metarecover -v -n d40 -p|grep -i free
Creating New FS in LUNs and new mount point to the Oracle filesystem
# metainit d111 -p d200 20G
d111: Soft Partition is setup
# newfs /dev/md/rdsk/d111
newfs: construct a new file system /dev/md/rdsk/d111: (y/n)? y
# mkdir ora13data
# chown oracle:dba /ora13data
# ls -la ora13data
# mount /dev/md/dsk/d111 /ora13data
#df -k
What is luxadm probe used
#luxadm probe
Found Enclosure(s):
SUNWGS INT FCBPL Name:FCloop
Logical Path:/dev/es/ses0
Logical Path:/dev/es/ses1

Node WWN:50800200001bcf28

Found Fibre Channel device(s):


Node WWN:50020f200000d250 Device Type:Disk device
Logical Path:/dev/rdsk/c8t60020F200000D28D3E7768C80008F620d0s2
Node WWN:50020f200000d28d Device Type:Disk device
Logical Path:/dev/rdsk/c8t60020F200000D28D3E77593A00008EA9d0s2
SMS800 Failed Disk Notes
This information was obtained from Sunsolve
http://sunsolve8.sun.com/search/document.do?assetkey=1-9-76438-1
This example is for replacing c1t0d0 on SMSLU104
Detach failed mirrors:
#metadetach f d0 d30
#metadetach f d1 d31
#metadetach f d2 d32
#metatdetach f d3 d33
#metadetach f d100 d101
Delete disk metadevices:
#metaclear d30
#metaclear d31

#metaclear d32
#metaclear d33
#metaclear d101
Verify there are no existing metadevices:
#metastat p | grep c1t0d0
Remove metadb replicates on failed disk:
#metadb i
#metadb d c1t0d0s7
Verify there are no replicas:
#metadb | grep c1t0d0
Save the disk partition table from the surviving disk:
#prtvtoc /dev/rdsk/c1t0d0s2 > /var/adm/mmddyyc1t0d0.vtoc
Required for v880, not sure about other HW:
Run luxadm to remove the failed disk
#luxadm remove_device F /dev/rdsk/c1t0d0s2
At the prompt, physically remove the device and continue
Run devfadm cleanup
#/usr/sbin/devfsadm C c disk
This should remove device paths for this disk. Verify via:
#ls ld /dev/dsk/c1t0d*
This should return no devices.
The new disk should now be inserted.
Refresh device names:
# devfsadm
or
# /usr/sbin/luxadm insert_device <enclosure_name,sx>
luxadm insert_device /dev/rdsk/c1t49d0s2
where sx is the slot number
or
# /usr/sbin/luxadm insert_device (if enclosure name is not known)
Note: In many cases, luxadm insert_device does not require the enclosure
name and slot number.
Use the following to find the slot number:
# luxadm display <enclosure_name>
To find the <enclosure_name> use:
# luxadm probe
Run "ls -ld /dev/dsk/c1t1d*" to verify that the new device paths have
been created.
Update hardware device numbers:
At the end of metastat command output are the hardware device numbers. After replacement the
metadevadm command should be run to update the new device number.
#metadevadm u c1t0d0
Write vtoc to replacement disk:
#fmthard s /var/adm/mmddyyc1t0d0.vtoc /dev/rdsk/c1t0d0s2
Or use format to copy the partition table.
Create new meta devices:
#metainit d30 1 1 c1t0d0s0

#metainit
#metainit
#metainit
#metainit

d31 1 1 c1t0d0s1
d32 1 1 c1t0d0s3
d33 1 1 c1t0d0s4
d101 1 1 c1t0d0s6

Attach mirrors:
#metattach d0 d30
#metattach d1 d31
#metattach d2 d32
#metattach d3 d33
#metattach d100 d101
Add metadbs to replacement disk:
#metadb a c 3 c1t0d0s7
Check that metadbs are correct:
The lower case lettered flags may not appear until the server is rebooted.
be six total metadbs. Three on each of the root mirrored disk.
# metadb
flags
a m p luo
a p luo
a p luo
a p luo
a p luo
a p luo

first blk
16
8208
16400
16
8208
16400

block count
8192
8192
8192
8192
8192
8192

For sms800 there need to

/dev/dsk/c1t0d0s7
/dev/dsk/c1t0d0s7
/dev/dsk/c1t0d0s7
/dev/dsk/c1t1d0s7
/dev/dsk/c1t1d0s7
/dev/dsk/c1t1d0s7

Check the progress of the sync:


#metastat | grep i sync
Comment chk-vols into root cron if needed.
Receiving following io error on /scratch ls /scratch
/scratch: I/O error
************************************************************************
Observation
Found trans device d5 having problem
Following file systems are not able to open, while using dk k its shows i/o error.
[root drcs1] ksh$ df -k | grep -i /dev/md/meter
/dev/md/meter/dsk/d14 1001382 117016 874353 12% /appl/TEST
/dev/md/meter/dsk/d6 11329080 681210 9514970
7% /ora1data/METR
/dev/md/meter/dsk/d10 5664168 1651 5096107
1% /ora1index/METR
/dev/md/meter/dsk/d22 2002021
10 1981991
1% /oraredo/METR
/dev/md/meter/dsk/d26 1887813
20 1699013
1% /redoarch/METR
Step 1
[root drcs1] ksh$
[root drcs1] ksh$ metastat -s meter d18
meter/d18: Trans
State: Hard Error
Size: 4087280 blocks
Master Device: meter/d17
Logging Device: meter/d5
meter/d17: Mirror
Submirror 0: meter/d15
State: Okay

Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 4087280 blocks
meter/d15: Submirror of meter/d17
State: Okay
Size: 4087280 blocks
Stripe 0:
Device
Start Block Dbase State
c2t5d1s0
0
No Okay

Hot Spare

meter/d5: Logging device for meter/d6 meter/d10 meter/d14 meter/d18 meter/d22


meter/d26
State: Hard Error
Size: 131990 blocks
meter/d5: Mirror
Submirror 0: meter/d3
State: Okay
Submirror 1: meter/d1
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 132240 blocks
meter/d3: Submirror of meter/d5
State: Okay
Size: 132240 blocks
Stripe 0:
Device
Start Block Dbase State
c1t3d4s6
0
No Okay
meter/d1: Submirror of meter/d5
State: Okay
Size: 132240 blocks
Stripe 0:
Device
Start Block Dbase State
c1t4d0s6
0
No Okay

Step 2:-

Hot Spare

Hot Spare

Analyzed both the disk and no error found disks are okay.

21. c1t3d4 <SUN2.1G cyl 2733 alt 2 hd 19 sec 80>


22. c1t4d0 <SUN2.1G cyl 2733 alt 2 hd 19 sec 80>
analyze> test
Ready to analyze (won't harm data). This takes a long time,
but is interruptable with CTRL-C. Continue? yes
pass 0 - pattern = 0xc6dec6de
2732/18/14
pass 1 - pattern = 0x6db6db6d
2732/18/14
Total of 0 defective blocks repaired.
Step 3:- Take the output of metadevice configuration for meter

[root drcs1] ksh$ metastat -s meter -p


meter/d6 -t meter/d2 meter/d5
meter/d2 -m meter/d0 1
meter/d0 1 6 c1t4d0s0 c1t4d1s0 c1t4d2s0 c1t4d3s0 c1t4d4s0 c1t5d0s0 -i 256b
meter/d10 -t meter/d9 meter/d5
meter/d9 -m meter/d7 1
meter/d7 1 3 c1t3d1s0 c1t3d2s0 c1t3d3s0 -i 256b
meter/d14 -t meter/d13 meter/d5
meter/d13 -m meter/d11 1
meter/d11 1 1 c1t5d1s0
meter/d18 -t meter/d17 meter/d5
meter/d17 -m meter/d15 1
meter/d15 1 1 c2t5d1s0
meter/d22 -t meter/d21 meter/d5
meter/d21 -m meter/d19 1
meter/d19 1 1 c1t5d4s0
meter/d26 -t meter/d25 meter/d5
meter/d25 -m meter/d23 1
meter/d23 1 1 c1t5d2s0
meter/d5 -m meter/d3 meter/d1 1
meter/d3 1 1 c1t3d4s6
meter/d1 1 1 c1t4d0s6
meter/d4 1 1 c1t5d3s0
[root drcs1] ksh$
Sterp4: Find the disk is place with some other metaset
[root drcs1] ksh$ metastat -s drcs1 -p | grep -i c1t3d4
[root drcs1] ksh$ metastat -s drcs1 -p | grep -i c1t4d0
[root drcs1] ksh$ metastat -s tdcc -p | grep -i c1t3d4
[root drcs1] ksh$ metastat -s tdcc -p | grep -i c1t4d0
[root drcs1] ksh$ metastat -s ssd -p | grep -i c1t3d4
[root drcs1] ksh$ metastat -s ssd -p | grep -i c1t4d0
[root drcs1] ksh$ metastat -s msp_cd -p | grep -i c1t3d4
[root drcs1] ksh$ metastat -s msp_cd -p | grep -i c1t4d0
[root drcs1] ksh$ metastat -s shared_disks -p | grep -i c1t3d4
[root drcs1] ksh$ metastat -s shared_disks -p | grep -i c1t4d0
[root drcs1] ksh$ metastat -s custdir -p | grep -i c1t3d4
[root drcs1] ksh$ metastat -s custdir -p | grep -i c1t4d0
[root drcs1] ksh$ metastat -s meter -p | grep -i c1t3d4
meter/d3 1 1 c1t3d4s6
[root drcs1] ksh$ metastat -s meter -p | grep -i c1t4d0
meter/d0 1 6 c1t4d0s0 c1t4d1s0 c1t4d2s0 c1t4d3s0 c1t4d4s0 c1t5d0s0 -i 256b
meter/d1 1 1 c1t4d0s6
[root drcs1] ksh$
Step 5: Find the entry in /etc/vfstab for meter
/dev/md/meter/dsk/d14 /dev/md/meter/rdsk/d14 /adev ufs 1
no
/dev/md/meter/dsk/d6 /dev/md/meter/rdsk/d6 /ora1data/METR ufs 1 no
/dev/md/meter/dsk/d10 /dev/md/meter/rdsk/d10 /ora1index/METR ufs 1 no /dev/md/meter/dsk/d22 /dev/md/meter/rdsk/d22 /oraredo/METR ufs 1 no
/dev/md/meter/dsk/d26 /dev/md/meter/rdsk/d26 /redoarch/METR ufs 1 no /dev/md/meter/dsk/d4 /dev/md/meter/rdsk/d4 /ora1data/HUB_TEST ufs 1 no /dev/md/meter/dsk/d18 /dev/md/meter/rdsk/d18 /scratch ufs 1
no
Step 6: Unmount all the following file systems
#umount
#umount
#umount
#umount

/appl/TEST
/ora1data/METR
/ora1index/METR
/oraredo/METR

#umount /redoarch/METR
Check df k whether the file system are umounted
Step7:- Clearing all the trans device from d5
meter/d5: Logging device for meter/d6 meter/d10 meter/d14 meter/d18 meter/d22
meter/d26
[root drcs1] ksh$ metaclear
meter/d10: Trans is cleared
[root drcs1] ksh$ metaclear
meter/d14: Trans is cleared
[root drcs1] ksh$ metaclear
meter/d18: Trans is cleared
[root drcs1] ksh$ metaclear
meter/d22: Trans is cleared
[root drcs1] ksh$ metaclear
meter/d26: Trans is cleared
[root drcs1] ksh$ metaclear
meter/d5: Mirror is cleared

-s meter d10
-s meter d14
-s meter d18
-s meter d22
-s meter d26
-s meter d5

Step 8:- Find all the Trans device configuration has cleared
[root drcs1] ksh$ metastat -s meter -p
meter/d2 -m meter/d0 1
meter/d0 1 6 c1t4d0s0 c1t4d1s0 c1t4d2s0 c1t4d3s0 c1t4d4s0 c1t5d0s0 -i 256b
meter/d9 -m meter/d7 1
meter/d7 1 3 c1t3d1s0 c1t3d2s0 c1t3d3s0 -i 256b
meter/d13 -m meter/d11 1
meter/d11 1 1 c1t5d1s0
meter/d17 -m meter/d15 1
meter/d15 1 1 c2t5d1s0
meter/d21 -m meter/d19 1
meter/d19 1 1 c1t5d4s0
meter/d25 -m meter/d23 1
meter/d23 1 1 c1t5d2s0
meter/d1 1 1 c1t4d0s6
meter/d3 1 1 c1t3d4s6
meter/d4 1 1 c1t5d3s0
[root drcs1] ksh$
Step 9:- Mirror meter/d5 with meter/d3
[root drcs1] ksh$ metainit meter/d5 -m meter/d3
meter/d5: Mirror is setup
Step 10:- Attache the mirror device meter/d5 with sub mirror meter/d1
[root drcs1] ksh$ metattach meter/d5 meter/d1
meter/d5: submirror meter/d1 is attached
Step 11:- Create all the trans device of d5
meter/d5: Logging device for meter/d6 meter/d10 meter/d14 meter/d18 meter/d22
meter/d26
#metainit
#metainit
#metainit
#metainit
#metainit
#metainit

meter/d6 -t meter/d2 meter/d5


meter/d10 -t meter/d9 meter/d5
meter/d14 -t meter/d13 meter/d5
meter/d18 -t meter/d17 meter/d5
meter/d22 -t meter/d21 meter/d5
meter/d26 -t meter/d25 meter/d5

Step 12:- Mount all the following file systems


#mount /dev/md/meter/dsk/d6
#mount /dev/md/meter/dsk/d10

#mount
#mount
#mount
#mount

/dev/md/meter/dsk/d14
/dev/md/meter/dsk/d18
/dev/md/meter/dsk/d22
/dev/md/meter/dsk/d26

Most Important:- Verification and confirmation


Check df k whether the file system are mounted
Check all the trans device is presentmetastat -s meter p
Check any error is exist metastat -s meter any hard error
Check all the file system specifically using ls lrt
Confirmed with user and close the call.
How to do the disk cloning on solaris
Here is the procedure
install the disk
you can do this few ways, let's the scenario be, the disk is already attached and its been label through
format.
primary disk is u r c1t0d0s2
#dd if=/dev/dsk/c1t0d0s2 of=/dev/dsk/c1t1d0s2 bs=256k
This will take time, depends on the size of the primary disk
verfy the clone disk has a clean filesystem, for that
#fsck -y /dev/rdsk/c1t1d0s0
To verify that mount the clone disk
#mount /dev/dsk/c1t1d0s0 /mnt
change the /etc/vfstab to point to the clone device
#vi /mnt/etc/vfstab
After making changes, boot the clone disk
-----Done
Backups
How will you take ufsdump and ufsrestore in a sing command line?
# ufsdump 0f - /dev/rdsk/c0t0d0s6 | (cd /mnt/haz ufsrestore xf -)
To check the status of the media inseted on the tape drive
# mt /dev/rmt/0 status
Syntex to execute a ufsdump
# ufsdump 0uf /dev/rmt/1
Difference between ufs and tar commnad
ufsdump
1. Used for complete file system backup.
2. It copies every thing from regular files in a file system to special character and block
device files.
3. It can work on mounted or unmounted file systems.
Tar:
1. Used for single or multiple files backup.
2. Can't backup special character & block device files.
3. Works only on mounted file system.
What is different between crontab and at command?
Crontab: job can be scheduled
At: Job can be a run once only
What is difference between incremental backup and differential backup?
Incremental: Only those files will be included which have been changed since the last backup.
Differential: Only those files will be included which have been changed since the last Full backup

How many ufsdump level


0-9 level
0 = Full Backup
1-9 = Incremental backup of file,That have changed since the last lower level backup.
Options in ufsdump
S = size estimate amount of space need on tape
L = auto loaded
O = offline once the backup completed & if possible to eject the media
U = update the /etc/dumdate files (Indicate:Name of the file system,Level of the backup 0-9,Date.
F = specified the tape devices name
Options in ufsrestore
T= list the content of the media
R =restore entire file system
X = restore only the file named on the command line
I = interactive mode
V = verbose mode
F = specified the tape devices name
Disaster recovery steps if OS corrupted
Ok boot cdrom s
# newfs /dev/rdsk/c0t0d0s0
# mkdir a
# mount /dev/dsk/c0t0d0s0 /a
# cd a
# ufsrestore rf /dev/rmt/0
# rm restoresymtable
# cd /usr/platform/uname-m/lib/fs/ufs
# installboot bootblk /dev/rdsk/c0t0d0s0
# cd /
# umount /a
# fsck /dev/rdsm/c0t0d0s0
# init 6
from the OK prompt, execute
OK> boot -r
How will you comment error line in /etc/system file
# Vi /etc/system
(To comment the error line in /etc/system files, we have to use *)
How will you come to know wheather hme 0r eri or to configuring the network card.
Base on Ethernet card
How may will you find the process id and disk utilizations.
# pr
Cannot open '/etc/path_to_inst'"
System can not find the /etc/path_to_install file .It might be missing or corrupted and needs to be
rebuild.
To rebuild this file boot the system with -ar option :
ok>boot -ar
Press enter to select default values for the questions asked during booting and select yes to rebuild
/etc/path_to_install

The /etc/path_to_inst on your system does not exist or is empty. Do you want to rebuild this file [n]? y
system will continue booting after rebuilding the file.
Is it possible to edit the corntab using vi
It is not recommended but it is possible by editing
# vi /var/spool/cron/crontabs/root
The command to edit the crontab is
#crontab -e
How will you restrict the users from login when system in heavy load

Is it possible to create swap in new harddisk without format


No without label the drive, you can't do anything
Veritas
How to find the demo?
#Ps ef |grep vxvm
How to find the versions?
pkginfo -l VRTSvxvm
What are the daemons in VERITAS?
Vxconfigd
Vxiod
vxrelocd
How will create a rootdg
# vxdg -g rootdg adddisk newroot=c2t2d0
How to mirror a rootdg
Initialize a new disk
#devfsadm
#format ( to label the disk)
#vxdctl enable
#vxdisksetup I diskname (c0t1d0s2)
#vxdg g rootdg adddisk rootnew=c0t1d0s2
#vxprint list
#vxrootmir rootnew
How to remove the root mirror
First disaccociate the plex
#vxunroot
Reboot the system ( it will remove the entry from /etc/system and /etc/vfstab)
#vxplex dis rootvol-02

Now, the slice information for disk newroot (c2t2d0) is:

# prtvtoc -s /dev/rdsk/c2t2d0s2

Delete the rootalt volume from the new root disk.


# cd /
# umount /mnt
# vxvol stop rootalt
# vxplex dis rootvol-02 (Disassociate the plex from the volume and remove the volume.)
# vxedit rm rootalt
# vxsd dis newroot-01
(Disassociate the sub-disk from the plex and remove the plex.)
# vxedit rm rootvol-02
7. Mirror all the other volumes from the current root disk to the new root disk.Do not mirror swap volumes. Swap slices
will be created on the new disk manually. In this example, the volumes to mirror are var and opt.
# vxassist -g rootdg mirror var newroot
# vxassist -g rootdg mirror opt newroot

What is an alternative to the top command on Solaris?


# prstat a

To find out the number of the disk available on veritas


vxdisk -o alldgs list | more
What are the steps to be follow to add a disk in veritas & before add the disk what are the
steps to be follow
Before adding the disk,
take an output from the format command.
take an output of vxdisk list
after the disk is added, do the following:
#devfsadm
#format --> label the disk
#vxdctl enable
#vxdiskadm choose the option 1 and then it will ask you the diskgroup once it has been added it will
ask for the encapsulation, say no then it will ask the device name, assign the name, that's it,
#vxdisk list, this will tell the status of the newly added disk as online
How ill you create a subdisk and plexus
vxmake sd disk02-02 disk02,0,8000
this would create a subdisk called disk02-02 at the start of disk02
and would be 8000blocks (4000k) long.
vxmake plex hazly-02 sd=disk03-01
creates a plex called hazly-02 using subdisk disk03-01
How ill you remove the subdisk and plexus
to dissociate a subdisk try

vxsd dis disk##-##


remove a subdisk by
vxedit rm disk##-##
How will you grow a disk space whitout disturbs
#vxresize
How to find the VERITAS License?
After installing VERITAS, one should confirm with VERITAS license keys
#vxlicense p or #vxlicreg -g
How to create a new volume
Detect new disks
#devfsadm
#vxdiskadm - select option 1 to add new disks (Initialize new disks and add it to disk group appdg )
# vxassist g appdg make snetprd01 55g (create volume in appdg as snetprd01)
#mkfs F vxfs o largefiles /dev/vx/rdsk/appdg/snetprd01 (creating file system for the volumes
snetprd01)
# cd /
(creating directories for snetprd01 )
#mkdir snetprd01
#chown oracle /snetprd01 (change the ownership to oracle for snetprd01 directories )
#vi /etc/vfstab
#mount /snetprd01
#df k
#mount p
#vxdisk list
##vxprint g appdg

How to remove the volume


Restore the previous /etc/vfstab
#cp /etc/vfstab.0702 /etc/vfstab
# umount /snetprd01 (Unmounting the files)
#rmdir /snetprd01 (Remove the mount points)
#vxvol g appdg stop snetprd01 (Stopping the volumes snetprd01 from appdg)
#vxedit g appdg v rm snetprd01 (removing volumes from disk group)
# df k
#vxprint -g appdg
What is shrinking and how to shrink a volume?
#vxassist shrinkto vol_name 1000
will shrink a volume by 1000 sectors,
make sure you don't shrink a volume below the current
size of the filesystem

Explain shrinking the disk


Shrinking the disk is nothing but reducing the allocated volume
size
Approach 2 Veritas Snap Procedure

This approach can be used for both first time complete refresh and ongoing mirroring process

Step 1: Create a VERITAS snap in the same server at source system

Approach 2 Veritas Snap Procedure

Step 3: NFS mount the Mirror volume to the remote server


Step 4: Make a copy of snap in the remote server

Procedure for Snapshot Volume


1. Should know the volume name
2. Give new temporary snapshot volume name
3. Find the disk available space to copy the snapshot volume.
Command to execute
Take a copy of
#vxprint -Aht | more
#vxprint list
To display
#vxprint t
#vxprint t
#vxprint t

information about DB1, DB2 and DB3


DB1
DB2
DB3

Create a snapshot volume:- (Execute the below command to take snapshot)


# vxassist g <give the dg name> -b snapstart DB1 snapdb1
# vxassist g <give the dg name> -b snapstart DB1 snapdb2
# vxassist g <give the dg name> -b snapstart DB1 snapdb3
Verify snapshot is completed:
# vxprint g <give the dg name>
# vxprint g <give the dg name>
# vxprint g <give the dg name>

( it will show 2 number of plex for the volume)


snapdb1
snapdb2
snapdb3

Detach the temporary snapshot and associated to new volume name


# vxassist g <give the dg name> snapshot snapdb1-02 snap-db1
# vxassist g <give the dg name> snapshot snapdb2-02 snap-db2
# vxassist g <give the dg name> snapshot snapdb3-02 snap-db3
Check the file system using fsck command
#fsck y /dev/vx/rdsk/<give the dg name> snap-db1
#fsck y /dev/vx/rdsk/<give the dg name> snap-db2
#fsck y /dev/vx/rdsk/<give the dg name> snap-db3

Make a temporary directory and mount the file system


#mkdir /hazly
# mount /dev/vx/rdsk/<give the dg name> snap-db1 /hazly
Edit the /etc/dfs/dfstab file and just added this entry:
# vi /etc/dfs/dfstab
share F nfs o rw= <server name> / snap-db3
share F nfs o rw= <server name> / snap-db3
share F nfs o rw= <server name> / snap-db3
Mount the file system to client or you can put the entre in /etc/vfstab on client side.
Create a mount point
#mkdir hazly-db1
#mkdir hazly-db2
#mkdir hazly-db3
#mount -t nfs servername:/snap-db1 /hazly-db1
#mount -t nfs servername:/snap-db2 /hazly-db2
#mount -t nfs servername:/snap-db3 /hazly-db3
Mount all the files system
#mount / hazly-db1
#mount / hazly-db2
#mount / hazly-db3
----------------End----------------If you want to take backup the snapshot files follow the below processor
Backup the files
#tar cvf /dev/rmt/0 /hazly
or
#ufsdump 0uf /dev/rmt/0 /dev/vx/rdsk/<give the dg name> snap-db1
/ora1data/CUSMARP2 file system not able to mounted on veritas
Solution:veritas volume made stale & cleaned
root on LIVE dqdmart02 # history -200
6978 find / -name vxva
6987 mount /ora1data/CUSMARP2
6994 vxprint -Ath | more
6995 mount /ora1data/CUSMARP2
6996 grep -i ora1data /etc/vfstab
6997 mount -F vxfs /dev/vx/dsk/cusmarp2_dg/vol_ora1data /ora1data/CUSMARP2
6998 vxdisk -list
6999 vxdisk list
7000 grep -i ora1data /etc/vfstab
7001 vxvea
7002 vea
7003 man vxrecover
7004 vxrecover -s -g cusmarp2_dg vol_ora1data
7005 vxrecover -v -g cusmarp2_dg vol_ora1data
7006 vxprint -Ath | more
7007 format
7008 datapath
7009 datapath query device | more
7010 vxprint -Ath | more
7011 mount -F vxfs /dev/vx/dsk/cusmarp2_dg/vol_ora1data /ora1data/CUSMARP2
7012 vxdiskadm

7014
7015
7016
7017
7021
7027
7028
7035
7037
7042
7043
7044
7048
7049
7050
7051
7052
7053
7054
7055
7056
7057
7058
7059
7060
root on

vxprint | more
vxdisk list
vxprint -Ath | more
mount -F vxfs /dev/vx/dsk/cusmarp2_dg/vol_ora1data
./vxse &
vxdiskadm
ps -aef | grep -i pmon
mount -F vxfs /dev/vx/dsk/cusmarp2_dg/vol_ora1data
mount -F vxfs /dev/vx/dsk/cusmarp2_dg/vol_ora1data
mount -F vxfs /dev/vx/dsk/cusmarp2_dg/vol_ora1data
vxprint -Ath
mount -F vxfs /dev/vx/dsk/cusmarp2_dg/vol_ora1data
vxprint -Ath
mount -F vxfs /dev/vx/dsk/cusmarp2_dg/vol_ora1data
vxprint -Ath
vxmend -g cusmarp2_dg fix stale vol_ora1data-01
vxprint -Ath
vxmend -g cusmarp2_dg fix clean vol_ora1data-01
vxprint -Ath
vxvol -g cusmarp2_dg start vol_ora1data
vxprint -Ath
mount -F vxfs /dev/vx/dsk/cusmarp2_dg/vol_ora1data
fsck -F vxfs /dev/vx/rdsk/cusmarp2_dg/vol_ora1data
mount -F vxfs /dev/vx/dsk/cusmarp2_dg/vol_ora1data
df -k
LIVE dqdmart02 #

/ora1data/CUSMARP2

/ora1data/CUSMARP2
/ora1data/CUSMARP2
/ora1data/CUSMARP2
/ora1data/CUSMARP2
/ora1data/CUSMARP2

/ora1data/CUSMARP2
/ora1data/CUSMARP2

How to find the plex, sub disk, Volume group, disk status, free spaces, disk controller,
Volume controller?
Displays info about plexes
#vxprint -lp
#vxprint -l plex_name
Displays info about subdisks
# vxprint -st
#vxprint -l disk##-##
show disk iops over 10 seconds...
#ssaadm display -p c#
Traces all i/o on a volume..
#vxtrace vol
To report disk stats
#vxstat -d
Displays the free space on the disks
#vxdg free
Display the disk crontroller
#vxprint list
#vxprint -Aht
Tells you how much you can grow a volume by
#vxassist maxgrow vol
In VERITAS How to recover the mirror disk with data.

How to bring the existing data disk into VERITAS volume manger control.
Through Encapsulations method
How to change the mirror status from 0:1 to 0:5 and how?
#vxassist g dgname v volume name relayout
Were the veritas disk information store.
#/kernel/drv/sd.cf
How to increase the size of the file system?
# df k ( dentify the VERITAS group and volume name of /myhr on EXU407 )
# vxassist g appdg maxsize (check for the free space available in the group appdg on EXU407 )
# vxprint thA g appdg (Check if /myhr is mirrored)
# mount p (check the file system of /myhr )
# cp p /etc/vfstab/etc/vfstab.070223 ((take backup of /etc/vfstab file )
you can increase the size of teh file system by using solaris volume manager
# vxresize F vxfs g rootdg myapps +5g (increase the size of /myapps by 5gb)
How to increase the size of the file system?
# vxresize F vxfs g rootdg myapps +5g (decrease the size of /myapps by 5gb)
#df -k
#vxresize b F vxfs ora03vol +70gb
What is encapsulation?
This is used to bring the disk under volume manager , which are already present in the system with
data but without volume manager . Data on these disks are not disturbed and if these disks meets
certain volume manager requirements these are added under volume manager
How to add a root dg?
#vxdg -g <dg name> adddisk <media name>=c2t0d0
What is the difference between the VERITAS 3.0 and VERITAS 4.0?
In VERITAS 3.0 , the root dg is present by default
In VERITAS 4.0, the root dg has to be created manually
In Vertias 4.0 cdsdisk has introduced which means, in any os it can be exported
How to mirror a root dg? With eg
#vxrootmir
How to rename the old root disk. In this example, rootdisk is being renamed as rootold.
# vxedit -g rootdg rename rootdisk rootold
How to Encapsulate the new root disk (c2t2d0).
# /etc/vx/bin/vxencap rootdisk=c2t2d0
How to mirror and unmirror a disk?
#vxassist mirror hazly disk05 disk06
uses disks5 and 6 ro make a mirror on volume called hazly
#vxassist make hazly 50m layout=mirror
makes a 50Mb mirror using any 2 disks
How to remove a disk group?
#vxdg destroy <disk group>

How to display the volume configuration information?


#vxprint -l rootdg
What is the command to identify master node on vcs
#vxdctl -c mode
mode: enabled: cluster active - MASTER
What is the nature of VCS?
Active Active Load Balancing and high availability
Active passive No load Balancing
What is the physical device to check the 2 nodes connect in cluster
Physical fiber cable which is connect to the 2 nodes is called a HOTBEAT
What is HA?
It is a daemon of a cluster which is in the form of Active Passive, i.e. No load balancing
HA--> Highly Available means, two or more systems are connected with the same configuration, if one
fails the other will take care the resources
How far VCS supports?
VCS supports more than 32 nodes.
What are the protocols in VCS
VRTSLLT ( low latency ) & VRTSGAB ( Global autom)
How
to
Use
the
#
DEVICE
c1t1d0s2
c1t2d0s2
c1t3d0s2
#
#

clear
vxdisk

the
list

failing
flag
on
a
veritas
volume
manager
disk:
command
to
find
out
which
disks
are
failing:
vxdisk
list
TYPE
DISK
GROUP
STATUS
auto:sliced
mydg01
mydg
online
auto:sliced
mydg02
mydg
online
failing
auto:sliced
mydg03
mydg
online
vxedit
set
failing=off
mydg02
vxdisk
list

How to Create a snapshot mirror


snapstart starts creating a online snapshot mirror of the volume using the available disk space . The
snapshot is completed with vxassist snapshot command when offline snapshot volume is created with a
userdefined name .
Command Syntax: vxassist snapstart volume_name
To create a snapshot mirror of a volume called vol8, type

#vxassist snapstart vol8

How to bring the Disk into diskgroup on veritas.


Force a lip event on the hbas so they will re-login in Storage
luxadm e forcelip /dev/cfg/c5
luxadm e forcelip /dev/cfg/c7
Enable the disk devices.
vxdctl enable
vxdisk o alldgs list
initialize newdisk
vxdisksetup i <diskname>
Add disk to disk group
vxdg g oradg adddisk oradg05= <diskname>
vxdg g oradg adddisk oradg06= <diskname>

Verify free disk space


vxassist g oradg maxsize
How to Create a snapshot volume
Command Syntax
vxassist snapshot volume_name new_volume_name
To create a snapshot volume of vol8, type:
#vxassist snapshot vol8 snapvol8
You can now back up the snapshot volume by whatever means you prefer. To avoid wasting space, you
can then remove the snapshot volume, which occupies as much space as the original volume
What are the types of cluster configuration?
1. Horizontal configuration
2. Vertical configuration
Command to find the VCS status
#hastatus sum
What are the minimum hardware requirements for cluster configuration?
Storage boxes 1
Node Cluster 2
Minimum 2 nodes, 2 etherned address, shared disk and HA applications (ex) oracle
What is the purpose of ha daemons in VCS servers?
Ha daemons are used to start/stop services in VCS servers
How to check the communication between 2 nodes?
Heart beat checks the communication between 2 nodes.
What is a heart-beat?
It is a script that checks the communication between nodes.
Heart-beat is a communication which can be set at the time of creating a system in a cluster, which can
send and receive signal through that designed port.
to check the heartbeat use the command gabconfig -a
What are the two types of service groups?
1. Parallel Service group
2. Fail over service group.
How to unconfigure llt and gab
#lltconfig U
#gabconfig -U
#hastop
How to start llt and gab
#lltconfig c
#gabconfig c -x
#hastart
How to start one node cluster.
Ok boot x
How to stop one node cluster exclusively
# hastop local force ( This will ring down the vcs only not application it will on alive)
Were the vcs logs will be
#/var/VRTSvcs/log/engine_A.log

What are configuration files and to configure it in VCS


Configuration files:
Notes :- Before configuring VCS make sure the local_mac_address =true.
#eeprom local_mac_address?=true ( on both the nodes)
# /etc/llthosts (specify the node names)
(eg) 0 sun 1
1 sun 2
#/etc/llttab (specify the node names)
Set-node 0
Link qfe0 /dev/qfe:0 _ ether _ _
Link qfe1 /dev/qfe:1 _ ether _ _
set_cluster 10
Start
#/etc/gabtab (specify the node names)
/sbin/config c n 2
Path to be set = /etc/profile
#PATH=$PATH:/opt/VRTS/bin:/sbin:/opt/VRTSllt
#export PATH
#/etc/VRTSvcs/conf/config/main.cf
#/etc/VRTSvcs/conf/config/sysname
How to bring the resource to online and offline
# /opt/VRTSvcs/bin/hagrp -online (service_group) -sys (system_name) hagrp
# /opt/VRTSvcs/bin/hagrp -offline (service_group) -sys (system_name) hagrp
How to Switch service group between nodes
# /opt/VRTSvcs/bin/hagrp -switch (service_group) -to (system_name) hagrp
How to Freeze svcgroup, (disable onl. & offl.)
# /opt/VRTSvcs/bin/hagrp -freeze (service_group) [-persistent] hagrp
How to unfreeze the svcgroup, (enable onl. & offl.)
# /opt/VRTSvcs/bin/hagrp -unfreeze (service_group) [-persistent]
What is the command to check the connectivity between 2 nodes
Get the mac address from both the nodes
#getmac /dev/qfe:0
Sv from server side
Cv from client side
#./dlpiping sv /dev/qfe:0 macaddresss
#./dlpiping cv /dev/qfe:0 macaddresss
How to stop the VCS
#hastop local
#hastop local evaculate 100% shutdown the system
#hastop loca force without shutting down the application,only vcs can be down (hadaemon)
What are the service group dependence?
4types
Online local
Online remote
Online global
Offline global

How to delete a service group


1. Bring all the resource offline
2. Disable resource
3. Delete resource
Eg : #hares delete mysun
How to add a group
#haconf makerw
#hagrp add groupname
#hagrp modify groupname systemList add node1 node2
#haconf dump -makero
If a main.cf files corrupted how will you rectify
#hastop all
Create a config file
#dtpad /etc/VRTSvcs/bin/sysname
#mkdir /etc/VRTSvcs/bin/sysname/config
#cp types.cf config
#cd config
#dtpad main.cf
#vi main.cf
Include types.cf
Cluster mycluster
System node1
System node2
Snmp mycluster
#hacf verify .
#hacf cftocmd .
#hastart
#hastatus sum

SUN CLUSTER offline cluster resources on smslu131


verify resources are online on tgui1-svc
# scstat
Take all resources offline
# scswitch F g smsweb-rg
verify resources are offline on both tgui1 & tgui2
# scstat
verify resources are offline on tgui1-svc & tgui2-svc
# scstat
switch resources online on smslu131
#scswitch Z g smsweb-rg
verify resources are online on tgui1-svc
#scstat
verify resources are online on tgui1-svc
# scstat
Comunicate to OPS to ignore the alerts on this servers - phys-hhdc1 & phys-hhdc2.
ii) Switch back the resource group "hhda-rg" from phys-hhdc1 to phys-hhdc2 using the command shown below:

scswitch -z -g hhda-rg -h phys-hhdc2


iii) Check if the resouce group is available on phys-hhdc2.
iv) Comunicate to OPS to start monitoring the alerts on this servers - phys-hhdc1 & phys-hhdc2
switch resources from smslu131 to smslu132
# scswitch z g smsweb-rg h tgui2-svc
verify resources are online on tgui2-svc
# scstat
switch resources from smslu131 to smslu132
# scswitch z g smsweb-rg h tgui1-svc
verify resources are online on tgui1-svc
# scstat
Disk Replacement on phys-mw1 and phys-mw2
Symptoms:
DID devices d18, d30, d39 are in Needs Maintenance state. (phys-mw2 on SENA-B)

format command was showing drive type unknown for these DID devices.

Resolution:
Step-I
The disks were hot-swappable & were replaced online.
We need to reboot the servers phys-mw1 & phys-mw2 for the disks to come under OS/Cluster control.
Here is the procedure for rebooting of phys-mw1 and phys-mw2:

1. Login to console of phys-mw1 & phys-mw2.


telnet 10.100.175.6 using ports 2 & 3

2. Login to phys-mw1 and switch the resource group gisdbmw to phys-mw2.


Phys-mw1# scadmin switch mwgis phys-mw2 gisdbmw

on phys-mw1

(Monitor on both console sessions for any errors/warning during switchover. Wait for 5-10 mins for the
switchover)

3. Confirm that cluster is stable & both gisdbmw & gisfsmw are owned by phys-mw2.
Phys-mw2# hastat

.on phys-mw2

4. Stop the cluster services on phys-mw1


Phys-mw1# scadmin stopnode

.on phys-mw1

(Monitor on both console sessions for any errors/warning during stopping for cluster services. Wait for 5-10
mins for this activity)

5. Confirm that cluster is stable & both gisdbmw & gisfsmw are owned by phys-mw2. phys-mw1 will NOT be a
cluster member now.
Phys-mw2# hastat

.on phys-mw2

6. Reboot phys-mw1 server


Phys-mw1# shutdown y g0 i6

.on phys-mw1

7. Waiting for phys-mw1 to return.. Monitoring phys-mw1 from console for any issues.
8. Login to phys-mw1 & start the cluster services.
Phys-mw1# scadmin startnode

.on phys-mw1

(Monitor on both console sessions for any errors/warning during starting for cluster services. Wait for 5-10 mins
for this activity)

9. Confirm that cluster is stable & both gisdbmw & gisfsmw are owned by phys-mw2. phys-mw1 will be a cluster
member now.
Phys-mw2# hastat

.on phys-mw2

10. From phys-mw2, Switch back gisdbmw back to phys-mw1.


Phys-mw2# scadmin switch mwgis phys-mw1 gisdbmw

..on phys-mw2

(Monitor on both console sessions for any errors/warning during switchover. Wait for 5-10 mins for the
switchover)

11. Confirm that cluster is stable & gisdbmw is owned by phys-mw1 & gisfsmw is owned by phys-mw2.
Phys-mw2# hastat

on phys-mw2

12. Switch the resource group gisfsmw from phys-mw2 to phys-mw1


Phys-mw2# scadmin switch mwgis phys-mw1 gisfsmw

on phys-mw2

(Monitor on both console sessions for any errors/warning during starting for cluster services. Wait for 5-10 mins
for this activity)

13. Confirm that cluster is stable & both gisdbmw & gisfsmw are owned by phys-mw1.
Phys-mw2# hastat
Phys-mw1# hastat

14. Stop the cluster services on phys-mw2


Phys-mw2# scadmin stopnode

.on phys-mw2

(Monitor on both console sessions for any errors/warning during stopping for cluster services. Wait for 5-10
mins for this activity)

15. Confirm that cluster is stable & both gisdbmw & gisfsmw are owned by phys-mw1. phys-mw2 will NOT be a
cluster member now.
Phys-mw1# hastat
Phys-mw2# hastat

16. Reboot phys-mw2 server


Phys-mw2# shutdown y g0 i6

..on phys-mw2

17. Waiting for phys-mw1 to return.. Monitoring phys-mw2 from console for any issues.
18. Login to phys-mw2 & start the cluster services.
Phys-mw2# scadmin startnode

.on phys-mw2

(Monitor on both console sessions for any errors/warning during starting for cluster services. Wait for 5-10 mins
for this activity)

19. Confirm that cluster is stable & both gisdbmw & gisfsmw are owned by phys-mw1. phys-mw2 will be a cluster
member now.
Phys-mw2# hastat

.on phys-mw2

20. Switch the resource group gisfsmw from phys-mw1 to phys-mw2


Phys-mw1# scadmin switch mwgis phys-mw2 gisfsmw

on phys-mw1

(Monitor on both console sessions for any errors/warning during starting for cluster services. Wait for 5-10 mins
for this activity)

21. Check if everything is fine on both the server

Phys-mw1# hastat

.Check if resource groups are owned by correct servers

Phys-mw1# df k

.Check if all the filesystem are mounted

Phys-mw2# hastat

.Check if resource groups are owned by correct servers

Phys-mw2# df k

.Check if all the filesystem are mounted

22. Now, that the disks are replaced, check if it is viewable from format command.
If its not viewable, run these commands - drvconfig, devlinks, disks
Suppose, the disk is not viewable from phys-mw1, then run these commands.
Phys-mw1# drvconfig
Phys-mw1# devlinks
Phys-mw1# disks
Phys-mw1# format

Check if the disk is viewable.

23. Disks needs to be added as DID device at the kernel level.


To add DID device d18, d30 and d39, run scdidadm R <DID> from the primay node (phys-mw1)
Phys-mw1# scdidadm R d18
Phys-mw1# scdidadm R d30
Phys-mw1# scdidadm R d39

24. hastat output is as shown in Appendix A.


25. Partitions the disks and mirror them as described in Appendix B.
Note: If any state database replicas exist on the failed disk (DID device), then delete the state database
replicas & then reboot the server. In phys-mw1, DID device d18 is containing the state database replicas. This
was removed before reboot.
phys-mw1# metadb -s gisdbmw
flags

first blk

block count
1034

16

1050

1034

/dev/did/dsk/d18s7
/dev/did/dsk/d18s7

phys-mw1# metadb d /dev/did/dsk/d18s7


After reboot and after scdidadm R d18
Phys-mw1# metadb ac 2 /dev/did/dsk/c18s7
Appendix A : hastat output
phys-mw1# hastat
Getting Information from all the nodes ......
HIGH AVAILABILITY CONFIGURATION AND STATUS
------------------------------------------LIST OF NODES CONFIGURED IN <mwgis> CLUSTER
phys-mw1 phys-mw2
CURRENT MEMBERS OF THE CLUSTER
phys-mw1 is a cluster member
phys-mw2 is a cluster member
CONFIGURATION STATE OF THE CLUSTER
Configuration State on phys-mw1: Stable

Configuration State on phys-mw2: Stable


UPTIME OF NODES IN THE CLUSTER
uptime of phys-mw1:

3:56pm up 34 day(s), 22:13, 2 users, load average: 0.97, 1.03, 1.01

uptime of phys-mw2:

3:56pm up 34 day(s), 19:43, 6 users, load average: 1.29, 1.24, 1.38

LOGICAL HOSTS MASTERED BY THE CLUSTER MEMBERS


Logical Hosts Mastered on phys-mw1:
gisdbmw
Logical Hosts for which phys-mw1 is Backup Node:
gisfsmw
Logical Hosts Mastered on phys-mw2:
gisfsmw
Logical Hosts for which phys-mw2 is Backup Node:
gisdbmw
LOGICAL HOSTS IN MAINTENANCE STATE
None
STATUS OF PRIVATE NETS IN THE CLUSTER
Status of Interconnects on phys-mw1:
interconnect0: selected
interconnect1: up
Status of private nets on phys-mw1:
To phys-mw1 - UP
To phys-mw2 - UP
Status of Interconnects on phys-mw2:
interconnect0: selected
interconnect1: up
Status of private nets on phys-mw2:
To phys-mw1 - UP
To phys-mw2 - UP
STATUS OF PUBLIC NETS IN THE CLUSTER
Status of Public Network On phys-mw1:
bkggrp r_adp status fo_time live_adp
nafo0 hme0:hme3
OK
NEVER hme0
Status of Public Network On phys-mw2:
bkggrp r_adp status fo_time live_adp
nafo0 hme0:hme3
OK
NEVER hme0
STATUS OF DATA SERVICES RUNNING IN THE CLUSTER
Status Of Registered Data Services
mwgisapi:
On
sybase:
On
nfs:
On
Status Of Data Services Running On phys-mw1
Data Service "mwgisapi":
Not being managed on this system
Data Service "sybase":
Database Status on phys-mw1:

gisdbmw - running;
Data Service "nfs":
Not being managed on this system
Status Of Data Services Running On phys-mw2
No Status Method for Data Service "mwgisapi"
Data Service "sybase":
Not being managed on this system
Data Service "nfs":
On Logical Host gisfsmw:
Ok
RECENT ERROR MESSAGES FROM THE CLUSTER
Recent Error Messages on phys-mw1
Aug 9 14:27:00 phys-mw1 snmpdx: agent_process() : bad community from 10.100.202.1
Aug 9 14:27:00 phys-mw1 snmpdx: local pdu process error
Aug 9 14:27:00 phys-mw1 snmpdx: session_send_loopback_request() failed
Aug 9 14:27:00 phys-mw1 snmpdx: session_open() failed for a pdu received from 10.100.202.1.50038
Aug 9 14:35:05 phys-mw1 ID[SUNWcluster.fdl_enum_probe_disks.1105]: Accessed majority of diskset gisfsmw
replicas
Aug 9 14:35:08 phys-mw1 ID[SUNWcluster.fdl_enum_probe_disks.3036]: Cannot access disk c4t19d0
(/dev/did/rdsk/d30) in diskset gisfsmw, possibly due to a bad cable or controller.
Aug 9 14:35:08 phys-mw1 ID[SUNWcluster.fdl_enum_probe_disks.3036]: Cannot access disk c4t22d0
(/dev/did/rdsk/d39) in diskset gisfsmw, possibly due to a bad cable or controller.
Recent Error Messages on phys-mw2
Aug 9 14:26:58 phys-mw2 snmpdx: local pdu process error
Aug 9 14:26:58 phys-mw2 snmpdx: session_send_loopback_request() failed
Aug 9 14:26:58 phys-mw2 snmpdx: session_open() failed for a pdu received from 10.100.202.1.50039
Aug 9 14:40:17 phys-mw2 majmddb[7394]: open of '/dev/did/rdsk/d18s7' failed - No such device or address
Aug 9 14:40:17 phys-mw2 last message repeated 1 time
Aug 9 14:40:17 phys-mw2 majmddb[7394]: Finished all listed devices: succeeded=24 toolong=0 bad=2
Aug 9 14:40:17 phys-mw2 ID[SUNWcluster.fdl_enum_probe_disks.1105]: Accessed majority of diskset gisdbmw
replicas
phys-mw1#
Appendix B: Create partitions & mirroring of the disks

1. To check which disks are mirrored.


phys-mw1# metastat -s gisdbmw d20
gisdbmw/d20: Trans
State: Okay
Size: 17107524 blocks
Master Device: gisdbmw/d21
Logging Device: gisdbmw/d24
gisdbmw/d21: Mirror
Submirror 0: gisdbmw/d22
State: Needs maintenance
Submirror 1: gisdbmw/d23
State: Okay
Pass: 1
Read option: roundrobin (default)
Write option: parallel (default)
Size: 17107524 blocks
gisdbmw/d22: Submirror of gisdbmw/d21

State: Needs maintenance


Invoke: metareplace gisdbmw/d21 /dev/did/dsk/d18s0 <new device>
Size: 17107524 blocks
Stripe 0:
Device
Start Block Dbase State
Hot Spare
/dev/did/dsk/d18s0
0 No Maintenance
gisdbmw/d23: Submirror of gisdbmw/d21
State: Okay
Size: 17107524 blocks
Stripe 0:
Device
Start Block Dbase State
/dev/did/dsk/d25s0
0 No Okay

Hot Spare

2. Check what disks represent these did devices.


phys-mw1# scdidadm -L |grep d25
25
phys-mw2:/dev/rdsk/c4t1d0
25
phys-mw1:/dev/rdsk/c4t1d0
phys-mw1# scdidadm -L |grep d18
18
phys-mw2:/dev/rdsk/c2t1d0
18
phys-mw1:/dev/rdsk/c2t1d0

/dev/did/rdsk/d25
/dev/did/rdsk/d25
/dev/did/rdsk/d18
/dev/did/rdsk/d18

3. Partition the disks as follows:


phys-mw1# format c4t1d0 c2t1d0
AVAILABLE DISK SELECTIONS:
0. c2t1d0 <SUN9.0G cyl 4924 alt 2 hd 27 sec 133>
/sbus@3,0/SUNW,socal@0,0/sf@0,0/ssd@w21000020374fe013,0
1. c4t1d0 <SUN9.0G cyl 4924 alt 2 hd 27 sec 133>
/sbus@b,0/SUNW,socal@0,0/sf@0,0/ssd@w2100002037222d57,0
Specify disk (enter its number): 1
selecting c4t1d0
[disk formatted]
format> p
PARTITION MENU:
0
- change `0' partition
1
- change `1' partition
2
- change `2' partition
3
- change `3' partition
4
- change `4' partition
5
- change `5' partition
6
- change `6' partition
7
- change `7' partition
select - select a predefined table
modify - modify a predefined partition table
name - name the current table
print - display the current table
label - write partition map and label to the disk
!<cmd> - execute <cmd>, then return
quit
partition> p
Current partition table (original):
Total disk cylinders available: 4924 + 2 (reserved cylinders)
Part
Tag Flag Cylinders
0 unassigned wm 117 - 4880
1 unassigned wm
0
2 unassigned wm
0
3 unassigned wm
0
4 unassigned wm
2 - 116

Size
Blocks
8.16GB (4764/0/0) 17107524
0
(0/0/0)
0
0
(0/0/0)
0
0
(0/0/0)
0
201.64MB (115/0/0) 412965

5 unassigned
6 unassigned
7 unassigned

wm
wm
wu

4881 - 4886
10.52MB (6/0/0)
21546
4887 - 4923
64.88MB (37/0/0) 132867
0- 1
3.51MB (2/0/0)
7182

partition> q
FORMAT MENU:
disk
- select a disk
type
- select (define) a disk type
partition - select (define) a partition table
current - describe the current disk
format - format and analyze the disk
repair - repair a defective sector
label
- write label to the disk
analyze - surface analysis
defect - defect list management
backup - search for backup labels
verify - read and display labels
save
- save new disk/partition definitions
inquiry - show vendor, product and revision
volname - set 8-character volume name
!<cmd> - execute <cmd>, then return
quit
format> di
AVAILABLE DISK SELECTIONS:
0. c2t1d0 <SUN9.0G cyl 4924 alt 2 hd 27 sec 133>
/sbus@3,0/SUNW,socal@0,0/sf@0,0/ssd@w21000020374fe013,0
1. c4t1d0 <SUN9.0G cyl 4924 alt 2 hd 27 sec 133>
/sbus@b,0/SUNW,socal@0,0/sf@0,0/ssd@w2100002037222d57,0
Specify disk (enter its number)[1]: 0
selecting c2t1d0
[disk formatted]
format> p
PARTITION MENU:
0
- change `0' partition
1
- change `1' partition
2
- change `2' partition
3
- change `3' partition
4
- change `4' partition
5
- change `5' partition
6
- change `6' partition
7
- change `7' partition
select - select a predefined table
modify - modify a predefined partition table
name - name the current table
print - display the current table
label - write partition map and label to the disk
!<cmd> - execute <cmd>, then return
quit
partition> p
Current partition table (original):
Total disk cylinders available: 4924 + 2 (reserved cylinders)
Part
Tag Flag Cylinders
Size
Blocks
0
root wm
0 - 73
129.75MB (74/0/0) 265734
1
swap wu
74 - 147
129.75MB (74/0/0) 265734
2 backup wu
0 - 4923
8.43GB (4924/0/0) 17682084
3 unassigned wm
0
0
(0/0/0)
0
4 unassigned wm
0
0
(0/0/0)
0
5 unassigned wm
0
0
(0/0/0)
0

6
usr wm 148 - 4923
7 unassigned wm
0
partition> 0
Part
Tag Flag
0
root wm

8.18GB (4776/0/0) 17150616


0
(0/0/0)
0

Cylinders
Size
0 - 73
129.75MB

Blocks
(74/0/0) 265734

Enter partition id tag[root]:


Enter partition permission flags[wm]:
Enter new starting cyl[0]:
Enter partition size[265734b, 74c, 129.75mb, 0.13gb]: 0
partition> 1
Part
Tag Flag Cylinders
Size
Blocks
1
swap wu
74 - 147
129.75MB (74/0/0) 265734
Enter partition id tag[swap]:
Enter partition permission flags[wu]:
Enter new starting cyl[74]:
Enter partition size[265734b, 74c, 129.75mb, 0.13gb]: 0
partition> 6
Part
Tag Flag Cylinders
Size
Blocks
6
usr wm 148 - 4923
8.18GB (4776/0/0) 17150616
Enter partition id tag[usr]:
Enter partition permission flags[wm]:
Enter new starting cyl[148]:
Enter partition size[17150616b, 4776c, 8374.32mb, 8.18gb]: 0
partition> p
Current partition table (unnamed):
Total disk cylinders available: 4924 + 2 (reserved cylinders)
Part
Tag Flag Cylinders
0 unassigned wm
0
1 unassigned wu
0
2 backup wu
0 - 4923
3 unassigned wm
0
4 unassigned wm
0
5 unassigned wm
0
6 unassigned wm
0
7 unassigned wm
0

Size
Blocks
0
(0/0/0)
0
0
(0/0/0)
0
8.43GB (4924/0/0) 17682084
0
(0/0/0)
0
0
(0/0/0)
0
0
(0/0/0)
0
0
(0/0/0)
0
0
(0/0/0)
0

partition> 7
Part
Tag Flag Cylinders
7 unassigned wm
0

Size
0

Blocks
(0/0/0)
0

Enter partition id tag[unassigned]:


Enter partition permission flags[wm]: wu
Enter new starting cyl[0]:
Enter partition size[0b, 0c, 0.00mb, 0.00gb]: 2c
partition> 4
Part
Tag Flag Cylinders
Size
Blocks
4 unassigned wm
0
0
(0/0/0)
0
Enter partition id tag[unassigned]:
Enter partition permission flags[wm]:
Enter new starting cyl[0]: 2
Enter partition size[0b, 0c, 0.00mb, 0.00gb]: 115c
partition> p
Current partition table (unnamed):
Total disk cylinders available: 4924 + 2 (reserved cylinders)
Part
Tag Flag Cylinders
0 unassigned wm
0
1 unassigned wu
0
2 backup wu
0 - 4923

Size
Blocks
0
(0/0/0)
0
0
(0/0/0)
0
8.43GB (4924/0/0) 17682084

3 unassigned
4 unassigned
5 unassigned
6 unassigned
7 unassigned

wm
wm
wm
wm
wu

0
2 - 116
0
0
0- 1

partition> 0
Part
Tag Flag Cylinders
0 unassigned wm
0

0
(0/0/0)
0
201.64MB (115/0/0) 412965
0
(0/0/0)
0
0
(0/0/0)
0
3.51MB (2/0/0)
7182
Size
0

Blocks
(0/0/0)
0

Enter partition id tag[unassigned]:


Enter partition permission flags[wm]:
Enter new starting cyl[0]: 117
Enter partition size[0b, 0c, 0.00mb, 0.00gb]: 4764c
partition> p
Current partition table (unnamed):
Total disk cylinders available: 4924 + 2 (reserved cylinders)
Part
Tag Flag Cylinders
Size
Blocks
0 unassigned wm 117 - 4880
8.16GB (4764/0/0) 17107524
1 unassigned wu
0
0
(0/0/0)
0
2 backup wu
0 - 4923
8.43GB (4924/0/0) 17682084
3 unassigned wm
0
0
(0/0/0)
0
4 unassigned wm
2 - 116
201.64MB (115/0/0) 412965
5 unassigned wm
0
0
(0/0/0)
0
6 unassigned wm
0
0
(0/0/0)
0
7 unassigned wu
0- 1
3.51MB (2/0/0)
7182
partition> 5
Part
Tag Flag Cylinders
5 unassigned wm
0

Size
0

Blocks
(0/0/0)
0

Enter partition id tag[unassigned]:


Enter partition permission flags[wm]:
Enter new starting cyl[0]: 4881
Enter partition size[0b, 0c, 0.00mb, 0.00gb]: 6
Please specify units in either b(blocks), c(cylinders), m(megabytes) or g(gigabytes)
Enter partition size[0b, 0c, 0.00mb, 0.00gb]: 6c
partition> p
Current partition table (unnamed):
Total disk cylinders available: 4924 + 2 (reserved cylinders)
Part
Tag Flag Cylinders
Size
Blocks
0 unassigned wm 117 - 4880
8.16GB (4764/0/0) 17107524
1 unassigned wu
0
0
(0/0/0)
0
2 backup wu
0 - 4923
8.43GB (4924/0/0) 17682084
3 unassigned wm
0
0
(0/0/0)
0
4 unassigned wm
2 - 116
201.64MB (115/0/0) 412965
5 unassigned wm 4881 - 4886
10.52MB (6/0/0)
21546
6 unassigned wm
0
0
(0/0/0)
0
7 unassigned wu
0- 1
3.51MB (2/0/0)
7182
partition> 6
Part
Tag Flag Cylinders
6 unassigned wm
0

Size
0

Blocks
(0/0/0)
0

Enter partition id tag[unassigned]:


Enter partition permission flags[wm]:
Enter new starting cyl[0]: 4887
Enter partition size[0b, 0c, 0.00mb, 0.00gb]: $
partition> p
Current partition table (unnamed):
Total disk cylinders available: 4924 + 2 (reserved cylinders)
Part

Tag

Flag

Cylinders

Size

Blocks

0 unassigned wm 117 - 4880


8.16GB (4764/0/0) 17107524
1 unassigned wu
0
0
(0/0/0)
0
2 backup wu
0 - 4923
8.43GB (4924/0/0) 17682084
3 unassigned wm
0
0
(0/0/0)
0
4 unassigned wm
2 - 116
201.64MB (115/0/0) 412965
5 unassigned wm 4881 - 4886
10.52MB (6/0/0)
21546
6 unassigned wm 4887 - 4923
64.88MB (37/0/0) 132867
7 unassigned wu
0- 1
3.51MB (2/0/0)
7182
partition> l
Ready to label disk, continue? y
partition> q
FORMAT MENU:
disk
- select a disk
type
- select (define) a disk type
partition - select (define) a partition table
current - describe the current disk
format - format and analyze the disk
repair - repair a defective sector
label
- write label to the disk
analyze - surface analysis
defect - defect list management
backup - search for backup labels
verify - read and display labels
save
- save new disk/partition definitions
inquiry - show vendor, product and revision
volname - set 8-character volume name
!<cmd> - execute <cmd>, then return
quit
format> q
4. Check out what mirrors needs to re-synced.
phys-mw1# metastat -s gisdbmw |grep d18
Invoke: metareplace gisdbmw/d21 /dev/did/dsk/d18s0 <new device>
/dev/did/dsk/d18s0
0 No Maintenance
Invoke: metareplace gisdbmw/d121 /dev/did/dsk/d18s4 <new device>
/dev/did/dsk/d18s4
0 No Maintenance
Invoke: metareplace gisdbmw/d124 /dev/did/dsk/d18s6 <new device>
/dev/did/dsk/d18s6
0 No Maintenance
Invoke: metareplace gisdbmw/d0 /dev/did/dsk/d18s5 <new device>
/dev/did/dsk/d18s5
0 No Maintenance
5. Resync the mirrors.
phys-mw1# metareplace -e gisdbmw/d21 /dev/did/dsk/d18s0
gisdbmw/d21: device /dev/did/dsk/d18s0 is enabled
phys-mw1# metareplace -e gisdbmw/d121 /dev/did/dsk/d18s4
gisdbmw/d121: device /dev/did/dsk/d18s4 is enabled
phys-mw1# metareplace -e gisdbmw/d124 /dev/did/dsk/d18s6
gisdbmw/d124: device /dev/did/dsk/d18s6 is enabled
phys-mw1# metareplace -e gisdbmw/d0 /dev/did/dsk/d18s5
gisdbmw/d0: device /dev/did/dsk/d18s5 is enabled
6. Check out if they are re-syncing..
phys-mw1# metastat -s gisdbmw |grep %
Resync in progress: 2 % done
Resync in progress: 35 % done
Resync in progress: 54 % done
phys-mw1# metastat s gisdbmw |grep Main
phys-mw1# metastat -s gisdbmw |grep -i main

Update from Rajiba Swain


INC000000290763 & RFC: 16070
Disks c2t0d0(d17) & c2t22d0 (d9) and Fan (SENA B: 1-in rear) failed within the SENA B disk array attached to physmw1 & phys-mw2 ... the disks & FAN will be hot-swapped by a HP.CDS engineer .
Step-1

1. Replace the faulty disk(s).


phys-mw2# luxadm remove B,r6
>> Issue the above command , then inform to Synstar person to remove the faulty disk and then insert
the new disk . (In this case it is r6)
>> After getting confirmation from Synstar issue the following command
phys-mw2# luxadm insert B,r6
>> Verify the disk geometry
phys-mw2# format c2t22d0
>> Create the partition table in the disk
Step-2
Follow the steps mentioned in the Original document.
Normally the disks will be detected automatically .Problem we faced during the RFC;
After rebooting the server, one disk (d17) is detected in did list of cluster but another disk (d9) is not configured in did
list.
phys-mw2# metareplace -e gisfsmw/d21 /dev/did/dsk/d9s0
metareplace: phys-mw2: /dev/did/dsk/d9s0: No such device or address --- error message
So we follow the bellow mentioned steps phys-mw1 and phys-mw2 server
We Ran:
Phys-mw1# drvconfig
Phys-mw1# devlinks
Phys-mw1# disks
Phys-mw2# drvconfig
Phys-mw2# devlinks
Phys-mw2# disks
Try to attach the disks in metadevice. If again facing same problem then following these steps:
Phys-mw1# drvconfig
Phys-mw1# devlinks
Phys-mw1# disks
Phys-mw2# drvconfig
Phys-mw2# devlinks
Phys-mw2# disks
Step-3

Things to be checked in phys-mw1 & phys-mw2 server before updating in the worklog that RFC has been completed
successfully.

1.

Only in phys-mw1 server

phys-mw1# su sa
Sun Microsystems Inc. SunOS 5.6
Generic August 1997
You have new mail.
phys-mw1:/home/sa> showserver ----------------------------- Sybase database
UID PID PPID C STIME TTY
TIME CMD
sybase 6007 6001 0 Oct 17 ?
1550:30 /opt/sybase/bin/dataserver -d/gisdbmw/data3/master
-e/gisdbmw/syblog/gisdbmw.lo
sybase 6517 6007 0 Oct 17 ?
2002:43 /opt/sybase/bin/dataserver
-ONLINE:1,0,0x66006099,0xc3800000,0x2b4b6000,0x1159
sybase 6017 6015 0 Oct 17 ?
2:05 /opt/sybase/bin/backupserver -e/gisdbmw/syblog/gisdbmw_back.log
-N25 -C20 -Sgis
sybase 6527 6521 0 Oct 17 ?
1519:07 /opt/sybase/bin/dataserver
-ONLINE:3,0,0x66006099,0xc3800000,0x2b4b6000,0x1159
sybase 6521 6517 0 Oct 17 ?
1633:38 /opt/sybase/bin/dataserver
-ONLINE:2,0,0x66006099,0xc3800000,0x2b4b6000,0x1159
phys-mw1:/home/sa>
Note : If you find any problem with Sybase database please contact with 1-WAMPAPP team.
Please refer Remedy Incident: 292913
2. Verify the Web services .
phys-mw2:/home/sa> ps -ef | grep httpd
nobody 4260 4259 0 Oct 17 ?
0:03 /usr/apache/bin/httpd -f /etc/apache/httpd.external.conf
root 4259 1 0 Oct 17 ?
0:00 /usr/apache/bin/httpd -f /etc/apache/httpd.external.conf
nobody 4403 4350 0 Oct 17 ?
0:00 /usr/apache/bin/httpd -f /etc/apache/httpd.contestable.conf
nobody 4223 4221 0 Oct 17 ?
0:03 /usr/apache/bin/httpd -f /etc/apache/httpd.internal.conf
root 4350 1 0 Oct 17 ?
0:00 /usr/apache/bin/httpd -f /etc/apache/httpd.contestable.conf
nobody 4261 4259 0 Oct 17 ?
0:02 /usr/apache/bin/httpd -f /etc/apache/httpd.external.conf
nobody 19175 4259 0 Oct 18 ?
0:03 /usr/apache/bin/httpd -f /etc/apache/httpd.external.conf
nobody 8669 4259 0 Oct 25 ?
0:01 /usr/apache/bin/httpd -f /etc/apache/httpd.external.conf
nobody 4402 4350 0 Oct 17 ?
0:00 /usr/apache/bin/httpd -f /etc/apache/httpd.contestable.conf
nobody 16054 4221 0 Oct 17 ?
0:03 /usr/apache/bin/httpd -f /etc/apache/httpd.internal.conf
nobody 7241 4259 0 Oct 18 ?
0:02 /usr/apache/bin/httpd -f /etc/apache/httpd.external.conf
nobody 4222 4221 0 Oct 17 ?
0:03 /usr/apache/bin/httpd -f /etc/apache/httpd.internal.conf
root 4221 1 0 Oct 17 ?
0:00 /usr/apache/bin/httpd -f /etc/apache/httpd.internal.conf
nobody 16055 4221 0 Oct 17 ?
0:03 /usr/apache/bin/httpd -f /etc/apache/httpd.internal.conf
nobody 16056 4221 0 Oct 17 ?
0:03 /usr/apache/bin/httpd -f /etc/apache/httpd.internal.conf
nobody 9497 4221 0 Oct 18 ?
0:03 /usr/apache/bin/httpd -f /etc/apache/httpd.internal.conf
nobody 19233 4259 0 Oct 18 ?
0:03 /usr/apache/bin/httpd -f /etc/apache/httpd.external.conf
nobody 18539 4350 0 15:34:26 ?
0:00 /usr/apache/bin/httpd -f /etc/apache/httpd.contestable.conf
nobody 7810 4259 0 Oct 18 ?
0:02 /usr/apache/bin/httpd -f /etc/apache/httpd.external.conf
nobody 8667 4259 0 Oct 25 ?
0:01 /usr/apache/bin/httpd -f /etc/apache/httpd.external.conf
nobody 24921 4221 0 Oct 19 ?
0:03 /usr/apache/bin/httpd -f /etc/apache/httpd.internal.conf
sa 23614 23547 0 07:30:26 pts/0 0:00 grep httpd
nobody 18449 4350 0 15:34:24 ?
0:00 /usr/apache/bin/httpd -f /etc/apache/httpd.contestable.conf
nobody 8664 4259 0 Oct 25 ?
0:01 /usr/apache/bin/httpd -f /etc/apache/httpd.external.conf
nobody 20439 4221 0 Oct 25 ?
0:01 /usr/apache/bin/httpd -f /etc/apache/httpd.internal.conf
phys-mw2:/home/sa>
T3 Storage
1)Vol add volname data undn raid n standby undn
2)Vol stat
3)Vol init volname data
4)Vol mount vol name
5)Vol list
6)Mkdir /dev/es
7)Luxadm insert
8)if above solaris 7 exclude the steps 6 & 7

9) format and partition .


How many controllers in 3510 storage.
4 channels
2 controllers
What is WWN on storage
World wide number
_______________continues_____________________

To view the LUNs on a Solaris host, you need to use cfgadms


For example if you remove SB3 board on 6800 server for replaceing faulty memory or
faulty CPU
cfgadm -c unconfigure N0.SB3
---------------For unconfigure the entire(only
SB3) CPU board
root@kbl-db-02 # cfgadm -c disconnect N0.SB3 ---- Disconecting from physical path
root@kbl-db-02 # cfgadm -al |more ------------for confirming removed or not
/N0/SB3/P2/B1/d2
cfgadm -c configure N0.SB3 ------------After replacing to configure the same board
SSAADM:- ssaadm command is now link to the luxadm command
LUXADM:The Luxadm program is an administrative command that manages both the sun storage A5000 and
SPARC storage array disk arrays, Lunadm performa a variety of control and query task, depending on

the command-line arguments and options used.

1. List the files in current directory sorted by size ? - ls -l | grep ^- | sort -nr
2. List the hidden files in current directory ? - ls -a1 | grep "^\."
3. Delete blank lines in a file ? - cat sample.txt | grep -v ^$ > new_sample.txt
4. Search for a sample string in particular files ? - grep Debug *.confHere grep uses the string
Debug to search in all files with extension.conf under current directory.
5. Display the last newly appending lines of a file during appendingdata to the same file by some
processes ? - tail f Debug.logHere tail shows the newly appended data into Debug.log by some
processes/user.
6. Display the Disk Usage of file sizes under each directory in currentDirectory ? - du -k * | sort nr
(or) du k . | sort -nr
7. Change to a directory, which is having very long name ? - cd CDMA_3X_GEN*Here original
directory name is CDMA_3X_GENERATION_DATA.
8. Display the all files recursively with path under current directory ? - find . -depth -print
9. Set the Display automatically for the current new user ? - export DISPLAY=`eval who am i | cut
-d"(" -f2 | cut -d")" -f1`Here in above command, see single quote, double quote, grave ascent is used.
Observe carefully.
10. Display the processes, which are running under yourusername ? - ps aef | grep MaheshvjHere,
Maheshvj is the username.
11. List some Hot Keys for bash shell ? - Ctrl+l Clears the Screen. Ctrl+r Does a search in
previously given commands in shell. Ctrl+u - Clears the typing before the hotkey. Ctrl+a Places
cursor at the beginning of the command at shell. Ctrl+e Places cursor at the end of the command at
shell. Ctrl+d Kills the shell. Ctrl+z Places the currently running process into background.
12. Display the files in the directory by file size ? - ls ltr | sort nr k 5
13. How to save man pages to a file ? - man <command> | col b > <output-file>Example : man top |
col b > top_help.txt
14. How to know the date & time for when script is executed ? - Add the following script line in
shell script.eval echo "Script is executed at `date`" >> timeinfo.infHere, timeinfo.inf contains date
& time details ie., when script is executed and history related to execution.
15. How do you find out drive statistics ? - iostat -E
16. Display disk usage in Kilobytes ? - du -k
17. Display top ten largest files/directories ? - du -sk * | sort -nr | head
18. How much space is used for users in kilobytes ? - quot -af
19. How to create null file ? - cat /dev/null > filename1
20. Access common commands quicker ? - ps -ef | grep -i $@
21. Display the page size of memory ? - pagesize -a
22. Display Ethernet Address arp table ? - arp -a
23. Display the no.of active established connections to localhost ? - netstat -a | grep EST
24. Display the state of interfaces used for TCP/IP traffice ? - netstat -i
25. Display the parent/child tree of a process ? - ptree <pid> Example: ptree 1267
26. Show the working directory of a process ? - pwdx <pid> Example: pwdx 1267
27. Display the processes current open files ? - pfiles <pid> Example: pfiles 1267
28. Display the inter-process communication facility status ? - ipcs

29. Display the top most process utilizing most CPU ? - top b 1
30. Alternative for top command ? - prstat -a

The -o largefiles and -o nolargefiles options can be used to change the largefile compatibility flag. When
invoked without options fsadm prints the current state of the compatibility flags.
To enable to large file on veritas

The -o largefiles and -o nolargefiles options can be used to change the largefile compatibility flag
#fsadm (prints the current state of the compatibility flags)
#fsadm F vxfs o /hazly

fsadm [-F vxfs] [-o specific_options] mount_point|special


(eg)

fsadm command is working fine


created a fs without largefilesuppot
bash-3.00# mkfs -F vxfs -o nolargefiles /dev/vx/rdsk/mydg/oravol
version 6 layout
204800 sectors, 102400 blocks of size 1024, log size 1024 blocks
largefiles not supported
bash-3.00# mount -F vxfs /dev/vx/dsk/mydg/oravol /mnt
bash-3.00# mount -p | grep mnt
mnttab - /etc/mnttab mntfs - no
/dev/vx/dsk/mydg/oravol - /mnt vxfs - no rw,suid,delaylog,nolargefiles,ioerror=mwdisable

Modify property through fsadm -enable largefiles


bash-3.00# /opt/VRTS/bin/fsadm -F vxfs -o largefiles /mnt
bash-3.00# mount -p | grep mnt
mnttab - /etc/mnttab mntfs - no
/dev/vx/dsk/mydg/oravol - /mnt vxfs - no rw,suid,delaylog,largefiles,ioerror=mwdisable
How you will identify that how may DG creation a particular VXVM version support
root on BUILD kirkcmis3 # vxdctl support
Support information:
vxconfigd_vrsn:
21
dg_minimum:
10
dg_maximum:
120
kernel:
15
protocol_minimum: 40
protocol_maximum: 60
protocol_current: 0
How will you recover the destroyed DG (devdg)
root on BUILD kirkcmis3 # vxdisk list VPATH_SANVC0_62
Device:
VPATH_SANVC0_62
devicetag: VPATH_SANVC0_62
type:
auto
hostid:
kirkcmis3
disk:
name=devdg84 id=1134042519.185.kirkcmis3

group:
name=devdg id=1157983965.194.kirkcmis3
info:
format=cdsdisk,privoffset=256,pubslice=2,privslice=2
flags:
online ready private autoconfig autoimport imported
pubpaths: block=/dev/vx/dmp/VPATH_SANVC0_62s2 char=/dev/vx/rdmp/VPATH_SANVC0_62s2
version:
3.1
iosize:
min=512 (bytes) max=2048 (blocks)
public:
slice=2 offset=2304 len=85948160 disk_offset=0
private:
slice=2 offset=256 len=2048 disk_offset=0
update:
time=1193065091 seqno=0.51
ssb:
actual_seqno=0.0
headers:
0 240
configs:
count=1 len=1280
logs:
count=1 len=192
Defined regions:
config
priv 000048-000239[000192]: copy=01 offset=000000 disabled
config
priv 000256-001343[001088]: copy=01 offset=000192 disabled
log
priv 001344-001535[000192]: copy=01 offset=000000 disabled
lockrgn priv 001536-001679[000144]: part=00 offset=000000
Multipathing information:
numpaths:
1
vpath128c
state=enabled
root on BUILD kirkcmis3 # cd /etc/vx/cbr/bk/ (the default repository for DG information)
root on BUILD kirkcmis3 # ls -ltr
total 20
drwxr-x--2 root
root
512 Feb 2 2006 testdg.1138894812.396.kirkcmis3
drwxr-x--2 root
root
1024 Feb 3 2006
dr_sdmp3dg.1138715376.392.kirkcmis3
drwxr-x--2 root
root
1536 Sep 8 2006 test1.1157725540.188.kirkcmis3
drwxr-x--2 root
root
1536 Oct 22 15:59 infodg.1134131590.274.kirkcmis3
drwxr-x--2 root
root
1536 Oct 22 15:59
everestdg.1135092719.278.kirkcmis3
drwxr-x--2 root
root
1536 Dec 3 15:59 devdg.1157983965.194.kirkcmis3

# /usr/lib/vxvm/bin/vxconfigrestore -l /etc/vx/cbr/bk/ devdg


Diskgroup devdg configuration restoration started ......
Installing volume manager disk header for c1t1d0s2 ...
Installing volume manager disk header for c1t2d0s2 ...
Installing volume manager disk header for c1t3d0s2 ...
Installing volume manager disk header for c1t4d0s2 ...
Installing volume manager disk header for c1t5d0s2 ...
Installing volume manager disk header for c1t6d0s2 ...
devdg's diskgroup configuration is restored (in a precommitted state).
Diskgroup can be accessed in read only and can be examined using
vxprint(1m) in this state.
Run:
vxconfigrestore -l /etc/vx/cbr/bk/ -c devdg ==> to commit the restoration.
vxconfigrestore -l /etc/vx/cbr/bk/ -d devdg ==> to abort the restoration.
# /usr/lib/vxvm/bin/vxconfigrestore l /etc/vx/cbr/bk c devdg
Committing configuration restoration for diskgroup devdg ....
devdg's diskgroup configuration restoration is committed.
Once the configuration is restored, the volumes can be started, and the file systems that reside on
those volumes can be mounted.
How you will check the Free Space in the DG?

root on BUILD kirkcmis3 # vxassist -g devdg maxsize


Maximum volume size: 26624 (13Mb)
How will you check the Master node in VCS
root on BUILD kirkcmis3 # vxdctl -c mode
mode: enabled: clustered_state (Cluster inactive, Cluster active - role not set, cluster active
- MASTER, cluster active SLAVE)
How will you create the shared Disk group.
root on BUILD kirkcmis3# vxdg -s init sharedg share01=c3t0d4s2
I am Unable to create the share DG
What error messages the command is displaying?
A: Disk group sharedg: cannot create: Disk group version doesn't support feature; see
the vxdg upgrade command
Check the PROTOCOLVERSION
# vxdctl protocolversion
Cluster running at protocol 40
To upgrade the protocol version to the latest available version run vxdctl upgrade on the master
node:
# vxdctl -c mode
mode: enabled: cluster active - MASTER
master: bits
# vxdctl upgrade
NOTE: All nodes need to be joined in the Cluster Volume Manager cluster before running the above
command.
To confirm that the protocol version has been updated, the following can be run:
# vxdctl protocolversion
Cluster running at protocol 50
How will you check the status of LLT
#lltstat nvv
How will you check the status of GAB
#gabtab a
Splitbrain : If the private network fails there will not be connectivity between the nodes,
Qourm device will take place in this senario, Quarm devie have the information of both
the nodes. It will distroy one of the node information and make other node to be owner of
the service group.
To solve this probel
# vi llttab
Node
Seeding; It is use to protect the cluster in pre-existing network, One seeding system can run vcs
Automatic seeding
Manual seeding

#gabconfig c n < no of nodes>


#gabconfig c x

How to mirror a root disk

Initialize a new disk


#devfsadm
#format ( to label the disk)
#vxdctl enable
#vxencap Ac -g rootdg f sliced rootdisk=c0t0d0
#vxdisksetup I diskname (c0t1d0)
#vxdg g rootdg adddisk rootnew=c1t0d0
#vxprint list
#vxrootmir rootnew
How to remove the root mirror
#vxunroot
Reboot the system (it will remove the entries of VXVM from /etc/systems & Filesystgem from /etc/vfstab)
Remove the plexes of the rootnew
#vxplex dis rootvol-02 (remove the plexes of the rootnew)
Differance between 3.0 & 4.0 verision
before 3 roodisk should be named as rootdg itself ..after 4.0 you can name it as any thing u want
To find the file system version
#modinfo | gre vx
Jeopardy Defined
The design of VCS requires that a minimum of two heartbeat-capable channels be available between nodes to protect
against network failure. When a node is missing a single heartbeat connection, VCS can no longer discriminate
between a system loss and a loss of the last network connection. It must then handle loss of communications on a
single network differently from loss on multiple networks. This procedure is called "jeopardy." As mentioned previously,
low latency transport (LLT) provides notification of reliable versus unreliable network communications to global atomic
broadcast (GAB). GAB uses this information, with or without a functional disk heartbeat, to delegate cluster
membership. If the system heartbeats are lost simultaneously across all channels, VCS determines the system has
failed. The services running on that system are then restarted on another. However, if the node was running with one
heartbeat only (in jeopardy) prior to the loss of a heartbeat, VCS does not restart the applications on a new node. This
action of disabling failover is a safety mechanism that prevents data corruption.

I/O Fencing SCSI III Reservations - I/O Fencing (VxFEN) is scheduled to be included in the VCS 4.0 version. VCS
can have parallel or failover service groups with disk group resources in them. If the cluster has a split-brain, VxFEN
should force one of the subclusters to commit suicide in order to prevent data corruption. The subcluster which
commits suicide should never gain access to the disk groups without joining the cluster again. In parallel service
groups, it is necessary to prevent any active processes from writing to the disks. In failover groups, however, access to
the disk only needs to be prevented when VCS fails over the service group to another node. Some multipathing
products will be supported with I/O Fencing.

You might also like