Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Module Objectives
By the end of this module, you should be able to: Use the sysstat, stats, statit, and options commands
Describe the factors that affect RAID performance Execute commands to collect data about write throughput Execute commands to verify the operation of hardware, software, and network components Identify commands and options used to obtain configuration and status
2009 NetApp. All rights reserved.
System Health
Performance problems can originate from multiple sources. To avoid some of these problems, check or monitor the following: Disk configuration
Disk status Write performance Read performance
Disk Status
Disk Status
Monitor disks:
shelfchk led_on diskid and led_off diskid (priv set advanced command)
Syslog Messages
shm: disk has reported a predicted failure (PFA) event: disk XX, serial_number XXXX shm: link failure detected, upstream from disk: id XX, serial_number XXXXX shm: disk I/O completion times too long: disk XX, serial number XXXXX shm: possible link errors on disk: id XX, serial number XXXXX shm: disk returns excessive recovered errors: disk XX, serial number XXXXX shm: intermittent instability on the loop that is attached to Fibre Channel adapter: id XXX, name XXXXX
2009 NetApp. All rights reserved.
Write Performance
sysstat
statit stats
Examples of instances:
/vol/vol0, /vol/nfstree, 0b.18 /vol/flex1/lun_test cifs_ops, cifs_latency, cifs_read_ops
2009 NetApp. All rights reserved.
PeriodCounter values are gathered over a single period of time and then displayed
stats start then stats stop
Read Performance
Read Performance
Data ONTAP is optimized for write performance Read performance could decrease over time
NOTE: Efficient use of cache can offset some disk performance issues.
Optimization:
To measure optimization:
reallocate measure [vol | file]
To resolve optimization:
reallocate start <pathname>
RAID Configuration
RAID Groups
/vol0 rg0
/vol1 rg0
/vol2 rg0
rg1
Monitoring Connectivity
Connectivity
Use the following to monitor connectivity: MAC
ifconfig ifstat arp
TCP/IP
ifconfig /etc/rc and /etc/hosts ping netstat -r
Protocols
nfsstat cifs stat nbtstat
2009 NetApp. All rights reserved.
Performance Measures
Round trip response times for specific NFS operations are displayed.
Analyzing smb_hist output CIFS request time processing: (46457) - milliseconds units 0ms 13175 <16ms 4039 1ms 17752 <24ms 2309 2ms 5111 <32ms 569 3ms 664 <40ms 165 4ms 451 <48ms 61 5ms 478 <56ms 21 6ms 570 <64ms 10 7ms 568 unused 0
Every other row displays the number of operations that took place in the interval in the row above it. In this example, 13,715 operations happened in less than .5 ms.
The time interval window lies halfway between the values for adjacent columns. In this example, 165 operations occurred in the 36-ms to 44-ms windows.
Obtaining Statistics
The statit command:
Is an advanced-mode command used for more detailed analysis of system performance Gathers per-second statistics averaged over the length of time it is running in the background Shows statistics representing all physical and some logical objects on the storage system Most of the data collected represents rates at which things are happening
3. After 30 seconds (or as necessary to end statistics collection and include NFS statistics), enter:
statit e n
Obtaining Statistics
The report generated is divided into the following statistics sections: CPU Multiprocessor CSMP domain switches Miscellaneous WAFL RAID Network interface Disk Aggregate Spares and other disks FCP iSCSI Tape
2009 NetApp. All rights reserved.
CPU Statistics
CPU Statistics 506.934263 time (seconds) 275.044317 system time 23.412966 rupt time 251.466451 non-rupt system time 271.837944 idle time 439.543653 time in CP 21.837230 rupt time in CP 100 % 54 % 5 % (7022 rupts x 0 usec/rupt 50 % 44 % 92 % 100 % 5 % (132 rupts x 0 sec/rupt)
Multiprocessor Statistics
Multiprocessor Statistics (per second) cpu0 cpu1 46.82 29.15 16.08 0.00 0.00 0.00 1000000.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 total 1424.91 1204.42 119.96 0.00 100.00 0.00 2000000.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
sk switches
hard switches domain switches CP rupts nonCP rupts nonCP rupt usec Idle
1378.09 1175.27 103.89 0.00 100.00 0.00 1000000.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
kahuna
network storage exempt raid target
netcache
netcache2
2009 NetApp. All rights reserved.
Miscellaneous Statistics
Miscellaneous Statistics (per second) 1893.73 hard context switches 0.00 NFS operations 0.00 CIFS operations 0.00 HTTP operations 0.00 NetCache URLs 0.00 streaming packets 0.00 network KB received 0.00 network KB transmitted 18.16 disk KB read 61.30 disk KB written 0.28 NVRAM KB written 0.00 nolog KB written 0.00 WAFL bufs given to clients 0.00 checksum cache hits ( 0%) 0.00 no checksum - partial buffer 0.00 DAFS operations 0.00 FCP operations 0.00 iSCSI operations
WAFL Rates
WAFL Statistics (per second) 5.96 name cache hits ( 62%) 3.69 name cache misses ( 38%) 19.30 inode cache hits ( 100%) 0.00 inode cache misses ( 0%) 55.06 buf cache hits ( 100%) 0.00 buf cache misses ( 0%) 0.00 blocks read 0.00 blocks read-ahead 0.00 chains read-ahead 0.00 blocks speculative read-ahead 5.11 blocks written 0.57 stripes written 0.00 blocks over-written 0.28 wafl_timer generated CP 0.00 snapshot generated CP 0.00 wafl_avail_bufs generated CP 0.00 dirty_blk_cnt generated CP 0.00 full NV-log generated CP 0.00 back-to-back CP 0.00 flush generated CP 0.00 sync generated CP 0.00 wafl_avail_vbufs generated CP 55.06 non-restart messages 0.00 IOWAIT suspends 604852 buffers
Disk Statistics
Disk Statistics (per second) ut% is the percent of time the disk was busy. xfers is the number of data transfer commands issued per second. xfers = ureads + writes + cpreads + greads + gwrites chain is the average number of 4K blocks per command. usecs is the average disk round trip time per 4K block. disk ut% xfers ureads--chain-usecs writes--chain-usecs cpreads-chain-usecs /vol0/plex0/rg0: 8a.16 5 3.69 0.57 1.00 94500 ... 8a.21 4 3.12 0.57 1.00 39500 ...
Spares and other disks: 8b.16 2 1.70 1.70 1.00 10167 0.00 .... . 0.00 .... . 0.00 .... . 0.00 .. 8b.17 0 0.00 0.00 .... . 0.00 .... . 0.00 .... . 0.00 .... . 0.00 .... . 8b.18 0 0.00 0.00 .... . 0.00 .... . 0.00 .... . 0.00 .... . 0.00 .... .
Other Resources
For more information about data collection and performance, see the Fundamentals of Performance Analysis course. This advanced course shows you how to: Analyze data using recommended methodology to correlate performance data into performance analysis information Monitor performance using performance tools and establish a baseline of expected throughput and response times for storage systems under planned and increasing workloads Perform capacity planning by monitoring performance and comparing baseline information over time to determine when a storage system will reach maximum capacity Perform tuning for optimal performance for protocols such as CIFS, NFS and SAN (including locating resources with tuning guidelines for database scenarios) Perform bottleneck analysis
Module Summary
In this module, you should have learned to: Use the sysstat, stats, statit, and options commands
Describe the factors that affect RAID performance Execute commands to collect data about write throughput Execute commands to verify the operation of hardware, software, and network components Identify commands and options used to obtain configuration and status
2009 NetApp. All rights reserved.
Exercise
Module 17: Data Collection Tools Estimated Time: 60 minutes
What command(s) would you use to help detect impending disk problems before they occur?
disk shm_stats
2009 NetApp. All rights reserved.