Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Solutions
ProtecTIER version 3.2.2
GA32-0923-06
Note
Before using this information and the product it supports, read the information in “Safety and Environmental notices” on
page xi and “Notices” on page 327.
This edition applies to ProtecTIER version 3.2.2 of the TS7650 Appliance and the TS7650G and to all subsequent
releases and modifications until otherwise indicated in new editions.
This edition replaces GA32-0923-05
© Copyright IBM Corporation 2011, 2012.
US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract
with IBM Corp.
Contents
Figures . . . . . . . . . . . . . . vii Accessing Problem Manager from the command
line . . . . . . . . . . . . . . . . 19
Tables . . . . . . . . . . . . . . . ix System health monitoring . . . . . . . . . 19
System health monitoring command line tools. . 19
Call Home. . . . . . . . . . . . . . . 23
Safety and Environmental notices . . . xi Troubleshooting the server-to-TSSC connection 24
Safety notices . . . . . . . . . . . . . . xi Call Home command line tools . . . . . . . 24
Power cords . . . . . . . . . . . . . . xii User-initiated Call Home . . . . . . . . . 25
Environmental notices . . . . . . . . . . xiii Using SNMP traps . . . . . . . . . . . . 33
Contents v
vi TS7650 and TS7650G Problem Determination and Service Guide
Figures
1. ProtecTIER Service menu . . . . . . . . 16 44. Stand-alone TS7650G (3958 DD1/3958 DD3)
2. ProtecTIER system Health Monitoring menu 18 attachment to the TSSC . . . . . . . . 123
3. Canceling a problem . . . . . . . . . 18 45. Clustered TS7650G (3958 DD1/3958 DD3)
4. Displaying status of a hard drive . . . . . 22 attachment to the TSSC . . . . . . . . 124
5. Displaying status of a power supply and fan 22 46. Disk controller hot-swap drive bays . . . . 131
6. Listing all possible checks . . . . . . . . 23 47. Disk controller front indicators. . . . . . 132
7. Listing status of all health point checks 23 48. DS4700 disk controller rear indicators 133
8. Performing a specific check immediately 23 49. DS3950 disk controller rear indicators 135
9. SNMP trap report . . . . . . . . . . 36 50. Battery unit LEDs . . . . . . . . . . 136
10. Server front view . . . . . . . . . . 38 51. Numeric display LEDs . . . . . . . . 137
11. Server operator information panel . . . . . 39 52. Disk controller seven-segment enclosure ID 139
12. Server rear view . . . . . . . . . . . 40 53. DS4700 disk controller Fibre Channel ports 139
13. RSA controls, connectors, and indicators 42 54. DS3950 disk controller Fibre Channel ports 140
14. Power supply rear view . . . . . . . . 43 55. Storage Manager: Running controller
15. Server rear view . . . . . . . . . . . 51 diagnostics . . . . . . . . . . . . 141
16. RSA controls, connectors, and indicators 52 56. Storage Manager: Downloading firmware 141
17. eServer logo screen for basic input/output 57. Storage Manager: Executing scripts . . . . 142
system (BIOS) setup. . . . . . . . . . 52 58. Unlocking the SFP module latch - plastic
18. Operator information panel . . . . . . . 55 variety . . . . . . . . . . . . . . 143
19. Server light path diagnostics panel . . . . . 55 59. Disk expansion unit front LEDs . . . . . 146
20. Ethernet controller LEDs . . . . . . . . 64 60. Disk expansion unit rear LEDs . . . . . . 147
21. Jumpers on I/O board (3958 DD3) . . . . . 66 61. Power supply and fan unit LEDs . . . . . 148
22. Stopping cluster services . . . . . . . . 70 62. Numeric display LEDs . . . . . . . . 149
23. Alerts Log . . . . . . . . . . . . . 72 63. Example output of versions check. . . . . 173
24. Server front view . . . . . . . . . . 74 64. WWNN main menu . . . . . . . . . 176
25. Operator information panel . . . . . . . 75 65. Display Names By Port menu . . . . . . 176
26. Server rear view . . . . . . . . . . . 77 66. Change Name By Port menu . . . . . . 176
27. Server LEDs rear view . . . . . . . . . 78 67. Change Name By Port menu (updated) 177
28. Power supply rear view . . . . . . . . 80 68. Main Menu for Qlogic adapter command line
29. Server rear view . . . . . . . . . . . 89 interface . . . . . . . . . . . . . 177
30. eServer logo screen for basic input/output 69. HBA Information Menu . . . . . . . . 178
system (BIOS) setup. . . . . . . . . . 89 70. HBA Instance submenu . . . . . . . . 178
31. Operator information panel . . . . . . . 92 71. HBA information display for a port . . . . 179
32. Server light path diagnostics panel . . . . . 92 72. Removing the top cover . . . . . . . . 183
33. Ethernet controller LEDs . . . . . . . . 104 73. Removing the top cover bracket . . . . . 184
34. Jumpers on I/O board . . . . . . . . 106 74. Extracting the RAID adapter carrier . . . . 185
35. Alerts Log . . . . . . . . . . . . 112 75. Removing the RAID adapter from the carrier 186
36. Stand-alone TS7650G (3958 DD4/3958 DD5) 76. Removing the battery carrier card from the
or TS7650 Appliance (3958 DD4) attachment adapter . . . . . . . . . . . . . 186
to the TSSC (x3250 M3 MT4252) . . . . . 114 77. SAS ports on the ServeRAID M5015 controller 187
37. Clustered TS7650G (3958 DD4/3958 DD5) or 78. SAS ports on the ServeRAID BR10i controller 187
TS7650 Appliance (3958 DD4) attachment to 79. Inserting the RAID adapter carrier . . . . 188
the TSSC (x3250 M3 MT4252) . . . . . . 115 80. Removing the top cover . . . . . . . . 190
| 38. Stand-alone TS7650 Appliance (3958 DD4) 81. Removing the top cover bracket . . . . . 191
| attachment to the TSSC (x3250 M4 MT 2538) . 116 82. Extracting the RAID adapter carrier . . . . 192
| 39. Clustered TS7650 Appliance (3958 DD4) 83. Removing the RAID adapter from the carrier 193
| attachment to the TSSC (x3250 M4 MT 2538) . 117 84. Removing the battery carrier card from the
| 40. Stand-alone TS7650G (3958 DD5) attachment adapter . . . . . . . . . . . . . 194
| to the TSSC (x3250 M4 MT 2538) . . . . . 118 85. SAS ports on the ServeRAID M5015 controller 194
| 41. Clustered TS7650G (3958 DD5) attachment to 86. Inserting the RAID adapter carrier . . . . 195
| the TSSC (x3250 M4 MT 2538) . . . . . . 119 87. Product label on server . . . . . . . . 200
42. Stand-alone TS7650 Appliance (3958 DD3) 88. RSA II settings . . . . . . . . . . . 204
attachment to the TSSC . . . . . . . . 121 89. Login Profiles window . . . . . . . . 206
43. Clustered TS7650 Appliance (3958 DD3) 90. Login Profile dialog box . . . . . . . . 207
attachment to the TSSC . . . . . . . . 122 91. RSA event log . . . . . . . . . . . 207
Safety notices
Observe the safety notices when using this product. These safety notices contain
danger and caution notices. These notices are sometimes accompanied by symbols
that represent the severity of the safety condition.
Most danger or caution notices contain a reference number (Dxxx or Cxxx). Use
the reference number to check the translation in the IBM Systems Safety Notices,
G229-9054 manual.
The sections that follow define each type of safety notice and give examples.
Danger notice
Caution notice
Power cords
For your safety, IBM provides a power cord with a grounded attachment plug to
use with this IBM product. To avoid electrical shock, always use the power cord
and plug with a properly grounded outlet.
IBM power cords used in the United States and Canada are listed by
Underwriter’s Laboratories (UL) and certified by the Canadian Standards
Association (CSA).
For units intended to be operated at 115 volts: Use a UL-listed and CSA-certified
cord set consisting of a minimum 18 AWG, Type SVT or SJT, three-conductor cord,
a maximum of 15 feet in length and a parallel blade, grounding-type attachment
plug rated 15 amperes, 125 volts.
For units intended to be operated at 230 volts (U.S. use): Use a UL-listed and
CSA-certified cord set consisting of a minimum 18 AWG, Type SVT or SJT,
For units intended to be operated at 230 volts (outside the U.S.): Use a cord set
with a grounding-type attachment plug. The cord set should have the appropriate
safety approvals for the country in which the equipment will be installed.
IBM power cords for a specific country or region are usually available only in that
country or region.
Environmental notices
The environmental notices that apply to this product are provided in the
Environmental Notices and User Guide, Z125-5823-xx manual. A copy of this manual
is located on the publications CD.
Note: Cache modules and cache controllers are supported separately within the
TS7650G. If the problem is known to be in the IBM attached storage component,
select the hardware option and enter the appropriate Machine Type and serial
number for the component. If the attached storage is not IBM branded, contact the
appropriate service provider for the component. For the TS7650 Appliance, all
components inside the frame are supported under the 3958 AP1 MTM.
Terminology
IBM offers two virtualization solutions:
TS7650
When used alone, this term signifies IBM's family of virtualization
solutions that operate on the ProtecTIER platform.
TS7650 Appliance or appliance
These are terms for IBM's self-contained virtualization solution from the
TS7650 family that includes a disk storage repository. The TS7650
Appliance consists of the following:
Server The 3958 AP1 server is based on the IBM System x3850 X5 Type
7145-PBR at the ProtecTIER version 3.2 release. When used as a
server in the TS7650 Appliance, its machine type and model are
3958 AP1. Use this machine type and model for service purposes.
System console
The system console is a TS3000 System Console (TSSC). This
document uses the terms system console and TSSC interchangeably.
Disk controller
The disk controller for the TS7650 Appliance is an IBM Feature
Code 3708: 4.8 TB Fibre Channel Disk Controller. Use this feature
code for service purposes.
Disk expansion unit
The disk expansion unit for the TS7650 Appliance is an IBM
Feature Code 3707: 4.8 TB Fibre Channel Disk Expansion Unit. Use
this feature code for service purposes.
IBM Tivoli Assist On-site (AOS)
IBM Tivoli Assist On-site (AOS) is a web-based tool that enables a
remote support representative in IBM to view or control the
management node desktop. More information is located at the
Tivoli AOS website.
TS7650G or Gateway
These are terms for IBM's virtualization solution from the TS7650 family
that does not include a disk storage repository, allowing the customer to
choose from a variety of storage options. IBM does support two TS7650
Gateway servers in a single frame (a two-node cluster). The TS7650G
consists of the following:
Under IBM best practices, the TS7650G also contains the following:
Disk controller
The customer must choose the disk controller for use with the
TS7650G. A list of compatible controllers is located at the IBM Tape
Systems Resource Library website in the TS7650/TS7650G ISV and
interoperability matrix document.
Disk expansion unit
The customer must choose the disk expansion unit for use with the
TS7650G. A list of compatible expansion units is located at the IBM
Tape Systems Resource Library website in the TS7650/TS7650G ISV
and interoperability matrix document.
IBM Tivoli Assist On-site (AOS)
IBM Tivoli Assist On-site (AOS) is a web-based tool that enables a
remote support representative in IBM to view or control the
management node desktop. More information is located at the
Tivoli AOS website.
OpenStorage
OpenStorage allows ProtecTIER to be integrated with NetBackup to
provide the means for backup-to-disk without using a virtual tape library
(VTL) emulation. Using a plug-in that is installed on an
OpenStorage-enabled media server, ProtecTIER can implement a
communication protocol that supports data transfer and control between
Websites
IBM maintains pages on the World Wide Web where you can get the most
up-to-date information about your product, including documentation and the most
recent downloads.
Be sure to visit the support page for the TS7650 Appliance or the TS7650G,
complete with FAQs, parts information, technical hints and tips, technical
publications, and downloadable files, if applicable. The support pages are available
at IBM Tape systems.
The translated publications for this product are included with the product. These
documents and product specification sheets are also available from the IBM
Support Portal.
Note: This product is equipped with a Software Call Home feature. When enabled,
it notifies IBM Service of software error events. Not all countries currently support
this feature. Contact your next level of support for more information. The Software
Call Home feature is supported in all EMEA/CEEMEA countries.
Choose the software option if you are uncertain if the problem involves TS7650
software or TS7650 hardware. Choose the hardware option only if you are certain
the problem solely involves the TS7650 hardware.
When calling IBM for service regarding the TS7650, follow these guidelines for the
software and hardware options:
Software option
Identify the TS7650 as your product and supply your customer number as
In the U.S. and Canada, these services are available 24 hours a day, 7 days a week.
In the U.K., these services are available Monday through Friday, from 9:00 a.m. to
6:00 p.m. In all other countries, contact your IBM reseller or IBM marketing
representative.
When calling IBM for support for the TS7650 Appliance or TS7650G, follow these
guidelines:
v If you are certain the problem involves the TS7650 Appliance or TS7650G
software, or if you are uncertain whether the problem involves the software or
Note: US or Canadian Customers calling 1 (800) IBM SERV are asked to select a
hardware or software option. Unless you are certain the problem involves the
hardware, choose the software option.
Related publications
The following documents provide information about the TS7650 Appliance and
TS7650G components and related hardware.
TS7650G-only publications
v IBM System Storage TS7650G ProtecTIER Deduplication Gateway Installation
Roadmap Guide, GA32-0921
To submit any comments about this book or any other IBM System Storage TS7600
with ProtecTIER documentation:
v Send your comments by email to starpubs@us.ibm.com. Be sure to include the
following information:
– Exact publication title and version
– Publication form number (for example, GC53-1196-03)
– Page, table, or illustration numbers that you are commenting on with a
detailed description of any information that should be changed
ProtecTIER Manager provides the Ethernet connection to the server Ethernet and
RSA ports. It also has the client/server application software for managing the
TS7650 Appliance and the TS7650G. Refer to the IBM System Storage ProtecTIER
User's Guide for Enterprise Edition and Appliance Edition, GA32-0922, for installation
and setup information. The TSSC can run ProtecTIER Manager for the IBM Service
Representative.
Note: The optimal screen resolution for the ProtecTIER Manager is 1280 x 1024.
Available configurations
Both the TS7650 Appliance and the TS7650G are available in a stand-alone
configuration (with one server) and a clustered configuration (with two servers).
For more information regarding the setup of these configurations, see the section
titled “Configurations” in the IBM System Storage TS7650 ProtecTIER Deduplication
Appliance Installation Roadmap Guide, GA32-0920, or the IBM System Storage TS7650G
ProtecTIER Deduplication Gateway Installation Roadmap Guide, GA32-0921.
Disk configuration
The server supports three RAID configurations.
Configuration Application
RAID 1 Server internal hard disk drives
RAID 5 User data on attached disk storage
RAID 10 Metadata on attached disk storage
Problem resolution
This section provides problem resolution procedures.
Important: For any disaster recovery situations, contact your next level of support.
If there is a problem with the RSA, see Appendix A, “Logging into the 3958
DD1/3958 DD3 server through the RSA connection,” on page 255 and Chapter 2,
“System troubleshooting tools,” on page 15.
Note: To eliminate confusion, systems are considered identical only if they are
exactly alike in all the following criteria:
v Machine type and models
v BIOS level
v Adapters/attachments in the same locations
v Address jumpers/terminators/cabling
v Software versions and levels
v Diagnostics code (version)
v Configuration options set in the system
v Setup for the operation system control files
LC wrap plugs are required to run the loopback test at the host bus adapter or at
the end of cables. The part numbers for the wrap plugs are as follows:
v 24P0950 (wrap connector and coupler kit)
– 11P3847 (wrap connector)
– 05N6766 (coupler)
In all cases, refer to the manufacturer's documentation provided with the disk
storage devices.
Attention: Before you proceed, become familiar with the server by reviewing the
following topics.
v For 3958 DD3 or 3958 DD1:
– “Power, controls, and indicators” on page 38
– “Front view” on page 38
– “Rear view of the 3958 DD3/DD1 server” on page 40
v For 3958 DD5 or 3958 DD4:
– “Power, controls, and indicators” on page 73
– “Front view” on page 74
– “Rear view” on page 77
You can also use IBM System x3850 M2 and x3950 M2 Types 7141, 7233, 7144, and
7234 Problem Determination and Service Guide or IBM System x3850 X5 and x3950 X5
Types 7145 and 7146 Problem Determination and Service Guide as a guide for errors
and topics related to X System.
Procedure
Attention: Before you proceed, become familiar with the disk controller by
reviewing Figure 46 on page 131, Figure 47 on page 132, Figure 50 on page 136,
“Disk controller rear indicators” on page 133 and Figure 53 on page 139.
v If you have the DS4700 disk controller, you can also use the IBM System Storage
DS4700 Express Storage Subsystem Installation, User's and Maintenance Guide as a
guide in diagnosis.
v If you have the DS3950 disk controller, you can also use the IBM System Storage
DS3950 Storage Subsystem Installation, User's, and Maintenance Guide as a guide in
diagnosis.
Procedure
To troubleshoot problems with the disk controller, answer the following questions.
1. Are all disk controller LEDs solid green or flashing green?
v Yes, return the TS7650 Appliance or TS7650G to the customer.
v No, go to step 2.
2. Are any amber LEDs lit on the disk controller?
v No, return the TS7650 Appliance or TS7650G to the customer.
v Yes, go to “Disk controller components” on page 131 and continue with the
diagnosis and any required removal and replacement procedures, then return
here.
v If the diagnostics do not detect any hardware errors but the problem remains
during server operations, use the Problem Manager to check for any
problems listed as open. See “Problem Manager” on page 17 for information
about how to display an open problem, which can aid in resolution of the
problem.
3. Verify that you have completed all the repairs. Are all disk controller LEDs
solid green or flashing green?
v Yes, return the TS7650 Appliance or TS7650G to the customer. If you found a
Problem Manager open problem related to the error, see “Problem Manager”
on page 17 for information about how to close the open problem.
v No, see “User-initiated Call Home” on page 25 and contact the next level of
support.
Attention: If you are not already familiar with the disk expansion unit, begin by
reviewing Figure 46 on page 131 (the hot-swap drive bays for the disk controller
and the disk expansion unit are identical), “Disk expansion unit front LEDs” on
page 146, and “Disk expansion unit rear LEDs” on page 147.
You can also use the IBM System Storage DS4700 Express Storage Subsystem
Installation, User's and Maintenance Guide and the IBM TotalStorage DS4000 EXP810
Storage Expansion Unit Installation, User's and Maintenance Guide as guides in
diagnosis.
Procedure
1. Are all disk expansion unit LEDs solid green or flashing green?
v Yes, return the TS7650 Appliance or TS7650G to the customer.
v No, go to step 2.
2. Are any amber LEDs lit on the disk expansion unit?
v No, return the TS7650 Appliance or TS7650G to the customer.
v Yes, go to Chapter 7, “Disk expansion units,” on page 145 and continue with
the diagnosis and any required removal and replacement procedures, then
return here.
v If the diagnostics do not detect any hardware errors but the problem remains
during server operations, use the Problem Manager to check for any
problems listed as open. See “Problem Manager” on page 17 for information
about how to display an open problem, which can aid in resolution of the
problem.
3. Verify that you have completed all the repairs. Are all disk expansion unit
LEDs solid green or flashing green?
v Yes, return the TS7650 Appliance or TS7650G to the customer. If you found a
Problem Manager open problem related to the error, see “Problem Manager”
on page 17 for information about how to close the open problem.
v No, see “User-initiated Call Home” on page 25 and contact the next level of
support.
Several normal activities can generate a preferred path critical event on the disk
controllers in a TS7650 Appliance or TS7650G.
v Restarting a server in a clustered configuration
v Failover between servers in a clustered configuration
Each scan event causes the host bus adapter (HBA) initiators to query both the A
controller and the B controller of the disk controllers. This results in the Automatic
Volume Transfer/Automatic Data Transfer (AVT/ADT) being signaled and the
logical units (LUNs) being moved to those controllers for a brief moment. In a
properly configured environment, after the devices are actually used by the server,
the logical drives move back to the preferred path. In a configuration that does not
access every LUN (idle node, archive data, and so on) those LUNs might not be
accessed on a sufficiently regular basis.
Procedure
To resolve a preferred path critical event, try the following corrective actions in
order until one of them is successful.
1. Within Storage Manager, select Advanced > Recovery > Redistribute logical
drives.
Tip: To enable Call Home, go to the TSSC General Settings page. The Call Home
option allows you to select either a Modem or Ethernet interface. Set the Call
Home option to use the Ethernet interface for the most reliable error notification.
The TS7650G and the TS7650 Appliance provide four Call Home capabilities:
Problem Call Home, Heartbeat Call Home, Test Call Home, and User-Initiated Call
Home; descriptions follow. RAS sends data files that may be helpful to IBM
Support Center personnel for all four types of Call Home. These data files include
error logs and configuration information, such as the Machine Reported Product
Data (MRPD) log.
Test Call Home
The IBM Service Representative sends a Test Call Home signal after
enabling the Call Home feature during initial installation. You can also
send a Test Call Home to ensure that the setup is correct and that the
appliance or gateway can successfully open a Problem Management
Record (PMR) in the IBM Remote Technical Assistance Information
Network (RETAIN).
Problem Call Home
When RAS detects a problem, RAS initiates a Call Home operation to
create a PMR in RETAIN. The PMR is a single page of text data that
enables the Support Center or the Service Representative to access an
action plan and a list of applicable FRU components.
Heartbeat Call Home
To ensure proper ongoing Call Home functionality, the system sends a
Heartbeat Call Home on a regularly-scheduled basis. The heartbeat interval
is user-defined.
User-Initiated Call Home
You can manually initiate Call Home from the TSSC GUI to collect a
product engineering (PE) package.
For instructions on how to install, configure or troubleshoot Call Home, see the
IBM System Storage TS3000 System Console (TSSC) Maintenance Information.
For more instructions on using Call Home to aid in system troubleshooting, see
Chapter 2, “System troubleshooting tools,” on page 15.
For instructions on how to install, configure or troubleshoot Call Home, see the
IBM System Storage TS3000 System Console (TSSC) Maintenance Information.
To ensure that Call Home is enabled on the server, log into the server with user ID
root and password admin (or an ID and password provided by the customer).
Then you can either run the Call Home enablement command from the command
line, or enable Call Home using the ProtecTIER Service menu.
v To enable Call Home from the command line, enter the following command:
/opt/ras/bin/rsCerCHFunction -e
v To enable Call Home from the ProtecTIER Service menu (see “ProtecTIER
Service menu” on page 16), type the option numbers to select the appropriate
options:
– For ProtecTIER v.2.5 or earlier, select Call Home Commands > Enable Call
Home.
– For ProtecTIER v.3.1 or later, select Problem Alerting > Enable/Disable.
If the return message is Call Home is already enabled, then the problem might be
a communication failure between the server and the TSSC. Verify that the server is
listed within the Attached Systems list of the TSSC, and under the Call Home
Switched column, there should be a 1 (enabled) listed in the row for the specific
server.
If the above criteria are correct, run a test Call Home (see “Running a test Call
Home”).
To verify that Call Home is installed and enabled, you can run a test Call Home.
v To run a test Call Home from the command line, enter the following command:
/opt/ras/bin/rsCerCHTest
v To run a test Call Home from the ProtecTIER Service menu (see “ProtecTIER
Service menu” on page 16), type the option numbers to initiate a test Call Home.
– For ProtecTIER V2.5 or earlier, select Call Home Commands > Test Call
Home.
If the return message is Test Call Home sent successfully, then the system has
completed the test. Contact your next level of support if you encounter any of the
following problems:
v The test Call Home fails.
v The test Call Home does not show in the Call Home Queue.
v The Call Home Switched column retains an NA after enabling.
The troubleshooting tools included with the TS7650 Appliance and the TS7650G
can perform the following tasks:
v Collect system information to aid in problem determination (see “Using
Dynamic System Analysis” on page 16)
v Manage problems (see “Problem Manager” on page 17)
v Monitor system health (see “System health monitoring” on page 19)
v Capture data and report errors automatically using Call Home (see “Call Home”
on page 23)
v Collect system logs and deliver them to the Call Home database (see
“User-initiated Call Home” on page 25)
These are discussed individually in the following sections after instructions on how
to run any of the system troubleshooting command line tools.
Procedure
To run the command line interface tools, complete the following steps:
1. From the TSSC login screen, log in with user ID service and password
service.
2. Right-click the TSSC desktop and select Console Actions > Console
Configuration Utility.
3. If prompted for a user ID and password, enter service and service,
respectively.
4. Click Attached Systems.
5. In the Attached Systems list:
a. Verify that the server is listed.
b. Note or record the IP address of the server. You will need the IP address
when you open the SSH session.
c. Click Close this window as necessary to return to the TSSC desktop.
6. If the customer system administrator has changed the password for the
ptadmin account from the default value ptadmin, obtain the new password.
7. Open an SSH session with the server:
a. Right-click the TSSC desktop and then click Terminal.
b. At the Terminal command prompt type the command tsys to display a list
of attached systems and type the number of the server to which you want
to connect or type the following:
ssh ptadmin@172.31.1.xxx
where xxx is the last octet of the server's IP address in the TSSC's Attached
Systems list, and then press the Enter key.
The ProtecTIER Service menu eliminates the need for you to remember commands
or look them up in the documentation. By navigating the menus and sub-menus,
you can easily understand and choose the possible actions or queries to perform.
To access the ProtecTIER Service menu, log in with username root and password
admin. Then type the appropriate command from any directory on the server.
v For ProtecTIER V2.5 or earlier, type rasMenu.
v For ProtecTIER V3.1 or later, type menu.
The script resides in the /opt/ras/bin directory. It displays a menu. Type the
number of the operation you want to use and press Enter. The top-level menu is
similar to the example shown in Figure 1.
----------------------------------------------------------------------
ProtecTIER Service Menu running on rasgen
----------------------------------------------------------------------
1) ProtecTIER Configuration (...)
2) Manage ProtecTIER services (...)
3) Health Monitoring (...)
4) Problem Alerting (...)
5) Version Information (...)
6) Generate a service report
7) Generate a system view
8) Update ProtecTIER code
E) Exit
----------------------------------------------------------------------
>>> Your choice?
Note: Some of the items lead to sub-menus, which are denoted by (...). See
“Problem Manager” on page 17 for the Health Monitoring menu structure.
Procedure
1. To access DSA from the ProtecTIER Service menu, select Generate a service
report. The following options display:
Problem Manager
Problem Manager is the Reliability, Availability, and Serviceability (RAS)
component that processes problems detected by system health monitoring.
Problem Manager consolidates the detected problems, filters out duplicate and
sympathetic problems (secondary problems that are generated by primary
problems), and maintains a problem log of the results (all open primary problems)
for use in service actions and information.
Problem Manager tools are available from the ProtecTIER Service menu (see
“ProtecTIER Service menu” on page 16) and from the command line. The preferred
way to access the tools is through the ProtecTIER Service menu.
To access Problem Manager tools from the ProtecTIER Service menu, type the
number of the operation you want to use and press Enter.
v To get a list of open problems, type the numbers to select the appropriate
options.
– For ProtecTIER V2.5 or earlier, select Problem management > List open
problems.
– For ProtecTIER V3.1 or later, select Health Monitoring > List open problems.
B) Back
E) Exit
----------------------------------------------------------------------
>>> Your choice?
v To close or cancel an open problem once it has been fixed, choose the option for
Problem management and then follow the on-screen instructions to either run
the full refresh process or skip it, and respond Y or N to close out the problem
records. Read the screen carefully to determine the correct choice of action in
your responses.
Service Mode is used to inhibit Call Home, SNMP, and email alerts. Service Mode
is available with ProtecTIER V3.1 or later. It should be enabled at the start of a
service call and disabled at the End-of-call. A health check shows degraded when
Service Mode is enabled. Service Mode is automatically disabled after 48 hours.
To enable Service Mode, select Health Monitoring > Service Mode. Then type yes
and press Enter.
If the ProtecTIER Service menu is not available, run the Problem Manager
commands from a server command line.
Get open problems: The following command retrieves and displays all open
problems from Problem Manager. Each problem is identified by a unique 12-digit
problem ID.
rsCerPMGetOpenProblems
Arguments
This command takes no arguments.
Cancel problem: This command closes an open problem in the problem log after it
is fixed.
rsCerPMCancelProblem problemID
Arguments
Other than the problem ID, this command takes no arguments.
problemID: The unique 12-digit number (displayed by
rsCerPMGetOpenProblems) that identifies the problem.
The preferred method for monitoring system health is using the ProtecTIER Service
menu. See “Accessing Problem Manager from the ProtecTIER Service menu” on
page 17. If the ProtecTIER Service menu is not available you can monitor system
health using the command line tools.
Note: The ProtecTIER server only calls home for the Disk Controller and
expansion units when attached to the TS7650 Appliance.
v Point identifier, which together with the path uniquely identifies the
health point.
v Path format. This includes the node identifier (Node 0 or Node 1). If a
disk repository related element, the path includes the disk array
identifier and the disk tray identifier.
Command Description
rsCerHMDisplay System health monitoring display. This tool
is used to view the results of the latest check
that was performed on the system. This
utility is also available through the
ProtecTIER Service menu (see “ProtecTIER
Service menu” on page 16). Select System
Health Monitoring > Display All System
Health Points.
rsCerHMStatusCtl System health monitoring status control.
This tool is used to immediately invoke one
or more checks on the system. To view the
results, type the display command.
rsCerHMDisplay -i id -p path
-v Server -x -n
DiskArray
Networking
Software
-vall
--all
--gall
-L
-Lv
-?
-H
-i id -p path
Display the status of the specified healthpoint (ID and path).
-v Display the status of the specified virtual object.
-vall Display the status of all virtual objects.
--all Display the status of all health points.
--gall Display the status of all health points in groups.
-L Return the status of the system:
0 All OK or unconfigured
1 Any number of OFFLINEs
2 Any number of WARNINGs
3 Any number of DEGRADEDs
4 Any number of FAILEDs
-Lv Display the status of the system in words.
-x Convert any output to XML.
-n Do not check for the system state before trying to run.
rsCerHMStatusCtl -r
-s
-l
-b -e check_id
-all -b
Some examples of arguments are shown in Figure 6 on page 23, Figure 7 on page
23, and Figure 8 on page 23.
> rsCerHMStatusCtl -s
There are currently 7 checks registered
Call Home
Call Home is the IBM means to automatically capture data and report errors in the
TS7650 Appliance and the TS7650G. This minimizes the need for human
intervention in gathering and offloading data to IBM support.
Procedure
1. Access the ProtecTIER Service menu (see “ProtecTIER Service menu” on page
16).
2. To ensure that Call Home is enabled on the server, type the option numbers to
select the appropriate options.
v For ProtecTIER V.2.5 or earlier, select Call Home Commands > Enable Call
Home.
v For ProtecTIER V.3.1 or later, select Problem Alerting > Enable/Disable.
3. If the return message is Call Home is already enabled, then the problem
might be a communication failure between the server and the TSSC. Verify that
the server is listed within the Attached Systems list of the TSSC, and under the
Call Home Switched column, there should be a 1 (enabled) listed in the row
for the specific server.
4. If the above criteria is correct, type the option numbers to initiate a test Call
Home.
v For ProtecTIER V2.5 or earlier, select Call Home Commands > Test Call
Home.
v For ProtecTIER V3.1 or later, select Problem Alerting > Send a Test Call
Home.
Results
If the return message is Test Call Home sent successfully, then the system has
completed the test.
What to do next
Contact your next level of support if any of the following problems occur:
v Failures are encountered.
v The test Call Home does not show in the Call Home Queue.
v The Call Home Switched column retains an NA after enabling.
Command Description
rsCerCHTest Call Home test
rsCerCHSetHBInterval Heartbeat interval modification
rsCerCHFunction Call Home enabling or disabling
This command invokes the rsCerCHDispatch command, which generates a test Call
Home signal to the TSSC and to IBM RETAIN.
rsCerCHTest
Arguments
This command takes no arguments.
This command sets the interval between Heartbeat/MRPD call home events. In a
clustered environment, running this command for one node changes the settings
for both nodes.
rsCerCHSetHBInterval <intervalInDays (1 to 14)>
Arguments
intervalInDays: how often (in days) the Heartbeat/MRPD Call Home
events are sent. Valid values are 1 to 14.
Other than the interval, this command takes no arguments.
When you manually initiate Call Home, you can customize the information to be
collected by specifying combinations of the following sources:
Chapter 2. System troubleshooting tools 25
Date The server collects information from log files modified after the specified
date and time.
Component
The server collects information from log files defined in component-specific
log definition files.
File name
The server collects information from all log files in a specified directory.
You can collect logs manually at the TSSC by the following two methods:
From the TS3000 System Console
This is the preferred method. See “Using the TS3000 System Console
graphical user interface.”
From the TS7650 Terminal command line
See “Using the TS3000 System Console command line” on page 27.
Procedure
1. If not already logged into the TSSC, log in using user ID service and
password service.
2. Right-click the TSSC desktop and then click Console Actions > Console
Configuration Utility.
3. If prompted for a user ID and password, enter service and service,
respectively.
4. Click Attached Systems.
5. Verify that the server is listed and then click Close this window.
6. Click PE Packages.
7. Click Subsystem Log Retrieval.
8. On the menu in the Select Logs area, click one of the following:
v 3958DD1
v 3958DD3
v 3958DD4
v 3958DD5
v 3958AP1
9. On the Select an Attached System menu, click the server you want.
10. In the Start Time area, enter or select values for the year, month, day, and
hour corresponding to some time before the occurrence of the problem.
11. (Optional) From the Component list, select one or more components. The
components are as follows:
Command syntax
Issue the log collection command in a secure shell (SSH) command line or by using
the ProtecTIER Service menu.
v To issue the log collection command in an SSH command line, type the
following command:
rsCerLogCollect [-d x] [-c x] [-b/-s] [-f "x"] [-o x] [-h/-?]
Parameters
The parameters for the log collection command in a secure shell (SSH) command
line are defined as follows:
-d date
This optional parameter defines the oldest modification date of files to be
collected. If no date is specified, the default is 24 hours before the
command is issued. You can use any date format supported by the generic
Linux date parameter -d. The following are examples of supported date
formats:
v yyyy-mm-dd (for example, 2010–10–24). This defaults to midnight at the
start of the specified date.
v yyyy/mm/dd (for example, 2010/10/24). This defaults to midnight at the
start of the specified date.
v dd-MMM (for example, 24-OCT). This defaults to midnight at the start of
the specified date of the current year.
v hh:mm:ss (for example, 02:34:00). This defaults to the time specified of the
current date and year.
v hh:mm:ss dd MMM yyyy (for example, 02:34:00 24 OCTOBER 2010).
v h:mm dd MMM (for example, 2:34 24 OCT). This defaults to the current
year.
-c component name
This parameter specifies one or more component-specific log definition file
to use for log collection. The components are as follows:
COMMON
This log definition file specifies items such as Linux settings and
command line history.
SERVER
This log definition file specifies various logs generated by the
server.
ProtecTIER (PT)
This log definition file specifies various ProtecTIER management
system logs.
DISK This log definition file specifies various logs generated by the disk
controller for the attached disk repository.
RAS This log definition file specifies Reliability, Availability, and
Serviceability (RAS) code logs used for RAS code analysis.
all This log definition file specifies that the logs in all log definition
files listed above be collected.
Examples
rsCerLogCollect -c all -d Mar 06 12:36
This command collects logs defined in all component-specific log definition
files modified after 12:36, March 6, of the current year, including a subset
of the logs in the PT log definition file.
rsCerLogCollect -c all -d 2010-12-05 -s
This command collects logs defined in all component-specific log definition
files modified after 00:00, December 5, 2010. Only a subset of the logs in
the PT log definition file are collected.
rsCerLogCollect -f "/dir1/log"
This command collects the following files:
v All files in the /dir1/log directory
v All files defined in the COMMON log definition file that were modified
in the last 24 hours
rsCerLogCollect -o -c PT -b /usr/package.tar.gz
This command collects all files defined in the COMMON log definition file
that were modified in the last 24 hours, including all logs in the PT log
definition file, and creates the package file /usr/package.tar.gz.
rsCerLogCollect -c DISK RAS
This command collects logs defined in the COMMON, DISK, and RAS log
definition files that were modified in the last 24 hours.
rsCerLogCollect -c SERVER -f "/directory/logname*" -d 2010-04-10 07:30:55
This command collects the following files:
Procedure
where xxx is the last octet of the server's IP address in the TSSC Attached
Systems list, and then press the Enter key.
c. At the password prompt, type the password for the ptadmin account you
obtained in step 5 (or, if the password has not changed from the default
value, type ptadmin) and then press the Enter key.
7. Do one of the following:
v From the ProtecTIER Service menu (see “ProtecTIER Service menu” on page
16), type the appropriate number for the sub-menu and action to be taken.
v At the SSH command prompt, type the following command:
rsCerLogCollect [-d x] [-c x] [-b/-s] [-f "x"] [-o x] [-h/-?]
with the parameters defined as desired, and then press the Enter key.
Define the parameters using the following guidelines:
– Define the -c parameter to best fit the situation; for example, use SERVER
with the -c parameter for server issues and use DISK with the -c
parameter for disk repository system issues. If the location of the issue is
uncertain, use all with the -c parameter.
– Specify a time with the -d parameter corresponding to the time of the
occurrence of the issue.
8. Allow the process to finish collecting logs and create the package file. (This
can take from several minutes to more than an hour.) When the process is
complete, the message “Exiting Successfully” is displayed and the SSH
command prompt returns.
9. At the SSH command prompt, type the following:
where
v xxx is the last octet of the server's IP address in the TSSC Attached Systems
list.
v /serverFilePath/fileName is the full path and file name of the log package file
on the server. If you do not specify an alternate with the -o parameter, the
default is /opt/ras/pepackage/out/latestPepackage.tar.gz. If you did
specify an alternate with the -o parameter, use that value.
Note: The machine type selections are limited to the machine types that
are listed in the Attached Systems list. (For example, if there are only
3958DD3s in the Attached Systems list, that is the only machine type
option in the list.)
d. In the Action area, click Click here.
e. If you specified a target directory different from the default in step 9 on
page 30:
1) In the Specify a different offload directory field, type the file path of
the directory you specified in step 9 on page 30.
2) Click Refresh.
Procedure
1. Run the SNMP trap catcher program on the SNMP server.
2. Open the SNMP report and review the information.
3. Make note of the defective component's name (or type) and FRU ID.
Note: The FRU ID (part number) for both CRU and FRU components, is
expressed as a FRU ID.
4. Verify the alert received was not caused by an easily-resolved condition, such
as a loose power cord or a defective cable.
For detailed information about the 3958 DD4, 3958 DD5, and 3958 AP1 ProtecTIER
servers, see Chapter 4, “ProtecTIER server (3958 DD4, DD5, and AP1),” on page 73.
Component labeling
The colors of component labels indicate different types of components.
Note: Orange can also indicate the touch points on hot-swap components.
Use the IBM System x3850 M2 and x3950 M2 Types 7141, 7233, 7144, and 7234
Problem Determination and Service Guide for removing or installing a specific
hot-swap component for any additional procedures that you might have to
perform before you remove or install the component.
Note: Adapters in the TS7650 (including the hot-swap adapter ports) are not
hot-swap capable due to the limitations of the Linux operating system. The system
must be shut down when an adapter needs to be replaced.
Front view
This topic describes the front view of the 3958 DD3/3958 DD1 server.
1 2
3 4
ts760024
Scalability LED
Electrostatic-discharge connector
Hard disk drive activity light-emitting diode (LED): When this LED is flashing, it
indicates that the drive is in use.
Hard disk drive status LED: When this LED is lit continuously, that individual
drive is faulty. When the drive is connected to the integrated serial attached SCSI
(SAS) controller with RAID capabilities, a flashing status LED indicates that the
drive is a secondary drive in a mirrored pair and the drive is being synchronized.
DVD-eject button: Press this button to release a CD from the DVD-ROM drive.
Note: The drive does not eject the CD until issued a umount /mnt command.
DVD drive activity LED: When this LED is lit, it indicates that the DVD-ROM
drive is in use.
Operator information panel: This panel contains controls, LEDs, and connectors.
Figure 11 on page 39 shows the controls, LEDs, and connectors on the operator
information panel.
1 2
ts760025
Ethernet port activity LEDs
Power-control button cover Locator button/locator LED
The following controls and LEDs are on the operator information panel:
v Power-control button cover: Slide this cover over the power-control button to
prevent the server from being turned off accidentally.
v Power-control button: Press this button to turn the server on and off manually.
v Power-on LED: When this LED is lit and not flashing, it indicates that the server
is turned on. When this LED is flashing, it indicates that the server is turned off
and still connected to an ac power source. When this LED is off, it indicates that
ac power is not present or the power supply or the LED itself has failed.
Note: If this LED is off, it does not mean that there is no electrical power in the
server. The LED might be burned out. To remove all electrical power from the
server, you must disconnect the power cords from the electrical outlets.
v Ethernet-icon LED: This LED lights the Ethernet icon.
v Ethernet activity LEDs: When these LEDs flash, they indicate that there is
activity between the server and the network on the indicated port.
v Locator LED: Use this LED to visually locate the server among other servers.
You can use the System Locator LED button under System Status in a remote
supervisor adapter (RSA) session to light this LED remotely; see Appendix A,
“Logging into the 3958 DD1/3958 DD3 server through the RSA connection,” on
page 255 for the procedure to open a remote supervisor adapter (RSA) session.
To light the LED manually, press the locator button. This LED is also lit during
startup.
In clustered configurations, when this LED flashes, it indicates that the server is
the primary server. When this LED is lit continuously, it indicates that the server
is the secondary server.
v Locator button: Press this button to turn the locator LED on and off manually.
v Information LED: When this LED is lit, it indicates that a suboptimal condition
exists in the server. Light path diagnostics lights an additional LED to help
isolate the condition. This LED and LEDs on the light path diagnostics panel
remain lit until you resolve the condition or you press the remind button.
v System-error LED: When this LED is lit, it indicates that a system error has
occurred. An LED on the light path diagnostics panel is also lit to help isolate
the error.
Gigabit Gigabit
Ethernet 2 LED Ethernet 1 LED
Gigabit Ethernet 2
Gigabit Ethernet 1
System-error LED
Power-on LED Locator LED Remote Supervisor Adapter II
Power
supply 1
AC
Power
DC
Power
Power-
supply
USB error
SAS
Power-
supply 2
System serial
SMP Expansion Port 1
link LED SMP Expansion Port 3
SMP Expansion Port 1
SMP Expansion Port 3
ts760026
SMP Expansion Port 2
link LED
link LED
SMP Expansion Port 2
Note: An LED that is off can also indicate a burned out LED. To remove
all power from the server, you must disconnect the power cords from the
electrical outlets.
Locator LED
Use this LED to visually locate the server among other servers. You can
use IBM Director to light this LED remotely or press the locator button to
light the LED manually. This LED is also lit during startup.
System-error LED
When this LED is lit, it indicates that a system error has occurred. An LED
on the light path diagnostics panel is also lit to help isolate the error.
Gigabit Ethernet 2 LED
When this LED flashes, it indicates activity between the server and the
network. When this LED is lit continuously, it indicates that there is an
active connection on the Ethernet port.
Gigabit Ethernet 2 connector
Use this connector to connect the server to a network.
External power
supply connector
Ethernet connector
(RJ-45)
Power LED
When this LED is lit, the RSA is receiving power from the server or from
an external power supply.
Reset button
Trained service technician only: To manually reset the RSA, insert the open
end of a paper clip (or similar object) into the recessed reset button and
press.
Mini-USB connector
This connector is not supported.
Video connector
Use this connector to connect the server monitor.
When the amber LED on the rear panel of a server power supply (Figure 14) is lit,
it indicates an error condition within the power supply.
1
2
3
ts760673
Procedure
The server is designed so that any LEDs that are lit remain lit when the server
shuts down, as long as the ac power source is operable, and the power supply can
provide +5 V dc. See “Power supply error LEDs” on page 43 for troubleshooting
with the power supply and power-on LEDs.
If the system-error LED on the front of the server is on, one or more LEDs inside
the server or on the power supply are on. The server has LEDs to help you
identify problems with some server components. These LEDs are part of the light
path diagnostic feature built into the server. By following the light path, you can
quickly identify the type of system error that occurred. See “Light path
diagnostics” on page 54.
Diagnostics
The topics in this section provide basic troubleshooting information to help you
resolve some common problems that might occur with the server.
Important:
Complete the following steps to perform the checkout procedure to identify system
problems.
Procedure
1. Check all cables and power cords.
2. Record any power-on self test (POST) error messages that are displayed on the
screen. If an error is displayed, look up the first error in the “POST error
codes” section of the IBM System x3850 M2 and x3950 M2 Types 7141, 7233,
7144, and 7234 Problem Determination and Service Guide.
3. Check the system-error light-emitting diode (LED) on the operator information
panel. If it is on, see “Light path diagnostic light-emitting diodes” on page 56.
4. Check the remote supervisor adapter (RSA) event log. If any errors or warnings
or any relevant informational messages are recorded, see the appropriate
section of the IBM System x3850 M2 and x3950 M2 Types 7141, 7233, 7144, and
7234 Problem Determination and Service Guide. Access the RSA via the TS3000
System Console (TSSC) browser using the RSA IP address 172.31.1.xxx. For an
alternate procedure to log in to the RSA, see Appendix A, “Logging into the
3958 DD1/3958 DD3 server through the RSA connection,” on page 255.
5. Check for any “open” problems posted by the Problem Manager See “Problem
Manager” on page 17 and use the rsCerPMGetOpenProblems command to
display any open problems.
6. Start the diagnostic programs. Refer to “Starting the diagnostic programs” on
page 50.
7. Check for the following responses:
v One beep
v Readable instructions or the main menu
What to do next
The following tools are available to help you diagnose and solve hardware-related
problems:
Power-on self test (POST) beep codes, error messages, and error logs
v The POST generates beep codes and messages to indicate successful test
completion or the detection of a problem.
v A beep code is a combination of short or long beeps or a series of short
beeps separated by pauses. For example, a 1-2-3 beep code is one beep, a
pause, two beeps, a pause, and three beeps.
– When the POST is completed, one beep is emitted to indicate that the
server is working correctly. If the POST detects a problem during
startup, other beep codes might occur.
– Error messages also appear during startup if the POST detects a
hardware configuration problem.
Problem isolation tables
Use these tables to help you diagnose various symptoms.
Light path diagnostics
Use the light path diagnostics to diagnose system errors quickly.
Diagnostic programs and error messages
The diagnostic programs are stored in memory on the microprocessor tray.
These programs are the primary method of testing the major components
of the server.
For detailed information, refer to the IBM System x3850 M2 and System x3950 M2
Type 7141 and 7233 User's Guide.
If the POST finishes without detecting any problems, a single beep sounds, and the
first screen of your operating system or application program is displayed.
If the POST detects a problem, more than one beep sounds, and an error message
displays on your screen. See the IBM System x3850 M2 and x3950 M2 Types 7141,
7233, 7144, and 7234 Problem Determination and Service Guide for detailed failure
information.
Notes:
v Before the POST continues, if you have a power-on password or an
administrator password set, you must type the password and press Enter when
prompted. (If you do not have the password, obtain it from the customer.)
v A single problem can cause several error messages. If this occurs, work to
correct the cause of the first error message and then run the test again. In many
cases, the other error messages are also eliminated.
The POST error log contains the three most recent error codes and messages that
the system generated during the POST. The system-error log or baseboard
management controller (BMC) log contains all messages issued during the POST
and all system status messages from the service processor.
You can view the contents of the system-error log and BMC log from the
Configuration/Setup Utility program or from the diagnostic programs.
Procedure
1. Start the Configuration/Setup Utility program and then select Error Logs from
the main menu.
2. Select Error Logs from the main menu.
What to do next
Procedure
Diagnostic error messages are not intended to be used to identify a failing part.
Troubleshooting and servicing of complex problems that are indicated by error
messages should be performed by trained service personnel.
Sometimes the first error to occur causes additional errors. In this case, the server
displays more than one error message. Always follow the suggested action
instructions for the first error message that appears.
Error codes that might be displayed are listed in the “Diagnostic error codes”
section in the IBM System x3850 M2 and x3950 M2 Types 7141, 7233, 7144, and 7234
Problem Determination and Service Guide.
Notes:
v Depending on the server configuration, some of the error codes might not
appear when you run the diagnostic programs.
v If diagnostic error codes appear that are not listed in the tables, make sure that
the server has the latest levels of basic input output system (BIOS), service
processor, and ServeRAID code installed. Check the latest IBM System Storage
RAS/BIOS and Firmware Update DVD to update to the latest levels.
Procedure
Gigabit Gigabit
Ethernet 2 LED Ethernet 1 LED
Gigabit Ethernet 2
Gigabit Ethernet 1
System-error LED
Power-on LED Locator LED Remote Supervisor Adapter II
Power
supply 1
AC
Power
DC
Power
Power-
supply
USB error
SAS
Power-
supply 2
System serial
SMP Expansion Port 1
link LED SMP Expansion Port 3
SMP Expansion Port 1
SMP Expansion Port 3
ts760026
SMP Expansion Port 2
link LED
link LED
SMP Expansion Port 2
External power
supply connector
Ethernet connector
(RJ-45)
Video connector
ts760027
cvt00029
Figure 17. eServer™ logo screen for basic input/output system (BIOS) setup
What to do next
If the diagnostic programs do not detect any hardware errors but the problem
persists during normal server operations, a software error might be the cause. If
you suspect a software problem, refer to the information that comes with the
software package.
Procedure
Complete the following steps in order to view the test log after tests are
completed.
1. Select Utility from the top of the screen and then select View Test Log.
2. View the test log or save the log to a file on a diskette or on the hard disk so
that you can view it later by performing the following steps:
a. Select Save Log on the diagnostic programs screen.
b. Specify a location and name for the saved log file.
Note: The test log data is maintained only while the diagnostic programs are
active. When you exit from the diagnostic programs, the test log is cleared
(saved test logs are not affected).
What to do next
For a detailed listing of diagnostic error messages, see “Running DSA Preboot” and
“DSA messages” in the IBM System x3850 X5 and x3950 X5 Types 7145 and 7146
Problem Determination and Service Guide.
You can view the system error log or BMC log from the diagnostic programs. See
the instructions in the “Viewing error logs from the Configuration/Setup Utility
program” and “Viewing the BMC log from the diagnostic programs” sections in
the IBM System x3850 X5 and x3950 X5 Types 7145 and 7146 Problem Determination
and Service Guide for an explanation of the error codes.
The server is designed so that LEDs remain lit when the server is connected to an
ac power source but is not turned on, provided the power supply is operating
correctly. This feature helps you to isolate the problem when the operating system
is shut down.
Any memory card LED can be lit while the memory card is removed from the
server so that you can isolate a problem. After ac power is removed from the
server, power remains available to these LEDs for up to 24 hours.
To view the memory card LEDs, press and hold the light path diagnostics button
on the memory card to light the error LEDs. The LEDs that were lit while the
server was turned on are lit again while the button is pressed.
Many errors are first indicated by a lit information LED or system-error LED on
the operator information panel on the front of the server. If one or both of these
LEDs are lit, one or more LEDs elsewhere in the server might also be lit and can
direct you to the source of the error.
1 2
ts760025
Ethernet port activity LEDs
Power-control button cover Locator button/locator LED
2. To view the light path diagnostics panel, press the release latch on the front of
the operator information panel to the left, then slide it forward. This reveals the
light path diagnostics panel. Lit LEDs on this panel indicate the type of error
that occurred.
REMIND
OVER SPEC LOG LINK PS PCI SP
NMI button
FAN TEMP MEM NMI (trained service
technician only)
CNFG CPU VRM DASD RAID BRD
ts760028
Light Path Diagnostics
Note: (Trained IBM service technician only) The non-maskable interrupt (NMI)
button is used for operating system debug purposes and causes the server to
reset if pressed.
Look at the system service label on the top of the server, which gives an
overview of internal components that correspond to the LEDs on the light path
diagnostics panel. This information and the information in “Troubleshooting
with light path diagnostics” can often provide enough information to correct
the error.
Light-emitting diodes (LEDs) in two locations on the server are available to help
you diagnose problems:
v Light path diagnostics panel
v LEDs on the system board
Procedure
1. Check the light path diagnostics panel first. If a system error has occurred, the
system-error LED on the front of the light path diagnostics drawer is lit. Slide
the latch to the left on the front of the light path diagnostics drawer to access
the light path diagnostics panel. Note any LEDs that are lit, and then close the
drawer.
2. Next, look at the LEDs on the system board to identify the component that is
causing the error. Note the lit LED on or beside the component.
What to do next
Refer to the IBM System x3850 M2 and x3950 M2 Types 7141, 7233, 7144, and 7234
Problem Determination and Service Guide for additional detailed information.
Table 6 on page 57 describes the LEDs on the light path diagnostics panel and on
the boards inside the server, and lists the suggested actions to correct the detected
problems.
v Follow the suggested actions in the order in which they are listed in the Action
column until the problem is solved.
v If an Action step is preceded by “Trained service technician only”, that step
must be performed only by a trained service technician.
Attention: Use the master power switch to turn cabinet power on and off
completely.
v When the server is turned off, the operating system does not run, and all core
logic except for the service processor is shut down.
v However, if the switched-off server is connected to an alternating current (ac)
power source, it can still respond to requests from the service processor (also
called the baseboard management controller). The power-on light-emitting diode
(LED) flashes to indicate that the server is connected to ac power but is not
turned on.
Note: Use this procedure only along with your next level of support.
Procedure
A general procedure for troubleshooting power problems is as follows:
1. Turn off the server, and disconnect all alternate current (ac) power cords.
2. Check for loose cables in the power subsystem.
3. Check for short circuit causes. For example, check to see whether a loose screw
might have caused a short circuit on a circuit board.
4. Remove adapters and disconnect the cables and power connectors to all
internal and external devices until the server is at the minimum configuration
required to start. See the “Solving undetermined problems” section in the IBM
System x3850 M2 and System x3950 M2 Type 7141 and 7233 User's Guide for the
minimum configuration.
5. Reconnect all ac power cords and turn on the server.
v If the server starts up successfully, replace adapters and devices one at a time
until the problem is isolated.
v If the server does not start up from the minimal configuration, replace field
replaceable units (FRUs) of minimal configuration one at a time until the
problem is isolated.
What to do next
v For specific power-supply problems, see “Power supply error LEDs” on page 43.
v For light panel information, see “Light path diagnostics” on page 54.
– The Ethernet link status LED is lit when the Ethernet controller receives a link
pulse from the hub. If the LED is off, there might be a defective connector or
cable or a problem with the hub.
– The Ethernet transmit/receive activity LED is lit when the Ethernet controller
sends or receives data over the Ethernet network. If the Ethernet
transmit/receive activity light is off, make sure that the hub and network are
operating and that the correct device drivers are installed.
v Check the local area network (LAN) activity LED on the rear of the server. The
LAN activity LED is lit when data is active on the Ethernet network. If the LAN
activity LED is off, make sure that the hub and network are operating and that
the correct device drivers are installed.
v Check for operating-system-specific causes of the problem.
v Make sure that the device drivers on the client and server are using the same
protocol.
If the Ethernet controller still cannot connect to the network but the hardware
appears to be working, the network administrator must investigate other possible
causes of the error. If the Ethernet adapter is defective, see to “Replacing a
Dual-Port or Quad-Port Ethernet card and recovering Ethernet settings” on page
196.
Power-on
password
(J33)
1
2
3
1 1 ts760030
Passwords
From the User Security menu choice, you can set, change, and delete a power-on
password and an administrator password. The User Security choice is on the full
Setup utility menu only.
From the User Security menu choice, you can set, change, and delete a power-on
password and an administrator password. The User Security choice is on the full
Setup utility menu only.
If you set only a power-on password, you must type the power-on password to
complete the system startup and to have access to the full Setup utility menu.
If you set a power-on password for a user and an administrator password for a
system administrator, you can type either password to complete the system
startup. A system administrator who types the administrator password has access
to the full Setup utility menu; the system administrator can give the user authority
to set, change, and delete the power-on password. A user who types the power-on
System recovery
Complete this task to recover the server to an operational state.
Procedure
1. Reinstall Red Hat Linux.
2. Configure the system settings to match those recorded at initial installation.
The validation tests perform checks on the Red Hat Linux cluster, including
fencing the other node, and waits for the system to reboot. This test cannot be
performed on a stand-alone configuration.
Important: Extensive discrepancies in the system time set on each server in the
cluster can result in unnecessary fencing between the nodes during system
validation testing. To avoid this, it is recommended that you synchronize the
servers' system times before you start the test. Refer to “Syncing the system times”
for instructions.
Procedure
1. If you are not already connected to a node, plug a USB keyboard and monitor
into the back of the server.
2. Log in to the server with the username ptadmin and password ptadmin.
3. Access the /install directory by typing the command:
cd /opt/dtc/install
4. Perform either of the following actions to begin the set clock configuration:
v From the command line, type ./ptconfig -setClock
v From the ProtecTIER Service menu (see “ProtecTIER Service menu” on page
16), select ProtecTIER Configuration > Update Time, Date, Timezone and
Timeserver(s)
The following menu is displayed:
Please Choose:
5. Synchronize the date and time by typing 1 in the Please Choose: field.
The following is displayed:
Please Choose:1
Please specify the date in DD/MM/YYYY format [08/11/2010]:
6. Type the date in the specified format and press Enter. If you do not enter a
date, the default (displayed in brackets) is used.
The following is displayed:
Please specify the date in DD/MM/YYYY format [08/11/2010]: 08/11/2010
Please specify the time in HH:MM:SS format [11:56:16]:
7. Type the time in the specified format and press Enter. If you do not enter a
time, the default (displayed in brackets) is used.
The following is displayed:
Date, Time, Timezone & Timeserver(s) configuration
==================================================
1. Set date & time
2. Set Timezone
3. Set Timeserver(s)
Please Choose:
Note: If you typed in new information, an asterisk (*) appears at the end of
the c. Commit changes and exit menu option to show that there are
modifications to be saved.
You can choose either c to save the changes and exit the setClock
configuration, choose q to exit the configuration without saving the changes,
or continue synchronizing the local timezone (step 5) or adding the
timeservers to the system (step 12 on page 69).
8. Synchronize the local timezone by typing 2 in the Please Choose: field. The
following is displayed:
Please Choose:2
Enter a 2 letter country code (or type ’m’ to enter the timezone manually):
9. Select a country.
v Type a two-letter country code (for example, US for the United States)
10. Type a number corresponding to the timezone you want to synchronize from
the list under the country code entered. The following is displayed:
Date, Time, Timezone & Timeserver(s) configuration
==================================================
1. Set date & time
2. Set Timezone
3. Set Timeserver(s)
Please Choose:
11. You can choose either c to save the changes and exit the setClock
configuration, choose q to exit the configuration without saving the changes,
or continue synchronizing the local timezone or adding the timeservers to the
system.
12. Add timeservers to the system by typing 3 in the Please Choose: field.
The following is displayed:
a. Please specify the timeserver's IP Address:
Type the timeserver's IP address. For example: 192.168.10.15
b. Would you like to set a secondary timeserver? (yes|no)
Type y to set a secondary timeserver.
c. Please specify the secondary timeserver's IP Address:
Type the secondary timeserver's IP address. For example: 192.168.12.15
Please Choose:
13. Confirm and save your changes to the configuration by typing c in the Please
Choose: field. A list of the configuration changes is displayed for review.
Results
Note: If only the date was modified, and not the timezones or timeservers, only
the date is displayed in the contents.
Example
If you modify the date, add a primary and secondary timeserver, and specify
America/Chicago as the time zone, when you choose c to commit the changes and
exit, the file contents are displayed as follows:
Please review the following information:
========================================
Date: Mon Nov 8 14:51:34 2010
Primary time server: 192.168.10.11
Secondary time server: 192.168.15.11
Timezone: America/Chicago
Type y to stop and restart the cluster and VTFD services on all the nodes.
When both nodes finish rebooting, use ProtecTIER Manager to verify that fencing
completed successfully and both nodes are online: To do so:
1. On the ProtecTIER Manager workstation or the TSSC, open ProtecTIER
Manager.
2. Confirm that both nodes display as Online.
v If both nodes are online, proceed to “Performing the system validation test.”
v If Node 1 is offline, manually reboot Node 1:
a. If necessary, connect the USB keyboard and monitor to Node 1 and log in
to the server with the username root and password admin.
b. At the command line, type: reboot
c. If both nodes display as online, proceed to “Performing the system
validation test.”
d. If one (or both) nodes displays as offline, contact your next level of
support.
Procedure
7. After vtfd is stopped and the test finishes on Node 1, disconnect the keyboard
and monitor from Node 1. Connect them to Node 2.
8. Log in to Node 2 with username root and password admin.
9. Repeat steps 3 through 6 on Node B.
ts760101
Figure 23. Alerts Log
When you are finished reviewing alerts, click Clear Alerts in the lower-right
corner of the Alerts Log screen.
10. If the test succeeds on both nodes, TS7650G installation and setup is complete.
Turn the system over to the customer and refer them to the IBM System
Storage ProtecTIER User's Guide for Enterprise Edition and Appliance Edition,
GA32-0922 for information on customizing, using, and maintaining
ProtecTIER; and creating, configuring, and managing replication grids, if
applicable.
11. If the test fails on either node, contact your next level of support to resolve the
failure conditions.
For detailed information about the 3958 DD1 and 3958 DD3 ProtecTIER servers,
see Chapter 3, “ProtecTIER server (3958 DD1 and 3958 DD3),” on page 37.
Component labeling
The colors of component labels indicate different types of components.
Note: Orange can also indicate the touch points on hot-swap components.
Use the IBM System x3850 X5 and x3950 X5 Types 7145 and 7146 Problem
Determination and Service Guide for removing or installing a specific hot-swap
component for any additional procedures that you might have to perform before
you remove or install the component.
Note: Adapters in the TS7650 (including the hot-swap adapter ports) are not
hot-swap capable due to the limitations of the Linux operating system. The system
must be shut down when an adapter needs to be replaced.
The following illustration shows the controls, LEDs, and connectors on the front of
the server. The DVD drive is optional in some models.
Electrostatic-discharge connector
ts760797
Hard disk drive
DVD drive panel activity LED
activity LED
DVD-eject button: Press this button to release a CD or DVD from the DVD drive.
DVD drive activity LED: When this LED is lit, it indicates that the DVD drive is
in use.
Scalability LED: This LED is lit and remains on during POST on the primary
server when the UEFI and the IMM detect more than four microprocessors. This
LED is lit and remains on after POST on the secondary server.
Hard disk drive activity LED: When this LED is flashing, it indicates that the
drive is in use.
Hard disk drive status LED: On some server models, each hot-swap hard disk
drive has a status LED. When this LED is lit, it indicates that the drive has failed.
If an optional IBM ServeRAID controller is installed in the server, when this LED is
flashing slowly (one flash per second), it indicates that the drive is being rebuilt.
When the LED is flashing rapidly (three flashes per second), it indicates that the
controller is identifying the drive.
Figure 25 shows the controls and LEDs on the operator information panel.
1 2
ts760798
Power-control button cover Ethernet port activity LEDs Locator button/locator LED
The following controls and LEDs are on the operator information panel:
Power-control button and power-on LED
Press this button to turn the server on and off manually or to wake the
server from a reduced-power state. The states of the power-on LED are as
follows:
Off ac power is not present, or the power supply or the LED itself has
failed.
Flashing rapidly (4 times per second)
The server is turned off and is not ready to be turned on. The
power-control button is disabled. In a fully configured server, it
could take up to 8 minutes after the server is connected to ac
power, before the power-control button becomes active.
Flashing slowly (once per second)
The server is turned off and is ready to be turned on. You can
press the power-control button to turn on the server.
Lit The server is turned on.
Fading on and off
The server is in a reduced-power state. To wake the server, press
the power-control button or use the IMM web interface (see
“Logging on to the Integrated Management Module web interface”
on page 76).
Ethernet icon LED
This LED lights the Ethernet icon.
Information LED
When this LED is lit, it indicates that a noncritical event has occurred. An
LED on the light path diagnostics panel is also lit to help isolate the error.
System-error LED
When this LED is lit, it indicates that a system error has occurred. An LED
on the light path diagnostics panel is also lit to help isolate the error.
Locator button and locator LED
Use this LED to visually locate the server among other servers. It is also
used as the physical presence for Trusted Platform Module (TPM). Press
Procedure
Note: If you are logging on to the IMM for the first time after installation, the
IMM defaults to DHCP. If a DHCP host is not available, the IMM assigns a
static IP address of 192.168.70.125.
2. On the Login page, type the user name and password. If you are using the
IMM for the first time, obtain the user name and password from your system
administrator. All login attempts are documented in the event log.
Note: The IMM is set initially with a user name of USERID and password of
PASSW0RD (with a zero, not the letter O). You have read/write access. You must
change this default password the first time you log on.
3. On the Welcome page, type a timeout value (in minutes) in the field that is
provided. You are logged off the web interface if your browser is inactive for
the number of minutes that you entered for the timeout value.
4. Click Continue to start the session. The System Status page provides a quick
view of the server status.
DC DC
! !
ts760799
ports Ethernet 1 - 2 serial ports
1-2 3-4
Power-cord connector
Connect the power cord to this connector.
System-management Ethernet connector
Use this connector to connect the server to a network for
systems-management information control. This connector is used only by
the Integrated Management Module (IMM).
QPI ports 1 - 4
In a single-node configuration, use these connectors to insert either a QPI
wrap card or a filler panel. The QPI wrap cards enable increased
performance in certain models. In a two-node configuration, insert the QPI
cables in these ports to connect another server or a MAX5 memory
expansion module to your server. See the documentation that came with
your QPI cable kit for detailed cabling instructions.
USB connectors
Connect USB devices, such as a USB mouse or keyboard, to these
connectors.
Video connector
Connect a monitor to this connector. In a two-node configuration, connect
the monitor to the primary server to see standard system output.
Serial connector
Connect a 9-pin serial device to this connector. The serial port is shared
with the IMM. The IMM can take control of the shared serial port to
perform text console redirection and to redirect serial traffic, using Serial
over LAN (SOL).
Rear view LEDs of the 3958 DD4, DD5, and AP1 servers
This topic describes the LEDs located on the rear of the 3958 DD4, DD5, and AP1
servers.
AC power LED
(green)
AC
DC power LED
(green)
DC
Error LED
(amber)
!
Thumbscrew is unpopulated in this position
except when 2-node lock bracket is installed
(2-node Scalability Kit Option)
AC
DC
AC AC
!
DC DC
! !
Power on
LED
System Locator
ts760800
ac power LED
Each hot-swap power supply has an ac power LED and a dc power LED.
When the ac power LED is lit, it indicates that sufficient power is coming
into the power supply through the power cord. During typical operation,
both the ac and dc power LEDs are lit. For any other combination of LEDs,
see “Power supply LEDs” on page 79.
dc power LED
Each hot-swap power supply has a dc power LED and an ac power LED.
When the dc power LED is lit, it indicates that the power supply is
See Figure 26 on page 77 for the location of the system management Ethernet port.
When the amber LED on the rear panel of a server power supply is lit, it indicates
an error condition within the power supply.
The following minimum configuration is required for the dc LED on the power
supply to be lit:
AC power LED
AC (green)
DC DC power LED
! (green)
Error LED
(amber)
ts760801
Note: For detailed analysis of other power problems, see “Power problems” in the
IBM System x3850 X5 and x3950 X5 Types 7145 and 7146 Problem Determination and
Service Guide.
Procedure
The server is designed so that any LEDs that are lit remain lit when the server
shuts down, as long as the server is connected to ac power, and is able to remain
in standby mode (that is, dc power is provided to the system board).
See “Power supply LEDs” on page 79 for troubleshooting with the power supply
LEDs and power-on LED.
If the system-error LED on the front of the server is on, one or more LEDs inside
the server or on the power supply are on. The server has LEDs to help you
identify problems with some server components. These LEDs are part of the light
path diagnostic feature built into the server. By following the light path, you can
quickly identify the type of system error that occurred. See “Light path
diagnostics” on page 91.
Diagnostics
The topics in this section provide basic troubleshooting information to help you
resolve some common problems that might occur with the server.
Follow the checkout procedure for diagnosing hardware problems. Review the
following information before performing the checkout procedure.
The following diagnostic tools are available to you for diagnosis and solving of
hardware-related problems:
Important:
v If the server is hung and a power-on self-test (POST) error code is displayed, see
“POST error codes” in the IBM System x3850 X5 and x3950 X5 Types 7145 and
7146 Problem Determination and Service Guide for suggested action to solve the
problem.
v If the server is hung, perform the following steps.
1. Do one of the following:
– Restart the server and press F2 to start DSA Preboot and view the event
logs.
– Restart the server. Press F1 to start the Setup utility. View the POST event
log or system-event log for the error indication. (Select POST Event
Viewer to view the POST event log. Select System Event Log to view the
system-event log.)
2. If you still have insufficient information to solve the problem, go to the
“Undetermined problems” section in IBM System x3850 X5 and x3950 X5
Types 7145 and 7146 Problem Determination and Service Guide
v For information about power-supply problems, see “ProtecTIER 3958 DD4, DD5,
and AP1 server power checkout” on page 102.
v For intermittent problems, check the error logs; see “Diagnostic programs and
error messages” on page 87.
Complete the following steps to perform the checkout procedure to identify system
problems:
1. Check the power supply LEDs (see “Power supply LEDs” on page 79).
2. Turn off the server and all external devices.
a. Undo any recent system changes,such as new settings or newly installed
devices.
b. Remove all hardware that is not listed on the ServerProven® website.
3. Check all cables and power cords.
4. Turn on all external devices.
5. Restart the server. If the server does not start, see “Troubleshooting tables” in
the IBM System x3850 X5 and x3950 X5 Types 7145 and 7146 Problem
Determination and Service Guide.
If the diagnostic programs were completed successfully and you still suspect a
problem, see the “Solving undetermined problems” section in the IBM System
x3850 X5 and x3950 X5 Types 7145 and 7146 Problem Determination and Service Guide.
The following tools are available to help you diagnose and solve hardware-related
problems:
Power-on self test (POST) error codes, error messages, and error logs
The POST generates error codes and messages to indicate detection of a
problem.
Problem isolation tables
Use these tables to help you diagnose various symptoms.
Light path diagnostics
Use the light path diagnostics to diagnose system errors quickly.
Diagnostic programs and error messages
The diagnostic programs are stored in memory on the system board. These
programs are the primary method of testing the major components of the
server.
Event logs, error logs
Error codes and messages are displayed in the following logs:
If the POST finishes without detecting any problems, the first screen of your
operating system or application program is displayed.
If the POST detects a problem, an error message is displayed. See the “POST error
codes” section in the IBM System x3850 X5 and x3950 X5 Types 7145 and 7146
Problem Determination and Service Guide for detailed failure information.
Notes:
v Before the POST continues, if you have a power-on password or an
administrator password set, you must type the password and press Enter when
prompted. (If you do not have the password, ask the customer for it.)
v A single problem might cause several error messages. When this occurs, work to
correct the cause of the first error message. After you correct the cause of the
first error message, the other error messages usually do not occur the next time
you run the test.
The POST error log contains the three most recent error codes and messages that
the system generated during the POST. The System Event log contains POST and
system management interrupt (SMI) events, and all events generated by the
baseboard management controller (BMC) which is embedded in IMM. Also logged
are system status messages from the service processor.
You can view the contents of the POST error log and System Event log from the
Setup Utility program or from the diagnostic programs.
Procedure
To view the POST event log or System event log, perform the following steps.
1. Turn on the server.
2. When the prompt <F1> Setup is displayed, press F1.
3. If a password and administrator password have been set, type the
administrator password to view the event logs.
4. Select the log you want to view.
v To view the POST event log, select POST Event Viewer.
v To view the system event log, select System Event Log.
To view error logs from diagnostic programs, complete the following steps.
1. Start the diagnostic programs.
2. Select Hardware Info from the top of the diagnostic programs screen. A list of
logs is displayed.
3. Select one of the following logs:
v System event log
v IMM event log
v Operating-system event log
v Merged DSA log
4. Follow the instructions on the screen. See “Starting the diagnostic programs”
on page 88 for more information.
The server diagnostic programs are stored in upgradeable read only memory
(ROM) on the system board. These programs are the primary method of testing the
major components of your server.
Diagnostic error messages indicate that a problem exists; they are not intended to
be used to identify a failing part. Troubleshooting and servicing of complex
problems that are indicated by error messages should be performed by trained
service personnel.
Sometimes the first error to occur causes additional errors. In this case, the server
displays more than one error message. Always follow the suggested action
instructions for the first error message that appears.
Diagnostic error messages that might be displayed are listed in the “DSA
messages” section in the IBM System x3850 X5 and x3950 X5 Types 7145 and 7146
Problem Determination and Service Guide.
Notes:
v Depending on the server configuration, some of the error codes might not
appear when you run the diagnostic programs.
v If diagnostic error codes appear that are not listed in the tables, make sure that
the server has the latest levels of firmware. To display the currently flashed
firmware versions, do the following:
– For ProtecTIER software V3.1 or later, from the ProtecTIER Service menu,
select Version Information > Display Firmware Levels (see “ProtecTIER
Service menu” on page 16).
– For ProtecTIER software V2.4 or V2.5, from the ProtecTIER Service menu,
select Display Firmware Versions.
– For ProtecTIER software V2.3 or earlier, from a command line type versions.
If the command is not found, type the following commands to mount the
IBM System Storage RAS/BIOS and Firmware Update DVD and display the
firmware versions:
mount /dev/hda /mnt/cdrom
cd /mnt/cdrom
./versions
Procedure
DC DC
! !
ts760799
ports Ethernet 1 - 2 serial ports
1-2 3-4
Note: QPI ports 1 and 2 are wrapped by QPI Wrap card 1. QPI ports 3 and 4
are wrapped by QPI Wrap card 2. QPI ports are not used as Expansion ports.
2. Turn on the server and watch the screen.
ts760802
Figure 30. eServer logo screen for basic input/output system (BIOS) setup
Note: You can run the USB interface test only if no USB devices are attached.
v You can view server configuration information (such as system configuration,
memory contents, interrupt request (IRQ) use, direct memory access (DMA)
use, device drivers, and so on) by selecting Hardware Info from the top of
the screen.
What to do next
If the diagnostic programs do not detect any hardware errors but the problem
persists during normal server operations, a software error might be the cause. If
you suspect a software problem, refer to the information that comes with the
software package. Otherwise, see “Undetermined problems” in the IBM System
x3850 X5 and x3950 X5 Types 7145 and 7146 Problem Determination and Service Guide.
Procedure
Complete the following steps in order to view the test log after tests are
completed.
1. Select Utility from the top of the screen and then select View Test Log.
2. View the test log or save the log to a file on a diskette or on the hard disk so
that you can view it later by performing the following steps:
a. Select Save Log on the diagnostic programs screen.
b. Specify a location and name for the saved log file.
Note: The test log data is maintained only while the diagnostic programs are
active. When you exit from the diagnostic programs, the test log is cleared
(saved test logs are not affected).
What to do next
For a detailed listing of diagnostic error messages, see “Running DSA Preboot” and
“DSA messages” in the IBM System x3850 X5 and x3950 X5 Types 7145 and 7146
Problem Determination and Service Guide.
To view the system error log, perform one of the following actions.
v Refer to “Viewing error logs from the Setup Utility program” on page 86.
v View the IMM event log online through the IMM Web interface.
For a list of error messages, refer to the “System-event log” section of the IBM
System x3850 X5 and x3950 X5 Types 7145 and 7146 Problem Determination and Service
Guide.
The server is designed so that LEDs remain lit when the server is connected to an
ac power source but is not turned on, provided the power supply is operating
correctly. This feature helps you to isolate the problem when the operating system
is shut down.
Any memory card LED can be lit while the memory card is removed from the
server so that you can isolate a problem. After ac power is removed from the
server, power remains available to these LEDs for up to 24 hours.
To view the memory card LEDs, press and hold the light path diagnostics button
on the memory card to light the error LEDs. The LEDs that were lit while the
server was turned on are lit again while the button is pressed.
Many errors are first indicated by a lit information LED or system-error LED on
the operator information panel on the front of the server. If one or both of these
LEDs are lit, one or more LEDs elsewhere in the server might also be lit and can
direct you to the source of the error.
1 2
ts760798
Power-control button cover Ethernet port activity LEDs Locator button/locator LED
2. To view the light path diagnostics panel, press the release latch on the front of
the operator information panel to the left, then slide it forward. This reveals the
light path diagnostics panel. Lit LEDs on this panel indicate the type of error
that occurred.
REMIND
OVER SPEC LOG LINK PS PCI SP
NMI button
FAN TEMP MEM NMI (trained service
technician only)
CNFG CPU VRM DASD RAID BRD
ts760028
Light Path Diagnostics
NMI button
The NMI button is reserved for trained IBM service technicians, who
use it to debug the operating system. Pressing the button sends a
non-maskable interrupt to the microprocessor, causing the server to
reset.
Reset button
The reset button is in the lower-right corner of the light path
diagnostics panel. Press it to reset the server and run the power-on
self-test (POST). You might have to use a pen or the end of a
straightened paper clip to press the button.
Remind button
You can use the remind button on the light path diagnostics panel to
put the system-error LED on the operator information panel into
Remind mode. When you press the remind button, you acknowledge
the error but indicate that you are not taking immediate action. The
system-error LED flashes while it is in Remind mode and stays in
Remind mode until one of the following conditions occurs:
v All known errors or suboptimal conditions are corrected.
v The server is powered back on.
v A new error or suboptimal condition occurs, causing the system-error
LED to be lit again.
Light-emitting diodes (LEDs) in two locations on the server are available to help
you diagnose problems that might occur during installation. Use them in the
following order:
Procedure
1. Light path diagnostics panel – Look at this panel first. If a system error has
occurred, the system-error LED on the front of the light path diagnostics
drawer is lit. Slide the latch to the left on the front of the light path diagnostics
drawer to access the light path diagnostics panel. Note any LEDs that are lit,
and then close the drawer.
2. LEDs on the system board – To identify the component that is causing the
error, note the lit LED on or beside the component.
What to do next
See the IBM System x3850 X5 and x3950 X5 Types 7145 and 7146 Problem
Determination and Service Guide for additional detailed information.
Table 10 on page 94 describes the LEDs on the light path diagnostics panel and on
the boards inside the server and suggested actions to correct the detected
problems.
Chapter 4. ProtecTIER server (3958 DD4, 3958 DD5, and AP1) 101
Table 10. LEDs on the light path diagnostics panel (continued)
Lit light path
diagnostics LED
with the
Description Action
system-error LED or
information LED
also lit
BOARD The I/O-board shuttle 1. Find the failing board by checking the
or microprocessor LEDs on the I/O-board shuttle and
board has failed. microprocessor board.
2. If the I/O board LED is lit, reseat the
I/O board (see “Removing the
I/O-board shuttle” and “Replacing the
I/O-board shuttle” in IBM System x3850
X5 and x3950 X5 Types 7145 and 7146
Problem Determination and Service Guide).
If the I/O board LED remains lit,
replace the I/O board.
3. If the I/O board LED is not lit, remove
memory card 7 to see if the
microprocessor-board LED is lit (see
“Removing a memory card” in IBM
System x3850 X5 and x3950 X5 Types
7145 and 7146 Problem Determination and
Service Guide).
4. (Trained service technician only) If the
microprocessor-board LED is lit, reseat
the microprocessor board (see
“Removing the microprocessor-board
assembly” and “Replacing the
microprocessor-board assembly” in IBM
System x3850 X5 and x3950 X5 Types
7145 and 7146 Problem Determination and
Service Guide). If the
microprocessor-board LED remains lit,
replace the microprocessor board.
Attention: Use the master power switch to turn cabinet power on and off
completely.
v When the server is turned off, the operating system does not run, and all core
logic except for the service processor is shut down.
v However, if the switched-off server is connected to an alternating current (ac)
power source, it can still respond to requests from the service processor (also
called the baseboard management controller). The power-on light-emitting diode
(LED) flashes to indicate that the server is connected to ac power but is not
turned on.
Power problems can be difficult to solve. For example, a short circuit can exist
anywhere on any of the power distribution buses. A short circuit usually causes
the power subsystem to shut down because of an overcurrent condition.
Note: Use this procedure only along with your next level of support.
Procedure
Chapter 4. ProtecTIER server (3958 DD4, 3958 DD5, and AP1) 103
ACT/LNK A
10=OFF
100=GRN
1000=ORG
Intel=PRO
ACT/LNK A
2
3
4
5
6
7
8
ts760482
If the Ethernet controller still cannot connect to the network but the hardware
appears to be working, the network administrator must investigate other possible
causes of the error. If the Ethernet adapter is defective, see to “Replacing a
Dual-Port or Quad-Port Ethernet card and recovering Ethernet settings” on page
196.
The 3958 DD4, DD5, and AP1 servers feature two 10 Gb dual-port Ethernet
adapters that you can order in place of the 1 Gb quad-port Ethernet adapters in
PCI slots 3 and 4 in OpenStorage and FSI configurations. The 10 Gb Ethernet
adapter ports connect to Fibre Channel cables.
If the power-on password for the server is lost or forgotten, regain access to the
server by performing any of the following:
v If an administrator password is set, type the administrator password at the
password prompt. Start the Configuration/Setup Utility program and reset the
power-on password.
v Remove the battery from the server and then reinstall it. For instructions for
removing the battery, see the “Removing the battery” section in the IBM System
x3850 X5 and x3950 X5 Types 7145 and 7146 Problem Determination and Service
Guide.
v Change the position of the power-on password override jumper (J29 on the I/O
board) to bypass the power-on password check.
Attention: Before changing any switch settings or moving any jumpers, turn
off the server, then disconnect all power cords and external cables. Do not
change settings or move jumpers on any system-board switch or jumper blocks
that are not shown in the IBM System x3850 X5 and x3950 X5 Types 7145 and 7146
Problem Determination and Service Guide.
Chapter 4. ProtecTIER server (3958 DD4, 3958 DD5, and AP1) 105
Figure 34 shows the jumpers on the I/O board.
1 1 1
Wake on LAN 2 Password 2 Boot 2
ts760812
bypass (J31) 3 override (J29) 3 recovery 3
(J22)
Passwords
This topic explains the use of passwords.
If you set only a power-on password, you must type the power-on password to
complete the system startup and to have access to the full Setup utility menu.
If you set a power-on password for a user and an administrator password for a
system administrator, you can type either password to complete the system
startup. A system administrator who types the administrator password has access
to the full Setup utility menu; the system administrator can give the user authority
to set, change, and delete the power-on password. A user who types the power-on
password has access to only the limited Setup utility menu; the user can set,
change, and delete the power-on password, if the system administrator has given
the user that authority.
System recovery
Complete this task to recover the server to an operational state.
Procedure
1. Reinstall Red Hat Linux.
2. Configure the system settings to match those recorded at initial installation.
The validation tests perform checks on the Red Hat Linux cluster, including
fencing the other node, and waits for the system to reboot. This test cannot be
performed on a stand-alone configuration.
Important: Extensive discrepancies in the system time set on each server in the
cluster can result in unnecessary fencing between the nodes during system
validation testing. To avoid this, it is recommended that you synchronize the
servers' system times before you start the test. Refer to “Syncing the system times”
on page 108 for instructions.
Chapter 4. ProtecTIER server (3958 DD4, 3958 DD5, and AP1) 107
Syncing the system times
You can set the date, time, and time zone manually or sync with a timeserver.
Procedure
1. If you are not already connected to a node, plug a USB keyboard and monitor
into the back of the server.
2. Log in to the server with the username root and password admin.
3. Access the /install directory by typing the command:
cd /opt/dtc/install
4. Perform either of the following actions to begin the set clock configuration:
v From the command line, type ./ptconfig -setClock
v From the ProtecTIER Service menu (see “ProtecTIER Service menu” on page
16), select ProtecTIER Configuration > Update Time, Date, Timezone and
Timeserver(s)
The following menu is displayed:
Date, Time, Timezone & Timeserver(s) configuration
==================================================
1. Set date & time
2. Set Timezone
3. Set Timeserver(s)
Please Choose:
5. Synchronize the date and time by typing 1 in the Please Choose: field.
The following is displayed:
Please Choose:1
Please specify the date in DD/MM/YYYY format [08/11/2010]:
6. Type the date in the specified format and press Enter. If you do not enter a
date, the default (displayed in brackets) is used.
The following is displayed:
Please specify the date in DD/MM/YYYY format [08/11/2010]: 08/11/2010
Please specify the time in HH:MM:SS format [11:56:16]:
7. Type the time in the specified format and press Enter. If you do not enter a
time, the default (displayed in brackets) is used.
The following is displayed:
Date, Time, Timezone & Timeserver(s) configuration
==================================================
1. Set date & time
2. Set Timezone
3. Set Timeserver(s)
Please Choose:
Note: If you typed in new information, an asterisk (*) appears at the end of
the c. Commit changes and exit menu option to show that there are
modifications to be saved.
9. Select a country.
v Type a two-letter country code (for example, US for the United States)
v If you chose to enter the timezone manually, type the name (for example,
Italy).
If you need a list of time zone codes from countries other than the United
States, see Appendix K, “Worldwide time zone codes,” on page 303. The
following is displayed:
Time zones under US:
====================
1. America/New_York
2. America/Detroit
3. America/Kentucky/Louisville
4. America/Kentucky/Monticello
5. America/Indiana/Indianapolis
6. America/Indiana/Vincennes
7. America/Indiana/Knox
8. America/Indiana/Winamac
9. America/Indiana/Marengo
10. America/Indiana/Vevay
11. America/Chicago
12. America/Indiana/Tell_City
13. America/Indiana/Petersburg
14. America/Menominee
15. America/North_Dakota/Center
16. America/North_Dakota/New_Salem
17. America/Denver
18. America/Boise
19. America/Shiprock
20. America/Phoenix
21. America/Los_Angeles
22. America/Anchorage
23. America/Juneau
24. America/Yakutat
25. America/Nome
26. America/Adak
27. Pacific/Honolulu
10. Type a number corresponding to the timezone you want to synchronize from
the list under the country code entered. The following is displayed:
Chapter 4. ProtecTIER server (3958 DD4, 3958 DD5, and AP1) 109
Date, Time, Timezone & Timeserver(s) configuration
==================================================
1. Set date & time
2. Set Timezone
3. Set Timeserver(s)
Please Choose:
11. You can choose either c to save the changes and exit the setClock
configuration, choose q to exit the configuration without saving the changes,
or continue synchronizing the local timezone or adding the timeservers to the
system.
12. Add timeservers to the system by typing 3 in the Please Choose: field.
The following is displayed:
a. Please specify the timeserver's IP Address:
Type the timeserver's IP address. For example: 192.168.10.15
b. Would you like to set a secondary timeserver? (yes|no)
Type y to set a secondary timeserver.
c. Please specify the secondary timeserver's IP Address:
Type the secondary timeserver's IP address. For example: 192.168.12.15
You can search your region for the appropriate NTP timeserver you want to
synchronize your system with using your internet search engine. There are
several sites available, one example is the NIST Internet Time Servers web
page.
After specifying the timeserver’s addresses, the following is displayed:
Date, Time, Timezone & Timeserver(s) configuration
==================================================
1. Set date & time
2. Set Timezone
3. Set Timeserver(s)
Please Choose:
13. Confirm and save your changes to the configuration by typing c in the Please
Choose: field. A list of the configuration changes is displayed for review.
Results
Note: If only the date was modified, and not the timezones or timeservers, only
the date is displayed in the contents.
Example
If you modify the date, add a primary and secondary timeserver, and specify
America/Chicago as the time zone, when you choose c to commit the changes and
exit, the file contents are displayed as follows:
Type y to stop and restart the cluster and VTFD services on all the nodes.
Stopping Cluster Services [ Done ]
Stopping NTPD [ Done ]
Setting Time Zone [ Done ]
Setting Timeserver [ Done ]
Setting Date & Time [ Done ]
Starting NTPD [ Done ]
Starting cluster [ Done ]
Cluster Started
What to do next
When both nodes finish rebooting, use ProtecTIER Manager to verify that fencing
completed successfully and both nodes are online: To do so:
1. On the ProtecTIER Manager workstation or the TSSC, open ProtecTIER
Manager.
2. Confirm that both nodes display as Online.
v If both nodes are online, proceed to “Performing the system validation test.”
v If Node 1 is offline, manually reboot Node 1:
a. If necessary, connect the USB keyboard and monitor to Node 1 and log in
to the server with the username root and password admin.
b. At the command line, type: reboot
c. If both nodes display as online, proceed to “Performing the system
validation test.”
d. If one (or both) nodes displays as offline, contact your next level of
support.
Procedure
Chapter 4. ProtecTIER server (3958 DD4, 3958 DD5, and AP1) 111
cd /opt/dtc/install
4. From the command prompt, enter the following command:
./ptconfig -validate
7. After vtfd is stopped and the test finishes on Node 1, disconnect the keyboard
and monitor from Node 1. Connect them to Node 2.
8. Log in to Node 2 with the username root and password admin.
9. Repeat steps 3 on page 111 through 6 on Node B.
Note: Alerts might be generated as the validation tests are running. The
ProtecTIER Manager Alerts Log allows you to monitor the alerts as they occur.
To do so, on the ProtecTIER Manager workstation, click Alerts in the lower
right corner of the ProtecTIER Manager screen. The Alerts Log is displayed
(Figure 35).
ts760101
When you are finished reviewing alerts, click Clear Alerts in the lower-right
corner of the Alerts Log screen.
10. If the test succeeds on both nodes, TS7650G installation and setup is complete.
Turn the system over to the customer and refer them to the IBM System
Storage ProtecTIER User's Guide for Enterprise Edition and Appliance Edition,
GA32-0922 for information on customizing, using, and maintaining
ProtecTIER; and creating, configuring, and managing replication grids, if
applicable.
11. If the test fails on either node, contact your next level of support to resolve the
failure conditions.
Note: Depending upon the model of the TSSC and TSSC network switch being
used, the port layout might be different than shown. If so, attach and label the
cables according to the port number assignments specified, regardless of the
position of the ports on the TSSC or the TSSC network switch. Refer to the IBM
System Storage TS3000 System Console (TSSC) Maintenance Information for detailed
port information.
See “TSSC connections for DD4, DD5, and AP1 servers” for cabling for the DD4,
DD5, and AP1 servers. See “TSSC connections for DD1 and DD3 servers” on page
120 for cabling for the DD1 and DD3 servers.
See the IBM System Storage TS3000 System Console (TSSC) Maintenance Information
document for detailed information about troubleshooting and repairing the TSSC.
Ethernet connections between the TSSC and the disk controllers are not
implemented, and therefore are not shown in the figures in this section.
Note: To test operational connectivity between the server and TSSC for call home,
see “Troubleshooting the server-to-TSSC connection” on page 24.
Note: Depending upon the model of the TSSC and TSSC network switch (16-port
or 26-port) being used, the port layout might be different than shown. If so, attach
and label the cables according to the port number assignments specified, regardless
of the position of the ports on the TSSC or the TSSC network switch. Refer to the
IBM System Storage TS3000 System Console (TSSC) Maintenance Information for
detailed port information.
SC SW 2 4 6 8 10 12 14 16 Customer
10 Network
P 1 3 5 7 9 11 13 15
SC Server 9 1 2
V U3
SC KVM
24 22
U1
21 11
ProtecTIER Server A
1 1 1 1 1 P 13 P
2 2 2 2 2
3
4
E1 E3
2
1
A1 A2 B V S
Terminator E2 E4
4
3
1 2 3 4 5 6 7
8 9
6
5
23
8
7
ts760988
Figure 36. Stand-alone TS7650G (3958 DD4/3958 DD5) or TS7650 Appliance (3958 DD4)
attachment to the TSSC (x3250 M3 MT4252)
1 1 1 1 1 P P
2 2 2 2 2
3
25 4
E1 E3
A1 A2 B V S
E2 E4
1 2 3 4 5 6 7
8 9
KVM
switch 12
14
SC SW 2 4 6 8 10 12 14 16 Customer
10 Network
P 1 3 5 7 9 11 13 15
SC Server 9 1 2
V U3
SC KVM
24 22
U1
21 11
ProtecTIER Server A
1 1 1 1 1 P 13 P
2 2 2 2 2
3
4
E1 E3
2
1
A1 A2 B V S
Terminator E2 E4
4
3
1 2 3 4 5 6 7
8 9
6
5
23
8
7
ts760989
Figure 37. Clustered TS7650G (3958 DD4/3958 DD5) or TS7650 Appliance (3958 DD4)
attachment to the TSSC (x3250 M3 MT4252)
SC SW 2 4 6 8 10 12 14 16 Customer
10 Network
P 1 3 5 7 9 11 13 15
SC Server 9 1 2
V U1 U3
SC KVM
24 22
U1
21 11
ProtecTIER Server A
1 1 1 1 1 P 13 P
2 2 2 2 2
3
4
E1 E3
2
1
A1 A2 B V S
Terminator E2 E4
4
3
1 2 3 4 5 6 7
8 9
6
5
23
8
7
ts761217
|
| Figure 38. Stand-alone TS7650 Appliance (3958 DD4) attachment to the TSSC (x3250 M4
| MT 2538)
ProtecTIER Server B
1 1 1 1 1 P P
2 2 2 2 2
3
4
E1 E3
25 A1 A2 B V S
E2 E4
1 2 3 4 5 6 7
8 9
KVM 12
switch 14
SC SW 2 4 6 8 10 12 14 16 Customer
10 Network
P 1 3 5 7 9 11 13 15
SC Server 9 1 2
V U1 U3
SC KVM
24 22
U1
21 11
ProtecTIER Server A
1 1 1 1 1 P 13 P
2 2 2 2 2
3
4
E1 E3
2
1
A1 A2 B V S
Terminator E2 E4
4
3
1 2 3 4 5 6 7
8 9
6
5
23
8
7
ts761218
|
| Figure 39. Clustered TS7650 Appliance (3958 DD4) attachment to the TSSC (x3250 M4 MT
| 2538)
28
KVM U1 1 3 5 7
P
switch 2 4 6 8
Terminator
9
SC Server 1 2 Customer
U1 U3 10
V Network
22
SC KVM
24
21 23
SC SW 1 3 5 7 9 11 13 15 17 19 21 23 25 26
P
2 4 6 8 10 12 14 16 18 20 22 24
11 RAS
ProtecTIER Server A
P RAS 13 P
1 1 1 1 1
2 2 2 2 2
3
4
E1 E3
A1 A2 B V S
E2 E4
1 2 3 4 5 6 7
ts761225
8 9
|
| Figure 40. Stand-alone TS7650G (3958 DD5) attachment to the TSSC (x3250 M4 MT 2538)
28
KVM U1 1 3 5 7
P
switch 2 4 6 8
Terminator
9
SC Server 1 2 Customer
U1 U3 10 Network
V
22
SC KVM
24
21
SC SW 1 3 5 7 9 11 13 15 17 19 21 23 25 26
P
2 4 6 8 10 12 14 16 18 20 22 24
12 11 23
RAS RAS
25
ProtecTIER Server B
1 1 1 1 1 P P
2 2 2 2 2
3
4
E1 E3
A1 A2 B V S
E2 E4
1 2 3 4 5 6 7
8 9
14
RAS
ProtecTIER Server A
RAS 13
1 1 1 1 1 P P
2 2 2 2 2
3
4
E1 E3
A1 A2 B V S
E2 E4
1 2 3 4 5 6 7
ts761226
8 9
|
| Figure 41. Clustered TS7650G (3958 DD5) attachment to the TSSC (x3250 M4 MT 2538)
Note: Ethernet port connections from the 3958 DD4 and 3958 DD5 servers to the
TSSC are the same for Virtual Tape Library, OpenStorage, or File System Interface.
Table 13. Stand-alone and clustered TSSC, KVM, and customer network Ethernet connections for the TS7650
Appliance and TS7650G
On device or
Item From On device To location Remarks
9 Port 5 SC SW (TSSC Port 1 SC Server (TSSC)
Ethernet switch)
10 Port 2 TSSC Customer local Customer specified
area network device
Note: Depending upon the model of the TSSC and TSSC network switch being
used, the port layout might be different than shown. If so, attach and label the
cables according to the port number assignments specified, regardless of the
position of the ports on the TSSC or the TSSC network switch. Refer to the IBM
System Storage TS3000 System Console (TSSC) Maintenance Information for detailed
port information.
The server E2 and RSA Ethernet ports connect to the TSSC by means of the TSSC
Ethernet switch (see Figure 42 on page 121). Direct connection of the RSA Ethernet
port to the customer LAN is not supported. In addition, with RAS/BIOS firmware
versions 2.5 and higher, Ethernet connections from the storage controllers to the
TSSC Ethernet switch are no longer required.
1 2
TSSC 1 2 3 4 5 6 7 8
network 172.31.1.xx
Ethernet 10 13 14 15 16
9 11 12
switch
3958 AP1
Server A RSA
1 1 1 1 1 1
E1
ts760123
2 2 2 2 2 2 E1
E2
E2
1 2 3 4 6 7
Figure 42. Stand-alone TS7650 Appliance (3958 DD3) attachment to the TSSC
2 2 2 2 2 2 E1
E1
E2
E2
1 2 3 4 6 7
Customer
connection
TSSC
1 2
TSSC 1 2 3 4 5 6 7 8
network 172.31.1.xx
Ethernet 10 13 14 15 16
9 11 12
switch
3958 AP1
Server A RSA
1 1 1 1 1 1
2 2 2 2 2 2 E1
E1
E2
E2
ts760122
1 2 3 4 6 7
Figure 43. Clustered TS7650 Appliance (3958 DD3) attachment to the TSSC
1 2
TSSC 1 2 3 4 5 6 7 8
network 172.31.1.xx
Ethernet 10 13 14 15 16
9 11 12
switch
3958 DD1/DD3
Server A RSA
1 1 1 1 1 1
E1
ts760013
2 2 2 2 2 2 E1
E2
E2
1 2 3 4 6 7
Figure 44. Stand-alone TS7650G (3958 DD1/3958 DD3) attachment to the TSSC
2 2 2 2 2 2 E1
E1
E2
E2
1 2 3 4 6 7
Customer
connection
TSSC
1 2
TSSC 1 2 3 4 5 6 7 8
network 172.31.1.xx
Ethernet 10 13 14 15 16
9 11 12
switch
3958 DD1/DD3
Server A RSA
1 1 1 1 1 1
E1
ts760012
2 2 2 2 2 2 E1
E2
E2
1 2 3 4 6 7
Figure 45. Clustered TS7650G (3958 DD1/3958 DD3) attachment to the TSSC
Important: For TS7650 and TS7650G installations, this section applies only if the
customer back-end storage controller is an IBM System Storage DS4700 Express or
DS3950. If the customer back-end storage controller differs, use the troubleshooting
procedures that were provided by the manufacturer with the controller in the
original configuration. All attached storage to the TS7650G is supported separately,
and problems are not called home for it.
The disk controller supports sixteen 450 GB, 4 Gbps, 15 krpm, Fibre Channel disk
drive modules (DDMs).
The disk controller comes with two RAID controllers, two power supplies, and two
cooling units. It provides dual, redundant controllers, redundant cooling,
redundant power, and battery backup of the RAID controller cache. The disk
controller is designed to provide maximum host and drive-side redundancy,
enabling full redundant drive loop support to additional disk storage capacity in
the expansion frames via the drive loop port from each RAID controller. In
addition, each RAID controller supports direct attachment of two servers that
contain two Fibre Channel host bus adapters each. External cables and small
form-factor pluggable (SFP) modules connect the disk controller to the expansion
units and the servers. For optimum performance, RAID 5 configuration is
recommended.
Note: Currently, the only expansion unit that can be attached to the IBM-provided
disk controller is the 1812 81H disk expansion unit, Feature Code 3707: 4.8 TB
Fibre Channel Disk Expansion Unit.
For more support information about the disk controller, go to the IBM Support
Portal and select Disk systems > DS4700 Express.
Attention: Before upgrading the ProtecTIER code level, you must update the
DS4700 storage subsystem firmware, NVSRAM firmware, and the ESM firmware
on your EXP810 expansion drawers.
You must call IBM Service to schedule a time to update the firmware to the levels
listed below.
Do not proceed to upgrade the ProtecTIER code level to version 2.4.x until:
v The storage subsystem firmware has been upgraded to the minimum level of
07.50.13.00 or the recommended level of 07.60.28.00.
v The DS4700 NVSRAM firmware has been updated to the minimum level of
N1814D47R1050V07 or the recommended level of N1814D47R1060V04.
v The expansion unit ESM firmware has been updated to the minimum level of
98C5 or the recommended level of 98D0.
Procedure
1. Verify that the firmware is at the correct code level.
a. If you are not already logged in to the TSSC, log in with user ID service
and password service.
b. Right-click the blue desktop, and from the IBM TS3000 System Console
menu, select Browser Functions > Storage Manager GUI Functions >
Launch GUI.
Note: If you receive a message that the DS Storage Manager GUI is not
installed, you must install it using the supplied DS Storage Manager
Graphical User Interface CD version 10.60.G5.16. Use step 3 on page 127 to
perform the installation and return to this step after DS Storage Manager is
installed. If the DS Storage Manager GUI is already installed, verify the
version by selecting Help > About. If the DS Storage Manager version is
Note: During this activation, both A and B controllers are upgraded. When the
activation is completed, a New Firmware Detected window is displayed. Click
OK.
6. Upgrade the ESM level.
a. In the Manage Storage Subsystem window, click the Advanced button in
the top menu bar.
b. Select Maintenance > Download > ESM Firmware. The Download
Environmental (ESM) Card Firmware window is displayed.
c. In the Drive Enclosure section, select the enclosure to be upgraded.
d. In the ESM Firmware section, click the Select File button. A new window is
displayed.
Results
What to do next
Close any open DS Storage Manager windows, right-click the blue desktop of the
TSSC and select Logout.
If the disk controller and disk expansion unit are at the minimum level of
ProtecTIER V2.4 code, the firmware is verified and upgraded automatically. During
installation, you are asked whether you want to update the controller code.
Answer yes and the disk controller and disk expansion unit firmware is
automatically updated.
Restriction: ProtecTIER V3.1.8 does not support automatic firmware updates. You
must use the procedure for manual updates in Appendix D, “Manually upgrading
the DS4700 storage system firmware (ProtecTIER version 2.4 and later),” on page
271.
If it is necessary for you to verify and upgrade the firmware manually, see
Appendix D, “Manually upgrading the DS4700 storage system firmware
(ProtecTIER version 2.4 and later),” on page 271.
Method One
Procedure
1. Go to the Subsystem Management window and select Storage Subsystem >
View > Profile. The Storage Subsystem Profile window is displayed.
Note: The ATA translator card firmware and drive firmware are
typically packaged together as a single drive firmware upgrade file.
ESM ESM card firmware version
Method Two
Complete the applicable procedure to obtain the specified firmware version.
Click the Controller icon in the Physical View pane of the Subsystem Management
window (see Properties in the right pane). The Controller Enclosure properties
window displays the properties for that controller. You must perform this step for
each individual controller.
Click the Controller icon in the Physical View pane of the Subsystem Management
window (see Properties in the right pane). The Drive Properties window displays
the properties for that drive. You must perform this step for each individual drive.
Hot-swap capable devices enable you to maintain the availability of your system
while you remove, install, or replace a hot-swap device. The disk controller
supports hot-swap Fibre Channel hard disk drives (HDDs) and cooling fan
field-replaceable units (FRUs), so you can remove and replace these components
without turning off the disk controller. In addition, the disk controller supports
hot-swap RAID controllers and power supplies.
The hot-swap drive bays that are accessible from the front of your disk controller
are shown in Figure 46.
ts760983
Figure 46. Disk controller hot-swap drive bays
The disk controller supports up to 16 Fibre Channel hard disk drives (HDDs).
These drives are preinstalled in drive trays. The unit consisting of the drive and
the drive carrier assembly is called a drive field-replaceable unit (FRU). The Fibre
Channel drive FRU includes the drive tray, the Fibre Channel drive, and the HDD
interposer card. You install the drive FRUs in the 16 drive bays on the front of the
disk controller. There are no serviceable parts in a drive FRU. If a drive FRU fails,
it must be replaced in its entirety (drive, interposer, bezel, and tray).
Attention:
v Never hot-swap a drive FRU when its associated green activity LED is flashing.
Hot-swap a drive FRU only when its associated amber fault LED light is not
flashing, or when the drive is inactive and its associated green activity LED light
is not flashing.
v The Fibre Channel drive and its HDD card are digitally signed together at the
factory. Do not swap drives and interposers between drive FRUs. Do not replace
one Fibre Channel drive with another Fibre Channel drive. These actions result
in a drive FRU that cannot be recognized by the disk controller RAID controller.
v After you remove a drive FRU, wait at least 70 seconds before replacing or
reseating the drive FRU to allow the drive to properly spin down.
Important: If the disk drive you want to remove is not in a failed or bypass state, always
use Storage Manager either to place the drive in a failed state or to place the array that is
associated with the drive (or drives) in an offline state before you remove the drive from
the enclosure.
ts760984
1 2 3 4 5 6
Attention: If the global summary fault LED lights, refer to the client software to
determine the problem with the disk controller, and check the needs attention
LEDs on the other disk components to help isolate the fault to a specific
component.
4 5
13 12 11 10 7 6 7 6 1 2 3
9 8 9 8
8 9 8 9
ts760970
5 4 3 2 1 6 7 6 7 10 11 12 13
Off - Seven-segment
LEDs indicate enclosure
ID
5 Heartbeat Flashing Off
6 Host channel speed - L1 See Table 17 on page 136.
7 Host channel speed - L2 See Table 17 on page 136.
8 Ethernet link speed On - 100 Mbps Not applicable
Off - 10 Mbps
9 Ethernet link activity On - link established Not applicable
Flashing - activity
10 Drive channel port Off On - No valid device
bypass detected and port is
(Also off if no SFP bypassed
(One LED per port) connected)
Note: The drive channel
consists of two fibre
channel ports. This LED
indicates the drive port
bypass status of one of
the two fibre channel
ports that comprise a
drive channel. The LED
marked 13 shows the
status of the other port.
11 Drive channel speed - L1 See Table 17 on page 136.
12 Drive channel speed - L2 See Table 17 on page 136.
1 2 3 4
ts761111
4 3 2 1
The L1 and L2 LEDs for each host and drive channel combine to indicate the
status and the operating speed of each host and drive channel.
3 2 1
ts760973
1 2 3
Seven-
segment
display
ds48042
76041
Diagnostic LED
When you power on the disk controller, the seven-segment LEDs show various
codes as the controllers are going through the boot process.
When the server successfully completes the power-on boot process, the numeric
display shows the enclosure identification (enclosure ID) of the disk controller.
If an error has occurred and the Needs attention LED is on, the numeric display
shows diagnostic information. The numeric display indicates the information is
diagnostic by illuminating an LED that appears as a decimal point between the
display numbers. The diagnostic LED tuns off when the numeric display shows
the disk controller enclosure ID. The numeric display shows various diagnostic
codes as the disk controller performs the startup process after each power cycle or
reset. After diagnostics are complete, the current disk controller enclosure ID is
displayed.
Diagnostic codes in the form of Lx, where x is a hexadecimal digit, indicate disk
controller state information. In general, these codes are displayed only when the
disk controller is in a non-operational state. The disk controller might be
non-operational due to a configuration problem (such as mismatched types), or it
might be non-operational due to hardware faults. If the disk controller is
non-operational due to system configuration, the disk controller Needs attention
LED is off. If the disk controller is non-operational due to a hardware fault, the
disk controller Needs attention LED is on. The definitions for Lx diagnostic codes
are listed in Table 19 on page 138.
The enclosure ID, comprised of two seven-segment numbers, is located on the back
of each controller next to the indicator lights. It provides a unique identifier for
each enclosure in the disk storage configuration.
The controller automatically sets the enclosure ID number. You can change the
setting through the storage management software if necessary. Both controller
enclosure ID numbers are identical under normal operating conditions.
The allowable ranges for enclosure ID settings are 0-99. Do not set the enclosure ID
to 00 or any number less than 80. The disk controller enclosure ID is normally set
to a value of 85 at the factory.
Figure 53 shows the locations of controllers A and B and the Fibre Channel ports
on the back of the DS4700 disk controller.
3 4 5 6
2
ts760974
6 5 4 3
Figure 54 on page 140 shows the locations of controllers A and B and the Fibre
Channel ports on the back of the DS3950 disk controller.
ts761112
6 5 2
Table 21 provides a list of the Fibre Channel ports associated with each of the
controllers in the disk controller.
Table 21. Disk controller Fibre Channel ports
Number Description
1 Controller A
2 Controller B
3 Drive channel 1 - Port 2 on controller A
Note: Storage Manager is not included in TS7650G initial installations. This section
assumes that customers with TS7650G installations are using the IBM 1814 70H
disk controller and 1812 81H disk expansion units as back-end storage, and that
Storage Manager is installed, configured, and available at the customer site to
support those components.
You can use Storage Manager to run diagnostics. See Figure 55 on page 141.
Whenever the controller cards are replaced with new field-replaceable units
(FRUs), the firmware must be reloaded. Use the Storage Manager Download
Firmware feature to update the firmware and NV-RAM. Refer to Figure 56.
Use the Storage Manager Execute Script feature to execute scripts included on the
disk storage configuration script CD to establish the file system on the disk
components. Refer to Figure 57 on page 142.
Attention: To prevent damage to the cable or to the SFP module, unplug the
LC-LC Fibre Channel cable before you remove the SFP module.
Procedure
1. Remove the LC-LC Fibre Channel cable from the SFP module.
2. Unlock the SFP module latch. For SFP modules that contain plastic tabs, unlock
the SFP module latch by pulling the plastic tab outward at an angle of ten
degrees, as shown in Figure 58 on page 143.
Protective cap
10 o
SFP module
ts760975
Figure 58. Unlocking the SFP module latch - plastic variety
3. With the SFP latch in the unlocked position, extract the SFP module. For SFP
modules that contain plastic tabs, slide the SFP module out of the port.
4. Replace the protective cap on the SFP module.
5. Place the SFP module into a static-protective package.
6. Replace the protective cap on the port.
What to do next
For additional information see the IBM System Storage DS4700 Express Storage
Subsystem Installation, User's and Maintenance Guide.
Attention: Do not remove the metal plate that covers the disk controller speed
setting. Do not adjust the disk controller speed setting.
Attention: In the initial installation of the disk controller, you can add only new
disk expansion modules to the disk controller. This means that there must be no
existing configuration information on the disk expansion module that you want to
install. Improper drive migration can cause loss of configuration and other disk
controller problems.
Important:
For TS7650G installations, this section does not apply. This section is for the TS7650
Appliance only. The back-end storage controller is as specified in Table 22.
Table 22. Back-end storage controllers
Systems purchased... Back-end storage controller
Before August 2010 DS4700
August 2010 or later DS3950
The disk expansion units are IBM System Storage EXP810 Storage Expansion Units.
If the customer has other back-end storage expansion units, use the
troubleshooting procedures that were provided by the manufacturer with the
expansion units in the original configuration.
The disk expansion unit supports sixteen 450 GB, 4 Gbps, 15 krpm, Fibre Channel
disk drive modules (DDMs). For optimum performance, RAID 5 configuration is
recommended.
The disk expansion unit supports redundant, dual-loop configurations with the
disk controller and other disk expansion units. External cables and small
form-factor pluggable (SFP) modules connect the controller to the disk expansion
unit. The disk expansion unit provides continuous, reliable service and uses
hot-swap technology.
For more support information about the disk expansion unit, go to the IBM
Support Portal and select Disk Systems > DS4700 EXP810 Storage Expansion
Unit.
It is important to check all the LEDs on the front and back of the disk expansion
unit when you turn on the power. During power-on, the indicator LEDs flash
intermittently as the disk expansion unit and components complete the power-on
exp81021
1 2 3 4 5 6
6 Link rate switch The Link rate switch should N/A
be set to 2 Gbps or 4 Gbps,
depending on your
configuration.
1 2 3 6 5 4 7 8 9
A
ESM
B
ESM
exp81022
9 8 7 4 5 6 3 2 1
4 ESM port 1 in bypass Off - Cable connected On, flashing for 30
seconds
(This port is labeled 1A) On - No cable connected
5 ESM ports 1 & 2 One LED is lit if 2 Gbps N/A
Figure 62 on page 149 shows the numeric display and the diagnostic LED.
ds48042
76041
Diagnostic LED
When you power on the disk expansion modules, the seven-segment LEDs show
88 before the boot firmware starts to run. After the boot code starts, - - is
displayed.
When the application code starts, the display shows either aa or bb, depending on
whether it is ESM A or B.
When the disk expansion module has completely started, the numeric display
shows the enclosure identification (enclosure ID) of the disk expansion module.
If an error has occurred and the Needs attention LED is on, the numeric display
shows diagnostic information. The numeric display indicates the information is
diagnostic by illuminating an LED that appears as a decimal point between the
display numbers. The diagnostic LED tuns off when the numeric display shows
the storage subsystem enclosure ID. The numeric display shows various diagnostic
codes as the enclosures perform the startup process after each power cycle or reset.
After diagnostics are complete, the current storage subsystem enclosure ID is
displayed.
Attention: Do not remove the metal plate that covers the disk expansion unit
speed setting. Do not adjust the disk expansion unit speed setting.
Documentation CD
The IBM System Storage TS7650 with ProtecTIER Publications CD contains
documentation for the TS7650 Appliance and the TS7650G Gateway.
Note: If the reload is of R2.4 code or above, the RAS/BIOS and Firmware update
is no longer available. These functions are included in the ProtecTIER software.
Recovery disk
There are multiple levels of Recovery Disks.
Make sure the Recovery Disk matches the level of ProtecTIER software you are
recovering.
Software CDs
This topic addresses the CDs included in the software ship group.
For an overview of the CDs included in the software ship group, refer to “Software
ship group DVDs” on page 2.
Field-replaceable units unique to the 3958 DD4, 3958 DD5, and 3958
AP1 servers
This topic lists the field-replaceable units (FRUs) that are unique to the 3958 DD4,
3958 DD5, and 3958 AP1 servers.
CAUTION:
The Ethernet and Fibre Channel adapter FRUs in server peripheral component
interconnect (PCI) slots 1 through 7 are not hot-swap capable because the server's
operating system does not permit the adapters to be varied offline prior to
removal. Request that the customer shut down the server before attempting to
replace any of these adapters.
Use Table 29 and Table 30 on page 162 to order FRUs for the 3958 DD4, DD5, and
AP1 servers.
Table 29. 3958 DD4, DD5, and AP1 server FRUs
IBM part
number FRU
59Y4815 Top cover (all models)
46M0000 PWR 3.0 m ROHS
Power supply filler (see Filler Kit)
49Y7760 Power supply (models 1Rx 2Rx 2Sx 3Rx 4Dx 4Rx 4Sx 5Dx 5Rx 5Sx ARx)
42D0500 Emulex 8 Gb dual-port FC adapter, Model Lpe12002
42D0516 8 Gb QLogic FC dual-port HBA adapter
46M0003 I/O board shuttle (all models)
59Y4814 Chassis assembly (models 1Rx, 2Rx, 2Sx, 3Rx, 4Dx, 4Rx, 4Sx, 5Dx, 5Rx, 5Sx,
ARx)
Hard disk drive backplane filler (see Filler Kit)
Hard disk drive backplane carrier (see Miscellaneous hardware parts kit)
Hard disk drive backplane power cables and carrier (see Cabling Kit)
RAID card carrier (see Miscellaneous hardware parts kit)
44E8690 ServeRAID-BR10i SAS/SATA Controller (all models except 5Dx)
46M0851 ServeRAID-M5015 SAS/SATA Controller (optional)
81Y4451 ServerRAID-M5015 BBU Battery Backup Unit ( M5000 battery )
49Y2004 Hard disk drive, 600 GB 10K, 2.5-inch SAS (optional), for DD5
42D0638 Hard disk drive, 300 GB 10K, 2.5-inch SAS (optional)
42D0673 Hard disk drive, 73 GB 15K, 2.5-inch SAS (optional)
42D0678 Hard disk drive, 146 GB 15K, 2.5-inch SAS (optional)
42D0633 Hard disk drive, 146 GB (optional)
Table 30. Additional FRUs for feature code 3447, Cluster Connection Kit
IBM part
number FRU
46M2177 Ethernet Switch 1, SMC8126L2 26-Port 1Gb Switch
46M2177 Ethernet Switch 2, SMC8126L2 26-Port 1Gb Switch
For FRUs not listed in Table 29 on page 161 and Table 30 on page 162, see the IBM
System x3850 X5 and x3950 X5 Types 7145 and 7146: Problem Determination and
Service Guide.
CAUTION:
The Ethernet and Fibre Channel adapter FRUs in server peripheral component
interconnect (PCI) slots 1 through 7 are not hot-swap capable because the server's
operating system does not permit the adapters to be varied offline prior to
removal. Request that the customer shut down the server before attempting to
replace any of these adapters.
Use Table 31 and Table 32 on page 164 to order FRUs for the 3958 DD3 server.
Table 31. 3958 DD3 server FRUs
IBM part
number FRU
15F8409 Battery system planar board
23R3245 PWR 3.0 m ROHS
26K8680 Hard disk drive (HDD) filler
39M5377 Alternate current (AC) power cord
39R6528 Qlogic 4 Gbps 2-port Fibre Channel host bus adapter (HBA)
39Y6128 Dual-port Ethernet card
39Y7355 1440 watt H/S power supply
41Y2770 2 GB Memory dual inline memory module (DIMM)
42D3062 Slide kit
43W4283 Server RAID-MR10k battery
43W7512 Emulex 4 Gbps 2-port Fibre Channel HBA
43W8673 Kodiak power backplane
43W9559 Central processing unit (CPU) heat sink
43W9578 92 mm cooling fan
Table 32. Additional FRUs for feature code 3447, Cluster Connection Kit with Alpha
Networks switch
IBM part
number FRU
22R4916 Alpha Networks Ethernet switch
22R5835 208 V dc jump I/O
23R5136 2.8 m C13/C19 power cord
45E5808 Western Telematic Inc. (WTI) 1 Gbps network power switch
95P4759 3.2 m Ethernet cable
For FRUs not listed in Table 31 on page 163 and Table 32 on page 164, see the IBM
System x3850 M2 and x3950 M2 Types 7141, 7233, 7144, and 7234 Problem
Determination and Service Guide.
CAUTION:
The Ethernet and Fibre Channel adapter FRUs in server PCI slots 1 through 7
are not hot-swap capable because the server operating system does not permit
the adapters to be varied offline prior to removal. Request that the customer
shut down the server before attempting to replace any of these adapters or FRUs
unless listed as a concurrent replacement.
Use Table 34 and Table 35 on page 166 to order FRUs for the 3958 DD1 server.
Table 34. 3958 DD1 server FRUs
IBM part
number FRU
95P2819 100 ft Ethernet cable
23R5136 2.8 m cable
44E4555 Tigerton 2.93 GHz/8M processor
39Y6128 2-port Ethernet card
95P4759 3.2 m Ethernet cable
41Y2851 4 GB DIMM
43W7512 Emulex 4 Gbps 2-port Fibre Channel HBA
15F8409 Battery System planar board
44E4564 System x3850 M2 bezel
44E4357 Archimides 2.5 inch HDD backplane
43W8673 Kodiak power backplane
44E4568 7–cable kit
44E4566 Enterprise cable management arm
40K6556 Scalable cable management arm
39M5377 AC power cord
44E4573 System chassis
Table 35. Additional FRUs for feature code 3447, Cluster Connection Kit with Alpha
Networks switch
IBM part
number FRU
22R4916 Alpha Networks Ethernet switch
22R5835 208 V dc jump I/O
For FRUs not listed in Table 34 on page 165 and Table 35 on page 166, see the IBM
System x3850 M2 and x3950 M2 Types 7141, 7233, 7144, and 7234 Problem
Determination and Service Guide.
The replacement procedures are common, but the firmware update commands are
different.
v For ProtecTIER software V2.3 or earlier, use the install commands on the IBM
System Storage RAS/BIOS and Firmware Update DVD.
v For ProtecTIER software V2.4 or V2.5, use the following command:
/opt/dtc/install/ptconfig -updateFirmwares
v For ProtecTIER V3.1 or later, select ProtecTIER Configuration > Update
Firmware from the ProtecTIER Service menu (see “ProtecTIER Service menu” on
page 16).
The TS7650 Appliance and TS7650G Gateway use standard field-replaceable units
(FRUs) and some unique FRUs. To replace a FRU, refer to the service guide that
shipped with the component in which you are installing the FRU. Some FRUs are
hot-swap capable and can be hot-swapped if the service guide instructs you to do
so (for example, power supplies). For FRUs that are not hot-swap capable (for
example, Fibre Channel adapters and the RSA), you might have to turn off the
component.
Note: Update the server base firmware and run either the /opt/dtc/install/
ptconfig -updateFirmwares command or the IBM System Storage RAS/BIOS and
Firmware Update DVD V3.2.25 or later for each FRU (except as noted below) after
any of the following maintenance actions:
v Server BIOS update
v RSA replacement
v RSA firmware update
v QLogic Fibre Channel adapter replacement
v Emulex Fibre Channel adapter replacement
v Ethernet adapter replacement
v System planar replacement
v MegaRAID (MR10k) adapter card replacement
v Replacement of RAID controller with M5015 ServeRAID adapter (in 3958 DD4,
DD5, and AP1 servers)
Use one the following Problem Determination Guides for removing or installing a
specific hot-swap component for any additional procedures that you might have to
perform before you remove or install the component.
v IBM System x3850 M2 and x3950 M2 Types 7141, 7233, 7144, and 7234 Problem
Determination and Service Guide
v IBM System x3850 X5 and x3950 X5 Types 7145 and 7146 Problem Determination and
Service Guide
Note: Adapters in the TS7650 (including the hot-swap adapter ports) are not
hot-swap capable due to the limitations of the Linux operating system. The system
must be shut down when an adapter needs to be replaced.
For systems running PT software 2.3 and below, see the IBM System Storage TS7600
with ProtecTIER Installation Instructions for the RAS Package, BIOS, and Firmware
updates following a FRU replacement for models 3958 DD1, 3958 DD3, and 3958 AP1,
PN 46X2459 and “Updating the server firmware and basic input/output system
settings” on page 228 for more information. You must follow the directions in this
document on how to update the firmware and to power down the TS7650
Appliance and TS7650G Gateway.
After replacing a Fibre Channel HBA or Ethernet adapter, additional steps might
be required in order to complete the replacement. The Fibre Channel HBA might
require ports set to target or initiator mode. To set the IP address on the Ethernet
adapter, see the IBM System Storage ProtecTIER User's Guide for Enterprise Edition
and Appliance Edition, GA32-0922.
Field replaceable units include the system planar, MR10k RAID adapter, Ethernet
adapters, and SAS disk drives. Often a FRU does not have the current tested level
of BIOS or firmware. New FRUs need to have their BIOS or firmware levels set to
the tested level for assured reliability. Perform the procedures for the FRU that
needs replacement and the BIOS or firmware level set.
Important: For the internal SAS drives on the server, when a disk drive failure
occurs on one of the drives, and a replacement is needed, after a physical
replacement is complete, the mirroring process on the remaining drive restores the
configuration to the new drive. The rebuild process takes several hours to
complete. Complete this drive replacement process concurrently while the system
is up and operational. This replacement process relates to either disk drive in bay 0
or 1. For a failure on both drives, contact your next level of support for recovery.
The replacement drive must be either new or newly formatted.
Attention: The Ethernet and Fibre Channel adapter FRUs in server peripheral
component interface (PCI) slots 1 through 7 are not hot-swap capable because the
server's operating system does not permit the adapters to be varied offline prior to
removal. You must stop all jobs and turn off the system and the server before
attempting to replace any of these adapters.
Procedure
1. Have the customer stop all activities from attached hosts and quiesce all jobs.
2. Enable Service Mode on the server:
a. From a command line on Node A, access the ProtecTIER Service Menu by
entering menu.
b. From the ProtecTIER Service Menu, select Health Monitoring > Service
Mode.
c. Type yes and press Enter to enable Service Mode.
d. Repeat steps 2a thru 2c on Node B.
3. Stop the VTFD and cluster services:
a. From a command line on Node B, type the following commands:
service vtfd stop
service ptcluster stop
b. Repeat step 3a on Node A.
4. Shut down the node on which the FRU must be replaced:
a. Right-click the system console desktop and click Terminal.
b. At the Terminal command prompt, enter the command tsys and select the
system to which you want to connect.
CAUTION:
The Ethernet and Fibre Channel adapter FRUs in server peripheral component
interface (PCI) slots 1 through 7 are not hot-swap capable because the server's
operating system does not permit the adapters to be varied offline prior to
removal. Before attempting to replace any of these adapters, request that the
customer stop all jobs and turn off the system and the server (see “Preparing the
system for FRU replacement” on page 170). See the appropriate FRU replacement
section in this document for each PCI adapter.
Important:
v Use the lists of FRUs in Chapter 9, “Parts catalog,” on page 161 to find FRU
part numbers for most components in those servers.
v When replacing PCI Express adapters, pay particular attention to the adapter
slots. The slots identified with the orange tabs are hot-swap capable slots.
The other slots require a complete system shutdown in order to replace the
adapters. Refer to Chapter 9, “Parts catalog,” on page 161 for important
information regarding the adapter slots. The adapters in the ProtecTIER
server are not hot-swap capable due to the Linux operating system
limitations. The system must be powered down for replacement.
2. After identifying the location, determine whether the FRU is secured by
retaining screws. If the screws are in place, remove them. Working from the
front of the frame, remove the retaining screws from the server front panel and
set them aside.
3. For FRU replacement procedures, refer to the IBM System x3850 X5 and x3950
X5 Types 7145 and 7146 Problem Determination and Service Guide (or, for the 3958
DD3 and 3958 DD1 servers, IBM System x3850 M2 and x3950 M2 Types 7141,
7233, 7144, and 7234 Problem Determination and Service Guide).
Important: If the IBM System x3850 M2 and x3950 M2 Types 7141, 7233, 7144,
and 7234 Problem Determination and Service Guide replacement procedures ask for
the server to be turned off, insert the IBM System Storage RAS/BIOS and
Firmware Update DVD (for R2.3 only) or the R2.4 TS7650 or TS7650G DVD into
the server DVD drive and follow the steps in “Powering off the ProtecTIER
server version 2.5 or earlier” on page 243 and “Powering on the ProtecTIER
server version 2.5 or earlier” on page 245 to cycle power to the server.
7. Make a note of the displayed levels and compare them to the levels listed in
this document.
8. If any firmware displayed is lower than the recommended level, follow the
instructions to update that component (using RAS package version 3.2 or later
or the /opt/dtc/install/ptconfig -updateFirmwares command). If the
firmware level is up to date, you can skip the firmware update.
Results
What to do next
If the remote supervisor adapter (RSA) field replaceable unit (FRU) was replaced,
continue to “Configuring the RSA card” on page 204 to set the IP addresses and
Procedure
1. Connect a USB keyboard and monitor to the 3958 server or use the TSSC to
switch to the node you are working on.
2. Insert the appropriate software DVD.
3. ProtecTIER software V2.3 only: If you cannot boot from the IBM System
Storage RAS/BIOS and Firmware Update DVD (the system displays an Abort,
Retry, Fail? error), remove the DVD and insert the TS7600 HBA Firmware CD
(PN 46X5505, EC M11251). Boot from this CD. (No DVD is needed for V2.5 or
later.)
4. Power on the node.
ProtecTIER software V2.3 and V2.4 only
The CD/DVD should be the default boot device. If it is not, when the
IBM logo and the function key options are displayed, press F12 to
select the CD/DVD drive as the boot device. The IBM TS7600 BIOS and
Firmware tool menu is displayed. Skip to step 5.
ProtecTIER software V2.5 or later
Wait until the node has completed power-on. From the command line,
type the following command to update the firmware:
/opt/dtc/install/ptconfig -updateFirmwares
After the firmware update is complete, type the command reboot. Skip
to step 7 on page 175.
ProtecTIER software V3.1 or later
Wait until the node has completed power-on. To update the firmware,
go to the ProtecTIER Service menu (see “ProtecTIER Service menu” on
page 16) and, select ProtecTIER Configuration > Update Firmware.
After the firmware update is complete, type the command reboot. Skip
to step 7 on page 175.
5. Select the option to Update the Emulex Firmware or Update the Qlogic
Firmware, depending on which Host Bus Adapter (HBA) card you replaced:
[0] Reboot
Note: The Reboot option restarts the server and ejects the CD-ROM.
The firmware is updated. A message states that the firmware update completed
successfully.
6. Select [0] to reboot. The system completes the boot cycle.
7. Verify that no errors are reported on screen or from the error indicators on the
light path display panel.
Note: After the reboot completes, the following message might be presented at
the login prompt:
The module sctp cannot be unloaded due to unsafe usage...
Ignore this message and press the enter key to return to the login prompt.
Results
The firmware update procedure for the Emulex or Qlogic adapter is complete.
What to do next
v If a customer uses a fibre switch to connect the TS7650 to the cache, warn the
customer that replacing a Qlogic adapter creates a different WWNN from the
one that is currently used. If the customer is zoning the switch, they must
rezone it with the new Qlogic WWNN (see “Obtaining WWNN and WWPN
values for a Qlogic adapter” on page 177).
v If the customer has configured LUN Mapping on the backend disk system the
LUN Mapping must be reconfigured to the new WWNN.
v If your system uses ProtecTIER V2.4 or earlier, you might need to manually reset
the World Wide Node Name (WWNN) after updating the firmware. For
instructions, see “Changing the World Wide Node Name on an Emulex
adapter.”
v If your system uses ProtecTIER V2.5 or later, the WWNN for the new Emulex
adapter is automatically reset to its original value by the firmware update and
reboot in step 4 on page 174.
Note: This task cannot be performed on a system running ProtecTIER V2.5 or later.
+------------------------------------------------------------+
| MAIN MENU |
+------------------------------------------------------------+
| [1] Show available FE ports names |
| [2] Edit WWN by port instance |
| [3] View Changes |
| [4] Submit or discard changes |
| [x] Exit |
+------------------------------------------------------------+
3. Select option 1 to show the available FE port names. The Display Names By
Port menu is displayed (Figure 65).
+-----------------------------------------------------------------+
| DISPLAY NAMES BY PORT |
+-----------------------------------------------------------------+
| [port] original nodename original portname |
| ====== ================= ================= |
| |
| [ 0] 20000000c97e3434 10000000c97e3434 |
| [ 1] 20000000c97e3435 10000000c97e3435 |
| [ 2] 20000000c97e2dc8 10000000c97e2dc8 |
| [ 3] 20000000c97e2dc9 10000000c97e2dc9 |
| [m] back to main menu |
+-----------------------------------------------------------------+
+-----------------------------------------------------------------+
| CHANGE NAME BY PORT |
+-----------------------------------------------------------------+
| [port] original nodename original portname |
| ====== ================= ================= |
| [ 0] 20000000c97e3434 10000000c97e3434 |
+-----------------------------------------------------------------+
What to do next
If you are performing any other FRU or firmware updates, proceed to the
appropriate section. Otherwise, go to Chapter 12, “End-of-call procedure,” on page
249.
Procedure
1. At the command line, type the following command:
/opt/QLogic_Corporation/SANsurferCLI/scli
The Main Menu for the Qlogic adapter is displayed (Figure 68).
v1.7.3 Build 32
Main Menu
1: General Information
2: HBA Information
3: HBA Parameters
4: Target/LUN List
5: iiDMA Settings
6: Boot Device
7: Utilities
8: Beacon
9: Diagnostics
10: Statistics
11: Virtual
12: Help
13: Exit
Enter Selection:
Figure 68. Main Menu for Qlogic adapter command line interface
v1.7.3 Build 32
v1.7.3 Build 32
1: Information
2: HBA Alias
3: HBA Port Alias
4: VPD
5: Return to Previous Menu
6. Record the WWNN value of the selected port (displayed next to the label Port
Name).
7. To return to the HBA Information Menu, press Return and then select option 5
(Return to Previous Menu).
8. Repeat steps 4 on page 178 through 7 as needed to obtain the WWNN values
for other ports.
9. From the HBA Information Menu, select option 6 to return to the Main Menu.
10. Select option 13 to exit from the Qlogic adapter command line interface.
11. Deliver the list of WWPN and WWNN values to the customer.
For 3958 DD4 and 3958 DD5 servers, it is important to adhere closely to the
replacement procedure of the “microprocessor-board assembly” in the IBM System
x3850 X5 and x3950 X5 Types 7145 and 7146 Problem Determination and Service Guide.
Procedure
1. Connect a USB keyboard and monitor to the server or use the TSSC keyboard
and monitor to access the command line of the node.
Results
The process of updating the server system planar board firmware and basic
input/output system (BIOS) settings is complete.
For both 7141 (3958 DD1) and 7233 (3958 DD3), the updated firmware levels are as
follows:
v brcm_fw_nic_2.1.0a_linux_32-64.bin (Sysplanar Broadcom Ethernet firmware
update)
v ibm_fw_bios_a3e166a_linux_i386.sh (Sysplanar BIOS)
v ibm_fw_bmc_a3bt54a_linux_i386.sh (Baseboard Management Controller)
v ibm_fw_fpga_a3ud23c_linux_i386.sh (Field Programmable Gate Array)
v ibm_fw_dsa_3.00_dsyt55a_linux_32_64.bin (Dynamic System Analysis)
What to do next
v The Broadcom firmware update script updates only active ports with unique IP
addresses. Sysplanar Port 1 (eth4) is used for the replication feature. Replication
configuration is required before the firmware for Port 1 is updated. Sysplanar
Port 2 (eth5) is used for connection to the TSSC. Installation of the RAS package
is also required before firmware for Port 2 is updated. 3958 DD4 and 3958 DD5
servers in VTL configuration have eth4 and eth5 onboard. Port eth5 is already
active as a replication port, but eth4 must be temporarily reconfigured as an
another active replication port for correct firmware update. After the firmware
update, port eth4 can be restored to its previous configuration.
v For 3958 DD4 and 3958 DD5 servers in OpenStorage or File System Interface
configurations, ports eth12 and eth13 are onboard. Both ports must be
temporarily reconfigured as active replication ports (with unique IP addresses)
for correct firmware update. After the firmware update, eth12 must be restored
to its previous configuration. (Restoring eth13 to its previous configuration is
optional.) You must reboot the server to allow the new IMM, uEFI, FPGA, DSA
pre-boot and BRCM firmware to take effect.
v After firmware updates have been completed on 3958 DD4 and 3958 DD5
servers, you must rebuild the RAS package.
– For ProtecTIER V2.5 or earlier, type the following command:
/opt/dtc/install/ptconfig -configRAS
– For ProtecTIER V3.1 or later, select ProtecTIER Configuration > Configure
RAS from the ProtecTIER Service menu (see “ProtecTIER Service menu” on
page 16).
After the rebuild is complete, continue with Chapter 12, “End-of-call procedure,”
on page 249.
v If you are performing any other FRU or firmware updates, proceed to the
appropriate section.
Attention: This procedure initializes both of the HDDs in the server and requires
a reload of the Red Hat Linux operating system and ProtecTIER code.
Note: Only a fresh field-replaceable unit HDD can be used for hot-swapping.
However, if a used HDD, with the same FRU part number, from another 7141
(3958 DD1) or 7233 (3958 DD3) system with the MR10K RAID controller, is
available for use as a replacement FRU, it must be placed in one of the two HDD
slots when the 3958 DD1/3958 DD3 is powered down. The same applies to a used
HDD, previously used in a 3958 DD4 or 3958 DD5 (Type 7145) with a ServeRAID
M5015 controller, and to be re-installed in a 7145 machine. Plug the used HDD as a
replacement FRU into the appropriate HDD slot when the 7145 is powered down.
Procedure
Procedure
Perform the following steps to remove the existing RAID controller from the server
and install an M5015 ServeRAID adapter:
1. Turn off the server and peripheral devices, and disconnect the power cords
and all external cables as necessary to replace the device.
2. Remove the top cover.
Attention: Operating the server for more than 2 minutes with the top cover
removed might damage server components. For proper cooling and airflow,
replace the top cover before you turn on the server.
a. Slide the server out of the rack until the slide rails lock into place.
Cover release
latch
Top cover
ts760917
Figure 72. Removing the top cover
Latch
ts760918
Figure 73. Removing the top cover bracket
4. Pull the blue handle on the RAID adapter carrier up to extract it from the
server.
RAID card
carrier
Slot
ts760920
Figure 74. Extracting the RAID adapter carrier
ts760921
Tabs
7. If a battery is installed on the RAID adapter, remove the battery carrier card
and the battery from the RAID adapter. You must remove the three screws to
separate them (Figure 76).
ts760922
Figure 76. Removing the battery carrier card from the adapter
Note: Attach the SAS cable from port 0 to the lower backplane and the SAS
cable from port 1 to the upper backplane (if one is installed). See Figure 77
and Figure 78 for port locations.
ts760919
Port 1 Port 0
ts760923
Port 1 Port 0
12. Slide the RAID adapter carrier and RAID adapter assembly into the slot on
the side of the server (Figure 79 on page 188). Make sure the carrier is flat
against the side wall of the server so that the adapter is installed in the
connector correctly.
RAID card
carrier
Tab
ts760924
Figure 79. Inserting the RAID adapter carrier
Procedure
ts760917
Figure 80. Removing the top cover
Latch
ts760918
Figure 81. Removing the top cover bracket
4. Pull the blue handle on the RAID adapter carrier up to extract it from the
server.
RAID card
carrier
Slot
ts760920
Figure 82. Extracting the RAID adapter carrier
Note: Mark the cables before removing them so they can be returned to their
original positions.
6. Press the release latch and remove the RAID adapter from the carrier
(Figure 83 on page 193).
ts760921
Tabs
7. Remove the battery carrier card and the battery from the RAID adapter. You
must remove the three screws to separate them (Figure 84 on page 194).
Note: The location of the screws will be different depending on the model of
RAID adapter. The most current RAID adapter contains the three screws on
the top of the BBU.
8. Install the replacement BBU onto the adapter using the three screws.
9. Install the RAID adapter onto the RAID adapter carrier.
10. Connect the SAS cables to the RAID adapter.
Note: Attach the SAS cable from port 0 to the lower backplane and the SAS
cable from port 1 to the upper backplane (if one is installed). See Figure 85 for
port locations.
Port 1 Port 0
Slot
RAID card
carrier
Tab
ts760924
Attention: (ProtecTIER V2.3 and earlier only) When using the Ethernet
Replacement Tool procedure, you must load the latest RAS package (verson 3.1.x
or higher) prior to running the tool from the menu selection. Refer to the RAS
package update information in the IBM System Storage TS7600 with ProtecTIER
Installation Instructions for the RAS Package, BIOS, and Firmware updates following a
FRU replacement for models 3958 DD1, 3958 DD3, and 3958 AP1, PN 46X2459 and
then return to this procedure.
Procedure
Perform the following steps to replace a Dual-Port or Quad-Port Ethernet card and
recover Ethernet settings.
1. Prepare the system for FRU replacement (see “Preparing the system for FRU
replacement” on page 170).
2. Replace the Ethernet adapter (see “Replacing the field-replaceable unit” on
page 171). After replacing the FRU, continue with step 3.
3. Replug the disconnected Ethernet cables and power on the server.
4. Log in with user ID ptadmin and password ptadmin.
5. Run the Ethernet card replacement tool on the server by using one of the
following methods.
v ProtecTIER V2.5 or later: The Ethernet card replacement tool is activated
automatically on bootup of the server. (This activation includes the automatic
update of the quad-port Ethernet replacement card on the 3958 DD4 or 3958
DD5 server.) To check that all is repaired as expected, from the ProtecTIER
Service menu (see “ProtecTIER Service menu” on page 16), select the options
to run a full check on the node.
– For ProtecTIER V2.5 or earlier, select System Health Monitoring > Run a
full check on this node.
– For ProtecTIER V3.1 or later, select Health Monitoring > Run a full check
on this node.
If no degraded check points are indicated, then the Ethernet settings of the
replaced adapter have been successfully recovered.
After the health check has been successfully completed, STOP. Skip the
remaining steps of this procedure and continue with Chapter 12, “End-of-call
procedure,” on page 249.
v ProtecTIER V2.4 or earlier, at the command line, type:
rsCerCfgUpdateEthConnections.
- OR -
v ProtecTIER 2.4 or earlier, select Manage Configuration > Update Eth
Connections.
6. To verify, ping other parties on the network.
The following example shows activation of this tool for the dual-port Ethernet card
on a system running ProtecTIER V2.4 or earlier.
1. Type the command rsCerCfgUpdateEthConnections.
# /opt/ras/bin/rsCerCfgUpdateEthConnections
2. The Ethernet replace tool identifies the replaced card:
The following replacement(s) were discovered for 2 port Ethernet card on slot 3
eth1
Old MAC address = 00:15:17:1e:2d:71
New MAC address= 00:15:17:7f:28:8d
eth0
Old MAC address = 00:15:17:1e:2d:70
New MAC address= 00:15:17:7f:28:8c
IP: 9.148.41.82
Do you wish to correct the ethernet configuration? YES/NO
3. Disconnect all Ethernet cables from the replaced card, and press Enter.
Verifying cables are disconnected... may take a few minutes
Configuring eth0
Please connect ethernet cable to eth0
press Enter to continue or type exit
4. Connect a single cable to the requested port (Eth0 in our case) and press
Enter.
Checking for connected cable... may take a few minutes
Note: In some instances, you are requested to connect the other cable as well.
In such a case you do not have to follow steps 6 through 8 on page 198.
This action would also update eth1
If you wish to connect a cable to eth1
Please connect it before typing YES
Please confirm YES/NO
6. In the following steps, the configuration for the port on the replaced card is
continued.
Configuring eth1
Please connect ethernet cable to eth1
press Enter to continue or type exit
7. If you wish to configure the port, connect the Ethernet cable to the port and
press Enter.
Checking for connected cable... may take a few minutes
[0] Reboot
What to do next
Procedure
1. Pay close attention to the steps in the procedure “Replacing the
microprocessor-board assembly” that refer to the use of the Advanced Settings
Utility (ASU) to restore the product name, machine type, and serial number of
the server. The ASU is used when the server is booted into the operating
system. Consult with the trained xSeries representative to imprint this vital
information via ASU or from a saved configuration file.
2. If necessary, set the system product fields (Product Name, Model/Type, and
Serial Number) on the new microprocessor board.
a. Locate the product label (1 in Figure 87 on page 200) on the right side of
the server.
b. Enter the correct values using the following commands:
cd /usr/bin
./asu64 set SYSTEM_PROD_DATA.SysInfoProdName "XXXXXXX"
./asu64 set SYSTEM_PROD_DATA.SysInfoProdIdentifier "System x3850 X5"
./asu64 set SYSTEM_PROD_DATA.SysInfoSerialNum "nnnnnnn"
where XXXXXXX is the Type and Model shown on the label 1, and
nnnnnnn is the seven-digit serial number.
ts761191
Figure 87. Product label on server
What to do next
v Following the replacement of the I/O board Shuttle assembly, it is necessary to
update firmware to the two onboard BroadCom (BRCM) Ethernet interfaces. Go
to “Updating the onboard Ethernet adapter firmware” on page 202.
v When new MAC addresses are recorded on bootup (due to replacement of an
Ethernet card with the new microprocessor-board assembly or due to
replacement of the I/O board shuttle), the MAC addresses are correctly
configured to the appropriate Ethernet ports. Further configuration of the
Ethernet ports to their previous settings occurs automatically on startup of the
Ethernet ports. See “Replacing a Dual-Port or Quad-Port Ethernet card and
recovering Ethernet settings” on page 196.
v If problems with automatic reconfiguration of the network ports occur during
startup, you must reinstall the Red Hat Linux operating system and the
ProtecTIER code.
Note: The server S/N and system product name must be gathered before
conducting a remove and replace of the microprocessor board.
If the server is still active, issue the following commands and record the
information.
1. dmidecode -s system-serial-number
A seven-digit serial number is displayed. Write down the serial number.
2. dmidecode -s system-product-name
The system product name information is displayed. Write down the text in
bold. For example: IBM 3850 M2 / x3950 M2 -[7233PAF]-
If the server is not accessible, verify the serial number and product name from the
previous Call Home concerning this issue.
Procedure
Perform the following steps to change the S/N of the new microprocessor board.
1. Download the x3850 M2 BIOS 170A.ISO file from the IBM Fix Central website,
or contact IBM Support (Level 2 Support) for ProtecTIER to forward this .iso
file to you, and save it to your laptop.
Note: If the .iso file is not readily available at Fix Central, consult immediately
with IBM Support for ProtecTIER to provide the .iso file to you.
2. Burn the x3850 M2 BIOS 170A.ISO file to a new CD. Use of ROXIO (on your
laptop) is recommended to burn the .iso file to the new CD. The .iso file
must be the only file on the CD.
3. Use the new CD on the x3850 M2 system to boot up to DOS.
4. Select option 1 “Update POST/BIOS” from the first menu.
5. Continue through the menus as if you are going to flash the BIOS.
6. You are prompted to enter a new S/N or keep the existing one.
7. You are prompted to enter a new system product name or keep the existing
one.
Note: BIOS Firmware (FW) version 1.16 is flashed. This is the latest level of
BIOS FW currently available. If BIOS FW code has been updated to a level
higher than version 1.16, then select the “Update Firmware” menu option after
boot up, or execute the command:
/opt/dtc/install/ptconfig -updateFirmwares
This upgrades the BIOS FW, if a new version greater than version 1.16 has been
incorporated in the ProtecTIER code.
9. Upon completion of the reboot, the S/N (and the system product name if
modified) is changed.
What to do next
v Following the replacement of the I/O board Shuttle assembly, it is necessary to
update firmware to the two onboard BroadCom (BRCM) Ethernet interfaces. Go
to “Updating the onboard Ethernet adapter firmware.”
v If MAC addresses are not correctly recorded on replacement of the
microprocessor-board assembly or of the I/O board shuttle on bootup, see
“Replacing a Dual-Port or Quad-Port Ethernet card and recovering Ethernet
settings” on page 196.
v The procedure “Replacing a Dual-Port or Quad-Port Ethernet card and
recovering Ethernet settings” on page 196 might not recover the new MAC
addresses for the appropriate network ports. In this case, you must reinstall the
Red Hat Linux operating system and the ProtecTIER 2.4.x (or lower) code and
reconfigure the system.
v When ProtecTIER 2.5.x or 3.1.x is loaded in the 3958 DD1/3958 DD3, all MAC
addresses are correctly configured to the appropriate Ethernet ports. Further
configuration of the Ethernet ports to their previous settings occurs
automatically on startup of the Ethernet ports. See “Replacing a Dual-Port or
Quad-Port Ethernet card and recovering Ethernet settings” on page 196.
v If problems with automatic reconfiguration of the network ports occur during
startup, you must reinstall the Red Hat Linux operating system and the
ProtecTIER code.
Procedure
Perform the following steps to update the onboard Ethernet adapter firmware.
1. Contact IBM Support (Level 2 Support) for ProtecTIER to forward the brcm_fw
file to you, and save it to your laptop.
2. Copy the brcm_fw file to a writable CD.
- OR -
mount /dev/cdrom /mnt/cdrom (for DD3)
b. cd /mnt/cdrom
c. cp brcm_fw /var/tmp
d. cd /var/tmp
e. eject (remove the CD from the DVD drive)
5. Verify that the BRCM firmware is present in the /opt/dtc/firmwares directory
by typing the following commands:
cd /opt/dtc/firmwares
ls | grep brcm
Note: The exact filename of the BRCM firmware file must be:
brcm_fw_nic_2.1.7a_linux_32-64.bin
6. Go to the /var/tmp directory by typing the command: cd /var/tmp.
7. Update the firmware by typing the command: ./brcm_fw.
8. Allow the script to finish, then reboot the server.
Note: The script displays progress of the firmware update. The updated
firmware levels are shown at the end of the script, as well as a reminder to
reboot the system. Reboot is necessary to activate the updated firmware code
level.
9. Verify that the firmware is updated:
a. Type menu at the command prompt and press Enter. The ProtecTIER Service
menu is displayed.
b. Select Version Information > Display Firmware Versions. Example:
Resource ID Version Description
-------- -- ------- -----------
Ethernet bnx2 2.x.xx Broadcom Corporation NetXtreme II
Ethernet bnx2 2.x.xx Broadcom Corporation NetXtreme II
Note: The updated firmware version for both of the onboard Ethernet ports
should be: bc 6.2.0 NCSI 2.0.11.
Procedure
After the RSA card has been replaced, perform the following procedure to update
the IP addresses.
1. Power on the 3958 server. When the IBM logo displays and the function key
options appear, press F1 (multiple times) to enter Setup Mode.
2. From the Configuration/Setup Utility menu, select Advanced Setup RSA II
Settings. The RSA II settings are displayed (Figure 88).
RSA II Settings
RSA II MAC Address 00-21-5E-2A-06-4F
DHCP IP Address 000.000.000.000
DHCP Control [ Use Static IP ]
What to do next
The RSA II firmware can be updated from the server's command line, or from the
ProtecTIER Service menu (V3.1 or later), if the server's firmware levels are at the
levels listed in Figure 63 on page 173.
Note: To update the RSA II firmware for ProtecTIER V2.4 or later, do not use the
IBM System Storage RAS/BIOS and Firmware Update DVD.
Two firmware files are required for the RSA II update, which are transferred from
the IBM System Storage RAS/BIOS and Firmware Update DVD (ProtecTIER V2.3 only)
in the x3850 M2 server to the TSSC service directory (/home/service) and then
flashed remotely to the RSA II via the TSSC web browser.
Note: This procedure requires the use of the TSSC with functioning Ethernet
connections to the 3850 M2 xServer (IP address 172.31.1.xx) and to the RSA II Web
interface (IP address 172.31.1.xx+1). The IBM System Storage RAS/BIOS and
Firmware Update DVD (ProtecTIER V2.3 only) must be mounted in the server
before you begin. Both the xSeries 7141 and 7233 servers use the same RSA II card
and firmware.
ts761189
Figure 90. Login Profile dialog box
ts761190
What to do next
Use the procedures described within this document to upgrade all the other
firmware to the latest levels.
The following procedure explains how to display the current levels so you can
determine whether the procedure to update the levels needs to be performed. If
the installed levels are up to date, the server can be turned back over to the
customer. If this is a clustered system, you must check both servers.
Note: If the browser is inactive for the number of minutes in the timeout
value, the RSA II logs off the Web interface. You must log in again.
6. Check the current levels of the firmware. At the System Status page in the left
navigation frame under Monitors, click on Vital Products Data.
Note: To verify the correct model of server you are upgrading, check the RSA
Vital Product Data panel under the top panel heading of Machine Level
VPD. Make a note of the Machine Type (7141 or 7233).
7. Scroll down to ASM VPD to verify that the build level of the two firmware
files is at the same, updated level (Example: A3EP29B). If the firmware code
needs to be updated, proceed to the next step. If the firmware does not need
to be updated, proceed to step 33 on page 211. The current levels of firmware
for both 7141 and 7233 model types are listed below:
v Main Application: A3EP41B PAETMNUS.PKT 07-30-09 18
v Boot ROM: A3BP41B PAETBRUS.PKT 07-30-09 18
Both with date codes of 07-30-09.
8. If the RSA II firmware needs to be updated, right-click the desktop and from
the IBM TS3000 System Console menu, select System Console Actions, then
select CD Copy to Console. A CD Copy to Console window is displayed. See
Figure 92 on page 209.
Note: This directory is only used for storing temporary data. It does not
contain any files that are needed for future use. It is safe to erase the contents
of this directory.
11. Press y and then Enter to continue.
12. After the copy is completed, press Enter to exit. The terminal closes and the
DVD is ejected automatically.
13. Right-click on the desktop and from the IBM TS3000 System Console menu,
select Terminal.
14. Change directories to the /home/service/offload directory by entering the
following command:
cd /home/service/offload
15. Expand the file using the following command:
tar -xvf rsacode.tar -C /
The following directories and files are placed on the TSSC in the
/home/service directory:
v TSSC_server:/home/service/ # tar -xvf rsacode.tar -C /
– /home/service/Code/
– /home/service/Code/7141/
– /home/service/Code/7141/PAETBRUS.PKT
– /home/service/Code/7141/RTALERT.MIB
Results
The process of loading the RSA firmware from the TSSC is complete.
Recovering a node
Reinstalling the Red Hat Linux operating system overlays all the Reliability,
Availability, and Serviceability (RAS), basic input/output system (BIOS), and
firmware scripts and utility rpms. As a result, an IBM Service Representative must
reload the RAS package after the customer reloads Red Hat Linux.
Attention: TS7600 products require a TSSC. The TSSC is a vital part of the service
strategy. For the RAS package to gather logs at the time of the failure and send a
call home packet, a TSSC is required. If the RAS package is not installed, the
customer must call 1(800) IBM SERV (7378) (in North America, only) or visit the
IBM Directory of worldwide contacts web page to obtain hardware support.
ProtecTIER must also be reinstalled following reinstallation of Red Hat Linux. This
is typically a customer responsibility. Refer to the system recovery information in
the IBM System Storage ProtecTIER User's Guide for Enterprise Edition and Appliance
Edition, GA32-0922, provided on the IBM System Storage TS7650 with ProtecTIER
Publications CD.
The recovery initiates the automatic node replacement procedure. This includes
missing node identification and performing the following tasks:
v Configures network settings
v Configures hostname
v Configures bond
v Configures Red Hat Linux cluster
v Restarts all cluster activities on the other node in the cluster if in a clustered
installation
v Discovers and registers the repository file systems in the /etc/fstab directory
v Configures multipath
For more detailed information about installing the RAS package on a system
running ProtecTIER software V2.3 or earlier, see the IBM System Storage TS7600
with ProtecTIER Installation Instructions for the RAS Package, BIOS, and Firmware
updates following a FRU replacement for models 3958 DD1, 3958 DD3, and 3958 AP1,
PN 46X2459.
v To perform the RAS package installation, you must first establish a server
connection.
v If you are installing the RAS package in a clustered configuration, perform the
installation on Server A (the bottom server) first.
v If this is the second node installation in a clustered configuration, make sure the
first node is up and running before proceeding.
Procedure
1. Insert the IBM System Storage RAS/BIOS and Firmware Update DVD into the
DVD drive on the 3958 server on which you are installing this package.
Important: Use the same frame number for both nodes when clustered
together in the same frame.
4. The 3958 DD1 first release (August 2008) supported RSA Ethernet connections
on the local customer LAN. This procedure alters that configuration to now
connect the 3958 RSA ethernet connection to connect to the TSSC private
network (172.31.1.xxx). Disconnect the local LAN Ethernet connection to the
RSA adapter port, and connect a new Ethernet cable between the RSA Ethernet
port and the TSSC SMC hub. You are prompted for this action during the
installation. If the connection between the RSA Ethernet port and the TSSC
SMC hub is already established, proceed to the next step.
Note: Depending upon the model of the TSSC and TSSC network switch being
used, the port layout might be different than shown. If so, attach and label the
cables according to the port number assignments specified, regardless of the
position of the ports on the TSSC or the TSSC network switch.
5. After the installation is complete, unmount the IBM System Storage RAS/BIOS
and Firmware Update DVD and eject it using the following commands:
cd /
eject /dev/hda
6. Repeat steps 1 on page 212 through 5 for the second 3958 server if applicable.
You are prompted for “frame number”.
7. Enter the same number that you entered for the first node. For example, if you
configured the frame on the first node to be 110, configure the frame on the
second node to 110 also, provided the two servers are being clustered together.
8. When all procedures have been completed and you are returned to a command
line, type exit to log out of the server.
Results
The process of reinstalling the RAS package (for ProtecTIER software V2.3 or
earlier only) is complete.
Attention: Before upgrading the ProtecTIER code level, you must update the
DS4700 storage subsystem firmware, NVSRAM firmware, and the ESM firmware
on your EXP810 expansion drawers.
You must call IBM Service to schedule a time to update the firmware to the levels
listed below.
Do not proceed to upgrade the ProtecTIER code level to version 2.4.x until:
v The storage subsystem firmware has been upgraded to the minimum level of
07.50.13.00 or the recommended level of 07.60.28.00.
v The DS4700 NVSRAM firmware has been updated to the minimum level of
N1814D47R1050V07 or the recommended level of N1814D47R1060V04.
v The expansion unit ESM firmware has been updated to the minimum level of
98C5 or the recommended level of 98D0.
Procedure
1. It is necessary to power off the server (or servers) before starting work on the
Disk Storage controller. Refer to “Power procedures for ProtecTIER version 3.1
or later” on page 237 or “Powering off the ProtecTIER server version 2.5 or
earlier” on page 243.
2. Identify the location of the FRU or CRU to be replaced and follow the
replacement procedures for the FRU or CRU. Refer to the IBM System Storage
DS4700 Express Storage Subsystem Installation, User's and Maintenance Guide.
3. Once the location has been identified, use the IBM System Storage DS4700
Express Storage Subsystem Installation, User's and Maintenance Guide for
replacement procedures.
4. Power on the server (or servers), refer to “Power procedures for ProtecTIER
version 3.1 or later” on page 237 or “Powering on the ProtecTIER server
version 2.5 or earlier” on page 245.
5. Referring to the IBM System Storage DS4700 Express Storage Subsystem
Installation, User's and Maintenance Guide, perform any checkout procedures.
Results
The process of removing and replacing Disk Storage controller FRUs is complete.
Attention: Before upgrading the ProtecTIER code level, you must update the
DS4700 storage subsystem firmware, NVSRAM firmware, and the ESM firmware
on your EXP810 expansion drawers.
You must call IBM Service to schedule a time to update the firmware to the levels
listed below.
Do not proceed to upgrade the ProtecTIER code level to version 2.4.x until:
v The storage subsystem firmware has been upgraded to the minimum level of
07.50.13.00 or the recommended level of 07.60.28.00.
v The DS4700 NVSRAM firmware has been updated to the minimum level of
N1814D47R1050V07 or the recommended level of N1814D47R1060V04.
v The expansion unit ESM firmware has been updated to the minimum level of
98C5 or the recommended level of 98D0.
Procedure
1. It is necessary to power off the server (or servers) before starting work on the
DS controller. Refer to “Power procedures for ProtecTIER version 3.1 or later”
on page 237 or “Powering off the ProtecTIER server version 2.5 or earlier” on
page 243.
2. Identify the location of the FRU to be replaced and follow the replacement
procedures for the FRU. Refer to the IBM TotalStorage DS4000 EXP810 Storage
Expansion Unit Installation, User's and Maintenance Guide.
3. Once the location has been identified, use the IBM TotalStorage DS4000 EXP810
Storage Expansion Unit Installation, User's and Maintenance Guide for replacement
procedures.
4. Power on the server (or servers), refer to “Power procedures for ProtecTIER
version 3.1 or later” on page 237 or “Powering on the ProtecTIER server
version 2.5 or earlier” on page 245.
5. Referring to the IBM TotalStorage DS4000 EXP810 Storage Expansion Unit
Installation, User's and Maintenance Guide, perform any checkout procedures.
Results
The process of removing and replacing disk expansion module FRUs is complete.
Note: If you are required to remove either switch, refer to “Removing the Ethernet
switch.”
Procedure
9
10
3
4 12 2
11
5
ts760895
Figure 93. Removing the Ethernet switch (3958 AP1 installations)
5. Remove side brackets and power cord. Save all screws and power cord for
reuse. See Figure 93.
Procedure
Perform the following steps to complete the reinstallation of the Alpha Networks
Ethernet switch.
Note: Use caution during this process to avoid dislodging or damaging the cabling
within the frame.
1. To attach the mounting brackets and power cords to the Ethernet switches,
perform the following steps:
a. Attach the mounting brackets to the short-sides of the Ethernet switch
(Figure 94 on page 218), making sure to align the perforated fan vents on
the switch with the cut-out areas on the brackets.
b. Thread the power cord through the holes in the mounting bracket, making
sure that the tab extensions on the power cord are positioned on the outside
of the bracket.
9
10
3
4 12 2
11
5
ts760895
Figure 94. Installing the Ethernet switch (3958 AP1 installations)
2. To install the Ethernet switches into the air duct housing, perform the following
steps (working from the back side of the frame):
a. Carefully slide one of the Ethernet switches into the air duct housing at
position 3 on the frame, with the Ethernet ports facing outward (Figure 94).
b. Align the top hole in the mounting bracket on the Ethernet switch with the
guide pins on the rails.
c. Insert a short flanged screw into the bottom holes on each side of the
mounting bracket, and tighten the screws to secure the Ethernet switch to
the rails.
d. Repeat the steps above for the other Ethernet switch, if applicable.
e. Insert the power cord into the power receptacle on the Ethernet switch to
reapply power.
Procedure
Perform the following steps to complete the reinstallation of the SMC Ethernet
switch.
Note: Use caution during this process to avoid dislodging or damaging the cabling
within the frame.
1. From the rear of the frame, route the female end of the two power cables for
the Ethernet switch through the oblong opening on the right side (as you face
it from the rear) of the frame (Figure 95 on page 219). Perform this action now
ts760894
Figure 95. Ethernet switch power cable routing into the frame
2. Install the right mounting rail 1 and the left mounting rail 2 in the frame
using one screw 3 each (Figure 96).
9
10
3
4 12 2
11
5
ts760895
3. From the front of the frame, route the female end of the Ethernet power cable
from step 1 on page 218 through square hole 12 in the left mounting rail
ts760893
Figure 97. Ethernet switch power cable routing through the mounting rail
4. Attach the left and right mounting brackets 4 and 5 to the Ethernet switch
9 with four screws 10 each.
5. Attach the left side of the switch 9 to the left mounting rail 2 using one
screw 11 through the mounting bracket 5.
6. Attach the right side of the switch 9 to the right mounting rail 1 using
one screw 11 through the mounting bracket 4.
Note: The switch might hang slightly below the bottoms of the mounting
rails.
7. From the front of the frame, connect the Ethernet switch power cable to the
Ethernet switch.
8. To configure the newly installed switch, access the ProtecTIER Service menu
(see “ProtecTIER Service menu” on page 16).
9. Select ProtecTIER Configuration. The ProtecTIER Configuration menu is
displayed (Figure 98 on page 221).
B) Back
E) Exit
----------------------------------------------------------------------
>>> Your choice?
----------------------------------------------------------------------
ProtecTIER Service Menu running on rasddx
ProtecTIER Configuration (...)
IP Network Configuration (...)
----------------------------------------------------------------------
1) Configure hostname
2) Configure ProtecTIER’s IP interfaces
3) Configure Static Routes
4) Configure cluster Ethernet switches
B) Back
E) Exit
----------------------------------------------------------------------
>>> Your choice?
WTI Network
Power Switch
Empty (1u)
Empty (1u)
TJ8 P Empty (1u)
BJ8 P Empty (1u)
Empty (1u)
1 Empty (1u)
Empty (1u)
2
Empty (1u)
3 Empty (1u) Top Power
4 Empty (1u) Distribution
Empty (1u) Unit (PDU)
5 Empty (1u)
6 Empty (1u)
7 Empty (1u)
8 Empty (1u)
Empty (1u)
Empty (1u)
E Empty (1u)
Empty (1u)
Empty (1u)
BJ5 TSSC (1u) TJ5
BJ4 TSSC Network Switch & KVM (1u) TJ4
3958 DD3
7233-PCS
3850 M2 (4u)
Bottom Power
Empty (1u) Distribution
Unit (PDU)
3958 DD3
7233-PCS
3850 M2 (4u)
Empty (1u)
1 GB Ethernet Switch (1u) TJ1
BJ1 1 GB Ethernet Switch (1u)
ts760234
Empty (1u)
Empty (1u)
1 3 5 7 9 11 13 15
Ethernet switch 2 2 4 6 8 10 12 14 16
P
1 3 5 7 9 11 13 15
Ethernet switch 1 P
2 4 6 8 10 12 14 16
WTI
Network
Power Switch 3958 Server B
P
1 1 1 1 1 1 A P
2 2 2 2 2 2 U
E1
R S
P E2 P
V
1 2 3 4 5 6 7
1
5
3958 Server A
6
7 1 1 1 1 1 1 A P
8 2 2 2 2 2 2 U
E1
R S
E2 P
E V
ts760241
1 2 3 4 5 6 7
Figure 102. Clustered 1 Gb Ethernet switch Ethernet connections (3958 DD1/3958 DD3)
1 3 5 7 9 11 13 15 17 19 21 23 25 26
Ethernet switch 1 P
2 4 6 8 10 12 14 16 18 20 22 24
WTI
3 2 7
network
power switch
ProtecTIER Server B
8
P
1 1 1 1 1 P1 P2
2 2 2 2 2
P
3
4 A1 A2 B V S E1 E3
A1
E2 E4
A2 1 2 3 4 5 6 7 8 9
A3 26 6 Customer Network
A4 Replication 1 27 Replication 2
1 Customer Network
B1
B2 ProtecTIER Server A
B3
1 1 1 1 1 P1 P2
B4
2 2 2 2 2
4 3
E
4 A1 A2 B V S E1 E3
E2 E4
1 2 3 4 5 6 7 8 9
ts760862
19 20
Replication 1 Replication 2
Table 37. Clustered 1 Gb Ethernet switch Ethernet connections for VTL configuration
Callout From On Device To On Device/Location
1 Slot 5, port 1 Server A Customer network Customer designated device
2 Port 1 Ethernet switch 1 Slot 5, port 2 Server A
3 Port 1 Ethernet switch 2 Port A1 Server A
4 Port 3 Ethernet switch 2 Ethernet port, E WTI network power switch
| 5 Port 26 Ethernet switch 2 Port 26 Ethernet switch 1
6 Slot 5, port 1 Server B Customer network Customer designated device
7 Port 2 Ethernet switch 1 Slot 5, port 2 Server B
8 Port 2 Ethernet switch 2 Port A1 Server B
19 Slot 5, port 3 Server A Customer Customer specified device
replication network
20 Port A2 Server A Customer Customer specified device
replication network
26 Slot 5, port 3 Server B Customer Customer specified device
replication network
27 Port A2 Server B Customer Customer specified device
replication network
1 3 5 7 9 11 13 15 17 19 21 23 25 26
P
Ethernet switch 1 2 4 6 8 10 12 14 16 18 20 22 24
WTI 2 7
3
network
power switch
ProtecTIER Server B
8
P
1 1 1 1 1 P1 P2
2 2 2 2 2
P
3 3 3
4 4 4 A1 A2 B V S E1 E3
A1
E2 E4
A2 1 2 3 4 5 6 7 8 9
A3 18 6 Customer Network
A4 17 Replication 2
Replication 1
B1 1 Customer Network
B2 ProtecTIER Server A
B3
1 1 1 1 1 P1 P2
B4
2 2 2 2 2
4
3 3 3
E
4 4 4 A1 A2 B V S E1 E3
E2 E4
1 2 3 4 5 6 7 8 9
ts760863
16
15 Replication 2
Replication 1
Table 38. Clustered 1 Gb Ethernet switch connections for OpenStorage configuration, Feature Code 3456 (For
ProtecTIER V3.1.8 or earlier)
Callout From On Device To On Device/Location
1 Port 1, slot 5 Server A Customer network Customer designated device
2 Port 1 Ethernet switch 1 Port 2, slot 5 Server A
3 Port 1 Ethernet switch 2 Port A1 Server A
4 Port 3 Ethernet switch 2 Ethernet port, E WTI network power switch
| 5 Port 26 Ethernet switch 2 Port 26 Ethernet switch 1
6 Port 1, slot 5 Server B Customer network Customer designated device
7 Port 2 Ethernet switch 1 Port 2, slot 5 Server B
8 Port 2 Ethernet switch 2 Port A1 Server B
15 Port 4, slot 3 Server A Customer replication Customer specified device
network
16 Port 4, slot 4 Server A Customer replication Customer specified device
network
17 Port 4, slot 3 Server B Customer replication Customer specified device
network
1 3 5 7 9 11 13 15 17 19 21 23 25 26
P
Ethernet switch 2 2 4 6 8 10 12 14 16 18 20 22 24 5
1 3 5 7 9 11 13 15 17 19 21 23 25 26
P
Ethernet switch 1 2 4 6 8 10 12 14 16 18 20 22 24
WTI 2 7
3
network
power switch
ProtecTIER Server B
8
P
1 1 1 1 1 P1 P2
2 2 2 2 2
P
3 3 3
4 4 4 A1 A2 B V S E1 E3
A1
E2 E4
A2 1 2 3 4 5 6 7 8 9
A3 6 Customer Network
18
A4 17 Replication 2
Replication 1
B1 1 Customer Network
B2 ProtecTIER Server A
B3
1 1 1 1 1 P1 P2
B4
2 2 2 2 2
4
3 3 3
E
4 4 4 A1 A2 B V S E1 E3
E2 E4
1 2 3 4 5 6 7 8 9
16
ts761115
Replication 2
15
Replication 1
Table 39. Clustered 1 Gb Ethernet switch connections for OpenStorage configuration, Feature Code 3456 (For
ProtecTIER V3.2)
Callout From On Device To On Device/Location
1 Port 1, slot 5 Server A Customer network Customer designated device
2 Port 1 Ethernet switch 1 Port 2, slot 5 Server A
3 Port 1 Ethernet switch 2 Port A1 Server A
4 Port 3 Ethernet switch 2 Ethernet port, E WTI network power switch
| 5 Port 26 Ethernet switch 2 Port 26 Ethernet switch 1
6 Port 1, slot 5 Server B Customer network Customer designated device
7 Port 2 Ethernet switch 1 Port 2, slot 5 Server B
8 Port 2 Ethernet switch 2 Port A1 Server B
|
1 3 5 7 9 11 13 15 17 19 21 23 25 26
Ethernet switch 2 P
2 4 6 8 10 12 14 16 18 20 22 24 5
1 3 5 7 9 11 13 15 17 19 21 23 25 26
Ethernet switch 1 P
2 4 6 8 10 12 14 16 18 20 22 24
WTI
network 3 2 7
power switch
ProtecTIER Server B
P 8
1 1 1 1 1 P1 P2
P
2 2 2 2 2
3
A1
4 A1 A2 B V S E1 E3
A2 E2 E4
1 2 3 4 5 6 7 8 9
A3
A4 6 Customer Network
17
18 Replication 2
B1 Replication 1
1 Customer Network
B2
B3 ProtecTIER Server A
B4
4 1 1 1 1 1 P1 P2
2 2 2 2 2
E
3
4 A1 A2 B V S E1 E3
E2 E4
1 2 3 4 5 6 7 8 9
ts761282
15 16
Replication 1 Replication 2
|
| Figure 106. Clustered 1 Gb Ethernet switch connections for OpenStorage configuration,
| Feature Code 3457
6. Power on the lower ProtecTIER Node A. The message Updating WTI firmware
might be displayed. The update takes at least five minutes.
7. After ProtecTIER Node A is completely powered on, power on ProtecTIER
Node B.
Note: (ProtecTIER V2.3 or earlier only) Only the items in the menu are updated.
To update all components in the x-System, perform the procedure in IBM System
Storage TS7600 with ProtecTIER Installation Instructions for the RAS Package, BIOS,
and Firmware updates following a FRU replacement for models 3958 DD1, 3958 DD3 and
3958 AP1, IBM part number 46X1339.
Procedure
Perform the following steps to update server firmware and BIOS settings.
1. Insert the appropriate software DVD.
Results
The firmware update process is complete. Return to the procedure that sent you
here or you can check the system health to verify that everything is OK.
Example
For both 7141 and 7233 the updated firmware levels are as follows:
v ibm_fw_fpga__linux_i386.sh (Field Programmable Gate Array)
v brcm_fw_nic__linux_32-64.bin (Sysplanar Broadcom Ethernet firmware update)
v ibm_fw_dsa__linux_32_64.sh (Dynamic System Analysis)
Notes:
1. The Broadcom firmware update script updates only active ports. Port 1 is used
for the replication feature. Replication configuration is required before the
firmware for Port 1 is updated. Port 2 is used for connection to the TS3000
System Console (TSSC).
2. Installation of the Reliability, Availability, and Serviceability (RAS) package is
also required before firmware for Port 2 is updated.
The mrdump script is provided to gather the MegaRAID logs needed by IBM
System x support to help diagnose problems with the MR10k card and battery.
Enter the following commands on the command line:
The script creates two log files (mrlog.log and MegaSAS.log) in the same directory
for support.
The ProtecTIER software V2.4.0 or later installs the MegaCli during the autorun
command.
1. Check whether the /opt/MegaRAID/MegaCli/mrdump.sh command is present. If it
is not, you must install the MegaCli commands for ProtecTIER software V2.3
and below.
2. Display the current firmware versions.
v For ProtecTIER software V3.1 or later, from the ProtecTIER Service menu,
select Version Information > Display Firmware Levels (see “ProtecTIER
Service menu” on page 16).
v For ProtecTIER software V2.4 or V2.5, from the ProtecTIER Service menu,
select Display Firmware Versions.
v For ProtecTIER software V2.3 or earlier, from a command line type versions.
If the command is not found, type the following commands to mount the
IBM System Storage RAS/BIOS and Firmware Update DVD and display the
firmware versions:
mount /dev/hda /mnt/cdrom
cd /mnt/cdrom
./versions
Make a note of the displayed levels and compare them to the levels listed in
this document.
3. Update the MR10K firmware.
v ProtecTIER V2.3 or earlier: If any firmware displayed is lower than the
recommended level, use the Reliability, Availability, and Serviceability (RAS)
package version 3.2 or later and follow the instructions in this section to
update that component. If the firmware level is current, you can skip the
firmware update.
v ProtecTIER V2.4 or later: Type the following command:
/opt/dtc/install/ptconfig -updateFirmwares
The update takes several minutes to complete. All components are updated to
supported levels
The MR10k MegaRAID card provides battery backup of the hard disk drive (HDD)
cache data. This card also provides the mirroring of the two HDDs used for the
operating system (OS). See the IBM System Storage Problem Determination and Service
Guide for the TS7650 ProtecTIER Deduplication Appliance and TS7650G ProtecTIER
Deduplication Gateway, GA32-0923, included on the IBM System Storage TS7650 with
ProtecTIER Publications CD, for more information. This procedure is used to update
the firmware on the MR10k after a hardware replacement of the adapter.
Procedure
1. Connect a universal serial bus (USB) keyboard and monitor to the server.
2. Insert the appropriate software DVD.
Note: The version listed is subject to change. Download the latest version as
necessary.
6. Enter the following command to update the MegaRAID command line utilities:
./installMRCli
7. After a successful update of the MRCli, eject the IBM System Storage RAS/BIOS
and Firmware Update DVD by entering the following commands:
cd /
eject /dev/hda
Results
The process of replacing the MR10k MegaRAID card and updating the firmware is
complete.
What to do next
If you are performing any other field replaceable unit (FRU) or firmware updates,
proceed to the appropriate section. Otherwise, go to Chapter 12, “End-of-call
procedure,” on page 249.
If one or more of these parts has been replaced and the firmware is not at the
current level, the following procedure must be performed to update the firmware
to current supported field levels.
Procedure
1. Connect a universal serial bus (USB) keyboard and monitor to the server.
2. Update the firmware.
v For ProtecTIER software V2.4 or later, type the following command at a
command line:
/opt/dtc/install/ptconfig -updateFirmwares
When this process completes, go to step 10 on page 233.
v For ProtecTIER software V2.3 or earlier, type versions at a command line. If
the command is not found, perform the following steps.
3. If the IBM System Storage RAS/BIOS and Firmware Update DVD is not already
installed in the CD/DVD drive of the 3958 server, insert it into the CD/DVD
drive.
4. Log with the ID root and the password admin.
5. From the command line, enter the following commands to start the firmware
load process:
mkdir /mnt/cdrom
Note: The SAS hard disk update can take up to one hour to complete.
./installHDDFW
Note: When updating the SAS hard disk firmware, multiple reboots might be
required. Press Enter when asked to continue.
Important: After the firmware updates have been applied, the system must be
shut down to a standby state and restarted for them to take effect.
7. Following the successful update of the components, enter the following
command from the command line to shut down the system:
poweroff
After the poweroff command has been entered, ignore any further error
messages presented.
8. Press the power button to restart the server.
9. As needed: Remove the IBM System Storage RAS/BIOS and Firmware Update
DVD as soon as the CD/DVD drive allows. After the system completes the
boot cycle, verify that no errors are reported on screen or from the error
indicators on the light path display panel.
Note: After the reboot completes, a message might be presented at the login
prompt that the module sctp cannot be unloaded due to unsafe usage...
Ignore this message and press the Enter key to return to the login prompt.
10. Display the current firmware versions.
v For ProtecTIER software V3.1 or later, from the ProtecTIER Service menu,
select Version Information > Display Firmware Levels (see “ProtecTIER
Service menu” on page 16).
v For ProtecTIER software V2.4 or V2.5, from the ProtecTIER Service menu,
select Display Firmware Versions.
v For ProtecTIER software V2.3 or earlier, from a command line type
versions. If the command is not found, type the following commands to
mount the IBM System Storage RAS/BIOS and Firmware Update DVD and
display the firmware versions:
mount /dev/hda /mnt/cdrom
cd /mnt/cdrom
./versions
Results
The process of replacing the SAS hard disk drive and updating the firmware is
complete.
If you are performing any other field replaceable unit (FRU) or firmware updates,
proceed to the appropriate section. Otherwise, go to Chapter 12, “End-of-call
procedure,” on page 249.
Procedure
1. Log in to the RSA using the procedure in Appendix A, “Logging into the 3958
DD1/3958 DD3 server through the RSA connection,” on page 255. the System x
logo screen shown in Figure 107 is displayed.
cvt00029
This is the recommended manual power off sequence for a single node
(stand-alone TS7650 server) TS7650G or TS7650 Appliance server.
Procedure
1. Ask the customer to suspend I/O to the server in the TS7650G or TS7650
Appliance. Indicate below when the customer reports that I/O is suspended.
2. If the TS7650G or TS7650 Appliance does not use a TSSC, attach a keyboard
and monitor to the TS7650 server. Log in with the ID ptconfig and the
password ptconfig to access the ProtecTIER Service Menu.
3. From the ProtecTIER Service Menu, select the Manage ProtecTIER services
option.
----------------------------------------------------------------------
ProtecTIER Service Menu running on rasddx
----------------------------------------------------------------------
1) ProtecTIER Configuration (...)
2) Manage ProtecTIER services (...)
3) Health Monitoring (...)
4) Problem Alerting (...)
5) Version Information (...)
6) Generate a service report
7) Generate a system view
8) Update ProtecTIER code
E) Exit
----------------------------------------------------------------------
>>> Your choice?
4. From the Manage ProtecTIER services menu, select Stop all services.
B) Back
E) Exit
----------------------------------------------------------------------
>>> Your choice?
When the services are stopped, the system shows the following messages:
Stopping ptrasd [ Done ]
Stopping vtfd [ Done ]
Stopping ptcluster [ Done ]
5. Press Enter to return to the ProtecTIER Service Menu.
6. Power off the server.
v ProtecTIER V3.1.8 or earlier:
a. Type E to exit from the ProtecTIER Service Menu.
b. Log in to the server with the user ID root, and the password admin.
c. From the server command line, type poweroff and press Enter.
v ProtecTIER V3.2:
a. From the ProtecTIER Service Menu, select Manage ProtecTIER Services.
b. In the Manage ProtecTIER Services menu, select Poweroff This Node.
----------------------------------------------------------------------
ProtecTIER Service Menu running on rasddx
Manage ProtecTIER Services (...)
----------------------------------------------------------------------
1) Display services status
2) Start all services
3) Stop all services
4) Stop ProtecTIER services only (including GFS)
5) Stop VTFD service only
6) Poweroff This Node
7) Reboot This Node
B) Back
E) Exit
----------------------------------------------------------------------
>>> Your choice?
When the power off processes complete, the Power LED on the server's front
panel blinks steadily. This indicates that server is in standby mode.
7. Power off all attached storage controllers that are located in the frame with
the server, or in a separate storage frame. The control units must be powered
off before the expansion drawers. If multiple controllers are present, power
them all off.
CAUTION:
See storage controller/expansion unit documentation for detailed power off
instructions to prevent damage to the controllers and expansion drawer
units.
8. When the controllers have finished powering off, power off all expansion
units.
9. Shut down the TSSC. To do so:
This is the recommended manual power off sequence for a TS7650G or TS7650
Appliance cluster.
Procedure
1. Ask the customer to suspend I/O to the servers in the TS7650G or TS7650
Appliance. Indicate below when the customer reports that I/O is suspended.
2. If the TS7650G or TS7650 Appliance uses a KVM, press the PrtSc key on the
KVM to select the node you are going to access.
If the TS7650G or TS7650 Appliance does not use a KVM, then use an external
monitor and keyboard to access the nodes.
3. On Node A (the bottom server) attach a keyboard and monitor to the server.
Log in with the ID ptconfig and the password ptconfig to access the
ProtecTIER Service Menu.
4. From the ProtecTIER Service Menu, select the Manage ProtecTIER services
option.
E) Exit
----------------------------------------------------------------------
>>> Your choice?
5. From the Manage ProtecTIER services menu, select Stop all services.
----------------------------------------------------------------------
ProtecTIER Service Menu running on rasddx
Manage ProtecTIER Services (...)
----------------------------------------------------------------------
1) Display services status
2) Start all services
3) Stop all services
4) Stop ProtecTIER services only (including GFS)
5) Stop VTFD service only
B) Back
E) Exit
----------------------------------------------------------------------
>>> Your choice?
When the services are stopped, the system shows the following messages:
Stopping ptrasd [ Done ]
Stopping vtfd [ Done ]
Stopping ptcluster [ Done ]
6. Press Enter to return to the ProtecTIER Service Menu.
7. Power off the server.
v ProtecTIER V3.1.8 or earlier:
a. Type E to exit from the ProtecTIER Service Menu.
b. Log in to the server with the user ID root, and the password admin.
c. From the server command line, type poweroff and press Enter.
v ProtecTIER V3.2:
a. From the ProtecTIER Service Menu, select Manage ProtecTIER Services.
b. In the Manage ProtecTIER Services menu, select Poweroff This Node.
----------------------------------------------------------------------
ProtecTIER Service Menu running on rasddx
Manage ProtecTIER Services (...)
----------------------------------------------------------------------
1) Display services status
2) Start all services
3) Stop all services
4) Stop ProtecTIER services only (including GFS)
5) Stop VTFD service only
6) Poweroff This Node
7) Reboot This Node
B) Back
E) Exit
----------------------------------------------------------------------
>>> Your choice?
Procedure
1. Using the frame's UPO switch or the customer's circuit breaker, restore power
to the frame (or frames).
2. Power on all expansion units.
3. Power on all storage controllers.
4. Press the power button on the TSSC.
5. Log in to the TSSC with the username service and password service.
6. Press the power button on the server's front panel.
Approximately 15 minutes after the server is powered-on, following the
reboot, you can log in to the ProtecTIER Service Menu.
E) Exit
----------------------------------------------------------------------
>>> Your choice?
9. From the Manage ProtecTIER services menu, select Display services status.
----------------------------------------------------------------------
ProtecTIER Service Menu running on rasddx
Manage ProtecTIER Services (...)
----------------------------------------------------------------------
1) Display services status
2) Start all services
3) Stop all services
4) Stop ProtecTIER services only (including GFS)
B) Back
E) Exit
----------------------------------------------------------------------
>>> Your choice?
10. Wait until the display shows all the services have started.
11. The stand-alone TS7650 server power on process is complete.
Procedure
1. Using the frame's UPO switch or the customer's circuit breaker, restore power
to the frame (or frames).
2. Power on all expansion units.
3. Power on all controllers.
4. Press the power button on the TSSC.
5. Log in to the TSSC with the username service and password service.
6. On Node A, press the power button on the server's front panel.
Approximately 15 minutes after the server is powered-on, following the
reboot, you can log in to the ProtecTIER Service Menu on Node A.
E) Exit
----------------------------------------------------------------------
>>> Your choice?
9. From the Manage ProtecTIER services menu, select Display services status.
----------------------------------------------------------------------
ProtecTIER Service Menu running on rasddx
Manage ProtecTIER Services (...)
----------------------------------------------------------------------
1) Display services status
2) Start all services
3) Stop all services
4) Stop ProtecTIER services only (including GFS)
B) Back
E) Exit
----------------------------------------------------------------------
>>> Your choice?
10. Wait until the display shows that all services have started on Node A before
powering-on Node B.
To turn off all components in the TS7650 Appliance or the TS7650G in both
stand-alone and clustered installations, perform the following steps:
1. Ensure that no drive fault light-emitting diodes (LEDs) are lit on all disk
components. If any drive fault LEDs are lit, correct the fault before continuing.
2. Ask the customer to suspend I/O to the server in the TS7650 Appliance or the
TS7650G in both stand-alone and clustered installations. Indicate below when
the customer reports that I/O is suspended.
This action dismounts the file system and shuts down the server to standby
mode.
4. Ensure that the server has shut down to standby mode by verifying that the
power-on LED on the operator information panel on the front of the server is
flashing. See Figure 108 on page 245.
1 2
ts760025
Ethernet port activity LEDs
Power-control button cover Locator button/locator LED
5. If in a two-node environment, repeat steps 3 on page 244 and 4 on page 244 for
the other server in the node.
6. Turn off both power supply switches on the back of the disk controller or
controllers.
7. Turn off both power switches on the back of all disk expansion modules
connected to the disk controller or controllers.
8. Verify that no external systems are connected to frame power supplies.
9. Set the Emergency Power Off (EPO) switch on the front of the frame or rack to
the OFF ( V ) position.
This section provides tasks for turning on all components of the TS7650 Appliance
or the TS7650G in both stand-alone and clustered installations.
Procedure
To turn on all components in the TS7650 Appliance or the TS7650G, perform the
following steps:
1. Verify the following:
v All data and power cables are connected to their components
v All power cables are connected to a correct power outlet
v Both power switches on the back of all disk controllers and disk expansion
modules are turned off
v All disk drive modules are located securely in place in the drive slots of the
disk controller and disk expansion module enclosures
2. If necessary, set the emergency power off (EPO) switch on the front of the rack
or frame to the ON ( | ) position.
Note: 19-inch racks (Gateway) do not have an EPO switch. Set circuit breakers
to ON for all power distribution units in use in the rack.
Attention: You must turn on the disk expansion modules before turning on
the disk controllers. The disk controllers might not recognize the correct
configuration if the disk expansion modules are turned on after the disk
controllers.
3. Turn on both power switches on the back of all disk expansion modules.
4. Ensure that no drive fault light-emitting diodes (LEDs) or any other amber
LEDs on any disk expansion module are lit.
Chapter 11. Power off and power on procedures 245
5. Wait at least 30 seconds after starting the disk expansion modules before
performing step 6.
6. Turn on both power switches on the back of all disk controllers.
7. Ensure that no drive fault LEDs or any other amber LEDs on any disk
controller are lit.
8. Press the white recessed power-control button on the operator information
panel on the front of all servers. See Figure 109.
1 2
ts760025
Ethernet port activity LEDs
Power-control button cover Locator button/locator LED
9. Open a secure shell (SSH) session with the server using one of the following
methods:
Using Terminal on the system console:
a. Right-click the system console desktop and then click Terminal.
b. At the Terminal command prompt, type the following:
ssh ptadmin@xxx.xxx.xxx.xxx
Note: If you do not have the ProtecTIER Manager icon on your Linux
desktop, navigate to the directory where you installed ProtecTIER Manager
(the default location is /opt/IBM/PTManager) and double-click the
ProtecTIER Manager icon.
15. Turn system control back over to the customer.
This topic describes how to shut down the TS7650 Appliance or TS7650G in case of
an emergency.
Procedure
1. Stop all activity.
2. Check all of the light-emitting diodes (LEDs). Make a note of any Fault LEDs
that are lit so that you can correct the problem when you turn on the power.
3. Turn off the emergency power off (EPO) switch and back-end disk repository.
Important: 19-inch racks (Gateway) do not have an EPO switch. You can only
power off by toggling the circuit breakers on the power distribution units.
4. Unplug the power cables from the disk controller and disk expansion module.
In an optimal state, the disk controllers and disk expansion modules should
recover automatically from an unexpected shutdown and unattended simultaneous
restoration of power to system components. After power is restored, call IBM
support if any of the following conditions occur:
v The disk component logical drives and arrays are not displayed in Storage
Manager.
v The disk component logical drives and arrays do not come online.
v The disk component logical drives and arrays seem to be degraded.
Note: Always wait at least 30 seconds between the time that you turn off a power
switch and the time that you turn on the power again.
Procedure
1. Check the ProtecTIER Manager alerts and clear any open alerts. To check the
ProtecTIER Manager for alerts, select the node on which you want to check
alerts and click on the Alerts button. The Alerts window is displayed. Take
appropriate action to clear any open alerts or contact IBM Software Support for
assistance. Repeat for the second node, if applicable.
2. Check the remote supervisor adapter (RSA) log (or the IMM log, for 3958 DD4
servers) for problems and clear the log. You can check the RSA log from a
browser or a Telnet session.
To check the RSA Adapter/IMM Log from a browser, complete the following
steps:
a. At the TS3000 System Console (TSSC), log in with user ID service and
password service.
b. Right-click on the blue desktop and from the menu, select Browser
Functions > Launch Browser.
c. In the URL field, enter the TSSC IP address of the RSA adapter, which is the
assigned frame number ID+1. Example: If the frame number in the attached
systems list is 10 (172.31.1.10), then the RSA IP address is 172.31.1.11.
d. When prompted for a username and password, enter the username USERID
and the password PASSW0RD (the 0 is a zero).
e. Select the timeout value to 10 minutes and click Continue.
f. Under Monitors, select Event Log.
g. Review the log for error entries (a red E), or warnings (a yellow W) and
verify that the notifications are valid.
h. After the problems have been identified, if the fix has been implemented,
scroll to the bottom of the log and choose either the option to save the log
as a text file (recommended in case it is needed later), or to clear the log. If
you save the log as a text file, clear the log after saving.
i. After the log has been cleared, from the left navigation pane, select Log Off.
Close the browser window.
j. Continue to the next step.
To check the RSA/IMM log from a Telnet session, complete the following
steps:
a. Telnet to the IP Address of the RSA Adapter from the TSSC or a server
node. The IP addresses are 172.31.1.x1 for Node A or 172.31.1.x6 for Node B,
where x is the Frame/Subsystem number.
ap1tssc:~ # telnet 172.31.1.11
Trying 172.31.1.1...
Connected to 172.31.1.1.
Escape character is ’^]’.
username: h
c. Enter the command readlog -f. The first five entries in the list of logged
errors are displayed.
SN# K116191D0S0> readlog -f
1 I SERVPROC 07/22/10 00:43:28 Remote Login Suc
cessful. Login ID:’’USERID’ CLI authenticated from 172.31.1.1 (Telnet).’
2 I SERVPROC 07/22/10 00:42:11 Remote Login Suc
cessful. Login ID:’’USERID’ CLI authenticated from 172.31.1.1 (Telnet).’
3 I SERVPROC 07/21/10 09:53:48 Remote Login Suc
cessful. Login ID:’’Service’ CLI authenticated from 172.31.1.10 (Telnet).’
4 I SERVPROC 07/21/10 09:53:46 Remote Login Suc
cessful. Login ID:’’Service’ CLI authenticated from 172.31.1.10 (Telnet).’
5 I SERVPROC 07/21/10 09:53:44 Remote Login Suc
cessful. Login ID:’’Service’ CLI authenticated from 172.31.1.10 (Telnet).’
SN# K116191D0S0> h
d. To display the next five entries, type the command readlog. Repeat until
you reach the end of the list.
e. If all displayed errors were resolved, clear the log by typing the command
clearlog.
SN# K116191D0S0> clearlog
ok
SN# K116191D0S0> readlog
(There are no more entries in the event log.)
SN# K116191D0S0> h
3. Use the ProtecTIER Service menu (see “ProtecTIER Service menu” on page 16)
to check for open problems in each node. Close all open problems. To check for
open problems, select Problem management from the ProtecTIER Service menu
and then select List open problems from the submenu. To close (cancel) an
open problem select Problem management from the ProtecTIER Service menu
and then select Cancel problem from the submenu. See below for the detailed
procedure:
a. If this is an Appliance and a keyboard, video, mouse (KVM) is available,
press the PrtSC key and select the server and press Enter. If this is a
stand-alone server, connect a universal serial bus (USB) keyboard and
monitor at the rear of the server.
b. To check for open problems, select Problem management from the
ProtecTIER Service menu and then select List open problems from the
submenu.
c. When asked if you want to continue, type y.
d. Review any listed open problem records and verify that they have been
resolved.
e. To cancel a problem, record the 12-digit problem ID number. Example:
ProblemRecord 2: problemID=<1271435451803> From the ProtecTIER
Service menu, select Problem management > Cancel problem. When
prompted to continue, type y and type the 12-digit problem ID number to
cancel the record. This can take several moments to complete. When
prompted with a question if you are sure you want to cancel the specified
problem, type y and press Enter.
Note: If the message "RAS Service Mode Started" displays, ignore it at this
time. Service Mode is disabled in the final steps of this procedure.
f. Continue to the next step.
6. Verify that both nodes are online and operational and that the filesystems are
mounted:
v For ProtecTIER V3.1 or later, select Manage ProtecTIER Services > Display
Services Status from the ProtecTIER Service menu (see “ProtecTIER Service
menu” on page 16). Check the status of the vtfd, cman, clvmd, and gfs
services.
v For ProtecTIER V2.5 or earlier, from the command line of the server, enter
the following commands to verify that the services have all been started
(indicated by a running state):
service vtfd status
service cman status
service clvmd status
Also, verify that the filesystems are mounted by running the following
command (this is variable, but must show a list of filesystems that are
mounted):
service gfs status
The output is similar to the examples shown in Figure 110 on page 252 and
Figure 111 on page 252.
/mnt/vg0-lv_vg0
/mnt/vg1-lv_vg1
/mnt/vg2-lv_vg2
/mnt/vg3-lv_vg3
/mnt/vg4-lv_vg4
/mnt/vg5-lv_vg5
/mnt/vg6-lv_vg6
/mnt/vg7-lv_vg7
/mnt/vg8-lv_vg8
/mnt/vg9-lv_vg9
/mnt/vg10-lv_vg10
/mnt/vg11-lv_vg11
7. After verifying all the services are operational and running, check the
ProtecTIER Manager and verify that the node or nodes (if clustered) are
accessible and online. This can be checked from the TSSC.
Note: It is assumed that all nodes have already been established on the TSSC
ProtecTIER Manager interface. If the nodes have not been established, they
must be added using the customer-assigned IP address of each node, and
added using the ProtecTIER Manager Add Node option:
a. At the TSSC, if not already logged in, log in with user ID service and
password service.
b. Right-click on the blue desktop and from the menu, select Browser
Functions > ProtecTIER Manager Functions > Launch GUI.
c. In the left navigation pane, select the tab for Nodes. Select the server. At the
login for the node, click the login button. Type the user ID ptoper and the
password ptoper. Click OK.
d. Once logged into the node, from the left navigation pane, select the tab for
Systems. Verify that the node (or nodes if clustered) show a Status of OK,
the Management Service shows as Online and the VT shows as Online.
e. If the status is online and OK, close the ProtecTIER Manager. Right-click on
the TSSC blue desktop and select Logout.
What to do next
In order to use the RSA to connect to the servers, you must use the TS3000 System
Console (TSSC) (code level 5.9.x or higher required), or your IBM service laptop. If
you use your laptop, you might also need an extension cord to reach a standard
power outlet, or a 240 V to 110 V step-down voltage converter to allow the service
laptop to run off of the power distribution unit (PDU).
Important: By default, the RSA ports on both servers in a cluster have the same IP
address. This causes a remote supervisor access IP address conflict when
connecting to the RSA card through the TSSC network for the very first time. To
avoid the conflict, you must change the IP address on one of the RSA ports so that
its IP address is unique or unplug the Ethernet cable to the remote supervisor
access to which you are not connecting. To do so, perform steps 2 through 11 on
page 257. Change the RSA port IP address on Server B in the cluster. Do not
change the default RSA port IP address on Server A. New remote supervisor access
field replaceable unit (FRU) replacement's default IP address is 192.168.70.125.
Procedure
1. If you are using the TSSC, skip ahead to step 12 on page 257. Otherwise,
continue with step 2
2. Connect an Ethernet cable from the Ethernet port on your laptop to the SYS
MGMT (remote supervisor access port) on a stand-alone server, or on Server B
in a cluster. See Figure 112 on page 256.
3. In a cluster, disconnect the Ethernet cable from the RSA port on Server A.
4. On the laptop, click Start > Control Panel.
5. Double-click Network Connections.
6. Right-click Local Area Connection > Properties. See Figure 113.
7. From the list, select Internet Protocol (TCP/IP), then click Properties. See
Figure 114 on page 257.
Note: The username and password are case sensitive and must be entered in
all uppercase.
The Remote Supervisor Adapter II Refresh 1 Welcome window is displayed.
15. In the Remote Supervisor Adapter II Refresh 1 Welcome window:
a. Use the Inactive session timeout value menu to specify a reasonable
amount of time that keeps your session active until you can complete your
task. Do not select the No Timeout setting, which prevents anyone else
from logging into the RSA until you have logged out.
b. Click Continue. See Figure 115 on page 258.
Appendix A. Logging into the 3958 DD1/3958 DD3 server through the RSA connection 257
ts760200
Figure 115. Remote Supervisor Adapter II Refresh 1 Welcome window
Results
What to do next
Procedure
1. In the Remote Supervisor Adapter II Refresh 1 window, click Remote Control
in the left-hand menu under the IBM logo. See Figure 116.
Figure 116. Remote Supervisor Adapter II Refresh 1 window: Remote Control link
2. In the Remote Control window, click Start Remote Control in Single User
Mode. See Figure 116. The advanced system management (ASM) Remote
Control window is displayed.
Note: If the browser on the TS3000 System Console (TSSC) or your laptop is
configured to block pop-ups, temporarily disable this function. Accept any
Java™ versions that are displayed, as well as any Windows security prompts
displayed as a result of trying to open this window.
What to do next
Appendix A. Logging into the 3958 DD1/3958 DD3 server through the RSA connection 259
Configuring remote supervisor adapter keystrokes
Use the following procedure to specify unique remote supervisor adapter (RSA)
keystroke combinations for use during Reliability, Availability, and Serviceability
(RAS) package configuration.
Procedure
1. In the asynchronous transfer mode (ASM) Remote Control window:
a. Drag the Video Speed (bps) slider to the 5m setting (all the way to the
right) to maximize video speed. You might have to adjust this setting later
so characters show up on the screen when typed.
b. Click the Windows Maximize button to maximize the size of the ASM
Remote Control window. See Figure 117.
ts760196
Figure 117. Maximize video speed
2. In the ASM Remote Control window, click the Preferences link on the far right
side of the Windows Terminal Services toolbar. See Figure 118.
The Remote Console Preferences window opens.
ts760197
Figure 118. ASM Remote Control preferences Window
In order to use the IMM to connect to the servers, you must use the TS3000 System
Console (TSSC) (code level 5.9.x or higher required), or your IBM service laptop. If
you use your laptop, you might also need an extension cord to reach a standard
power outlet, or a 240 V to 110 V step-down voltage converter to allow the service
laptop to run off of the power distribution unit (PDU).
Procedure
1. If you are using the TSSC, skip ahead to step 13 on page 264. Otherwise,
continue with step 2.
2. Connect an Ethernet cable from the Ethernet port on your laptop to the SYS
MGMT (remote supervisor access port) on a stand-alone server, or on Server B
in a cluster. See Figure 120.
ProtecTIER Server
ts760483
3. In a cluster, disconnect the Ethernet cable from the IMM port on Server A.
4. On the laptop, click Start > Control Panel.
5. Double-click Network Connections.
6. Right-click Local Area Connection > Properties. See Figure 121 on page 263.
7. From the list, select Internet Protocol (TCP/IP), then click Properties. See
Figure 122.
Note: If you are using the TSSC browser to address the IMM at the default IP
address 192.168.70.125, you must set up a temporary alias address of
192.168.70.33 in the TSSC in order to access the IMM.
15. Press Enter.
16. In the Login window, type the user ID and password.
Note: Do not select the No Timeout setting. It prevents anyone else from
logging into the IMM until you have logged out.
b. Click Continue. See Figure 123 on page 265.
Appendix B. Logging into the 3958 DD4, DD5, and AP1 servers through the IMM web interface 265
Note: Do not select the No Timeout setting. It prevents anyone else from
logging into the IMM until you have logged out.
30. Click Continue.
What to do next
Go to “Enabling remote control.” (This might not be necessary for the IMM.)
The TSSC does not currently support access to remote control. To access remote
control, use one of the following browsers:
v Microsoft Internet Explorer
v Mozilla Firefox
v Google Chrome
Procedure
1. In the Integrated Management Module window, click Remote Control in the
left-hand menu under the IBM logo. See Figure 124.
ts760488
What to do next
Important: This procedure enables you to update IMM, UEFI, FPGA and DSA
firmware via the IMM web interface. It should only be used in an emergency,
when firmware cannot be loaded normally for inband flashing by the operating
system. Normally, firmware updates are performed during the autorun phase of
ProtecTIER code load or by selecting the Update Firmware menu option.
Note: The IMM, UEFI, FPGA and DSA firmware files can be downloaded from the
IBM website.
1. Go to IBM Fix Central.
2. Click the IBM Sign in link in the masthead and log in using your IBM user ID.
If you do not have an IBM user ID, follow the procedures to create one from
any IBM web page.
3. In the Select product area, select System x from the Product Group menu.
4. Select System x3850 X5 from the Product type menu.
5. Select 7145 from the Product menu.
6. Select All (or your specific operating system) from the Operating system menu.
7. Click Continue to display the matrix of downloadable firmware files.
8. Scroll to the IMM area or other firmware area, select the link for the firmware
update, and save the update file to your laptop or to the TSSC.
9. The IMM can use either the EXE type or BIN type of file to perform the
update.
Procedure
1. Connect an Ethernet cable from the Ethernet port on your laptop to the SYS
MGMT port on the 3958 DD4 or 3958 DD5 server (see Figure 120 on page
262).
2. To facilitate opening of a web browser on your laptop, perform preparatory
steps 4 on page 262 through 14 on page 264. Then return to this procedure.
Note: If you are using the TSSC, perform only steps 13 on page 264 and 14 on
page 264. Then return to this procedure.
You have already opened the web browser with the appropriate IP address of
the IMM Web interface to which you have to connect.
Appendix B. Logging into the 3958 DD4, DD5, and AP1 servers through the IMM web interface 267
v If the user name and password have been changed, obtain them from the
system administrator.
v Otherwise, type the initial values for the IMM:
Time (mm:ss) Checkpoint(s) Major System Components Successfully Initialized Video Display Comment
00:00 Push power button to turn on system System power up sequence starts with code format 0:x or 1:x
(digits separated by dots)
00:02 0:1 .. 0:8 Turn on 12V, Turn on 5V, 3.3V volt regs Blank .
- System processors begin executing Power On Self Test (POST) with checkpoint format yy (no dots)
00:06 0C..0d..0E..10 Checksumming BIOS flash ROM, copy flash ROM to RAM, execute from RAM Blank .
00:15 07 Check memory Checking system memory for errors, please wait ... .
00:21 d9..dA..dC Internal PCI devices Initializing PCI devices, please wait ... .
00:56 82 Report POST errors, including previous memory error Clear screen .
01:20 3C Prompt Logo: Press F1 for Setup ... Prompt to press F1, F2, or F12
01:30 1E PCIe adapters’ ROM BIOS Adapter ROM BIOS messages Time here depends on number and type of
installed PCIe adapters
00:00 Push power button to turn off system. System power down sequence starts with code format 2:x
(digits separated by dots)
00:05 2:0..2:7 Turn off processors, chipset, 3.3V and 5V volt regs, 12V Last video display when button
was pressed. .
Note: As you perform the steps to upgrade the DS4700 storage system firmware,
note that the IBM System Storage ProtecTIER Manager V3.2 DVD provides controller
firmware, NVSRAM firmware, and ESM firmware. DS3950 firmware must be
copied to the /home/service/offload directory in the TSSC from the ProtecTIER
code in the 3958 AP1 (3958 DD4 server). The steps to upgrade DS3950 firmware
are identical to the steps in this procedure for the DS4700, except that the DS3950
firmware files are provided to the TSSC from the ProtecTIER V3.1 software
installed on the 3958 AP1.
v DS3950 Controller firmware is in the /opt/dtc/IBM_DS4000/FIRMWARE/TS7650
directory.
v DS3950 NVSRAM firmware is in the /opt/dtc/IBM_DS4000/NVSRAM/TS7650
directory.
v DS3950 ESM (Enclosure) firmware is in the /opt/dtc/IBM_DS4000/ESM_HDD/
EXP810 directory.
Procedure
1. Verify that the firmware is at the correct code level.
a. If you are not already logged in to the TSSC, log in with user ID service
and password service.
b. Right-click the blue desktop, and from the IBM TS3000 System Console
menu, select Browser Functions > SM GUI Functions > Launch GUI.
Note: The storage system firmware is on the IBM System Storage ProtecTIER
Manager V3.2 DVD under a 10.60.x5.17 for Windows 32-bit directory. It is used
on the TSSC Linux version as well, and the files are the same.
4. Upgrade the DS Storage Manager application to version 10.60.x5.16 on the
TSSC.
a. Right-click the blue desktop of the TSSC and select Browser Functions >
Storage Manager GUI Functions > Install GUI. The TSSC CD tray opens
automatically.
b. Insert the TS3000 System Console Storage Manager CD, version 10.60.x5.16 (in
the TSSC ship group) into the TSSC and follow the instructions on the
screen to complete the installation.
5. Locate the firmware and NVSRAM files using the DS Storage Manager.
a. Launch the DS Storage Manager from the TSSC by right-clicking the TSSC
and selecting Browser Functions > Storage Manager GUI Functions >
Launch GUI.
b. Select the Storage Subsystem to be upgraded (identified in step 1 on page
271) by right-clicking the storage and selecting Manage Storage Subsystem.
v If prompted to enter a password, enter: ibm2serv.
v If prompted to change or set a password, click No.
v If prompted with a Synchronize Controller Clocks window, click Cancel.
c. In the Subsystem Management window, select the Advanced button in the
top menu bar.
Appendix D. Manually upgrading the DS4700 storage system firmware (ProtecTIER version 2.4 and later) 273
d. Select Maintenance > Download > Controller Firmware. The Download
Controller Firmware window is displayed. The DS Storage Manager checks
the health of the storage subsystem. This check might take several minutes.
Any problem associated with the storage is identified. You cannot continue
with the upgrade until the problem is resolved. If you cannot isolate or
resolve the error condition, contact your next level of support.
e. In the Controller Firmware panel, click the Select File button. A new
window is displayed.
f. Type the path of the controller firmware and press Enter. The path might be
similar to the following: /home/service/offload/DS4k Storage manager
v10.60.x5.17 for Windows 32-bit/DS_FW.
g. In the same window, click to highlight the FW.DS4700_07602800.dlp file
from the Files section.
Note: The storage system firmware is on the IBM System Storage ProtecTIER
Manager V3.2 DVD under a 10.60.x5.17 for Windows 32-bit directory, DS4k
Storage Manager v10.60.x5.17 for Windows 32-bit. It is used on the TSSC
Linux version as well, and the files are the same.
h. Click the OK button at the bottom of the window. The Download
Controller Firmware window is displayed.
i. In the Download Controller Firmware window, click the Transfer the
NVSRAM file with controller firmware button.
j. Click the Select File button in the NVSRAM section. A new window is
displayed.
k. Type the path of the NVSRAM and press Enter. The path might be similar
to the following: /home/service/offload/DS4k Storage manager
v10.60.x5.17 for Windows 32-bit/DS_FW.
l. In the same window, select the N1814D47R1060V04.dlp file from the Files
section.
Note: The storage system firmware is on the IBM System Storage ProtecTIER
Manager V3.2 DVD under a 10.60.x5.17 for Windows 32-bit directory, DS4k
Storage Manager v10.60.x5.17 for Windows 32-bit. It is used on the TSSC
Linux version as well, and the files are the same.
m. Click the OK button at the bottom of the window. The Download
Controller Firmware window is displayed.
6. Upgrade the firmware and NVSRAM using the DS Storage Manager. In the
Download Controller Firmware window, click the Transfer button.
7. When you are prompted with a version compatibility window to confirm the
download and you are asked Do you want to continue?, click Yes. The transfer
and activation process takes 15-20 minutes to complete.
Note: During this activation, both A and B controllers are upgraded. When the
activation is completed, a New Firmware Detected window is displayed. Click
OK.
8. Upgrade the ESM level.
a. In the Manage Storage Subsystem window, click Advanced button in the
top menu bar.
b. Select Maintenance > Download > ESM Firmware. The Download
Environmental (ESM) Card Firmware window is displayed.
c. In the Drive Enclosure section, select the enclosure to be upgraded.
Note: The storage system firmware is on the IBM System Storage ProtecTIER
Manager V3.2 DVD under a 10.60.x5.17 for Windows 32-bit directory, DS4k
Storage Manager v10.60.x5.17 for Windows 32-bit. It is used on the TSSC
Linux version as well, and the files are the same.
g. Click the OK button at the bottom of the window. The Download
Environmental (ESM) Card Firmware window is displayed.
h. Click the Start button in the Download Environmental Card Firmware
window to start the ESM download process. A Confirm Download window
is displayed.
i. Type yes and click OK to continue. This process takes 5-7 minutes for each
enclosure. The Status column displays Transferring until the process is
complete. The enclosures can only be updated one at a time.
j. Repeat substeps 8c on page 274 through 8h for each enclosure.
Results
What to do next
Close any open DS Storage Manager windows. Right-click the blue desktop of the
TSSC and select Logout.
Appendix D. Manually upgrading the DS4700 storage system firmware (ProtecTIER version 2.4 and later) 275
276 TS7650 and TS7650G Problem Determination and Service Guide
Appendix E. ProtecTIER Network Performance Validation
Utility
The objective of the pt_net_perf_util is to test maximal network performance in
order to discover potential performance bottlenecks. On VTL systems, the utility
tests maximal replication performance between two future ProtecTIER VTL
repositories by emulating the network usage patterns of the ProtecTIER Native
Replication component. On OpenStorage systems, the utility tests network
performance between an OpenStorage host and a ProtecTIER server to identify
possible bottlenecks in a backup and restore scenario.
The pt_net_perf_util utility and the iperf tool are part of the ProtecTIER software
installation. To test the replication performance, use the following tool:
v iperf 2.0.4
/usr/local/bin/iperf
This utility has two modes of operation, client and server. The server has to be
started before the client. Before running the utility, shut down all other programs
on both the client and server ProtecTIER systems. The client is the ProtecTIER
system that transmits the test data and the server is the ProtecTIER system that
receives the data (also known as the target server). Based on the data sent by the
client and received by the server, the script outputs key network parameter values
which indicate certain attributes of the network. The goal of these tests is to
benchmark the throughput of the network. The most important benchmark is the
direction that replication actually takes place, that is, test the target as the server
since the flow of data will be to that server from the client. However, it is also
important to also test the reverse direction to measure the bandwidth performance
during disaster recovery failback. Network bandwidth is not always the same in
both directions.
For the utility to work, the replication ports must be configured on the source and
the destination servers, and there must be a static route defined from source to
destination and from destination to source.
In the following procedure, the goal is to test network performance between two
machines on a WAN, server1 and server2. Each test runs for five minutes. Since
there are five tests, the process takes a total of 25 minutes.
Repeat these tests for each server and for each replication port on the server (two
tests total per server, one for each replication port).
E) Exit
----------------------------------------------------------------------
>>> Your choice?
B) Back
E) Exit
----------------------------------------------------------------------
>>> Your choice?
Note: You can also stop the VTFD services from the command line. From the
command line on the server, enter the following command:
service vtfd stop <enter>
3. Wait for the services to stop. A message similar to the following displays:
Shutting down vtfd: [ OK ]
Note: In the sample output below the test ran for only 5 seconds instead of
300.
*** Latency
PING 9.5.53.33 (9.5.53.33) 56(84) bytes of data.
Done.
See the next section for information about interpreting the results of the tests.
The utility performs five foreground tests (Tests 1-5 below), and one background
test (Test 6 below). The example outputs shown are from the client side. Each of
the first five tests ran for 300 seconds (-t 300), while the last test monitored TCP
performance during that time.
Test 1: Latency
This test checks the nominal network link latency and packet loss. Example result:
*** Latency
PING 10.0.13.194 (10.0.13.194) 56(84) bytes of data.
Appendix F. ProtecTIER Replication Network Performance Validation Utility for VTL Systems 281
This test checks maximal TCP throughput of a single data stream with default TCP
settings. Example result:
*** Throughput - Default TCP
[ 3] 0.0-120.1 sec 2.41 GBytes 173 Mbits/sec
This test checks maximal TCP throughput of a single data stream with a 1 MB
(0.95 MiB) send buffer. Example result:
*** Throughput - 1 TCP stream(s), 1MB send buffer
[ 3] 0.0-120.0 sec 2.51 GBytes 180 Mbits/sec
Example result:
*** Throughput - 16 TCP stream(s), 1MB send buffer
[SUM] 0.0-121.4 sec 5.91 GBytes 418 Mbits/sec
Example result:
*** Throughput - 127 TCP stream(s), 1MB send buffer
[SUM] 0.0-126.1 sec 8.08 GBytes 550 Mbits/sec
Example result:
Number of TCP segments sent: 1619061
Number of TCP retransmissions detected: 201038 (12%)
What to do next
You might want to run these tests again to test the reverse throughput in the
network. To run the tests in reverse, change server1 to the client and server2 to the
server and repeat the procedures.
Appendix F. ProtecTIER Replication Network Performance Validation Utility for VTL Systems 283
284 TS7650 and TS7650G Problem Determination and Service Guide
Appendix G. ProtecTIER Network Performance Validation
Utility for OpenStorage Systems
The objective of the OpenStorage pt_net_perf_util utility is to test network
performance between an OpenStorage host and a ProtecTIER server. This utility
can not predict backup and restore performance, but it can discover performance
bottlenecks.
The pt_net_perf_util utility and the iperf tool are part of the ProtecTIER software
installation. The location of the utility varies according to the platform:
v Windows plug-in: %PROGRAMFILES%\IBM\ost_plugin_tools
v AIX plug-in: /opt/IBM/ost_plugin_tools
v ProtecTIER server: /opp/dtc/app/sbin
This utility has two modes of operation, client and server. The server has to be
started before the client. Before running the utility, shut down all other programs
on both the client and server systems. The client is the system that transmits the
test data and the server is the system that receives the data (also known as the
target server). Based on the data sent by the client and received by the server, the
script outputs key network parameter values which indicate certain attributes of
the network. The goal of these tests is to benchmark the throughput of the
network. It is important to test both OpenStorage host-to-ProtecTIER server and
ProtecTIER server-to-OpenStorage host directions to measure the bandwidth
performance during both backup and restore activities. Network bandwidth is not
always the same in both directions. In addition, if an OpenStorage host or a
ProtecTIER server uses more than one IP address for OpenStorage operation, it is
essential to test each IP address separately.
For the utility to work, the OpenStorage ports must be configured on the source
and the destination servers, and there must be a static route defined from source to
destination and from destination to source.
In the following procedure, the goal is to test network performance between two
machines server1 and server2. Each test runs for five minutes. Since there are five
tests, the process takes a total of 25 minutes.
Repeat these tests for each server and for each OpenStorage port on the server
(between two and six tests total per server, one for each OpenStorage port).
E) Exit
----------------------------------------------------------------------
>>> Your choice?
Note: You can also stop the VTFD services from the command line. From the
command line on the server, enter the following command:
service vtfd stop <enter>
3. Wait for the services to stop. A message similar to the following displays:
Shutting down vtfd: [ OK ]
Note: In the sample output below the test ran for only 5 seconds instead of
300.
The utility performs five foreground tests (Tests 1-5 below), and one background
test (Test 6 below). The example outputs are from the client side. Each of the first
five tests ran for 300 seconds (-t 300), while the last test monitored TCP
performance during that time.
Test 1: Latency
This test checks the nominal network link latency and packet loss. Example result:
*** Latency
PING 10.0.13.194 (10.0.13.194) 56(84) bytes of data.
This test checks maximal TCP throughput using a single data stream with default
TCP settings. Example result:
*** Throughput - Default TCP
[ 3] 0.0-120.1 sec 2.41 GBytes 173 Mbits/sec
Appendix G. ProtecTIER Network Performance Validation Utility for OpenStorage Systems 287
This test checks maximal TCP throughput of a single data stream with a 1 MB
(0.95 MiB) send buffer. Example result:
*** Throughput - 1 TCP stream(s), 1MB send buffer
[ 3] 0.0-120.0 sec 2.51 GBytes 180 Mbits/sec
Example result:
*** Throughput - 16 TCP stream(s), 1MB send buffer
[SUM] 0.0-121.4 sec 5.91 GBytes 418 Mbits/sec
Example result:
*** Throughput - 127 TCP stream(s), 1MB send buffer
[SUM] 0.0-126.1 sec 8.08 GBytes 550 Mbits/sec
Example result:
Number of TCP segments sent: 1619061
Number of TCP retransmissions detected: 201038 (12%)
What to do next
You might want to run these tests again to test the reverse throughput in the
network. To run the tests in reverse, change server1 to the client and server2 to the
server and repeat the procedures.
Appendix G. ProtecTIER Network Performance Validation Utility for OpenStorage Systems 289
290 TS7650 and TS7650G Problem Determination and Service Guide
Appendix H. Checking the Western Telematic Inc. power ports
This topic provides a method to access the Western Telematic Inc. (WTI) switch to
view power port status and to power on or power off the power ports. This
method must be performed by an IBM service representative.
Important: The commands within this topic must be initiated from only one
server. Ports 1 and 5 supply power to Node A. Port 1 is the upper power supply of
Node A. Ports 2 and 6 supply power to Node B. Port 2 is the upper power supply
of Node B.
1 3 5 7 9 11 13 15
Ethernet switch 2 2 4 6 8 10 12 14 16
P
1 3 5 7 9 11 13 15
Ethernet switch 1 P
2 4 6 8 10 12 14 16
WTI
Network
Power Switch 3958 Server B
P
1 1 1 1 1 1 A P
2 2 2 2 2 2 U
E1
R S
P E2 P
V
1 2 3 4 5 6 7
1
5
3958 Server A
6
7 1 1 1 1 1 1 A P
8 2 2 2 2 2 2 U
E1
R S
E2 P
E V
ts760241
1 2 3 4 5 6 7
Figure 125. Dual node cluster cabling for 3958 AP1 and 3958 DD3
Figure 126 on page 292 shows the network power switch connections.
Empty (1u)
Empty (1u)
TJ8 P Empty (1u)
BJ8 P Empty (1u)
Empty (1u)
1 Empty (1u)
Empty (1u)
2
Empty (1u)
3 Empty (1u) Top Power
4 Empty (1u) Distribution
Empty (1u) Unit (PDU)
5 Empty (1u)
6 Empty (1u)
7 Empty (1u)
8 Empty (1u)
Empty (1u)
Empty (1u)
E Empty (1u)
Empty (1u)
Empty (1u)
BJ5 TSSC (1u) TJ5
BJ4 TSSC Network Switch & KVM (1u) TJ4
3958 DD3
7233-PCS
3850 M2 (4u)
Bottom Power
Empty (1u) Distribution
Unit (PDU)
3958 DD3
7233-PCS
3850 M2 (4u)
Empty (1u)
1 GB Ethernet Switch (1u) TJ1
BJ1 1 GB Ethernet Switch (1u)
ts760234
Empty (1u)
Empty (1u)
1 3 5 7 9 11 13 15 17 19 21 23 25 26
Ethernet switch 1 P
2 4 6 8 10 12 14 16 18 20 22 24
WTI
3 2 7
network
power switch
ProtecTIER Server B
8
P
1 1 1 1 1 P1 P2
2 2 2 2 2
P
3
4 A1 A2 B V S E1 E3
A1
E2 E4
A2 1 2 3 4 5 6 7 8 9
A3 26 6 Customer Network
A4 Replication 1 27 Replication 2
1 Customer Network
B1
B2 ProtecTIER Server A
B3
1 1 1 1 1 P1 P2
B4
2 2 2 2 2
4 3
E
4 A1 A2 B V S E1 E3
E2 E4
1 2 3 4 5 6 7 8 9
ts760862
19 20
Replication 1 Replication 2
Figure 127. Ethernet cable connections in the VTL configuration of two clustered 3958
DD4/3958 DD5 servers
Note: The Network Power Switch is shown vertically at the side for ease of
understanding where the connections should go. Actual placement and orientation
of the switch might differ.
2. When you are prompted, log in by typing the password. The default WTI
password is password.
Enter Password: ********
3. Identify which port is turned off. In the example above, port 2 is currently
turned off and is the port that you need to turn on. Turn on the identified port
by entering the command /On [n] where n is the port number. Type y when
asked if you are sure.
IPS> /On 2
Plugs to be turned on:
Plug 2: internal_node2
Sure? (Y/N): y
Processing - please wait ...
Internet Power Switch v1.41h Site ID: (undefined)
Note: You can use the command /H (help) to bring up further details. Also
refer to the WTI Power Switch User's Guide on the IBM System Storage TS7650
with ProtecTIER Publications CD and the documentation on the WTI website.
Display Configuration
/H Display Help Screen /G View/Set General Parameters
/S Display Plug Status /P [n] View/Set Plug Parameters
/SN Display Network Status /C View/Set Serial Parameters
/N View/Set Network Parameters
Control /T View/set Telnet Parameters
/D Set Plugs to Default /W View/Set Web Server
/Boot <n> Boot Plug n /E Save Parameters
/On <n> Turn On Plug n /R Recall Parameters
/Off <n> Turn Off Plug n /DL Download Parameters to File
/X Exit/Disconnect
Utilities
+------------------------------------+ /I Reset Network Interface
| [n] = optional plug name or number | /U Upgrade Firmware
| <n> = required plug name or number |
| n+n or n n = plug n and plug n |
| n:n = plug n through plug n |
| * = all plugs |
| ,y = bypass "Sure? (y/n)" |
+------------------------------------+
4. Exit the telnet session and type y when asked if you are sure.
IPS> /X
5. If fencing occurs again and cannot be stopped, contact your next level of
support.
Procedure
1. Log on to the lower ProtecTIER server.
2. Create an IP alias by typing in the following command:
ifconfig eth1:1 192.168.168.170
3. From the ProtecTIER server command prompt, telnet to the WTI switch.
telnet 192.168.168.168
Enter the Password : password
4. At the IPS prompt, type /H to list the configuration commands.
5. Type /G to set the General Parameters menu.
6. Change the system password.
a. Type 1.
b. Type the new password.
c. Type the new password again to confirm.
d. Press the ESC key to exit the General Parameters menu.
7. Change the plug parameters for Node A power plug.
a. Display the plug numbers and names by typing /S at the IPS prompt.
b. Type /P 1 to change the parameters for plug 1.
c. Type 1 to select the name.
d. Type the plug name internal_node1.
e. Type 3 to select the Seq/Delay parameter.
f. Type 4 to set the delay to 5 seconds.
g. Press ESC to exit the Plug #1 parameters.
h. Type /P 5 to change the parameters for plug 5.
i. Type 1 to select the name.
j. Type the plug name internal_node1.
k. Type 3 to select the Seq/Delay parameter.
l. Type 4 to set the delay to 5 seconds.
m. Press the ESC key to exit the Plug #5 parameters.
8. Change the plug parameters for Node B power plug.
a. Display the plug numbers and names by typing /S at the IPS prompt.
b. Type /P 2 to change the parameters for plug 2.
c. Type 1 to select the name.
d. Type the plug name internal_node2.
e. Type 3 to select the Seq/Delay parameter.
The following wesites can also be useful references for Linux commands:
v Computer Hope
v UNIXguide
The following table lists all of the worldwide time zone codes and the associated
time zone descriptions. Additional information about the time zone is located in
the Comments column.
If you experience difficulties when you use the PDF files and want to request a
Web-based format for a publication, send your request to the following address:
In the request, be sure to include the publication number and title. When you send
information to IBM, you grant IBM a nonexclusive right to use or distribute the
information in any way it believes appropriate without incurring any obligation to
you.
The accessibility features in ProtecTIER Manager help persons with limited vision
use the ProtecTIER Manager installation wizard and software. After preparing the
ProtecTIER Manager workstation for accessibility, you can use Windows-based
screen-reader software and a digital voice synthesizer to hear what is displayed on
the screen.
Complete the following the procedures to download and install the Java-based
tools that are required to enable full screen-reader compatibility on the ProtecTIER
Manager workstation.
Install the Java Runtime Environment (JRE) first, and then install the Java Access
Bridge (JAB). Both of these tools must be installed before you install the
ProtecTIER Manager software.
The JRE includes the Java Virtual Machine (JVM). These tools are necessary for
your computer to run Java-based applications.
Procedure
1. Go to http://www.java.com. The Java website opens.
The java.com website auto-detects the operating system and Internet browser of
the computer you use when you access the site.
2. Click Free Java Download, and proceed as appropriate:
v If the Download Java for Windows page opens, go on to step 3
v If the Download Java for... page title contains the name of an operating
system other than Windows, do the following:
a. Click the See all downloads here link.
The list of available downloads, categorized by operating system,
displays.
b. In the Windows section, click Windows 7/XP/Vista/2000/2003/2008
Online.
3. Review the information provided, and then click Agree and Start Free
Download.
The download dialog box opens.
4. Follow the on-screen instructions to save the executable (.exe) installer file to
the hard disk drive.
5. After the download is complete, find the installer file on the hard disk drive
and write down the full path to the location of the file. For example:
C:\Program Files\Java\jre6\bin\java.exe. This path is needed during ProtecTIER
Manager installation.
6. Proceed as appropriate:
The Java Access Bridge (JAB) makes it possible for you to use Java-based screen
readers with the ProtecTIER Manager installation wizard and software.
Procedure
1. Go to: http://java.sun.com/products/jfc/accessibility.html.
The Java SE Desktop Accessibility page of the Oracle website opens.
2. Read the information provided, then click Access Bridge.
3. Scroll down to the Java Access Bridge for Microsoft Windows Operating
System x.x.x (where x.x.x is the most recent version listed) section. Click the
Download Java Access Bridge x.x.x link.
The Software License Agreement page opens.
4. Read the license agreement, and then select the I agree to the Software
License Agreement check box.
The Download Java Access Bridge for Windows Operating System x.x.x
page opens.
5. In the Required Files list, click the link to download the Access Bridge x.x.x,
accessbridge-x.x.x.exe file.
The download dialog box opens.
6. Follow the on-screen instructions to save the executable (.exe) installer file to
the hard disk drive.
7. When the download is complete, locate the installer file on the hard disk drive
and proceed as appropriate:
v If you downloaded the installer by using the ProtecTIER Manager
workstation on which you are installing the JAB, go on to step 8 on page
318.
v If you downloaded the installer by using a PC other than the applicable
ProtecTIER Manager workstation, do the following:
a. Copy the installer file onto a CD, flash memory drive, or other
removable media device.
b. Copy the installer file from the removable media device to the hard disk
drive of the ProtecTIER Manager workstation.
i When entering the commands, type them exactly as shown, including any
spaces or quotation marks. Any deviation in the procedure can cause the
installation to start in the non-accessible mode, or fail completely.
Procedure
1. If your workstation is configured to automatically open DVDs, temporarily
disable the Windows AutoPlay feature for the CD/DVD device. Use the
Windows Help or other Windows documentation for instructions, and then go
on to step 2.
2. Insert the IBM System Storage ProtecTIER Manager V3.2 DVD into the CD/DVD
drive of the ProtecTIER Manager workstation.
3. Access the command prompt on the ProtecTIER Manager workstation:
a. Click Start > Run....
The Run dialog box opens.
4. In the Open field, type: cmd and click Ok.
The command window opens.
5. Browse to the ProtecTIER Manager installation directory on the DVD. To do so:
a. At the command prompt, type: D: (where D: is the letter assigned to the
CD/DVD drive of the workstation) and press <enter>.
b. At the command prompt, list the contents of the DVD. Type: dir and press
<enter>.
c. Locate the name of the ProtecTIER Manager directory on the DVD. For
example: PT_Manager_V3.2.
d. At the command prompt, change to the ProtecTIER Manager directory.
Type: cd <directory name> and press <enter>. For example:
cd PT_Manager_V3.2 <enter>.
e. At the command prompt, change to the Windows directory. Type:
cd windows and press <enter>.
To make it possible for ProtecTIER Manager display in high contrast, you must
first enable the Use High Contrast option in Windows.
Procedure
1. On the ProtecTIER Manager workstation, go to Windows > Control Panel >
Accessibility Options.
The Accessibility Options dialog box opens.
2. Select the Display tab.
3. In the High Contrast area of the Display tab, select the Use High Contrast
check box, as shown in Figure 130 on page 320:
4. Click Settings.
The Settings for High Contrast dialog box displays, as shown in Figure 131 on
page 321:
Now that you have changed the contrast scheme in Windows, you must enable the
Support system settings option in ProtecTIER Manager.
Procedure
1. Launch ProtecTIER Manager:
ts760899
Figure 132. ProtecTIER Manager window
3. On the Appearance tab, select the Support system settings check box.
You are returned to the ProtecTIER Manager window.
4. Exit and restart ProtecTIER Manager so the contrast settings take effect:
a. On the ProtecTIER Manager toolbar, click: File > Exit.
The ProtecTIER Manager window closes.
b. Click: Start > All Programs > IBM > ProtecTIER Manager > IBM
ProtecTIER Manager.
When the ProtecTIER Manager window opens, the display reflects the
contrast change, as shown in: Figure 134.
ts760883
5. Proceed as appropriate:
v If you want to change one or more of the colors used in the ProtecTIER
Manager display, continue to “Customizing the color palette” on page 324.
v If you do not want to customize the color palette, resume your regular use of
ProtecTIER Manager.
Use this procedure to customize the color palette for ProtecTIER Manager to
improve visibility in the display, or to suit your personal preferences.
Procedure
1. If necessary, start ProtecTIER Manager as described in step 1 on page 321.
2. Open the Preferences dialog box, as described in 2 on page 322.
3. Scroll down (if necessary) to see the entire Color selection list, and then select
the color you want to change.
The Color selection dialog box opens, with the Swatches tab selected, as
shown in Figure 135:
ts760881
The color that is currently defined for your selection is shown in the Preview
pane.
4. Select a new color from the color palette.
ts760882
Figure 136. Default color versus custom color
9. Proceed as appropriate. Return to the task from which you were sent to these
instructions or resume your regular use of ProtecTIER Manager.
IBM may not offer the products, services, or features discussed in this document in
other countries. Consult your local IBM representative for information on the
products and services currently available in your area. Any reference to an IBM
product, program, or service is not intended to state or imply that only that IBM
product, program, or service may be used. Any functionally equivalent product,
program, or service that does not infringe any IBM intellectual property right may
be used instead. However, it is the user's responsibility to evaluate and verify the
operation of any non-IBM product, program, or service.
IBM may have patents or pending patent applications covering subject matter
described in this document. The furnishing of this document does not give you
any license to these patents. You can send license inquiries, in writing, to:
The following paragraph does not apply to the United Kingdom or any other
country where such provisions are inconsistent with local law:
INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS
PUBLICATIONS "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER
EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS
FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or
implied warranties in certain transactions, therefore, this statement may not apply
to you.
Any references in this information to non-IBM Web sites are provided for
convenience only and do not in any manner serve as an endorsement of those Web
sites. The materials at those Web sites are not part of the materials for this IBM
product and use of those Web sites is at your own risk.
IBM may use or distribute any of the information you supply in any way it
believes appropriate without incurring any obligation to you.
All statements regarding IBM's future direction or intent are subject to change or
withdrawal without notice, and represent goals and objectives only.
This information is for planning purposes only. The information herein is subject to
change before the products described become available.
This information contains examples of data and reports used in daily business
operations. To illustrate them as completely as possible, the examples include the
names of individuals, companies, brands, and products. All of these names are
fictitious and any similarity to the names and addresses used by an actual business
enterprise is entirely coincidental.
Trademarks
The following terms are trademarks of the International Business Machines
Corporation in the United States, other countries, or both:
v AIX®
v DS4000
v Enterprise Storage Server®
v ESCON
v FICON
v i5/OS®
v iSeries®
v IBM
v ProtecTIER
v pSeries®
v S/390®
v ServeRAID
v System x
v System Storage
v TotalStorage
v Wake on LAN
v z/OS®
v zSeries®
IBM, the IBM logo, and ibm.com® are trademarks or registered trademarks of
International Business Machines Corporation in the United States, other countries,
or both. If these and other IBM trademarked terms are marked on their first
occurrence in this information with a trademark symbol ((R) or (TM)), these
symbols indicate U.S. registered or common law trademarks owned by IBM at the
time this information was published. Such trademarks may also be registered or
Adobe, the Adobe logo, PostScript, and the PostScript logo are either registered
trademarks or trademarks of Adobe Systems Incorporated in the United States,
and/or other countries.
Java and all Java-based trademarks are trademarks of Oracle, Inc. in the United
States, other countries, or both.
Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo,
Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or
registered trademarks of Intel Corporation or its subsidiaries in the United States
and other countries.
UNIX is a registered trademark of The Open Group in the United States and other
countries.
Other company, product, and service names may be trademarks or service marks
of others.
This equipment has been tested and found to comply with the limits for a Class A
digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to
provide reasonable protection against harmful interference when the equipment is
operated in a commercial environment. This equipment generates, uses, and can
radiate radio frequency energy and, if not installed and used in accordance with
the instruction manual, might cause harmful interference to radio communications.
Operation of this equipment in a residential area is likely to cause harmful
interference, in which case the user will be required to correct the interference at
his own expense.
Properly shielded and grounded cables and connectors must be used in order to
meet FCC emission limits. IBM is not responsible for any radio or television
interference caused by using other than recommended cables and connectors, or by
unauthorized changes or modifications to this equipment. Unauthorized changes
or modifications could void the user's authority to operate the equipment.
Notices 329
This device complies with Part 15 of the FCC Rules. Operation is subject to the
following two conditions: (1) this device might not cause harmful interference, and
(2) this device must accept any interference received, including interference that
might cause undesired operation.
Responsible Manufacturer:
"Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im
Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber
verlangt werden, angemessene Mabnahmen zu ergreifen und dafür
aufzukommen."
Dieses Gerät ist berechtigt, in übereinstimmung mit dem Deutschen EMVG das
EG-Konformitätszeichen - CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk,New York 10504
Tel: 914-499-1900
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse
A.
Notices 331
People's Republic of China Class A Electronic Emission
statement
jjieta
Korean Communications Commission (KCC) Class A
Statement
Notices 333
334 TS7650 and TS7650G Problem Determination and Service Guide
Index
Numerics A CD (continued)
RAS BIOS/Firmware Update
1812 81H about this document xvii RAS BIOS/Firmware Update 228
FRUs sending comments xxiv recovery DVD 159
removing and replacing 215 ac software 159
power supply and fan unit LEDs 148 power LED 78 checkout
related publications xxiv access general
1814 70H 132 disk components 140 3958 DD1/DD3/AP1 46
battery unit LEDs 136 accessibility 315 3958 DD4/DD5/AP1 83
FRUs accessing from command line 21 server 46, 83
removing and replacing 214 adminstrator password 66 clearing system errors 45
operation 143 attaching comments, sending xxiv
rear indicators and LEDs 133 cables 237 component labels 37, 73
related publications xxiv audience of this document xvii components
seven-segment numeric display 137 available configurations 2 disk controller 131
troubleshooting 8 configuration
3958 AP1 37, 73, 75 disk components 159
configuration 234
front view 38, 74
B server (3958 DD1/DD3/AP1) 234
battery unit Configuration/Setup Utility
FRUs 163 viewing error logs 49
disk controller
power checkout 63, 103 Configuration/Setup Utility program
LEDs 136
power features 63, 102 starting 234
bays
rear view 40 configurations 2
hot-swap 131
related publications xxiv configuring
beep code
status LEDs 46, 83 RAS package 255, 261
power-on self-test (POST) 49
troubleshooting 6 using IMM connection 261
beep codes
3958 DD1 37 using RSA connection 255
power-on self-test (POST) 48, 85
configuration 234 connecting
BIOS code
front view 38 cables 237
recovering 228
FRUs 165 connectors
BMC log
power features 63, 102 rear of server 77
viewing 54
rear view 40 Remote supervisor adapter (RSA) 42
boot
related publications xxiv console
recovery jumper location 106
status LEDs 46 ProtecTIER Manager 2
button
troubleshooting 6 controls
DVD drive eject 74
3958 DD1/DD3/AP1 Remote supervisor adapter (RSA) 42
locator 75
general checkout 46 controls and LEDs
power-control 75
power-on password override 65, 105 operator information panel 75
recovering a node 211
automatic recovery 212
recovery 67, 107 C
3958 DD3 37 cable D
change serial number 201 attaching 237 dc
configuration 234 cabling 237 power LED 78
front view 38 disk components 144 DD1/DD3
FRUs 163 call home 13 TSSC connections 120
power checkout 63 enabling 13 DD4, DD5, and AP1
power features 63, 102 testing 13 TSSC connections 113
rear view 40 Call Home device driver
related publications xxiv accessing from command line 24 related publications xxiv
status LEDs 46 initiating from TSSC 26 diagnostic
troubleshooting 6 initiating from TSSC command tools overview 48, 85
3958 DD4 line 27 diagnostic programs
power checkout 103 initiating manually 25 error messages 50, 87
3958 DD4/3958 DD5 73, 75 overview 23 starting 51, 88
front view 74 software events 32 viewing error logs 49, 87
FRUs 161 Troubleshooting the server-to-TSSC diagnostic text messages 50, 88
status LEDs 83 connection 24 diagnostics 16
3958 DD4/DD5/AP1 CD BMC log 54
general checkout 83 documentation 159 Ethernet controller
overview 159 troubleshooting 64, 103
RAS BIOS/firmware update 159 failure 3
Index 337
rear rsCerHMDisplay command 21 speed setting (continued)
view 77 rsCerHMStatusCtl command 22 disk expansion unit 157
recovering starting
BIOS code 228 diagnostic programs 51, 88
server
node 211
S startup 245
status LEDs
SAS hard disk drive 231
server node 3958 DD1/DD3/AP1 46
sending
automatic recovery 212 3958 DD4/DD5/AP1 83
comments xxiv
recovery Storage Manager 140
serial
disaster 3 symptoms
connector 77
server 67, 107 general 4
serial number change 201
related publications xxiii syncing the system time 67, 108
server
remote control system
3958 AP1 37, 73
enabling 259 locator LED, front 75
3958 DD1 37
Integrated Management Module 266 system console 113
3958 DD1/DD3/AP1
remote supervisor access 255 related publications xxiv
RAS BIOS/Firmware Update
Remote Supervisor Adapter removing and replacing FRUs 215
CD 159
related publications xxiv system error log
3958 DD3 37
Remote supervisor adapter (RSA) viewing 54
3958 DD4/3958 DD5 73
controls, connectors, and system errors, clearing 45
configuration 234
indicators 42 system event log
error LEDs
remote system management viewing 91
power supply 43, 79
controls, connectors, and system health monitoring
front view 38, 74
indicators 79 command line tools 19
FRUs
removal system health monitoring display 21
unique to the 3958 DD1
Ethernet switch 216 system health monitoring status
server 165
Remote Supervisor Adapter 203 control 22
unique to the 3958 DD3 and AP1
RSA 203 system health monitoring tools 21
servers 163
removal procedures system shutdown 244
unique to the 3958 DD4/3958 DD5
SFP modules 142 system startup 245
server 161
removing system test 67, 107
FRUSs
disk controller system time 67, 108
removing and replacing 170
FRUs 214 system troubleshooting tools
general checkout 46, 83
disk expansion unit introduction 15
power checkout 63, 103
FRUs 215 system validation test 71, 111
power features 63, 102
Ethernet switch system-error LED
power supply
FRUs 216 front 75
error LEDs 43, 79
server rear 78
power-on password override 65, 105
FRUs 170 systems-management connector 77
RAS BIOS/firmware update CD 159
replacement
rear view 40
field 169
recovering a node 211
Remote Supervisor Adapter 203
RSA 203
automatic recovery 212 T
recovery 67, 107 terminology xvii
replacing
related publications xxiv test log
disk controller
status LEDs 46, 83 viewing 53, 90
FRUs 214
troubleshooting 6 testing the cluster 67, 107
disk expansion unit
server (3958 DD1/DD3/AP1) text messages
FRUs 215
power features 63, 102 diagnostic 50, 88
Ethernet switch
service xxi tools
FRUs 216
settings diagnostic 48, 85
server
disk controller speed 144 Trademarks 328
FRUs 170
disk expansion unit speed 157 troubleshooting
requirements
Setup Utility 1814 70H 8
loopback test 4
error logs 86 disk controller 8
resolving problems
seven-segment numeric display 137, 148 disk expansion unit 9
considerations for 3
SFP modules Ethernet controller 64, 103
RSA
removal 142 light path 56, 93
connection 255
ship group preferred path critical event 10
controls, connectors, and
hardware 1 problem resolution map 5
indicators 42
software 2 server
definition 255
shutdown 244 3958 DD1/DD3/AP1 6
related publications xxiv
software starting point 1
Remote Supervisor Adapter
CDs 159 troubleshooting tools
removal and replacement 203
overview 159 Call Home 23
removal and replacement 203
ship group 2 introduction 15
RSA keystrokes
speed setting system health monitoring
configuring 260
disk expansion module 144 command line tools 19
U
USB
connectors 77
front connectors 74
User Security 66
V
video
connector 77
W
web interface
logging on to 76
websites
directory of worldwide contacts xx
IBM home page xx
Support for IBM System Storage and
TotalStorage products xx
Western Telematic Inc. switch
checking power ports 291
configuring 299
removing and replacing 222
World Wide Node Name
changing 175
obtaining from Qlogic adapter 177
World Wide Port Name
obtaining from Qlogic adapter 177
wrap plugs 4
WTI switch
checking power ports 291
configuring 299
removing and replacing 222
WWNN
changing 175
obtaining from Qlogic adapter 177
WWPN
obtaining from Qlogic adapter 177
Index 339
340 TS7650 and TS7650G Problem Determination and Service Guide
GA32-0923-06