Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Stratus Technologies
R014W-19
Notice
Software described in Stratus documents (a) is the property of Stratus Technologies Bermuda, Ltd. or the third party,
(b) is furnished only under license, and (c) may be copied or used only as expressly permitted under the terms of the
license.
Stratus documentation describes all supported features of the user interfaces and the application programming
interfaces (API) developed by Stratus. Any undocumented features of these interfaces are intended solely for use by
Stratus personnel and are subject to change without warning.
This document is protected by copyright. All rights are reserved. Stratus Technologies grants you limited permission to
download and print a reasonable number of copies of this document (or any portions thereof), without change, for your
internal use only, provided you retain all copyright notices and other restrictive legends and/or notices appearing in the
copied document.
Stratus, the Stratus logo, ftServer, and the ftServer logo are registered trademarks of Stratus Technologies Bermuda,
Ltd.
The Stratus Technologies logo, the Stratus 24 x 7 logo, ActiveService, ftScalable, Automated Uptime, and Active
Upgrade are trademarks of Stratus Technologies Bermuda, Ltd.
Manual Name: Stratus ftServer System Administrator’s Guide for the Windows Operating System
Preface ix
Contents iii
Contents
iv Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Contents
5. Troubleshooting 5-1
MTBF Statistics 5-2
Taking Components Offline and Bringing Them Online 5-3
Expansion ROM Console Message During System Boot 5-3
Internal Disks 5-3
Hardware Installation Wizard 5-5
Determining That a Component Has Failed 5-6
Locating Components 5-7
When the System Does Not Respond 5-10
Remote Event Notification and Service 5-11
Making Dump Files 5-13
Disaster Recovery 5-16
Windows Hotfixes Distributed by Stratus 5-18
Tape Drives and Backup 5-19
Contents v
Contents
Index Index-1
vi Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Figures
Figures vii
Tables
viii Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Preface
Audience
This manual is intended for anyone who administers or troubleshoots ftServer 2900,
4900, or 6900 systems.
Revision Information
This manual is a revision. It includes the following changes:
Notation Conventions
This document uses the notation conventions described in this section.
! WARNING
A warning indicates a hazardous situation that, if not
avoided, could result in death or serious injury.
Preface ix
Preface
! AV E RT I S S E M E N T
Un avertissement indique une situation dangereuse qui, si
pas évitée, pourrait entraîner la mort ou des blessures
graves.
! CAUTION
A caution indicates a hazardous situation that, if not
avoided, could result in minor or moderate injury.
! MISE EN GARDE
Une mise en garde indique une situation dangereuse qui,
si pas évitée, pourrait entraîner des blessures mineures
ou modérées.
NOTICE
A notice indicates information that, if not acted on, could
result in damage to a system, hardware device, program,
or data, but does not present a health or safety hazard.
NOTE
A note provides important information about the operation
of an ftServer system or related equipment or software.
Typographical Conventions
The following typographical conventions are used in ftServer documents:
The bold font emphasizes words in text or indicates text that you type, the name of
a screen object, or the name of a programming element. For example:
Before handling or replacing system components, make sure that you are
properly grounded by using a grounded wrist strap.
x Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Preface
The italic font introduces new terms and indicates programming and command-line
arguments that the user defines. For example:
Many hardware components are customer-replaceable units (CRUs), which
can be replaced on-site by system administrators with minimal training or tools.
The monospace font indicates sample program code and output, including
message text. For example:
#include <iostream.h>
Getting Help
If you have a technical question about ftServer system hardware or software, try these
online resources first:
http://www.stratus.com/go/support
The Service Portal provides access to Knowledge Base articles for all Stratus
product lines. You can locate articles by performing a simple or advanced keyword
search, viewing recent articles or top FAQs, or browsing a product and category.
To log in to the Service Portal, enter your employee user name and password or, if
you have not been provided with a login account, click Register Account. When
registering a new account, ensure that you specify an email address from a
company that has a service agreement with Stratus.
Preface xi
Preface
Online product support for Microsoft® products. Your primary source for
support is the computer manufacturer who provided your software, or an
authorized Microsoft Support Provider. You can also find the latest technical
information about Microsoft Windows® and other Microsoft products through online
product support at the Microsoft Help and Support Web site:
http://support.microsoft.com/
If you cannot resolve your questions with these online self-help resources, and the
ftServer system is covered by a service agreement, contact the Stratus Customer
Assistance Center (CAC) or your authorized Stratus service representative. To contact
the CAC, use the Service Portal to log a support request. Click Customer Support and
Add Issue, and then complete the Create Issue form. A member of our Customer
Service team will be glad to assist you.
By clicking the site feedback link at the bottom of a Help topic. Information to
identify the topic is supplied in the StrataDOC Web Site Feedback form.
By email to Comments@stratus.com. If it is possible, please include specific
information about the documentation on which you are commenting:
– For a printed document or a document in PDF format, include the title and part
number from the Notice page and the page numbers.
– For online documentation, include the Help subject and topic title.
This information will assist Stratus Information Development in making any needed
changes to the ftServer system documentation. Your assistance is most appreciated.
Regulatory Notice
All regulatory notices are provided in the site planning guide for your system.
xii Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Chapter 1
Introduction to System Administration 1-
The following topics provide an introduction to Stratus ftServer 2900, 4900, and 6900
system administration.
ActiveService Network
Your ftServer system offers an optional, secure network, the ActiveService
Network (ASN). ASN enables the CAC or your authorized Stratus service
representative to provide remote monitoring, diagnosis, troubleshooting, and
Your system to send alerts (call-home alarm messages) to the CAC or your
authorized Stratus service representative when unusual events occur on the
system. (Since changes in network configuration can generate false alarms, ASN
does not perform call home when a network adapter team loses connectivity.)
The CAC or your authorized Stratus service representative to access the system
through a connection to the ASN
Your system can connect to the ASN through a modem connected to the system or
through the Internet.
Internet-based ASN connects over a secure path to your authorized Stratus service
representative.
After verifying a hardware problem, your authorized Stratus service representative can
send out a replacement CRU.
See the Stratus ActiveService Network Configuration Guide (R072) and Stratus
Automated Uptime Layer for Windows-based ftServer Systems: Installation and
Configuration (R002W) for more information. Also, see the online Help for instructions
on how to configure ASN alarm notification by email.
NOTE
VTM functionality is integrated into the baseboard
management controller (BMC).
You can access the VTM console over a LAN, or the Internet, if Internet connectivity is
available. You can use the VTM console even if the host system’s operating system is
unresponsive and its network connections are lost. VTM console functionality also
includes:
1-2 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
System Software Features
Remote KVM console enables you to manage the host computer remotely over the
Web-based console. Using Remote KVM console, you can view the remote system
desktop and redirect its local keyboard and mouse.
A VTM requires the assignment of two IP addresses, either static or from a Dynamic
Host Configuration Protocol (DHCP) server.
For information about initially configuring the VTMs for use, see Stratus Automated
Uptime Layer for Windows-based ftServer Systems: Installation and
Configuration (R002W). For information about using the VTM console to open a
Remote KVM session or to perform other administrative tasks, see the Stratus ftServer
Virtual Technician Module User’s Guide (R642).
Disk-Management Tools
Disk-management tools enable you to monitor disk status, mirror entire physical disks,
create logical volumes, mirror volumes across multiple disks, and perform other
data-storage operations.
Fail-Safe Software
Automated Uptime Layer works in conjunction with lockstep technology to increase
system reliability. Fail-safe software features capture issues and report them to the
CAC or your authorized Stratus service representative. Even data in memory can be
constantly protected and maintained.
Stratus's hardened device drivers prevent software failures, and Stratus provides an
embedded open-driver technology that enables third-party device drivers to access
hardening capabilities.
Stratus Services
Stratus provides software fault-tolerant services that run on your system as
Windows-based services. These services constantly monitor for, and respond to,
hardware problems. Applications do not need customization to support the services.
These services start automatically when the system boots and remain running during
normal operation. If a situation requires system administrator intervention, you can use
Computer Management to manually start and stop system services. See “Monitoring
ftServer Services” on page 3-11 for details.
See the Stratus ftServer Systems: Technical Reference Guide (R550) for descriptions
of the Stratus services.
Stratus also supplies drivers that enable ASN integration and PCI device management.
Automatic reporting of device failures to the CAC or your authorized Stratus service
representative
Notification to you by the CAC or your authorized Stratus service representative
when a device fails
Inventory reporting to the CAC or your authorized Stratus service representative
that lists all PCI devices in the system
Comprehensive debug information about all devices in the system for use by the
CAC or your authorized Stratus service representative in troubleshooting your
system
Hot-plug support, which alerts the plug-and-play manager to bring a device into
service or take it out of service. You can initiate bring-up or bring-down from ftSMC.
Hot-plug support also initiates discovery of newly attached or removed devices.
NOTE
You must remove the CPU- I / O enclosure from the
system in order to remove a PCI adapter.
Information display that indicates whether removing a device will compromise the
system. The drivers can set LEDs and values in ftSMC that indicate whether you
can safely remove a component.
1-4 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Initial Configuration
The automatic restart of failed devices. The device will be restarted after transient
faults, until it falls below its mean-time-between faults (MTBF) threshold.
The tracking of a device’s MTBF and management of the device when it
experiences faults. You can set some MTBF parameters on internal disks (see
“Setting the Hard MTBF Threshold for Disks” on page 4-23) and you can clear
MTBF settings on CPU elements, I/O elements, and Ethernet controllers (see
“Clearing MTBF Settings” on page 5-2).
Initial Configuration
Configure the following items before you use your system.
Documentation
The Stratus ftServer 29x0, 49x0, and 69x0 Systems: Site Planning and Installation
Guide (R786) lists ftServer system documentation. You can find additional information
in ftSMC Help.
Online Documentation
If you installed StrataDOC during the Automated Uptime Layer installation process, the
ftServer Help and Manuals folder on the ftServer desktop contains links to the Stratus
ftServer documents.
You can view the documents in the Help system or open them in PDF format for
viewing or printing.
To gain access to the documents on the World Wide Web, point your browser to the
ftServer StrataDOC Web site (http://stratadoc4ftserver.stratus.com).
General Help. You can access general help from the ftServer Help and Manuals
icon on the desktop, the Help button on an application’s menu bar, or by pressing
the F1 key.
Context-Sensitive Help. Context-sensitive help is available for certain items within
help-enabled applications, such as the ftSMC snap-in. If you installed StrataDOC
during the Automated Uptime Layer installation process, you can access the
ftSMC help by selecting an icon in the ftSMC system inventory tree and
pressing F1. Alternatively, right-click an ftSMC system inventory tree icon, and in
the shortcut menu, click Help.
What’s This? of ftSMC Help. What’s This? of ftSMC Help is available for items in
the details pane of the ftSMC snap-in. To access it, right-click the question mark (?)
icon in front of the item, and in the shortcut menu, click What’s this? (see
Figure 2-3 for an example). What’s This? help is also available from certain dialog
boxes having the What’s this? Help icon (?) in their upper-right corner. For
example, in a Properties dialog box in ftSMC, click the Help icon (?), then click a
box to display a pop-up window of ftSMC Help.
1-6 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Windows Documentation
Windows Documentation
This manual documents only those system administration tasks that are required for
ftServer systems in particular, not those that are required for Windows-based servers
in general. General system-administration tasks for Windows systems include:
For help with these kinds of tasks, see the associated Microsoft Help system and the
Windows OS documentation provided by Microsoft.
1-8 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Chapter 2
ftSys Management Console 2-
For information about the Virtual Technician Module (VTM), see the Stratus ftServer
Virtual Technician Module User’s Guide (R642) and ftSMC Help.
ftSMC Overview
ftSMC, an MMC snap-in, is the system management tool you use to monitor and control
the operation of your system. ftSMC presents a list of the system components in the
console tree and properties of the component in the details pane.
Use ftSMC to disable devices. Do not use Windows Device Manager to disable devices
except in documented special situations.
NOTE
MMC snap-ins, including ftSMC, do not lock edit sessions
to prevent simultaneous users from editing the same
parameters.
ftSMC Interface
The details pane, the icons used in ftSMC, and the ftSMC display refresh options are
features of the ftSMC interface. See Figure 2-2.
To expand all nodes in the snapin, select the ftServer (Local) node and press the
asterisk (*) key on the number pad.
2-2 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
ftSMC Overview
Table 2-1 describes the key elements in the ftSMC user interface.
1 Menu bar The Action menu contains MMC snap-in commands. The View
menu contains console viewing options.
2 Console tree Lists all system component nodes. Each node represents a
manageable object.
3 Details pane Displays the properties related to the node selected in the console
tree. For an explanation of a property, right-click the property and
select What’s this? from the shortcut menu to display information
from ftSMC Help. (Not all properties have What’s this? help.) The
Stratus ftServer Systems: Technical Reference Guide (R550)
includes information about properties.
Details Pane
The names and values of the properties of the components in the ftSMC tree are
displayed in the details pane. A question mark (?) icon in front of a property name
indicates that What’s this? Help is available in ftSMC Help. Right-click the property
name and click in the shortcut menu that appears. See Figure 2-3.
The empty (dimmed folder) icon indicates that the node does not contain any
information.
The plain, yellow online folder icon indicates that the node is online and contains
information or child nodes. This icon indicates online simplex components.
The offline (tools) icon indicates that the node is offline, either because of operator
intervention for maintenance or because the system has detected a problem with
a component.
The replacement requested icon shows dual yellow monitors. It indicates that the
flagged component has experienced repeated transient errors and, while still
operational, should be replaced as soon as possible to avoid loss of fault tolerance.
The duplexed (partnered) icon shows dual blue-green monitors. This indicates
that the node is duplexed with another node (for example, two CPU elements). A
duplexed component is safe to remove.
The simplexed (single-monitor) icon indicates that the component is operating
simplexed (unpartnered). Return the partner component to service as soon as
possible to restore fault-tolerant operation.
2-4 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
ftSMC Overview
NOTICE
It is unsafe to remove a simplexed component from the
system. Removing a simplexed component causes loss of
that component function to the system.
The simplexed icon can also indicate an invalid configuration. For instance, in
Figure 2-5, Network Port - 0 is duplexed while Network Port - 1 is simplexed. This
makes Ethernet Controller - 6 (and therefore, its containing I/O element) a single
point of failure, which is an invalid configuration.
The testing (magnifying glass) icon indicates that the node is offline and
undergoing diagnostic testing.
ftSMC displays the warning and error icons described in Table 2-2.
Icon Indication
No inventory ftSMC cannot retrieve inventory information for the device. The
found ftSMC status bar displays a message indicating that the properties
cannot be retrieved.
Warning icons precede the names of all parent nodes that contain nodes with an error.
Figure 2-6 shows warning icons next to the ftServer I/O Enclosures parent node and
its two child nodes, I/O Enclosure - 11 and Ethernet Controller - 6. Finally, an error
icon appears next to the child node that is offline, Network Port - 1.
Figure 2-7 shows a no-inventory-found icon next to the CPU Enclosure icons.
2-6 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Performing ftSMC Tasks
NOTICES
1. Before using ftSMC to interact with the Stratus
Customer Assistance Center (CAC) or your
authorized Stratus service representative, configure
your ASN connection as described in the Stratus
ActiveService Network Configuration Guide (R072)
and Stratus Automated Uptime Layer for
Windows-based ftServer Systems: Installation and
Configuration (R002W).
2. When working in ftSMC, generally do not change the
configured values of properties unless directed to do
so in documentation or by the CAC or your authorized
Stratus service representative.
When you click a command that does not have an ellipsis (...), the system performs the
command immediately and a dialog box displays the status of the command—for
instance, The operation completed successfully. This status does not indicate that
the selected procedure has completed, but only that the command has been
successfully initiated. Click OK in the dialog box to dismiss it. Monitor the state of the
device in the details pane or the system event log to see if the command completed
successfully.
2-8 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Using ftSMC to Manage CPU-I/O Enclosures
CPU Initiate BringUp Removed, Broken, Shot, Firmware update complete, Diagnostics
Element passed
† In ftSMC, if you select the action Initiate BringDown on the ftServer I/O Enclosures node, the
LEDs on the embedded 1GbE ports do not turn off; instead, these LEDs remain lit.
Table 2-5. ftSMC Nodes for CPU Element and I/O Element
CPU Enclosure - 1
I/O Enclosure - 11
Descriptions of CPU elements and I/O elements apply to the elements of both
CPU- I / O enclosures, unless otherwise specified.
For example, your authorized Stratus service provider might ask you to switch the
primary enclosure for troubleshooting purposes.
If the value for Op State: Reason is Secondary and the value for
ActiveCompatibilityFlag is False, you can manually set the enclosure to be the
primary enclosure.
Chapter 3, “Operating and Monitoring Your System” includes more tasks that you can
perform with the ftSMC snapin.
2-10 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Chapter 3
Operating and Monitoring Your System 3-
NOTICES
1. Always shut down the operating system before
turning the system power off.
2. Give the system time to recover automatically
from problems. If the system hangs for any reason
while it is booting, wait at least ten minutes before
intervening manually.
3. If a system contains failed components, it may
attempt to boot for an extended period of time. This is
because the system makes up to six attempts to boot,
You can accomplish an orderly shutdown of the system from a remote computer by
accessing the Windows desktop and executing a Shut Down command.
You must physically be at the system to see the message indicating that it is safe to
power off the system. To turn off power, use the power button on the machine, the
standby circuit-breaker switches, or the VTM console. Note, however, that the
Windows Shut Down command automatically powers down the system.
Discussions of CPU enclosures and I/O enclosures apply equally to the corresponding
elements of CPU-I/O enclosures, unless otherwise specified.
3-2 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Starting Up and Shutting Down a System
NOTE
Brownout is not available if the Hyper-V role is configured.
During CPU bringup, the blackout phase of resynchronization takes longer with
larger memory configurations. This extended resynchronization time can cause
certain applications to time out before the blackout phase completes, resulting in
application failure. This situation is most likely to occur in configurations where
brownout is not available (see the preceding Note) and where memory sizes are
128 GB or larger.
For example, this situation may occur on ftServer systems with 128 GB or more of
memory, in any configuration where brownout is not available, if either of the
following is true:
In this situation, you may want to defer CPU bringup to an off-peak time or a time
when you can manage your network applications (see “Managing CPU Element
Bringup” on page 3-4).
If you have concerns about these extended resynchronization times, you may want
to conduct a thorough validation in a non-production environment.
During part of the resynchronization period, the system does not respond to network
connections. Therefore, it is important to set the timeout period and/or number of retries
for network applications to be large enough to prevent timeouts during
resynchronization. The length of the resynchronization period is proportional to the
amount of system memory configured.
By default, resynchronization occurs as soon as the CPU element that was taken out
of service is determined to be operational or the failed enclosure is replaced. However,
you can defer resynchronization to a more convenient time, such as an off-peak period.
See “Managing CPU Element Bringup” on page 3-4 for information about scheduling
CPU bringup.
When Scheduler executes the commands, Enable CpuBoard Bringup changes the
CpuBringUpPolicy property under the Srabid driver to Enable Bringup. This setting
enables CPU elements that were removed from service to come back into service. It
also brings up any CPUs that are in the Removed from Service state for a reason of
Deferred BringUp, as long as they do not exceed the MTBF eviction threshold.
The eviction threshold, which appears as ThresholdEvict in the ftSMC snapin, is one
of two MTBF thresholds; the other is the replacement threshold (ThresholdReplace in
ftSMC).
3-4 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Screen Resolutions and Display Adapter
When Scheduler executes the Defer CpuBoard Bringup command, it changes the
CpuBringUpPolicy property under the Srabid driver to Defer Bringup. With this
setting, if a CPU element is removed from service, the CPU elements come back into
service at different times.
Screen Resolutions
Table 3-1 lists resolutions for Windows Server 2016 systems. Note, though, that the
video control panel of your computer lists only the modes that your monitor supports
(unless you have configured your computer to show all modes, including modes that
the monitor cannot display).
Table 3-1. Screen Resolutions: ftServer System with Windows Server 2016
† Although the Display control panel lists the refresh rate for these
modes as 64 Hz, they are actually equal to the refresh rates shown in
parentheses.
Display Adapter
On Windows systems, Device Manager lists one Display Adapter. On ftServer
systems, the adapter is the Stratus Virtual Pilot3 Video adapter.
NOTE
Users must have passwords to access remote desktops
with the Remote Desktop application. See Microsoft
documentation for details.
For information about how to use Remote Desktop, see the Windows Server online
Help.
For information about initially configuring the VTMs for use, see Stratus Automated
Uptime Layer for Windows-based ftServer Systems: Installation and
3-6 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Using Windows System Event Logs
Configuration (R002W). For information about using the VTM console to open a
Remote KVM session or to perform other administrative tasks, see the Stratus ftServer
Virtual Technician Module User’s Guide (R642).
You can use Event Viewer to view the logs and to set the size of the application and
system event logs. To prevent loss of event log data, maintain a regular schedule for
manually saving event log files or obtain a third-party tool that saves event logs
automatically.
3-8 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Monitoring Network Status
NOTICE
Data will be overwritten if one of these event logs reaches
its maximum log size.
3-10 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Monitoring ftServer Services
3. Under Windows Logs, right-click the event log you want to manage (for example,
Application) and select Properties.
The other log choices are Security, Setup, System, and Forwarded Events. (See
Windows online Help for details about these event logs.)
4. In the Log Properties dialog box, set the desired value in the Maximum log size
(KB) box. The system default size for the application and system event logs is
20 MB.
5. Click OK.
Saving the log file has no effect on the current contents of the active log.
See the Windows Server online Help for more options to the Save All Events As
command.
NOTE
If the system boots fully, simply restart it and press the F2
key again as it begins to boot.
3. Use the RIGHT ARROW and LEFT ARROW keys to select different tabs. Use the
UP ARROW and DOWN ARROW keys to select different settings on a page. Press
the ESC key to return to a previous page. Press Enter to save new settings.
NOTE
On the Security tab, you can set a user or supervisor
password. If a password is not set, you can enter the
setup utility and modify any value. If a password is set,
and you enter the setup utility without the valid password,
you can modify only the System Date and System Time,
which are on the Main tab. A user or supervisor who
enters the setup utility with the valid password can set or
modify any value.
See the Stratus ftServer Systems: Technical Reference Guide (R550) for cautions and
details about using the setup utility. See the Stratus ftServer Virtual Technician Module
User’s Guide (R642) for information on POST codes. See Stratus Automated Uptime
Layer for Windows-based ftServer Systems: Installation and Configuration (R002W)
for more information about configuring boot settings in the setup utility.
3-12 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Using Serial Ports
If no ASN modem is configured for the ftServer system, Serial Port B is also always
available for use by the operating system and accessible to devices connected to
the serial port connector labeled 2 at the rear of the ftServer system.
If an ASN modem is configured for the ftServer system, Serial Port B is dedicated
to the ASN modem, is not available for general use by the operating system, and
is not accessible to other devices connected to its serial connector at the rear of
the ftServer system. To free Serial Port B for other purposes, perform one of the
following actions:
You can determine the presence or absence of the ASN modem by examining the
ftServer Call Home Modem - 0 node in ftSMC, which will be shown as Empty if the
modem is not configured.
NOTICE
Although the serial ports have a nominal maximum speed
of 115,200 bps, they do not operate reliably at speeds
higher than 57,600 bps. Therefore, do not attempt to run
them at higher speeds.
For information about qualified UPS models, see the information for your system at
the following Web site:
http://stratadoc.stratus.com/genref/refsell/qualified_equipment.html
For UPS installation details, see Stratus Automated Uptime Layer for
Windows-based ftServer Systems: Installation and Configuration (R002W).
If your ftServer system is not connected to a qualified UPS, the system may not have
time to shut down gracefully in the event of a power failure, thus limiting your recovery
procedure options. In that case, you have to reboot the system and verify that your file
systems were not corrupted. Contact the Stratus CAC or your authorized Stratus
service representative for further assistance.
NOTE
Assign a static IP address to the network interface that
provides the connection to the UPS.
3-14 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
SNMP on ftServer 2900, 4900, and 6900 Systems
ClearMTBF—Resets the MTBF to 0 (Unknown) and erases the record of all faults.
SetMtbfThreshold—Sets the MTBF threshold and views the current count
(seconds), number of faults, and date and time of last fault. The MTBF threshold is
set at the factory; you should not change it unless instructed to do so by your
support representative.
InitiateBringUp—Powers the component on and brings it into service (online). The
component must be offline, broken, or shot.
InitiateBringDown—Takes the component out of service (offline) and turns off
power to it. You cannot use this action on a simplex component. The component
must be online and duplex, offline with firmware update complete or diagnostics
passed, or broken or shot.
BurnFirmware—Burns a new firmware image on the target CPU element.
SetCPUBoardPriority—Sets the priority for the CPU element. Supported priorities
are High and Medium. Setting the priority of one element affects the priorities of the
other elements. CPU elements must be in duplex mode to change priority; you
cannot change the priority of a simplex element.
SetMtbfType—Defines how the MTBF threshold value is used when the MTBF of
a device goes below the threshold. One of the following values:
– Use Threshold
– Never Restart
– Always Restart
Normally set to Use Threshold, which indicates that the threshold value is used to
determine whether or not to bring the device back into service automatically.
SNMP also reports detailed system information in the following MIB tree nodes:
SNMP Monitoring
The ftServer SNMP Agent provides a standard mechanism for monitoring the status of
systems and for receiving alarms from the systems.
Unsupported (integer value 1). The system does not support the specified object,
meaning, the sraSiOverallSystemStatus query itself. Because the agent supports
the query, this value will never be returned.
No Faults (integer value 2). The system is fully duplexed, the disks are mirrored,
and no hardware faults are detected in major components. This status does not
include the status of Ethernet cards or other connected communications media.
3-16 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Understanding Updates and Hotfixes
System Fault (integer value 3). A major component has a fault, or the system is not
duplexed. Use the ftSMC snapin to investigate the status further.
System Down (integer value 4). The system is down. If this is true, the agent on
that system will not be operational, so a value of 4 will never be returned. If the
system is down, the sraSiOverallSystemStatus query will fail.
http://www.stratus.com/go/support/osupdatereleasepolicy
Avoiding Problems
To take full advantage of your system’s fault-tolerant features and to avoid problems,
observe the following when setting up and operating your system:
Do not use the “Repair Your Computer” option from the Windows operating
system “Advanced Boot Options” menu (accessed by pressing the F8 key
during system boot), unless you first power down the secondary CPU-I/O
enclosure. If you do so, the operating system will change the disk signature
(disk ID) of the secondary enclosure’s boot disk without warning, making the disk
in the secondary enclosure unbootable. (The boot disks still appear to be
duplexed.) Note that you can safely use the Safe Boot option from the Advanced
Boot Options menu. If the operating system changes the disk signature on a boot
disk, it will be unable to boot from that disk and will fault-resilient boot over to the
other CRU. When the operating system successfully boots, RDR software will
repair the disk signature problem, so even if the system ends up in this state, it will
automatically recover.
Do not attempt to boot the system from a network drive. Booting the system
from a network drive is not supported.
Do not use Virtual Disk Service (VDS) RAID 1 (disk mirroring); it is not
supported. Use only RDR disk mirroring. If you use VDS mirroring, the disks will
not be mirrored after an I/O element is returned to service.
You can use VDS or other volume-mirroring software only if you want to create
RAID 0 (striped disks) over the RAID 1 (disk mirroring) provided by RDR (for
RAID 1 + 0), or if you want to configure your system to boot from external storage,
as described in Stratus Automated Uptime Layer for Windows-based ftServer
Systems: Installation and Configuration (R002W).
take devices out of service and return them to service is through the ftSMC snap-in.
However, to handle special situations, the system documentation may instruct you
to use Windows Device Manager.
Do not install devices unless they have been tested and certified as
compatible with ftServer systems. Installing devices that have not been tested
and certified to be compatible with your ftServer system can reduce the fault
tolerance of your system. If you want to install a device that has not been certified,
contact your account team.
3-18 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Chapter 4
Disk Storage Management 4-
If you are not using RDR or G3MC, see your disk-management tool documentation
for information on how to create striped volumes across the mirrored internal disks.
NOTICE
Create a backup of the system disk with which to restart
the system in the unlikely event of a system failure. See
“Deporting a Physical Disk from an RDR Virtual Disk” on
page 4-20.
You may use any slots for any disks, with the following two exceptions. First, pair disks
in partner I/O elements, not within the same element, to achieve fault-tolerance.
Second, use paired SAS disks for the system boot disks and locate them in Slot 0
(upper slot, far left) of each enclosure. Ensure that the disks are a matched pair of the
same model, firmware, level, and geometry.
After the IPL is complete, use RDR technology to mirror the system disk with its partner
disk in the secondary enclosure for fault-tolerant operation.
Because the system disk and boot partition hold system-critical files, such as the
paging file and system event files, limit the traffic to this disk to system traffic only.
NOTICES
1. A paging file with a minimum size of 2 GB is required
on the boot drive. Otherwise, memory dumps may not
be captured correctly under certain conditions.
Non-boot drives can optionally have additional paging
files.
2. Dynamic disks and Storage Spaces are not
supported on the system disk; they are supported
only on internal RDR data disks.
4-2 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Guidelines for Internal Disks
Data Disks
Use paired SAS disks for internal data disks and locate them in Slot 1 (lower slot, far
left) to Slot 7 (lower slot, far right) of each enclosure.
Use Windows Disk Management to initialize the disks and then use RDR technology to
mirror each pair of disks for fault-tolerant operation.
The manner in which you initialize the disks in Disk Management depends on the
storage technology you plan to use to manage the disks:
NOTICE
It is best to decide what type of disk management
technology and disk partitioning style to use on data disks
during the initial configuration of your system, because
changing them later may require reinitializing the disks.
Basic disks
Basic disks are the default type of disks in Windows Server. A basic disk contains
partitions that are typically formatted with a file system to create one or more
simple volumes for file storage.
If you plan to create basic data disks on a system running Automated Uptime Layer
for Windows-based ftServer Systems, initialize each pair of disks with the default,
GPT partition style, mirror the disks with RDR, and then create volumes on the
RDR virtual disk.
Dynamic disks
NOTES
1. Although dynamic disks are still supported,
Automated Uptime Layer for Windows-based ftServer
Systems is optimized for best compatibility with the
newer Storage Spaces technology.
2. Mirroring disks with RDR already provides a layer of
redundancy. Use dynamic disks to mirror data disks
only if your policies require it.
If you plan to create dynamic data disks on a system running Automated Uptime
Layer for Windows-based ftServer Systems, initialize each pair of disks with the
Master Boot Record (MBR) partition style, mirror the disks with RDR, convert the
RDR virtual disk to a dynamic disk, and then create volumes.
Storage Spaces
Storage Spaces is a newer technology in Windows Server 2012 or higher that
allows you to group 3 or more disks together into a storage pool and allocate disk
space from that pool to create storage spaces.
If you plan to use Storage Spaces on a system running Automated Uptime Layer
for Windows-based ftServer Systems, initialize each pair of data disks with the
default, GPT partition style, mirror the disks with RDR, create a storage pool with
3 or more RDR virtual disks, and then create storage spaces.
For more information about using Storage Spaces, see “Using Storage Spaces on
Data Disks” on page 4-33
NOTICE
To mirror the internal disks in your system, you must use
RDR mirroring. If you use any other type of volume
mirroring on the internal disks, the disks will not be
mirrored after an I/O element is returned to service. You
can use VDS or other volume-mirroring software only if
you want to create RAID 0 (striped disks) over the RAID 1
(disk mirroring) provided by RDR (or RAID1+0).
The data on one disk of the RDR virtual disk pair is mirrored in full—or duplexed—on
its corresponding partner disk, irrespective of volume or partition layout.
However, an RDR virtual disk may also be simplex (consisting of only one physical
disk). A simplex RDR virtual disk may contain mirrored volumes created using
third-party disk-management facilities. If all volumes on a simplex RDR disk are
mirrored with other disks in the system, then the simplex RDR disk is effectively
duplexed (mirrored).
4-4 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Using RDR for Mirroring and Fault Tolerance
If you are upgrading a system’s disks from a non-RDR to an RDR configuration, the
existing disks must provide adequate space for metadata. Metadata is “data about
data” that the RDR software stores on disks to implement disk management. When you
configure a disk for RDR, the system displays an error message if the disk does not
have adequate space. The disk being upgraded should be a basic disk. A basic disk
is one that is divided into partitions.
The details pane presents the location of the physical disks in the RDR virtual disk in
the following properties:
See “Finding Physical Disks” on page 4-40 to match the RDR virtual disk members to
the physical disk in the CPU-I/O enclosure.
The DevicePathID property that appears at the top of the details pane shows a number
like 39/2. This number does not indicate a hardware component and does not signify
the physical disk’s location in the system; it only identifies the virtual disk to
system-management software.
However, a virtual disk cannot be larger than the smaller of its two physical disks. For
example, if each physical disk of an RDR virtual disk is 300 gigabytes (GB), the virtual
disk is a 300-GB virtual disk. If you replace one 300-GB physical disk with a 600-GB
physical disk, the virtual disk remains a 300-GB disk, the smaller of the two physical
disks. But if you replace the other 300-GB physical disk with a 600-GB physical disk,
the RDR virtual disk automatically updates to 600 GB.
ftSMC reports the capacity of physical disks and virtual disks in the Capacity property
in two different locations:
The physical disk Capacity property appears in the ftSMC details pane under
ftServer I/O Enclosure - 10, Storage Enclosure - 40, Slot - n, Disk - n.
The virtual disk Capacity property appears in the ftSMC details pane under
Logical Disk Information, RDR Virtual Disk - n.
Automatic virtual disk expansion does not work on dynamic virtual disks—that is, RDR
virtual disks that have been converted to dynamic disks by Windows Disk
Management. (A dynamic disk consists of a single partition that is divided into volumes.
Dynamic disks can be combined with other disks to form volumes that expand beyond
a single disk.)
4-6 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
To use RDR to mirror dynamic disks, see Stratus Automated Uptime Layer for
Windows-based ftServer Systems: Installation and Configuration (R002W).
Be sure to read about RDR reserved partitions before you start using RDR.
You can also set disk parameters from the SCSI Port Duplex Driver - Sradisk_Driver
node shortcut menu in the ftSMC.
Because the ftServer software creates an RDR reserved partition when it creates the
RDR disk, the RDR disk is never completely empty, even when Disk Management
shows the partition as empty. (The RDR reserved partition is simply too small to appear
in the bar-graph display in Disk Management.) As a result:
You cannot toggle an RDR disk between the MBR and GPT partition styles. As
described in “Guidelines for Internal Disks” on page 4-2, it is best to select the disk
management technology and disk partition style to use on data disks during the
initial configuration of your system.
The RDR pair will duplex even before you create any partitions. However, this
behavior should not affect your disk configurations in any way.
On disks with the MBR partition style, the system may assign a logical drive letter
to an RDR reserved partition. If needed, you can safely remove the drive letter in
Disk Management, but do not delete the RDR reserved partition itself unless
you intend to completely wipe and reinitialize the disk.
Deleting the last volume from an RDR disk no longer automatically converts that
disk to a basic disk. If you need to convert a dynamic disk to basic disk, follow the
procedure described in “Converting a Dynamic Disk to a Basic Disk” on page 4-17.
NOTICE
If you delete an RDR reserved partition, you must not add
the associated RDR virtual disk to a Storage Spaces
storage pool, because doing so makes the disk
unmanageable with ftSMC or ftServer Manager.
The RDR reserved partitions also appear in the output of the Windows DiskPart utility,
as shown in Figure 4-3. Partition 2 that ends at offset 1024 KB is an RDR reserved
partition.
4-8 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
If you create the first RDR disk (plex) on a disk that already has data on it (for instance,
a system disk), make sure that the first disk and its intended partner disk have the same
capacity, that they are basic disks (not dynamic), and that the second disk (the one to
be the mirror) has no partitions.
NOTE
When you create an RDR disk, the ftServer software
automatically creates a small partition. This means that
the RDR disk is never completely empty, even when Disk
Management shows the partition as empty. See “RDR
Reserved Partitions” on page 4-7 for more information.
If you are upgrading disks to use RDR, be sure that the disks are basic disks. (Basic
disks are divided into partitions. Dynamic disks consist of a single partition that is
divided into volumes. Dynamic disks can be combined with other disks to form volumes
that expand beyond a single disk.) If the disks have been configured as dynamic disks,
convert the disks to basic disks before upgrading them to RDR (refer to Windows online
help).
Although you can create RDR disks only from basic disks, you can make RDR disks
dynamic if your policies require it.
If you insert an uninitialized (that is, having no disk signature or volume ID),
non-RDR-configured disk into a slot previously occupied by an RDR disk whose
partner slot is currently occupied by an RDR disk, the system automatically adds the
inserted disk to the RDR virtual disk. In Windows Disk Manager, the disk is shown as
Not Initialized.
Not all RDR commands and procedures are valid on all physical disks. Error messages
appear if you attempt an invalid RDR command. Error messages also suggest possible
corrective actions, where applicable.
Some RDR configuration changes, especially those made on simplex RDR virtual
disks, may require that the operating system be restarted in order to complete. When
the Windows restart warning appears, you can either restart the system immediately or
defer restarting until a more convenient time.
1. In ftSMC, expand I/O Enclosure, Storage Enclosure - 40 and Slot - n, and then
select the disk (the target disk) that you want to add to the RDR virtual disk.
4-10 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
3. Click Next.
The Select a source disk dialog box appears, identifying the target disk and the
disk or disks available for being chosen as sources.
NOTE
The target disk must have the same sector size and
partition style as the existing RDR virtual disk.
A source disk will already be selected by default in the Select a source disk dialog
box, if storage-enclosure rules allow.
4. If a source disk is not already selected in the dialog box, choose a source from the
available disks listed in the right-hand box within the dialog box, and click Next.
The RDR virtual source disk will overwrite the unconfigured target disk, making it
a mirror of the RDR virtual source disk.
In this example, only one source disk, the disk in 10/40/2/2, is available.
The following screen appears, identifying the source and target disks.
4-12 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
5. Click Finish.
The system mirrors the source disk to the target disk. The mirroring process may
take a half-hour or more, depending on the size of the disks.
While the disks are resynchronizing, LEDs blink to indicate read and write
operations, and ftSMC displays a value of Op State: State Syncing in the details
pane for the disk.
When the disks are synchronized, the LEDs on both disks are lit steady green and
ftSMC displays a value of Op State: State: Duplex in the details pane for the disk
and for the virtual disk. The value next to Status for the virtual disk is None.
NOTE
If you receive an error message when using the Add
Physical Disk To RDR Virtual Disk command, the disk
you are trying to add may not have sufficient free space for
the metadata. Be sure the disk has a minimum of 10MB
free space.
A System Shutdown message may appear, stating that the system will reboot in
two minutes. If the message appears, let the system restart.
NOTES
1. If you added a system disk to an RDR virtual disk, a
message notifies you that the system will reboot. The
message gives you the option of confirming or
cancelling the request.
2. The system restarts if you add a data disk to an RDR
virtual disk and the software cannot unmount all data
volumes on the disk.
3. You cannot add a physical disk to RDR if the disk is
dynamic or a nonblank data disk.
If a source disk fails or is removed from service during resynchronization, the target
disk is removed from the system and treated as a deported disk. If the target disk fails
or is removed from service during resynchronization, the source disk continues
operating in simplex mode.
If the system starts to shut down while an RDR virtual disk is resynchronizing, RDR
records the state of the resynchronization and, upon system reboot, resumes
resynchronization from that point.
4-14 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
Write Caching
To optimize system performance, you can enable write caching on RDR virtual disks.
By default, write caching is disabled, except during disk resynchronization, when it is
automatically enabled.
Write caching does not work on a simplex RDR virtual disk (except when it is
resynchronizing), even if it is enabled. However, if you enable write caching on a
simplex disk that is later duplexed, write caching will work.
NOTICES
1. Although enabling write caching typically optimizes
system performance, it does so at the risk of losing
data in the event of certain hardware and system
failures. To ensure the highest level of fault tolerance,
do not enable write caching for disks in ftServer
systems.
2. If you do enable write caching on RDR virtual disks,
do so only if the system is protected by a UPS.
You can enable or disable read-load balancing either for each physical disk or globally,
for all RDR physical disks across the system.
NOTE
Global read-load balancing settings do not overwrite
read-load balancing settings on individual RDR physical
disks. So, for example, if the Lun Load Balancing field for
a particular RDR physical disk is set to Off, you must set
it to On before a global load-balancing command can take
effect on that disk.
4-16 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
NOTICE
Converting a dynamic disk to a basic disk removes all
data from the disk. Before you convert the disk to a basic
disk, back it up so that you can recover the data.
1. Go to Computer Management.
2. Expand Storage and click Disk Management.
3. Right-click the icon of the target disk in the lower portion of the Disk Management
details pane.
4. Select Convert to Basic Disk.
5. If necessary, recover the information from backup media.
To upgrade the basic disks to RDR, see “Creating a First RDR Virtual Disk” on
page 4-9.
You might use Set As Active RDR Plex, for example, when using the chkdsk
command to check the performance of a particular physical disk in an RDR virtual disk.
To do this, issue the Set As Active RDR Plex command for the physical disk you want
to check; this makes it the active disk. Then disable read-load balancing on the RDR
virtual disk, and run chkdsk on the active disk.
NOTE
When you run the chkdsk utility, the video display
remains blank even though the utility is executing.
However, the system is not hung. Therefore, do not
power-cycle or reset the system during this time. You can
see the results of the chkdsk utility by viewing the
Windows Application log.
1. In ftSMC, expand I/O Enclosure - 10 (or 11), Storage Enclosure - 40, Slot - n,
and then right-click on Disk - n.
2. Select Set As Active RDR Plex.
The RDR metadata is deleted, but other data remains on the disk.
The disk is no longer an RDR disk and is not part of an RDR virtual disk.
You can reconfigure the disk as an RDR disk again, using the Add Physical Disk
To RDR Virtual Disk command.
NOTICES
1. If you delete the RDR configuration from a simplex
data disk, then all of the New Technology File System
(NTFS) partitions on that data disk lose their drive
letters.
2. Deleting the RDR configuration from a simplexed or
duplexed boot disk, dynamic disks, or data disks with
one or more paging files can result in unexpected
results, including an operating system crash.
4-18 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
NOTE
Disregard any “Unsafe Removal of Device” messages that
appear when you remove an internal physical disk from
the enclosure.
To disable automounting, open a command prompt and issue the following command:
mountvol /n
mountvol /e
Executing the Remove a Physical Disk from RDR Virtual Disk command on a
partnered (duplex) RDR virtual disk (comprising two physical disks):
Erases all of the data on the disk, including all Windows Disk Management
dynamic disk metadata and all RDR metadata.
Returns the physical disk to a basic, non-RDR-configured, blank disk.
The system does not warn you when you issue this command because the target disk
has a surviving partner physical disk that still contains all of the data. For that reason,
the system does not need to restart.
If you execute the Remove a Physical Disk from RDR Virtual Disk command on an
unpartnered (simplex) RDR virtual disk (comprising one physical disk), the system
prompts you to remove all partition and volume data on the disk before the command
can continue. The prompt alerts you that this operation may result in the loss of all data
on this virtual disk since there is only one physical disk.
NOTE
You use the Deport Physical Disk from RDR Virtual
Disk command to create backup disks and spare,
bootable, RDR virtual disks.
Once a disk is deported, it is hidden from the system and is inaccessible from ftSMC.
It does not become accessible again until you remove it from and reinsert it into the
system. You can physically remove and then reinsert the disk, or you can issue the
InitiateBringdown and InitiateBringup commands for the disk slot.
If you insert a disk deported from one system into another system, the disk appears in
ftSMC as Active, Configured, Deported.
A deported disk inadvertently left in the system or reinserted into its original slot
remains an intact backup. It is not automatically resynchronized or overwritten by the
primary RDR virtual disk.
The last physical disk of an RDR virtual disk if it is bootable or contains a paging file
An unmirrored boot disk
A dynamic disk
A resynchronizing disk
4-20 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
The Deport a Physical Disk from an RDR Virtual Disk command should be grayed
out if the physical disk is not in the valid state (for example, if it is a dynamic disk or a
simplex boot disk) for this command to work properly. A valid deport operation should
rarely fail.
If the Deport a Physical Disk from an RDR Virtual Disk operation requires that the
system reboot, a system-restart dialog box appears, giving you the option to restart the
system immediately or later. If the operation succeeds without requiring a reboot, a
message box indicating success appears.
If Deport a Physical Disk from an RDR Virtual Disk fails, a dialog box indicating the
reason for the failure appears. In this case, the RDR virtual disk is not deported and the
system does not reboot.
After a successful Deport a Physical Disk from an RDR Virtual Disk command, no
drive letters or mount points associated with volumes on the deported disk are
preserved.
3. Click OK.
A message appears, telling you to remove the deported disk.
Label and store this disk, which is now the spare bootable disk, in a safe place.
NOTICE
If the original boot disk is still in a system into which you
insert a spare (deported) boot disk, the original boot disk
loses its RDR configuration and is no longer an
RDR-enabled disk.
Dynamic disks
Simplex RDR virtual disks
RDR virtual disks while they are resynchronizing
RDR virtual disks if either the global LUN Verify Policy (the LunVerifyPolicy
property of the SCSI Port Duplex Driver - Sradisk_Driver) or the virtual disk’s
Verify RDR Virtual Disk Policy is set to Disable.
1. Verify the RDR virtual disk that you are going to back up to ensure that both plexes
(that is, physical disks) of the RDR virtual disk are healthy.
2. In ftSMC, expand ftServer I/O Enclosure, I/O Enclosure - 10 (or 11, as is
appropriate for your configuration), Storage Enclosure - 40, and then select the
appropriate slot to access the target disk. (See “System Elements and Internal
Storage Disk Locations” on page 5-8 for information about identifying disks and
slots.)
3. Right click on the target disk and select Break Physical Disk from RDR Virtual
Disk.
This command splits the plex from the RDR virtual disk, creates an exact copy of
its volumes, and adds them to Disk Management.
4-22 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
Setting the hard or soft MTBF threshold sets the MTBF threshold and views the current
count (seconds), number of faults, and date and time of last fault. Stratus presets the
MTBF thresholds.
The hard MTBF threshold refers to the threshold for disk failures that are serious
enough to remove a disk from service permanently.
The soft MTBF threshold refers to the threshold for disk failures that are not serious
enough to remove a disk from service permanently.
The data on the stored-data disk is overwritten with the data on the disk in its
companion slot.
The stored-data disk remains an RDR disk.
Wait for the disk to be fully resynchronized before removing the stored disk or before
removing power from the system.
4-24 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
NOTE
To delete the RDR configuration from a stored-data disk
but retain its data, see “Deleting the RDR Configuration on
a Physical Disk” on page 4-18.
1. Make sure no disk occupies the slot that is the partner slot to the disk whose RDR
configuration you are removing.
2. If your disk was a dynamic disk, use your disk-management tool to convert the disk
to basic, deleting all data on the disk.
If your disk was a basic disk, use your disk-management tool to delete all volumes
and partitions.
3. In ftSMC, select and right-click the disk, and click Remove Physical Disk From
RDR Virtual Disk.
NOTICE
You must complete the final step. Otherwise, if the former
RDR virtual disk partner of the newly erased disk is
inserted into the partner slot of the erased (but not
deleted) disk, the newly inserted disk is mirrored to the
empty disk, deleting the data on the newly inserted disk.
Resynchronize This Physical Disk From RDR Virtual Disk. This command
resynchronizes the chosen physical disk with its partner.
Resynchronize Physical Disks Of This RDR Virtual Disk. This command simply
makes the two physical disks of an RDR virtual disk identical. The software
arbitrarily chooses which disk to make the master and which the copy, irrespective
of which disk was last written to.
While the disks are resynchronizing, the disk LEDs blink to indicate read and write
operations. You can also use ftSMC to monitor the status of the resynchronization
process.
When the resync process is finished, the Status value for the RDR virtual disk is None,
and the LED on each physical disk in the RDR pair is lit steady green.
When RDR disks are resynchronizing, the Logical to Physical Mapping details pane
shows the disk that is resynchronizing as SYNCING, and shows the partner disk as
SIMPLEX.
NOTE
If you insert an uninitialized disk into the companion slot
of an RDR virtual disk, the uninitialized disk is
automatically configured and synchronized to the existing
RDR disk.
See “Disk Failures During Resynchronization” on page 4-14 for instructions on what to
do if the source RDR virtual disk fails during resynchronization.
4-26 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
As shown in the RDR Virtual Disk icon and in the Op State: State property, the
RDR Virtual Disk changes from Duplex to Simplex.
When resynchronization completes, the RDR Virtual Disk changes from Simplex
to Duplex, as shown in the RDR Virtual Disk icon and the Op State: State
property.
The backup boot disk automatically resynchronizes with the original partner. The
command takes some time to run.
NOTE
Disregard any “Unsafe Removal of Device” messages that
appear when you remove an internal physical disk from
the enclosure.
2. Replace the defective disk and add the disk to an RDR virtual disk.
In the unlikely event of a system failure resulting from disk failures, restore the system
by performing one of the following procedures, as appropriate:
Restore the system by using a backup system disk. See the discussion of creating
spare bootable disks in “Deporting a Physical Disk from an RDR Virtual Disk” on
page 4-20 or Stratus Automated Uptime Layer for Windows-based ftServer
Systems: Installation and Configuration (R002W) for information.
Reload the Automated Uptime Layer, as described in Stratus Automated Uptime
Layer for Windows-based ftServer Systems: Installation and
Configuration (R002W).
Running the Verify RDR Virtual Disk command enables you to detect disk errors
before an application needs to use the affected data or before the errors result in disk
failure. Disks deemed to be at high risk of failure should be replaced. However, the
Verify RDR Virtual Disk command requests disk I/O and can degrade disk
performance while running. Therefore, your use of Verify RDR Virtual Disk should
take into account the benefits of disk verification versus the impact on disk
performance. If you are uncertain about the most appropriate use of Verify RDR
Virtual Disk, contact the CAC or your authorized Stratus service representative.
You can execute the Verify RDR Virtual Disk command on specific disks manually on
an individual basis, or automatically, by means of scheduling and batch processes. See
the online help for ftSys Management Console (ftSMC) for a detailed description of the
RDR commands.
4-28 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
NOTE
Scheduled surface scans and the Verify RDR Virtual
Disk action perform the same disk verification; however,
scheduled surface scans occur automatically (though you
can modify the schedule), whereas the Verify RDR
Virtual Disk action is a command that you issue to verify
a specific RDR virtual disk once.
“Using the LunVerifyPolicy Property to Run a Batch Verify Task” on page 4-29
“Changing the Schedule of the Batch Verify Task” on page 4-30
“Globally Re-enabling Verify Tasks on Individual RDR Virtual Disks” on page 4-30
“Changing the Schedule of Surface Scans for Individual RDR Virtual Disks” on
page 4-30
“Enabling or Disabling Scheduled Surface Scans on Individual RDR Virtual Disks”
on page 4-31
“Disabling All Surface Scans” on page 4-31
Changing the Schedule of Surface Scans for Individual RDR Virtual Disks
By default, the surface scans for individual RDR virtual disks are scheduled to run once
a month, shortly after midnight on the day that corresponds to the RDR virtual disk’s
LUN number. You can change the schedule in ftSMC or in the Windows Task
Scheduler, which shows, by default, one verify task for each LUN and one batch verify
task.
To change the schedule of surface scans for an individual RDR virtual disk
1. In ftSMC, expand ftServer (Local) and click Logical Disk Information.
2. Right-click the appropriate RDR virtual disk.
3. Select Schedule Verify On This RDR Virtual Disk. The RdrVerifyLunn window
appears.
4-30 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing RDR Disks
NOTE
Scheduling or descheduling a surface scan has no effect
on the RdrLunVerifyPolicy (or LunVerifyPolicy) setting.
4. In the RdrVerifyLunn window, click the Schedule tab. Modify the schedule as
desired.
5. After you set the schedule, click OK to close the RdrVerifyLunn window.
You may need to recover updated data if your applications start to run on a stale disk.
When you are sure that the system and data disks on the boot enclosure are updated,
restore power to the second enclosure.
4-32 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Using Storage Spaces on Data Disks
For more information about Storage Spaces, see the Microsoft website. In particular,
the following site provides detailed information about using Server Manager to create
storage pools and virtual disks:
https://blogs.technet.microsoft.com/filecab/2012/12/06/using-the-storage-pools-page-i
n-server-manager-to-create-storage-spaces/
Storage Spaces is not supported on any part of the system (boot) disk.
Before you can add a new disk to a storage pool, you must create RDR disks as
described in “Managing RDR Disks” on page 4-6.
NOTES
1. When you initialize a new disk for use with Storage
Spaces, be sure to select the GPT partition style,
which is the default.
2. RDR configuration does not allow you to specify a
partition style.
NOTICES
1. After you create RDR disks, check the disks in the
ftServer Management Console (ftSMC). The value of
the Vendor property must be Stratus, and the value
of the ProductID property must be Data Duplex
LUN. If these properties show any other values, the
disks are not configured correctly as RDR disks and
cannot be used with Storage Spaces.
2. After you create RDR disks, but before you add the
disks to a storage pool, you must bring the disks
online in Disk Management. If you do not, the disks
will be persistently in the Online state in ftSMC, and
safe-to-pull calculations will be broken.
After you create RDR disks, use Server Manager to implement Storage Spaces by first
creating a storage pool and then creating a virtual disk. The Stratus-specific differences
in this process are as follows:
4-34 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Using Storage Spaces on Data Disks
NOTICE
Do not clear the RDR configuration’s metadata on boot
disks.
3. Re-create the RDR configuration on one of the data disks in each pair. See
“Creating a First RDR Virtual Disk” on page 4-9.
4. Re-add the partner data disk to the re-created RDR virtual disk plex. See “Adding
a Physical Disk to an RDR Virtual Disk” on page 4-10.
At this point, the data disks resynchronize automatically.
NOTE
During storage-pool creation, any existing data on the
data disks is not touched. Therefore, when you create a
storage pool and add RDR disks with existing data, only
the portion of those disks that is unused or unallocated is
eligible to be added to the storage pool.
If you do not perform the preceding procedure on your existing data disks, Storage
Spaces does not work correctly.
The InStoragePool property indicates, with a Yes or No value, whether the disk is
in a storage pool.
The StoragePoolName property provides the name of the disk’s storage pool.
NOTE
If your disks are configured for Storage Spaces and have
storage pool/virtual disk associations, the partition table
on the data disks contains the necessary Storage Space
configuration information.
2. Reinstall Automated Uptime Layer only on the boot disk. See Stratus Automated
Uptime Layer for Windows-based ftServer Systems: Installation and
Configuration (R002W)for information about reinstalling Automated Uptime Layer.
3. Open the command prompt and run the Windows Diskpart utility. Issue the
following command, and then exit the utility.
SAN POLICY=OnlineAll
4. After the reinstallation, reinsert all data disks into CPU-I/O enclosure 10. These
disks should appear in ftSMC as Online and Deported.
5. Insert the partner disks into CPU-I/O enclosure 11. These disks should appear in
ftSMC as Online and Deported.
6. Create RDR virtual disks on CPU-I/O enclosure 10’s disks. See “Creating a First
RDR Virtual Disk” on page 4-9.
7. Remove physical disks from all partner disks in CPU-I/O enclosure 11. This step
deletes all data (including the Storage Spaces partition table) from the data disks.
See “Removing a Physical Disk from an RDR Virtual Disk” on page 4-19.
NOTICE
Be sure to use the Remove a Physical Disk from RDR
Virtual Disk command. Do not use the Delete RDR
4-36 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Using Volume Mirroring for Fault Tolerance on Data Disks
8. Add the physical disks to the RDR for each of CPU-I/O enclosure 11’s disks to their
partners in CPU-I/O enclosure 10. See “Adding a Physical Disk to an RDR Virtual
Disk” on page 4-10.
9. The data disks resynchronize automatically. Wait for the resynchronization to
complete before performing the next step.
10. Reboot the system.
11. In Server Manager, click File and Storage Services, then click Storage Pools.
12. Each of the Storage Pools, Virtual Disks, and Physical Disks areas has a
TASKS button. Click each button to view the appropriate tasks for recovering and
importing the data disks on CPU-I/O enclosure 10 to the newly reinstalled system,
then perform these tasks.
13. Verify that RDR pairs have the correct storage pool information in ftSMC.
14. Repair storage pools and virtual disks, if necessary.
15. Manually remount the virtual-disk volumes that are in the storage pools, if
necessary.
Configure disks as dynamic disks. Dynamic disks organize data within volumes on
a disk and support fault-tolerant operation and disk migration between systems.
(See “Data Disks” on page 4-3 for guidelines.)
Format volumes as NTFS volumes.
Mirror each volume on two or more physical disks.
NOTICES
1. You must use RDR to mirror system disks. Dynamic
disks are supported only on data disks.
2. Failure to set up your disk storage as required results
in storage that is not fault tolerant.
A mirror is an identical copy of data that you can access if your primary data becomes
unavailable. The resulting mirrored pair of disks work together to provide a
fault-tolerant volume. If one disk fails, the volume automatically uses the data from the
other disk.
Mirror all data volumes. Mirroring volumes as documented provides fault tolerance,
because volumes are mirrored on separate physical disks on separate FC buses.
When mirroring volumes, pair identical physical disks (same capacity, model, and
manufacturer) so that in the event of a disk failure, an entire physical disk can be
replaced without any data loss.
After a disk is replaced or brought back online, the mirror between it and the disk that
remained in operation must be resynchronized. During resynchronization, the system
copies the format and the contents of the good mirrored volume(s) to their mirror(s) on
the disk most recently brought back online.
NOTE
Because mirror resynchronization can severely affect
performance, you should resynchronize mirrors when the
system is least busy.
Mirror resynchronization can occur automatically, or, in some cases, you must
resynchronize the mirror manually:
If you insert a replacement disk without a disk signature into the same slot as the
failed disk, the system automatically attempts to resynchronize the disk.
If you replace a failed disk with a blank basic or dynamic disk that contains a
signature, you must resynchronize the disks manually.
The system automatically attempts to resynchronize the disks if:
4-38 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Using Hardware RAID Controllers for Fault Tolerance
For example, if the system was shut down normally, but it was shut down while
synchronization was in progress, you need to explicitly reactivate the mirror. On the
other hand, if the system shutdown was due to an unexpected event like a power
failure, the disks begin the remirroring process automatically after booting.
Before mirroring system disks with a disk management tool, make sure either that both
disks are plexes of an RDR virtual disk or that neither disk is. Do not partner RDR and
non-RDR system disks in a mirror. Otherwise, the disks will fully resynchronize after
every reboot.
The boot disks do not require resynchronization if the RDR disk is the primary boot
volume from which the system is booted.
NOTE
You may remove disks from or insert disks into the internal
storage as needed.
Slot
Disk
Slot Node
The Slot node represents the slots where disks reside. System disks must reside in
certain slots (see “Mirroring Disks in ftScalable Storage Systems” on page 4-40 for
details).
Disk Node
Each Disk node represents a disk that resides in a slot in a storage enclosure. You use
the DevicePathID property to locate a physical disk. See “Finding Physical Disks” on
page 4-40 for details.
4-40 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Finding Physical Disks
For RDR virtual disks, the detail pane presents information in the following format:
NOTE
The information for non-RDR disks is the same, except
that it does not include LUN (RDR virtual disk) or PLEX
(physical disk) information, which pertains only to RDR
disks.
For RDR disks, Duplex indicates that the two physical disks of the RDR virtual
disks are partnered. If the two physical disks were not partnered, the displayed
value would be Simplex. For non-RDR disks, the values in this position indicate the
disk’s operating status.
The location of the disk appears in the DevicePathID property, which presents the
location information in a format similar to the following: 10/40/1/0. This DevicePathID
indicates the following:
4-42 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Chapter 5
Troubleshooting 5-
The following topics can help you troubleshoot your ftServer system:
NOTE
See the sections on reviving a nonresponsive system and
the various diagnostic codes in the Stratus ftServer Virtual
Technician Module User’s Guide (R642) for more
information on troubleshooting a system.
Troubleshooting 5-1
MTBF Statistics
MTBF Statistics
The fault-tolerant services software automatically maintains
mean-time-between-failure (MTBF) statistics for internal disks, CPU elements, and I/O
elements. Administrators can view this information in ftSMC.
The system stores the current MTBF values associated with each device in Windows
registry keys. Whenever an MTBF value changes, the registry reflects the change.
Therefore, when the driver stops and restarts, or the system restarts, the system
retains the current MTBF information.
For changing the MTBF settings on internal disks, see the sections in Chapter 4, “Disk
Storage Management” relating to MTBF. For clearing MTBF on CPU elements, I/O
elements, and internal disks, see “Clearing MTBF Settings” below.
For information on how MTBF is calculated and on MTBF types, see the Stratus
ftServer Systems: Technical Reference Guide (R550).
1. In ftSMC, expand ftServer (Local) until you locate the target component.
2. Right-click the target component and click Clear MTBF on the shortcut menu.
3. If the target component is a disk, choose the Clear Hard MTBF or Clear Soft
MTBF command, as appropriate.
If the target component is a CPU or I/O element, a drop-down menu appears,
displaying the available fault options. Select the type of fault you want to clear. The
choices are as follows:
All
Uncorrectable
Correctable (available only on CPU elements)
Microsync (available only on CPU elements)
Software (available only on I/O elements)
(See the Stratus ftServer Systems: Technical Reference Guide (R550) for
descriptions of these fault types.)
5-2 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Taking Components Offline and Bringing Them Online
4. Click Finish, then click OK on the dialog box indicating that the operation
completed successfully.
NOTE
Clearing the MTBF does not bring a component back into
service automatically.
If the state of device that you cleared is Broken, correct the state.
User actions that can cause the broken state to change include the Initiate BringUp
and Initiate BringDown commands, and removing the board.
You can safely ignore this message. You can avoid receiving this message by disabling
Option ROM Scan Monitoring for one or more PCI adapter slots in the System
Management submenu of the Server menu in the system BIOS setup utility. See
“Managing BIOS Settings” on page 3-12 as well as the Stratus ftServer Systems:
Technical Reference Guide (R550) for information about how to change BIOS settings.
Internal Disks
If a disk’s status is not healthy, you may be able to recover the disk, or you may need
to replace the disk.
Recovering a Disk
In your disk-management tool, try to recover the disk manually if:
Troubleshooting 5-3
Internal Disks
Verify that the system does not have a problem that is causing the mirror to break on
an internal or external disk. Ensure that the disk is securely seated in the drive and that
it has power. Perform the following procedures to recover the disk.
To recover an RDR disk, see “Recovering and Replacing RDR Disks” on page 4-27.
NOTICE
If a disk continues to have the status of Online (Errors),
replace it as soon as possible.
If the recovery procedures fail to reactivate the disk, or if the status does not return to
Healthy, you must physically replace the failed disk.
5-4 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Hardware Installation Wizard
To use Windows Disk Management to replace a failed disk with an initialized disk
1. Physically remove the failed disk and replace it with an initialized disk. For details,
see the operation and maintenance guide for your system.
2. Examine the type of the initialized replacement disk:
If the replacement disk’s type is Foreign Dynamic, go to step 3.
If the replacement disk’s type is Basic, go to step 4.
3. In Disk Management, right-click the foreign dynamic replacement disk, and then
click Convert to Basic. The replacement disk should now appear as a basic disk.
4. If the basic replacement disk has any volumes, delete them. Right-click the volume
on the replacement disk, and then click Delete Partition. Repeat for each volume
on the replacement disk. Verify that the basic replacement disk now appears as a
single unallocated space.
5. Right-click the replacement disk and click Convert to Dynamic.
6. Remove the mirrors from the missing disk that was physically removed in step 1.
Right-click a volume on the missing disk and then click Remove Mirror. In the
Remove Mirror dialog box, select the missing disk and click the Remove Mirror
button.
NOTICE
Be sure to select the missing disk, not the remaining good
disk. You will lose data if you select the remaining good
disk.
Troubleshooting 5-5
Determining That a Component Has Failed
The enclosure LEDs represent the combined states of the CPU element, I/O element,
BMC, the internal disks, and the PCI adapters.
The LED states are described in detail in the hardware troubleshooting chapter of the
operation and maintenance guide for your system. See the Getting Started guide for
your storage system for information about the ftScalable Storage system LEDs.
3. Click the problem node and check the details pane for the cause of the error.
In this example, the Op State: Reason property shows Media Disconnect; that is,
Network Port - 1 is disconnected.
5-6 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Locating Components
Locating Components
ftSMC displays device IDs and device paths for system components that it monitors.
Component location diagrams show the device IDs of major elements, enclosures, and
internal disks. Two commands assist you in locating select system components:
Start Slot Identification. Causes the LEDs on the component to flash so that you
can physically locate it.
Stop Slot Identification. Stops the LEDs on the component from flashing after you
have physically located it.
CPU-I/O enclosures, I/O enclosures, and internal disks support these slot identification
commands.
Hardware devices in the system inventory have device ID numbers. For example, the
device ID of the ftScalable Storage subsystem in Figure 5-2 is 75. The figure illustrates
the following hierarchically presented components and their device IDs:
A device path is a slash-separated sequence of device IDs that identifies the location
of a component in the system. Device paths appear in the details pane of ftSMC. They
use the following syntax:
Troubleshooting 5-7
Locating Components
Call-Home Modem 0
Device IDs and device paths apply only to hardware components. Logical devices and
software components do not have device IDs even if they appear in the ftSMC
interface.
5-8 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Locating Components
CPU0, I/O 10
Slot 0 Slot 2 Slot 4 Slot 6
Slot 1 Slot 3 Slot 5 Slot 7
To find which disk in ftSMC corresponds to which physical disk in the CPU-I/O
enclosure, match the slot number shown in ftSMC (0 through 7) with the physical slot
on the enclosure (1 through 8).
For example, in Figure 5-4, the disk (Disk - 2) associated with Slot - 4 in ftSMC is
located in the slot labeled 3 on the CPU-I/O enclosure. (In this example, Slot - 3
does not contain a disk. If Slot - 3 contained a disk, it would be Disk - 2 and the
subsequent disk numbers would be incremented by 1.)
NOTE
The disk numbers that ftSMC displays in Disk - n
correspond to the disk numbers that Windows assigns
and that the Disk Administrator displays.
Troubleshooting 5-9
When the System Does Not Respond
peg021
1. Create a dump file so that the Stratus Customer Assistance Center (CAC) or your
authorized Stratus service representative can analyze the problem, if necessary.
See “Making Dump Files” on page 5-13” for more information.
2. Use the VTM console or the power button to restart the system. Related
information can be found in the Stratus ftServer Virtual Technician Module User’s
Guide (R642).
5-10 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Remote Event Notification and Service
Event reporting to your supporting Stratus CAC or your authorized Stratus service
representative
Remote service to your system by service support personnel
You use the Stratus Customer Service Portal to configure all remote event reporting
and remote service features of the ASN. See the Stratus ActiveService Network
Configuration Guide (R072) for details.
Alarm Messages
Alarm messages placed in the system event log contain information about device state
changes such as changing modes from duplex to simplex, failure of diagnostics on a
device, exceeded thresholds, memory dumping, and failed device startups. Alarm
messages appear with Stratus Alarms as the event source. See the Stratus ftServer
Systems: Technical Reference Guide (R550) for descriptions of these alarm
messages.
You can configure your system to send alarm messages by the following options:
Troubleshooting 5-11
Remote Event Notification and Service
To change the property, update your configuration in the Stratus Customer Service
Portal as described in the Stratus ActiveService Network Configuration Guide (R072),
and then import the new ASN configuration file to your system as described in Stratus
Automated Uptime Layer for Windows-based ftServer Systems: Installation and
Configuration (R002W). If you change the property only in ftSMC, it is overwritten the
next time you import a configuration file.
Event-Log Entries
System software may generate an alarm in response to a sequence of several events.
To indicate what these events are, an alarm message may include event-log entries.
These event-log entries contain items that represent an event from both the system
event logs (SELs) and application event logs (AELs). The number of event-log entries
included with an alarm message is determined by specifying how many seconds of the
event log to include. System software may configure this time differently for each type
of alarm, or it may exclude some event-log entries.
The BMC on each CPU-I/O logs SELs for its CPU-I/O enclosure. The Windows System
Manager (WSM) reads the SELs from each BMC and saves them to the AELs. In the
AELs, the system events are simply interleaved with one another, not merged, so
events reported on both enclosures appear in duplicate.
Remote Service
If you have a service contract with the CAC or your authorized Stratus service
representative, your service representative can connect to your system through either
the Internet or a phone line.
When the connection is through a phone line, the VTM validates that the phone call is
from a permitted source, disconnects the incoming call, and calls back to make a
connection to the validated CAC.
5-12 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Making Dump Files
You can configure the system to allow the CAC or your authorized Stratus service
representative to connect to your system through the Internet, using secure tunneling
over IP, to diagnose problems and perform data-transfer functions. These accesses
are subject to validation by the CAC database, by the VTM on your system, and by you
as the system administrator. Connecting through the Internet provides higher-speed,
more reliable connections than connecting through a phone line.
The CAC or your authorized Stratus service representative may contact you to
determine the seriousness of the condition. If a CRU has failed, the CAC or your
authorized Stratus service representative will ship a new part to your site.
Press a button
Configure your system to create memory dumps
In ftSMC, right-click a CPU Enclosure, and select the Dump and Go command
In ftSMC, right-click a certain component, and select either the Dump and Go
command or the Dump command. (These commands are not available for all
components.)
Use the VTM console
You can also use Windows commands to configure the creation of memory dumps.
See the Windows documentation.
Note that if you install Automated Uptime Layer for Windows-based ftServer systems
and then you install additional memory into your ftServer system, you must adjust the
size of the NtKernel.dmp file using the Windows CrashControl DumpFileSize registry
value.
Troubleshooting 5-13
Making Dump Files
Press the non-maskable interrupt (NMI) button on the primary (active) CPU-I/O
enclosure for four to seven seconds to initiate a dump. The primary enclosure is the
one whose PRIMARY LED is lit. In order to see the PRIMARY LED, you must remove
the system bezel. However, you can see the enclosure status in ftSMC. The NMI button
is also referred to as the dump button. It is located on the front of the CPU-I/O
enclosure, though it is obscured by a bezel. (See your operation and maintenance
guide for details.)
The DumpFileSize registry value is set during the installation of Automated Uptime
Layer based on the total amount of physical RAM present in the system using the
following formula:
Using this formula, for example, the DumpFileSize registry value (and, thus, the size of
the NtKernel.dmp file) is set to 16 GB for a system with 512 GB of installed RAM.
You can adjust the NtKernel.dmp file size at the following location in the Windows
registry:
HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\CrashControl\DumpFileSize
5-14 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Making Dump Files
NOTE
Quick dumps are not supported on ftServer systems that
boot from an external storage system. Only standard
Windows dumps are available on these systems.
Dump files beyond this number overwrite the existing saved dump files. For
instance, if the value is 3, then the fourth saved dump file overwrites the first,
existing dump file.
If, at the time of a system crash, the DumpQuickEnabled property of the Board
Instance Driver node is set to True and the CPU element is duplexed, the system
creates a memory dump and stores the dump in the memory of one CPU element. The
system restarts with the remaining CPU element. After the system restarts, the dump
is automatically copied to a disk file.
After a system reboot where a dump has been generated, do not attempt to read or
copy the Memory.dmp file created. After the system has completed writing the
Memory.dmp file, it renames the file Memoryn.dmp, where n is a numeral. You can
read or copy the Memoryn.dmp file. It may take up to five minutes after the reboot to
complete writing the Memory.dmp file before it is renamed and can be read.
NOTICE
If you attempt to read or copy the Memory.dmp file while
the system is writing it, you may corrupt the file.
Troubleshooting 5-15
Disaster Recovery
Dump and Go
The Dump and Go command captures a snapshot dump of a running system or a
particular component without significantly interfering with the system’s or component’s
operation. The system must be duplexed if you use this method.
To capture a snapshot dump, right-click the CPU Enclosure node of interest in the
ftSMC console tree and select Dump and Go from the Action menu. The system:
Takes the selected CPU element offline with its memory intact
Dumps a snapshot of the memory to C:\ftSysDump\MEMORYn.DMP
Displays the status of the operation
Returns the CPU element back to online status
NOTICE
Making a dump file of an entire system automatically
takes the operating system offline and then restarts it.
Disaster Recovery
To help you to log on and access system resources after a computer disaster has
occurred, you can restart the system in safe mode or boot from a spare disk.
Safe Mode
You use the Safe Mode startup options to start the system with only the minimal
necessary services. Safe Mode options include Last Known Good Configuration,
Safe Mode, Safe Mode with Networking, and Safe Mode with Command Prompt.
For more information, see Safe Mode startup options in Windows Help.
5-16 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Disaster Recovery
NOTICE
You must disable the Watchdog Timer by disabling boot
monitoring in the BIOS before attempting to boot in Safe
Mode. See Stratus Automated Uptime Layer for
Windows-based ftServer Systems: Installation and
Configuration (R002W) for a description of how to disable
boot monitoring.
To boot a system from an RDR boot disk, see “Booting a System with a Backup RDR
Boot Disk” on page 4-24.
The following procedure explains how to boot a system from a non-RDR system disk.
Troubleshooting 5-17
Windows Hotfixes Distributed by Stratus
NOTE
When you mirror a system disk, click OK when a pop-up
window with the following message is displayed:
Windows Disk Manager could not update the boot file
for any boot partitions on the target disk. Verify your
arcpath listings in file boot.ini or through the
bootcfg.exe utility. Do not modify the boot.ini file. This
message does not apply to your ftServer system.
9. When the disk is 100% resynchronized, verify that the original system disk and the
upgraded system disk are healthy, resynching, dynamic, mirrored volumes. Note
that the drives are not duplexed until the resynchronization activity is complete.
10. After all resynchronization activity has finished, verify that all dynamic disks are
healthy, dynamic, mirrored volumes.
11. Start all applications to verify that your system and applications are working
properly.
5-18 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Tape Drives and Backup
1. Connect the IBM Ultrium 3580 SAS tape drive to the external port on the U114
Eight-Port SAS PCI-Express Adapter.
2. Install Symantec™ NetBackup™ 7.6 (or later) for Windows Servers. During
configuration, click Cancel at the following Volume Configuration Wizard prompt:
Select the robot that you want to inventory or the type of standalone drive
for which you want to configure volumes.
Troubleshooting 5-19
Tape Drives and Backup
5-20 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Appendix A
Configuring and Managing
Ethernet PCI Adapters A-
On ftServer 2900, 4900, and 6900 systems, you configure and manage Ethernet PCI
adapters with Load Balancing/Failover (LBFO) technology, using the Stratus LBFO
teaming script and Windows Server NIC Teaming utility to perform the following tasks:
Configure Ethernet PCI adapters into teams. For planning information, see
“Managing Ethernet PCI Adapters” on page A-2, “Types of Fault-Tolerant Teams”
on page A-6, “Fault-Tolerant Ethernet Teams and IP Multicasting” on page A-7, and
“Teaming Ethernet Network Adapters in a Hyper-V Environment” on page A-8, if
applicable. For information about how to configure Ethernet teams, see
“Configuring Ethernet Teams” on page A-10.
If needed, assign virtual LANs to a team. For planning information, see “Virtual
LANs” on page A-7. For information about how to configure the VLANs, see
“Configuring Virtual LANs Over a Team” on page A-20.
If needed, configure Wake On LAN (WOL) power management for a team or
individual adapters. See “Enabling Ethernet Wake On LAN Power Management”
on page A-22.
Manage a team by physically locating Ethernet PCI adapters, setting a persistent
MAC address for the team, removing or adding adapters, or deleting the team. See
“Managing Ethernet Teams” on page A-24.
Troubleshoot a team or individual adapters. See “Troubleshooting Ethernet Teams
and Ethernet PCI Adapters” on page A-38.
NOTICES
1. You can physically remove one of two partnered
CPU-I/O enclosures without disrupting the network
only if the active Ethernet PCI adapters in both of
the enclosures are teamed with each other. With
teams configured for load balancing, your system will
experience a drop in throughput while the enclosure
is absent.
2. For fault tolerance, configure your Ethernet PCI
adapters into teams. Although ftServer systems
Use the Stratus LBFO teaming script (ftSysLbfo.ps1) when you want to create
basic, fault-tolerant teams or to automatically assign a MAC address to a team. See
“Using the Stratus LBFO Teaming Script to Manage Ethernet Teams” on page A-2.
Use the Microsoft NIC Teaming utility (LbfoAdmin.exe), a component of Server
Manager, to configure specific teaming modes or load-balancing settings, to
configure virtual LANs (VLANs), or to add, remove, or replace Ethernet PCI
adapters on ftServer systems. This appendix does not discuss the NIC Teaming
utility in detail. For a summary of usage and restrictions, see “Using the NIC
Teaming Utility to Manage Ethernet Teams” on page A-5. For complete information
about NIC teaming technology and using the NIC Teaming utility to configure
specific teaming modes or load-balancing settings, see the Microsoft Windows
Server 2016 NIC and Switch Embedded Teaming User Guide.
C:\Program Files\ftSys\CustomerService\SUPPORTTOOLS\ftSysLbfo.ps1
NOTICE
The ftSysLbfo.ps1 script relies on the PowerShell module
ftSysLogToFile.psm1, which is located in the same folder.
Do not delete or modify these modules.
To use the ftSysLbfo.ps1 script, execute the script in the Windows PowerShell utility,
which is installed automatically on your system when you install Automated Uptime
Layer.
A-2 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing Ethernet PCI Adapters
You must type y (for yes) and press Enter to complete the change. Later, after you
finish running the script to configure teams, remember to close the PowerShell
utility or set the execution policy back to Restricted to secure your system.
NOTE
If running scripts is restricted on your system, executing a
script in PowerShell will display the message
script-name cannot be loaded because
running scripts is disabled on this system.
5. Switch to the C: drive (if needed) and change the directory to the location of the
script:
> c:
> cd "\Program Files\ftSys\CustomerService\SUPPORTTOOLS\"
If you execute the script with no options, it displays a summary of the network
adapter information (that is, the -show option is turned on by default).
Table A-1 describes the ftSysLbfo.ps1 script options. The rest of this chapter describes
each of the options in detail.
Option Description
-create Create new teams of four Ethernet PCI adapters in slots with no team.
-create Create one team containing 2 to 8 Ethernet PCI adapters. The format
io:slot:port,io:slot: io:slot:port represents an adapter’s device path ID (that is, its I/O
port ... enclosure number, PCI slot number, and logical port number).
-delete Delete all existing teams. See “Deleting All Ethernet Teams” on
page A-37.
-rename Rename all Ethernet PCI adapters to reflect the names of their device
path IDs. See “Renaming Ethernet PCI Adapters According to
Location” on page A-13.
-verify Check that the existing teams have a valid Ethernet teaming
configuration. See “Verifying the Configuration of Ethernet Teams” on
page A-18.
Each time the script runs, it creates a new log file named
ftSysLbfoTeams-MMDDYY-HHMMSS.txt in the following folder:
C:\Program Files\ftSys\management\logs
In the log file name, the format MMDDYY-HHMMSS represents the month, day, year,
hour, minute, and second that the file was created.
For more information about using the PowerShell utility, execute help in PowerShell
or see the Microsoft Web site. For example:
https://msdn.microsoft.com/en-us/powershell/scripting/powershell-scripting
A-4 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing Ethernet PCI Adapters
If you use the NIC Teaming utility to manage Ethernet teams, keep the following
guidelines and restrictions in mind:
If you are creating new Ethernet teams, first create them with the LBFO teaming
script, then, if needed, use the NIC Teaming utility to modify the teaming modes or
load-balancing settings. Initially creating teams with the LBFO teaming script
automatically ensures that the teams have an Ethernet PCI adapter from each
I/O element and that the teams have a persistent MAC address assigned.
If you use both the LBFO teaming script and NIC Teaming utility to manage
Ethernet teams, use only one utility at a time to manage a team.
Use the NIC Teaming utility only when both partnered CPU- I / O enclosures are
online. All NICs to be teamed must be present and enabled to be added to a team.
If you examine a team with the NIC Teaming utility when some team members are
not present, the utility may fail. See “NIC Teaming Utility Fails When a CPU-I/O
Enclosure Is Offline” on page A-39.
In the NIC Teaming utility, click OK in each dialog box to preserve your changes. If
you do not click OK before closing the dialog box, the configuration procedure is
incomplete, and you must repeat it.
If your account does not have administrative privileges, right-click Server Manager
and click Run as Administrator.
For complete information about NIC teaming technology and using the NIC Teaming
utility to manage teams, see the Microsoft Windows Server 2016 NIC and Switch
Embedded Teaming User Guide.
In some teams, one Ethernet PCI adapter is the active adapter (or primary adapter),
and all other Ethernet PCI adapters are standby (or secondary) adapters; a standby
adapter becomes active only if the active adapter fails. In other teams, most or all
members of the team actively pass traffic to increase overall transmit or receive
throughput.
Typically, the permanent Media Access Control (MAC) address (or Ethernet Address)
of the primary Ethernet PCI adapter becomes the current MAC address for all Ethernet
PCI adapters in one team.
Switch-independent mode
The team does not require the Ethernet switch to participate in the teaming. The
switch does not know that the Ethernet PCI adapter is part of a team in the host.
The adapters may be connected to different switches, if applicable.
Active/active teaming
Two or more Ethernet PCI adapters provide redundant connections to the network.
The active adapter(s) execute all transmit and receive functions for the team. If you
optionally configure a standby adapter, it becomes active only if a failover occurs
(typically, if an active adapter fails).
Outbound loads are distributed in real time based on a hash of the TCP ports and
IP addresses. Inbound loads are distributed by the destination MAC address (or
VM MAC address), as if Hyper-V switch port distribution were in use.
These are the only teaming modes that you can create and manage with the
ftSysLbfo.ps1 script. For detailed descriptions of these teaming modes, and
information about more advanced teaming configurations that you optionally manage
A-6 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Fault-Tolerant Ethernet Teams and IP Multicasting
with the Microsoft Windows NIC Teaming utility (LbfoAdmin.exe), see the Microsoft
Windows Server 2016 NIC and Switch Embedded Teaming User Guide.
You can assign any Ethernet PCI adapter that Stratus supplies to any type of team
supported by the NIC Teaming utility, with the exception of assigning U117 PCIe
adapters or embedded 1GbE adapters to a Static Link Aggregation team, as
summarized in “Do Not Configure U117 PCIe Adapters or Embedded 1GbE Adapters
as SLA Teams” on page A-41.
Each network host in an IP multicast group uses the Internet Group Management
Protocol (IGMP) to acknowledge and maintain its IP multicast group membership with
a switch.
In some LBFO team configurations, it is possible that only one Ethernet PCI adapter
(the active adapter) is registered to receive IP multicast packets. When failover to
another adapter occurs, the host cannot automatically update its multicast group
membership using the new active adapter; therefore, the host stops receiving multicast
packets. If the switch is configured to periodically query the host for its registration, or
if static IGMP is configured on the switch for each adapter in the team, multicasting to
the host will eventually resume, but possibly after an extended interruption.
If you use IP multicasting, you should test the team configuration on site to verify if the
behavior of IP multicasting after failover is acceptable. Alternatively, you can configure
either static or dynamic link aggregation teams. IP multicasting functions more reliably
with qualified switches that support these modes, because all Ethernet PCI adapters in
a link aggregation team are registered to receive the IP multicast packets. The team
continues to receive IP multicast packets unless all links to the switch are lost and then
restored (in which case, there is also an extended interruption).
For more information about IP multicasting, see the documentation for your switch, or
contact your network administrator.
Virtual LANs
You can add a Virtual LAN (VLAN) to any type of Ethernet team.
A VLAN allows you to isolate network traffic for a specific group of network hosts. Using
VLANs, you can organize networked systems into logical workgroups, such as
Marketing or Engineering, that span a building, complex, or an entire enterprise
network. Members of a particular VLAN receive traffic only from other members of the
same VLAN.
VLANs are particularly useful for limiting broadcast storms, reducing security problems,
and simplifying network management.
Implicit VLANs, which are configured entirely at the switch level. The switch does
not alter or tag packets to enforce an implicit VLAN.
Explicit VLANs, which are configured throughout the network, on each adapter and
link partner. Switches identify and route traffic based on a four-byte tag (802.3ac)
in each packet header.
To support either type of VLAN, switches on your network must support 802.1Q
VLANs. Also, to configure a VLAN on your system, you need to contact your network
administrator to obtain the VLAN ID for your workgroup, which must be identical to the
VLAN ID programmed into the switches.
In an environment with many VLAN workgroups, you can optionally implement the
Generic Attribute Registration Protocol (GARP) VLAN Registration Protocol (GVRP) to
dynamically create, change, and remove VLANs. To support GVRP, all adapters and
switches on your network must have GVRP enabled. For more information about
GVRP, follow the links in the online help available on the VLANs tab, see the
documentation for your switch, or contact your network administrator.
NOTICE
Create teams only in the parent partition. Do not create
teams in the child partitions (VMs).
If you need to assign virtual LAN (VLAN) IDs to your network connections, configure
the VLAN settings as follows:
If you need to enable VLAN identification for a team that is assigned to a specific
VM, assign the VLAN ID in the VM’s settings in Hyper-V Manager. To support
multiple VLANs in a VM, provide the VM with multiple Hyper-V switch ports and
associate each port with a VLAN.
A-8 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Teaming Ethernet Network Adapters in a Hyper-V Environment
If you need to enable VLAN identification for a team that is shared between the
parent partition and the VMs, use Virtual Network Manager to assign the VLAN ID
to the associated virtual network switch.
If you need to enable VLAN identification for an Ethernet team that is reserved
exclusively for the parent partition, configure the VLAN setting in the NIC Teaming
properties for the Ethernet team in the parent partition.
When you team network adapters in a Hyper-V environment, observe the following
restrictions:
Intel Virtualization Technology for Directed I/O (VT-d) interfaces, Single Root I/O
Virtualization (SR-IOV) interfaces, and Microsoft Virtual Machine Queue (VMQ)
technology are not supported because they conflict with fault-tolerant operation.
Switch Embedded Teaming (SET) is not supported because SR-IOV is not
supported.
You cannot create a virtual network and associate it with an Ethernet team that
contains disabled members. If you do so, the following error is displayed: Error
Applying New Virtual Network Changes. Setup switch failed. The
operation failed because the file was not found. To correct the
problem, enable all members of the team in the Network and Sharing Center
control panel.
You cannot change the type of a team if that team is tied to a virtual network. If you
try to do so, the team properties indicate that the type has been successfully
changed when it has not. To change the team type, remove the virtual network in
Hyper-V Manager, change the team type in the NIC Teaming utility, and then create
a new virtual network with the team.
You cannot use the LINK state of an adapter in a VM to determine if the VM has
network connectivity.
A virtual network indicates that a network connection exists even if all members of
an Ethernet team assigned to that network lose their network link. Attempts to ping
the IP address of the team in the VM or in the parent partition from a remote system
will time out. However, if the virtual network is accessible from the parent partition,
you can still successfully ping the VM from the parent partition, and ping the parent
partition from the VM.
To determine the correct LINK state for an Ethernet team, view the status of the
team in the parent partition. For example, view the status of the associated
connection in the Network and Sharing Center control panel, or the status of the
Ethernet team in the NIC Teaming utility or ftSMC.
For more information about using NIC Teaming in a Hyper-V environment, see the
Microsoft Windows Server 2016 NIC and Switch Embedded Teaming User Guide. For
information about configuring Hyper-V on ftServer systems, see Stratus Automated
To configure Ethernet PCI adapters into fault-tolerant teams, perform the procedures
described in the following sections, in the order listed.
A-10 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Configuring Ethernet Teams
When you create basic teams with the LBFO teaming script, the script automatically
ensures that the teams are fault tolerant by including matching Ethernet PCI adapters
from each I/O element. No additional planning is needed unless you require a custom
configuration. For example, if you want to manually create teams using adapters that
you select, or use the NIC Teaming utility to create teams with specific teaming modes
or load-balancing settings, see “Locating Ethernet PCI Adapters” on page A-24 for
information about locating and selecting the adapters to ensure fault-tolerant operation.
NOTE
For diagrams showing the location of the embedded
Ethernet PCI adapters and PCI slots in your system, see
the Stratus ftServer Systems: PCI Adapter Guide (R461).
> .\ftSysLbfo.ps1
or:
The following example shows a system where all of the Ethernet PCI adapters are
enabled and unteamed. The En (for Enabled) column for each adapter displays Up (for
enabled and link Up), the T (for Teamed status) column for each adapter displays N (for
No), and the team summary displays No LBFO Teams found, which means that all
of the adapters are available for teaming. (These items are highlighted for emphasis in
the example.)
> .\ftSysLbfo.ps1
If any Ethernet PCI adapters are already teamed, you can still create teams with the
remaining available adapters, or you can delete the existing teams as described in
“Deleting a Single Ethernet Team” on page A-36 or “Deleting All Ethernet Teams” on
page A-37.
A-12 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Configuring Ethernet Teams
Typically, Windows-based systems assign each Ethernet PCI adapter in the system a
unique Connection Name in the format Ethernet nn (for example, Ethernet 31 or
Ethernet 28, as shown in “Displaying the Current Configuration of Ethernet PCI
Adapters” on page A-11). The -rename option automatically replaces the default
names with new names in the format IonnSlotnnpn, where Ionn is the I/O element
location, Slotnn is the PCI slot location, and pn is the logical port location (for
example, Io11Slot03p1 is I/O element 11, PCI slot 3, port 1).
These location-based Connection Names are persistent and can be used to confirm
the location of adapters in the LBFO teaming script, the NIC teaming utility, the Network
and Sharing Center, and other networking utilities. They are also helpful to use when
adding, removing, or replacing adapters in your system, as described in “Managing
Ethernet Teams” on page A-24. (Most examples in this appendix show the
location-based Connection Names for clarity.)
NOTE
Before changing the Connection Names, check with your
network administrator to ensure that they do not use a
different, custom naming scheme.
The following example shows a system where all of the embedded Ethernet PCI
adapters have been renamed (the new names in this example are highlighted for
emphasis):
> .\ftSysLbfo.ps1
For information about other methods for locating Ethernet PCI adapters in the system,
see “Locating Ethernet PCI Adapters” on page A-24.
The term duplex in duplex mode is unrelated to the term duplexed used when referring
to PCI adapters that are duplexed for fault-tolerance. Duplex mode refers to the ability
of a communications line (which is attached to an Ethernet PCI adapter) to send and
receive data simultaneously. In full-duplex mode, the line can send and receive data
simultaneously. In half-duplex mode, the line can only send or receive data at any given
A-14 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Configuring Ethernet Teams
moment. Duplex mode does not refer to whether or not a PCI adapter is one of a pair,
or is teamed, for fault-tolerance.
NOTICE
When you manually set these values, make sure that the
values are identical for the Ethernet port and the link
partner to which it connects, and that the settings for all
Ethernet ports in one team are the same.
As described in “Types of Fault-Tolerant Teams” on page A-6, the LBFO teaming script
automatically creates teams in a switch-independent configuration with active/active
teaming and dynamic distribution load balancing.
The LBFO teaming script will team the adapters even if they do not currently have a
network connection.
If you prefer to select which adapters to team, you can also run the LBFO teaming script
with the -create io:slot:port,io:slot:port... option to specify each
adapter. This option creates a custom team containing two to eight Ethernet PCI
adapters.
When you create Ethernet teams with the LBFO teaming script:
All specified Ethernet PCI adapters must be present, enabled, and unteamed.
The LBFO teaming script does not check for Ethernet PCI adapter types or link
speeds, so you must ensure that the Ethernet PCI adapters are of the same type
and link speed.
Creating Ethernet teams may take a minute or two to complete.
If you are creating custom teams, you must specify at least two ports, and one port
must be in each I/O enclosure. Also, you can create only one custom team each
time the script runs.
NOTICES
1. Creating an Ethernet team disrupts the network traffic
over all Ethernet PCI adapters in the team.
2. Do not use Remote Desktop when creating or
managing Ethernet teams, because you may lose
your connection to the system when you modify the
network settings.
1. Open the Windows PowerShell utility and prepare to run the ftSysLbfo.ps1 script.
2. Execute ftSysLbfo.ps1 -create to automatically team the Ethernet PCI
adapters in the ftServer system:
> .\ftSysLbfo.ps1 -create
a. Verify the proper configuration of the teams. See “Verifying the Configuration
of Ethernet Teams” on page A-18.
b. If needed, configure VLANs. See “Configuring Virtual LANs Over a Team” on
page A-20.
c. If needed, configure the TCP/IP settings for the new team. See “Assigning an
IP Address to a Team or VLAN” on page A-21.
A-16 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Configuring Ethernet Teams
The following example shows output from creating basic Ethernet teams, where all
Ethernet PCI adapters are teamed and correctly configured:
The following example creates a custom team containing two Ethernet PCI adapters
from matching slots in different I/O elements:
The following example creates a custom team containing eight Ethernet PCI adapters,
each from matching slots in different I/O elements:
The next example shows output where the LBFO script aborts creating a custom team
containing two Ethernet PCI adapters because the resulting team would contain
adapters from the same I/O element, which would not be fault tolerant.
To create the team from the preceding example, select matching Ethernet PCI
adapters from different I/O elements (for example, 10:6:0, 11:6:0) and run the script
again.
NOTE
Both CPU- I / O enclosures must be online to use the
-verify option.
A-18 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Configuring Ethernet Teams
The following example shows output for a system where all Ethernet PCI adapters are
teamed and correctly configured:
The following example shows a system where some Ethernet adapers are unteamed:
To correct the issue from the preceding example, ensure that you have created at least
one Ethernet team with matching adapters from each I/O element for fault-tolerant
operation. Teaming all Ethernet PCI adapters is not required, but it is best to do so for
maximum redundancy. The -verify option will continue to warn if any Ethernet PCI
adapters are unteamed.
The next example shows a system where a persistent MAC address, the Locally
Administered Address (LAA), has not been set for one of the teams (the registry value
is blank):
To correct the issue from the preceding example, delete the team as described in
“Deleting a Single Ethernet Team” on page A-36 and create it again with the LBFO
teaming script to automatically set the LAA.
It is best to create teams with the LBFO teaming script to ensure that the LAA is
properly set; however, if needed, you can manually set the LAA as described in “Setting
the Locally Administered Address for a Team or VLAN” on page A-30.
If VLANs are needed, use the Microsoft NIC Teaming utility to create them.
NOTICES
1. Configuring a VLAN over an Ethernet team disrupts
the network traffic over the team.
2. If you create multiple VLAN interfaces for a team, you
must set a persistent MAC address for each additional
VLAN interface, as described in “Setting the Locally
Administered Address for a Team or VLAN” on
page A-30.
A-20 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Configuring Ethernet Teams
3. Under VLAN membership, click Specific VLAN and specify a VLAN ID.
As you enter the VLAN ID, the NIC Teaming utility modifies the name of the team
to include the VLAN ID. For example, if you enter 120 in the Specific VLAN field,
the utility automatically appends VLAN 120 to the team name.
(If you initially create a team with the LBFO teaming script, it automatically sets the
MAC address for the default teaming interface, but not additional VLAN interfaces.)
If you need to create additional VLAN interfaces, use the following procedure.
NOTE
Your TCP/IP property settings must match the
configuration of the network to which the team or VLAN is
connected. Your system can obtain the IP address
automatically only if the network supports it. If you choose
the option to obtain the IP address automatically and the
network does not support it, the team may be unusable.
NOTES
1. Only the embedded 1GbE adapters support WoL.
The embedded 10GbE adapters and non-embedded
adapters do not.
2. If you want to use WoL, do not connect the embedded
1GbE port to a 10GbE switch. The embedded port
does not respond to a WoL packet if the port is
connected to a 10GbE switch.
3. Do not enable WoL on embedded 1GbE ports in an
SLA team, as described in “Enabling WoL on
Embedded Adapters Can Cause SLA Network
Outages” on page A-41.
A-22 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Enabling Ethernet Wake On LAN Power Management
For more information about settings in the BIOS setup utility, see the Stratus ftServer
Systems: Technical Reference Guide (R550).
After enabling WoL in the BIOS, enable it in Windows Server for each adapter that
should respond to a wake packet. If the Ethernet PCI adapters are part of a team,
enable WoL for each adapter in the team.
NOTES
1. Whenever you modify an existing team, you might
temporarily interrupt the flow of network traffic to the
team.
2. When you add or remove Ethernet PCI adapters in a
team that uses LBFO Static Teaming mode, the link to
the adapters you are adding or removing must be
down; otherwise, the switch forwards traffic to the
adapters before the configuration is complete.
3. When you remove or replace a teamed adapter, you
might need to reset the MAC address of the team as
described in “Setting the Locally Administered
Address for a Team or VLAN” on page A-30. In
addition, contact your network administrator to update
any network filtering or security settings that are
based on the MAC address of the Ethernet PCI
adapter you removed.
A-24 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing Ethernet Teams
If you use the LBFO teaming script to create teams, it automatically teams adapters in
matching slots of each I/O element. However, you might need to know the physical
location of the adapters in the system if you want to:
Use the LBFO teaming script to manually specify the adapters to create a team.
Use the NIC Teaming utility to create a more advanced teaming configuration.
Add, remove, or replace Ethernet PCI adapters in existing teams.
NOTE
For diagrams showing the location of the embedded
Ethernet PCI adapters and PCI slots in your system, see
the Stratus ftServer Systems: PCI Adapter Guide (R461).
There are a few methods for confirming the location Ethernet PCI adapters in the
system:
Use the -rename option of the LBFO teaming script to rename the Ethernet PCI
adapters according to their I/O element, PCI slot, and logical port location, which
makes the adapters easier to manage. See “Renaming Ethernet PCI Adapters
According to Location” on page A-13, earlier in this appendix.
Use the -show option of the LBFO teaming script to display a list of Ethernet PCI
adapters and their location in the system. See “Locating Ethernet PCI Adapters
with the LBFO Teaming Script” on page A-25.
If needed, physically confirm the location of an Ethernet PCI adapter by
disconnecting a network cable from a physical adapter port and checking the status
of the matching logical port with the LBFO teaming script or ftSMC. See “Physically
Locating Ethernet PCI Adapters in the ftServer System” on page A-28.
> .\ftSysLbfo.ps1
or:
The following output shows the location of the Ethernet PCI adapters in the first three
columns. Using this output as an example, if you wanted to create a 2-adapter team
that contained a matched pair of embedded 1GbE ports from different I/O elements,
you could use Ethernet 31 (10/6/0) and Ethernet 28 (11/6/0) (highlighted for emphasis
in the output):
> .\ftSysLbfo.ps1
A-26 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing Ethernet Teams
To create the embedded 1GbE team from the preceding example, you would execute
the following command:
The following output shows an ftServer system that includes both embedded and
non-embedded Ethernet PCI adapters. Using this output as an example, if you wanted
to create an Ethernet team of all 10GbE ports in paired embedded and non-embedded
Ethernet PCI adapters from different I/O elements, you could use Ethernet 16 (10/3/0),
Ethernet 17 (10/3/1), Ethernet 18 (10/12/0), Ethernet 19 (10/12/1),
Ethernet 6 (11/3/0), Ethernet 8 (11/3/1), Ethernet 7 (11/12/0), and
Ethernet 10 (11/12/1) (highlighted for emphasis in the output):
> .\ftSysLbfo.ps1
To create the embedded and non-embedded 10GbE team from the preceding
example, you would execute the following command:
.\ftSysLbfo.ps1 -create
10:3:0,10:3:1,10:12:0,10:12:1,11:3:0,11:3:1,11:12:0,11:12:1
NOTICE
Do not use Remote Desktop for the following procedures,
because you may lose your connection to the system
when you disconnect the network cables.
A-28 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing Ethernet Teams
In the preceding output, the status of the adapter in slot 10/6/0 (10 6 0) is currently
Up, indicating that the adapter has an active link.
3. Disconnect the network cable from the Ethernet PCI adapter port that you verified
as Up in step 2.
4. Execute the LBFO script again to check the status of the adapter in slot 10/6/0:
> .\ftSysLbfo.ps1
.
.
.
------------- Network Adapter Summary Information
---------------------------------------------------
IO Sl F ConnectionName Device Manager Name En T
MAC(only if up)
10 6 0 Ethernet 31 Stratus emb-I350 2-Port Gigabit Adapter #12 Ld N
74-3A-65-84-16-14
In the preceding output, the status of the adapter in slot 10/6/0 (10 6 0) is Ld,
indicating that the adapter link is down. This confirms that the adapter you
disconnected is the correct adapter.
If you initially create teams with the LBFO teaming script, it automatically sets the LAA
for each team; however, you may need to manually set the LAA if you:
A-30 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing Ethernet Teams
Initially create teams with the NIC teaming utility instead of the LBFO teaming
script.
Typically, a team’s LAA is the same as the MAC address of the primary adapter in the
team. If you replace the primary adapter, the team continues to use the LAA that
matches the MAC address of the original primary adapter unless you manually update
the LAA to match the MAC address of the new primary adapter. If you have configured
multiple VLAN interfaces for the team, you must also update the LAA for each VLAN
interface to the new primary MAC address.
NOTICES
1. Set the LAA only on team or VLAN interfaces. Do not
set the LAA for the individual Ethernet PCI adapters
in the team.
2. You cannot manually set the LAA for a team that
includes embedded 10GbE adapters. If you remove
an Ethernet PCI adapter from a team that contains
embedded 10GbE adapters, delete the team and
re-create it using the LBFO teaming script. Use the
LBFO script -verify option to ensure that the LAA
for the team is properly set (no warning displayed).
NOTE
Although the NIC teaming utility displays the Permanent
address for the team, the address is not persistent until
you complete this procedure, which preserves the team’s
MAC address in the system registry.
10. Click OK to set the address and close the dialog box.
11. If the team has multiple interfaces (including VLANs), repeat steps 3-10.
After setting the LAA, confirm the setting by using one of the following procedures.
To confirm the LAA is set for a team with the LBFO teaming script
In PowerShell, execute ftSysLbfo.ps1 -verify, as described in “Verifying the
Configuration of Ethernet Teams” on page A-18. If the LAA is set, the output has no
warnings, as follows:
If the LAA is not set, the script displays a warning similar to the following, in which the
registry value is missing:
Verifying all NICs are teamed and teams have registry MAC values.
WARNING: Team MAC Address doesn't match registry value for team
name Team-emb-I350-Slot-6!! Registry value may have error.
WARNING: Team MAC value is 743A65841617 but registry value is
To confirm the LAA is set for a team or VLANs with Windows PowerShell
You can verify that the LAA has been set by executing a command similar to the
following in Windows PowerShell, where team-name is the name of the default team
interface or VLAN interface:
A-32 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing Ethernet Teams
Inserting the wildcard (*) with the team-name* displays information for all interfaces
associated with the team (including the VLANs), as follows:
The preceding example shows a team with two VLANs, 103 and 102. The LAA,
8CDF9DD3715F, has been set for both VLANs and no further action is needed.
The next example shows the same VLAN configuration, except one of the VLANs is
missing the LAA:
The preceding example shows a team with two VLANs, 103 and 102. The LAA,
8CDF9DD3715F, has been set only for VLAN 102. Follow the instructions in this
section to set the LAA for VLAN 103. If there are multiple VLAN interfaces in the team,
set the same LAA on each interface in the team.
NOTE
Remove Ethernet PCI adapters from only one team at a
time. Complete the procedure before removing an adapter
from another team.
If you are not physically replacing the Ethernet PCI adapters that you removed, update
the MAC address of the team at this time. See “Deleting a Single Ethernet Team” on
page A-36.
A-34 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing Ethernet Teams
NOTICE
You must remove an adapter from the team in the NIC
Teaming utility before physically removing the adapter
from the system. The NIC Teaming utility may not work
properly if a teamed adapter is not present (see “NIC
Teaming Utility Fails When a CPU-I/O Enclosure Is
Offline” on page A-39).
NOTE
If you intend to use the old Ethernet PCI adapters in other
systems, do not install them in other systems until you
reset the MAC address of the team (that is, when you
reach step 4 of “Adding Ethernet PCI Adapters to a
Team”).
5. Add the new Ethernet PCI adapters to the team, as “Adding Ethernet PCI Adapters
to a Team” describes.
NOTES
1. Add Ethernet PCI adapters to only one team at a time.
Complete the procedure before adding an adapter to
another team.
2. One team can have a maximum of eight Ethernet PCI
adapters.
1. In the NIC Teaming utility, under TEAMS, select the team to which you want to add
an adapter.
2. Under ADAPTERS and INTERFACES, click the Network Adapters tab. Locate an
adapter that you want to add to the team. Right-click the adapter and select Add
to team name, where name is the team you selected in step 1.
Repeat this step for any adapters you want to add to the team.
3. Under ADAPTERS and INTERFACES, verify that the adapters you added now
appear under the appropriate team entry.
4. If necessary, reset the MAC address for the team. See “Deleting a Single Ethernet
Team” on page A-36.
5. If necessary, connect the new Ethernet PCI adapters to the network.
6. Verify the configuration of the team. See “Verifying the Configuration of Ethernet
Teams” on page A-18.
NOTICE
When you remove a team, you lose the connection to the
network until you install and configure a new team.
A-36 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Managing Ethernet Teams
Deleting all teams may take a minute or two to complete. The following example shows
a system where all of the Ethernet teams are deleted:
After deleting all teams, execute ftSysLbfo.ps1 to show the current configuration
and verify that all teams were successfully deleted.
Use Windows Remote Desktop if you need to start Device Manager remotely. You can
then create or modify a team if it is not carrying your connection to the system.
A-38 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Troubleshooting Ethernet Teams and Ethernet PCI Adapters
NOTICE
Modifying the team that provides your connection to the
system will result in a loss of network connection.
If you do need to disable a team, disable the team and also each individual Ethernet
PCI adapter that is part of the team. You can disable network interfaces in Device
Manager or in the Network and Sharing Center by clicking Change adapter settings
to access controls in the Network Connections window.
Use the NIC Teaming utility only when both partnered CPU- I / O enclosures are online,
and close the utility if you need to remove a CPU- I / O enclosure for maintenance.
If you have already physically removed an adapter and the NIC Teaming utility is
failing, you can recover access to the utility as follows:
Name : Team-Slot3-Port0
Members : {Ethernet 12, Ethernet 4}
TeamNics : Team-Slot3-Port0
TeamingMode : SwitchIndependent
LoadBalancingAlgorithm : Dynamic
Status : Up
Name : Team-Slot6-Port0
Members : {Ethernet 5, Ethernet 7}
TeamNics : Team-Slot6-Port0
TeamingMode : SwitchIndependent
LoadBalancingAlgorithm : Dynamic
Status : Up
4. Execute a command similar to the following to remove the adapter from the team.
Use the information you collected in steps 2-3 to specify the name of the adapter
(NIC-Name) that you physically removed and the name of the team (Team-Name)
from which to remove the adapter:
PS> Remove-NetLbfoTeamMember -Name NIC-Name -Team Team-Name
For example, to remove the adapter Ethernet 4 listed in step 3, you would enter:
5. After removing the adapter, start the NIC Teaming utility and verify that it is working.
6. If applicable, add a new adapter as soon as possible to ensure the fault-tolerant
operation of your system. See “Adding Ethernet PCI Adapters to a Team” on
page A-36.
To recover the NIC Teaming utility, start it from the command line with the
/ResetConfig option. For example:
A-40 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Troubleshooting Ethernet Teams and Ethernet PCI Adapters
The /ResetConfig option resets the list of servers associated with the NIC Teaming
utility, which prevents the utility from searching for the previous host name. Resetting
the list of servers does not change your team configuration, nor does it disrupt your
network connections.
Long Delay If Disabling U117 PCIe Adapter in IEEE 802.3ad Dynamic Link
Aggregation Teams
If U117 Dual-Port Fiber 10-Gbps Ethernet PCI-Express Adapters are configured in an
IEEE 802.3ad Dynamic Link Aggregation team, you may experience a long delay of up
to two minutes if you disable any of the adapters in the team. If applicable, you can
reduce the delay by setting the Link Aggregation Control Protocol (LACP) timeout to
short on the Ethernet switch.
A-42 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Index Index-
Index-1
Index
Index-2 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Index
Index-3
Index
Index-4 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Index
Index-5
Index
N PCI adapters
Ethernet teams, 1-5
network adapter teams. See Ethernet PCI
ftScalable Storage system, 4-40
adapter teams
managing Ethernet, A-1
network adapters. See Ethernet PCI adapters
Performance Counter nodes, 3-9
Network Port node, 2-5
plexes, 4-4
NIC Teaming. See Ethernet PCI adapter teams
active RDR, 4-17
NIC teaming utility. See LbfoAdmin.exe utility
power button
NMI button, 5-14
enabling, 3-2
nodes
powering down system, 3-2
Board Instance Driver, 5-15
primary enclosures, 2-10, 5-14
CPU enclosure, 2-9
switching, 2-10
Disk, 4-40
procedures
expanding, 2-2
archiving log files, 3-11
ftServer, 5-6
booting from spare disk, 5-17
ftServer Call Home Modem, 3-13
changing IP address for UPS, 3-14
I/O enclosure, 2-9
configuring a UPS, 1-5
Logical Disk Information, 4-5
deleting RDR configuration on a physical
Logical to Physical Mapping, 4-40
disk, 4-18
Network Port, 2-5
deporting physical disk from RDR virtual
Performance Counter, 3-9
disk, 4-21
RDR Virtual Disk, 4-26
determining component failure, 5-6
SCSI Port Duplex Driver -
enabling read-load balancing
Sradisk_Driver, 4-7
globally, 4-16
Slot, 4-40
single disk, 4-16
Storage Enclosure, 4-40
enabling Remote Desktop, 3-6
viewing, 2-2
enabling write caching, 4-15
warning and error icons, 5-6
obtaining list of Windows hotfixes, 5-18
nonmaskable interrupts (NMI)
prioritizing resynchronization
with NMI button, 5-14
for all RDR disks, 4-32
with VTMs, 5-16
for individual RDR disks, 4-32
recovering
O offline non-RDR disk, 5-4
offline component identification, 5-6 RDR disk, 4-27, 5-4
Offline or Missing disk status, 5-4 when Windows hangs, 5-10
optimizing system performance, 4-15 removing physical disk from RDR virtual
disk, 4-20
P replacing a failed disk, 5-4
replacing a failed physical RDR disk, 4-28
paging files, 4-20 resynchronizing RDR data disks, 4-26
parent partition, Hyper-V saving log files, 3-11
verifying LINK state for virtual setting size of application and system event
machines, A-9 logs, 3-10
partition style verifying an RDR disk, 4-28
GPT, 4-2, 4-3, 4-4, 4-7, 4-34 Properties command, 2-8
MBR, 4-4, 4-7, 4-8
Index-6 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Index
Index-7
Index
Index-8 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)
Index
V wizard
Add Physical Disk to RDR Virtual
VDS. See Virtual Disk Service
Disk, 4-11
Verify RDR Virtual Disk command, 4-28
hardware installation, 5-5
verifying LINK for virtual machines, A-9
write caching on RDR virtual disks, 4-15
viewing all nodes, 2-2
Virtual Disk Service (VDS), 3-17
Virtual LANs. See VLANs Y
virtual machine, Hyper-V yellow folder icon, 2-4
verifying LINK state, A-9
virtual machines (VMs)
assigning VLAN IDs, A-8
virtual network, Hyper-V
verifying LINK state, A-9
Virtual Technician Module. See VTMs
VLANs, A-7
assigning in Hyper-V, A-8
explicit, A-8
GARP VLAN Registration Protocol
(GVRP), A-8
implicit, A-8
VLAN ID, A-8, A-21
volume mirroring, 4-38
VTMs
console overview, 1-2
creating system dump file (nonmaskable
interrupts, NMI), 5-16
overview, 1-2
W
Wake On LAN (WoL), A-22
enabling in BIOS, A-23
enabling in Windows Server 2016, A-23
warning icons, 5-6
What’s this? Help
accessing, 1-6
for details pane properties, 2-3
Windows
accessing desktop, 3-6
installing hotfixes, 3-17
installing updates, 3-17
not responding, 5-10
obtaining list of hotfixes, 5-18
Windows Device Manager
disabling devices, 3-17
Index-9
Index
Index-10 Stratus ftServer System Administrator’s Guide for the Windows Operating System (R014W)