Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
MSC Server
ZTE CORPORATION
NO. 55, Hi-tech Road South, ShenZhen, P.R.China
Postcode: 518057
Tel: +86-755-26771900
Fax: +86-755-26770801
URL: http://ensupport.zte.com.cn
E-mail: support@zte.com.cn
LEGAL INFORMATION
Copyright 2012 ZTE CORPORATION.
The contents of this document are protected by copyright laws and international treaties. Any reproduction or
distribution of this document or any portion of this document, in any form by any means, without the prior written
consent of ZTE CORPORATION is prohibited.
Revision History
Revision No.
Revision Date
Revison Reason
R1.0
2012-10-31
First edition
SJ-20120730093520-018|2012-10-31 (R1.0)
Contents
About This Manual ......................................................................................... I
Chapter 1 Routine Maintenance Overview............................................... 1-1
1.1 Classification ..................................................................................................... 1-1
1.2 Purposes ........................................................................................................... 1-2
1.3 Precautions........................................................................................................ 1-2
1.4 Basic Requirements for Maintenance Personnel................................................... 1-4
1.5 Routine Maintenance Tools ................................................................................. 1-4
1.6 Introduction to Routine Maintenance Operations .................................................. 1-5
II
SJ-20120730093520-018|2012-10-31 (R1.0)
Intended Audience
This manual is intended for:
l
l
Summary
Forms
Appendix B, Cautions for Routine
Maintenance
I
SJ-20120730093520-018|2012-10-31 (R1.0)
Related Documentation
The following documentation is related to this manual:
l
l
l
l
l
Conventions
This manual uses the following typographical conventions:
Typeface
Meaning
Italics
Variables in commands. It may also refer to other related manuals and documents.
Bold
Menus, menu options, function names, input fields, option button names, check
boxes, drop-down lists, dialog box names, window names, parameters, and
commands.
Constant
Text that you type, program codes, filenames, directory names, and function names.
width
[]
Optional parameters.
{}
Mandatory parameters.
II
SJ-20120730093520-018|2012-10-31 (R1.0)
Chapter 1
Routine Maintenance
Overview
Table of Contents
Classification ..............................................................................................................1-1
Purposes....................................................................................................................1-2
Precautions ................................................................................................................1-2
Basic Requirements for Maintenance Personnel ........................................................1-4
Routine Maintenance Tools ........................................................................................1-4
Introduction to Routine Maintenance Operations ........................................................1-5
1.1 Classification
Description
To effectively implement the functions of the equipment and prevent potential accidents,
it is necessary to perform routine maintenance on the running equipment. Routine
maintenance is a preventive measure, which aims to discover and remove the defects
and hidden risks of the equipment in time. Regular check and maintenance improve the
safety, stability and reliability of the equipment.
Category
The routine maintenance can be divided into two categories:
l
l
1.2 Purposes
Purpose of Daily Maintenance
Daily maintenance helps you to:
l
Find out the alarms of the equipment or hidden risks in time and adopt appropriate
measures to handle and recover the faults so as to maintain the healthy level of the
equipment and reduce equipment faults.
Find out the abnormalities of link status or connection during service running in time
and adopt appropriate measures to handle and recover system faults so as to ensure
normal service running.
Grasp the running status of the equipment and network in real time, know the running
trend of the equipment and network, and improve the sudden-event handling efficiency
of the maintenance personnel.
Make the equipment always remain in good condition, and ensure the safe, stable
and reliable running of the system.
Find out hidden troubles, including natural aging, function expiry and performance
deterioration during the running of the equipment in time, and handle these troubles
to prevent the accidents.
1.3 Precautions
Read and observe the following precautions when you perform routine maintenance.
l
SJ-20120730093520-018|2012-10-31 (R1.0)
l
l
l
l
l
emergency, and security measures. Back up the data before you do any change to
them. Do not delete the backup data until the equipment runs properly for a time
period (usually a week) after you change the data.
Set different NMS passwords with different access rights, put them under strict
management, and make changes to them periodically. The passwords should be
provided only to the maintenance personnel.
Do not play games or surf the Internet on PC terminals. Do not install, run or copy any
other software not related to the system on the terminal. Do not use the PC terminal
for any other purposes.
Have the regularly used tools and meters ready. The regularly used tools and meters
include screwdrivers (flathead and crosshead), signaling tester, network cable pliers,
multi-meter, AC power, telephone line, and network cables. Test and calibrate the
instruments and meters regularly to ensure their accuracy.
Check spare components regularly to ensure they are sufficient in quantity and are in
good condition, and make sure that the components are not affected by dampness or
mould. Keep them away from those faulty ones removed during maintenance. Return
the faulty boards for repair in time and prepare sufficient spare parts for the major
boards.
Keep the software and documents that might be used during the maintenance in a
handy place.
Keep the equipment room in normal temperature and humidity. Keep the environment
neat and clean. Take dust prevention and damp prevention measures. Keep rats and
insects out.
Ensure the primary power of the system is stable. Check the system grounding and
lightning protection ground periodically. Check the lightning protection system before
and after the storm season to make sure that related facilities are in good condition.
Make sure that the light in the equipment room is bright enough for the maintenance.
Any damaged lamp should be repaired in time.
The maintenance personnel must perform regular checks and tests each day and
make records by referencing related suggestions in the manual.
Handle any fault as soon as it is detected. Record the detailed information of the
problem that cannot be solved, and then contact local ZTE office or customer service
center for help.
If there is an emergency in the equipment room, the maintenance personnel must
be calm and perform troubleshooting according to the Troubleshooting Manual, and
contact local ZTE office or customer service center immediately.
Keep the contact information of ZTE local office in a noticeable place and let all
maintenance personnel notice it, so they can contact ZTE in time when they need
help. Maintain the contact information regularly to keep it up to date.
1-3
SJ-20120730093520-018|2012-10-31 (R1.0)
l
l
l
l
Failure observer
Signal trace
1-4
SJ-20120730093520-018|2012-10-31 (R1.0)
l
l
l
l
Fault management
Dynamic management
Performance management
Log management
You can perform related routine maintenance operations on the Local Maintenance
Terminal (LMT).
To perform related routine maintenance operations on a GPBB0/GPBX1 board, use
either of the following method to log in .
Connect the display, keyboard and mouse respectively to the VGA display, PS/2
keyboard and PS/2 mouse port on the GPI1 board.
Note:
The Windows operating system (Windows XP or Windows 2003) should be
already installed on the debugging computer. The debugging computer can
communicate properly with the blade.
For the operations on a Windows operating system, this manual takes Windows 2003
for example.
1-5
SJ-20120730093520-018|2012-10-31 (R1.0)
1-6
SJ-20120730093520-018|2012-10-31 (R1.0)
Chapter 2
Purpose
Make sure that there is no abnormality in the system.
Procedure
1. Log in to the Local Maintenance Terminal(LMT).
2. In the web page of the LMT, click (Fault Management) button in the lower left corner
to open the Fault Management tab page as shown in Figure 2-1.
Figure 2-1 Fault Management Tab Page
2-1
SJ-20120730093520-018|2012-10-31 (R1.0)
3. When checking the system alarms, you should check the following items in every check
period.
l Current real-time alarms and current alarm recovery information
l Historical alarms occurred in the past 24 hours
l Current real-time notifications
l Historical notifications occurred in the past 24 hours
Criteria
There is no abnormality alarm or notification in the system. Any alarm or notification found
should be analyzed and processed in time.
1. You can double-click the alarm message on the tab page to view the alarm handling
suggestions and eliminate the related faults according to the suggestions.
2. If the problem cannot be solved, contact ZTE technical personnel immediately.
Purpose
Make sure that the equipment work under normal temperature.
Procedure
Check the temperature shown on the air conditioner.
Criteria
In normal cases, the temperature in the equipment room is 15 C~25 C.
If the temperature does not reach the standard, you should adjust it in time so that it meets
the requirements for normal running of the equipment.
Purpose
Make sure that the equipment works under normal humidity.
2-2
SJ-20120730093520-018|2012-10-31 (R1.0)
Procedure
Check the relative humidity shown at the air conditioner.
Criteria
In normal cases, the relative humidity in the equipment room is 30 %~70 %.
If the humidity does not reach the standard, you should adjust it in time so that it meets
the requirements for normal running of the equipment.
Purpose
Make sure each board of the ZXUN iCX(MSCS) runs properly.
Procedure
1. Check the board alarms in the Fault Management tab page of the LMT.
a. Log in to the LMT.
b. In the web page of the LMT, click (Fault Management) button in the lower left
corner to open the Fault Management tab page.
c.
Check the alarms of each board to see whether there is any abnormality on the
board.
Criteria
l
l
There is no alarm for any board fault in the Fault Management tab page of the LMT.
The indicators of each board are proper. The normal status of each commonly used
indicator is shown in the following table.
Indicator
Meaning
Normal Status
Remarks
OOS
Off
OK
Flashing green
H/S
Hotswap indicator
Off
ACT
Active/standby status
Active: On
indicator
Standby: Off
Running/Alarm indicator
Flashing green
HOST
2-3
SJ-20120730093520-018|2012-10-31 (R1.0)
Indicator
Meaning
Normal Status
Remarks
HD1/HD2
The indicators
read/write operations
GPBB0/GPBX1
boards only.
Note:
For more details about indicators, see ZXUN iCX(MSCS) MSC Server Hardware
Description.
If any indicator is found abnormal, locate and handle the fault according to the
corresponding alarm messages reported on the LMT. If you cannot solve the problem,
contact technical support personnel of ZTE.
Purpose
Check whether the performance index values of the MTP3 signaling links have obvious
fluctuations.
Procedure
1. Log in to the LMT.
2. In the web page of the LMT, click (Performance Management) button in the lower
left corner to open the Performance Management tab page.
l
l
Note:
Before performing the query for the first time, you must create the corresponding
measurement task.
The path to locate the required type of measurement (MTP3 Link) on the
navigation tree of performance management is as follows: Signal Measurement
> N7 Measurement > MTP3 Signaling Link Measurement.
2-4
SJ-20120730093520-018|2012-10-31 (R1.0)
3. Query the measurement task of measurement type MTP3 Link. Measure the
performance index values during busy hours based on a collection granularity of 5
minutes. The performance indices to be queried include: Average Traffic Signaling
Link and MTP3 Link Load.
4. Check the performance indices of the MTP3 signaling links during busy hours. Check
whether the load is evenly shared among the signaling links or the load of some signaling links is too high.
Criteria
l
The difference of load between any two links to the same office should not exceed 20
%.
l The load of 2 M signaling link should not exceed 0.2 Erl, and that of 64 k signaling link
should not exceed 0.4 Erl.
In case of abnormalities in performance statistical data, contact the maintenance personnel
of related NEs to handle together. If necessary, contact ZTE.
Purpose
Check whether the performance index values of the SCTP links have obvious fluctuations.
Procedure
1. Log in to the LMT.
2. In the web page of the LMT, click (Performance Management) button in the lower
left corner to open the Performance Management tab page.
l
l
Note:
Before performing the query for the first time, you must create the corresponding
measurement task.
The path to locate the required type of measurement (SCTP Link) on the
navigation tree of performance management is as follows: Signal Measurement
> SIGTRAN Measurement > SCTP Link Measurement.
3. Query the measurement task of measurement type SCTP. Measure the number of
messages sent and received during busy hours based on a collection granularity of 5
minutes.
2-5
SJ-20120730093520-018|2012-10-31 (R1.0)
4. Check the performance indices of the SCTP links during busy hours. Check whether
the load is evenly shared among the signaling links or the load of some signaling links
is too high.
Criteria
l
l
The difference of load between any two associations to the same office should not
exceed 20 %.
The load of each SCTP link should not exceed 0.2 Erl.
2-6
SJ-20120730093520-018|2012-10-31 (R1.0)
Chapter 3
Purpose
Make sure that the time of the OMP module is synchronous with the SNTP server
(generally, the OMM server).
Procedure
1. Log in to the LMT. The default tab page is Terminal.
Note:
To open the Terminal tab page, in the web page of the LMT, click
in the bottom left corner.
(Terminal) button
SJ-20120730093520-018|2012-10-31 (R1.0)
4. Check the time of the OMM server. In the web page of the LMT , the time in the bottom
right corner is the OMM servers time .
5. Check whether the OMP time is the same as the OMM server time.
Criteria
l
The configuration of the SNTP server (the IP address of the SNTP server) is correct.
If the SNTP server configuration is not correct, you can change it with the SET SNTP
command.
The OMP time is the same as the OMM server. If the OMP time is not correct, you
can change it with the UPD TIME command.
Purpose
Make sure that the clock works in the "TRACE" status.
Procedure
1. Log in to the LMT. The default tab page is Terminal.
Note:
To open the Terminal tab page, in the web page of the LMT, click
in the bottom left corner.
(Terminal) button
2. On the command navigation tree in the left pane, select Maintenance Management
> Patrol Maintenance > Equipment Management > Check System Clock Status
to open the CHECK CLOCK STATUS command configuration interface.
button to execute the command.
3. Click
4. Check the status of the HOST and T/C indicators (corresponding to the clock sub-card)
on the panel of the SWI1/SWI2 board.
3-2
SJ-20120730093520-018|2012-10-31 (R1.0)
Criteria
l
l
The execution result of the CHECK CLOCK STATUS command shows that the clock
works in "TRACE" status.
The HOST indicator on the panel of the SWI1/SWI2 board flashes green and the clock
status indicator T/C is steady on.
If the clock status if abnormal, handle according to alarms. If you cannot remove this
problem, contact ZTE for help.
Purpose
Make sure the OMP module has sufficient storage space.
Procedure
1. Log in to the GPBB0 board where the OMP module is located using a remote control
tool such as SCRT.
2. Run the df -k command in the Terminal tab page to check the hard disk space.
Criteria
The free space of each file system on the hard disk of the GPBB0 board where the OMP
module is located should be above 10 % of the total space. Otherwise, you should remove
the files of the previous version or expired log files from the disk.
Purpose
Make sure that the OMM server has sufficient CPU and memory resources.
Procedure
1. Log in to the GPBX1 board of the OMM server using a remote control tool such as
vncviewer.
3-3
SJ-20120730093520-018|2012-10-31 (R1.0)
2. Run the vmstat command in the Terminal window to check the CPU/memory usage.
Note:
You can also check the memory usage of the OMM server with the CHECK MEMORY
STATUS command in the Terminal tab page of the LMT.
Criteria
l
l
Normally, the CPU usage should not exceed 50 %. If the CPU usage exceeds 50 %,
check for abnormal processes.
Normally, the available memory should be more than 90 % of the total. If the memory
usage has exceeded 90 % for a long time, you should expand the physical memory.
Purpose
Make sure that the OMM server has sufficient hard disk resource.
Procedure
1. Log in to the LMT. The default tab page is Terminal.
Note:
To open the Terminal tab page, in the web page of the LMT, click
in the bottom left corner.
(Terminal) button
2. On the command navigation tree in the left pane, select Maintenance Management >
Patrol Maintenance > OMM Server > Check OMM HDC Status to open the CHECK
HDC STATUS command configuration interface.
3. Click
button to execute the command.
Criteria
l
The file systems where the operating system and the database are located should
respectively have at least a free space of 800 MB, and the free space of each should
3-4
SJ-20120730093520-018|2012-10-31 (R1.0)
be above 10 % of the total size of the file system. If the free space is not enough,
delete some useless files.
The free space in each of the other file systems should also be above 10 % of the
total size of the file system. If the free space is not enough, delete some useless files.
Purpose
Make sure that NCMM runs properly.
Procedure
Check the related indicators of NCMM.
Criteria
l
l
l
l
If any indicator is found in improper status, locate the fault according to the corresponding
alarm given on the LMT and perform troubleshooting. If the problem still exists, contact
ZTE for help.
Purpose
Make sure that the switching board runs properly.
Procedure
Check the indicators on the front panel of the switching board.
3-5
SJ-20120730093520-018|2012-10-31 (R1.0)
Criteria
l
l
The service status indicator marked with OOS and the hot-swap indicator marked with
H/S are off, and the health status indicator marked with OK flashes green.
The indicators of the two blades (active and standby) should be in the same status.
If any indicator is found in improper status, locate the fault according to the corresponding
alarm given on the LMT and perform troubleshooting. If the problem still exists, contact
ZTE for help.
Purpose
Make sure that the power module of the shelf runs properly.
Procedure
Check the -48/-60 VA, -48/-60 VB, H/S and OK indicators on the shelfs power module.
Criteria
l
l
l
If any indicator is found in improper status, locate the fault according to the corresponding
alarm given on the LMT and perform troubleshooting. If the problem still exists, contact
ZTE for help.
Purpose
Make sure that the fan module of the shelf runs properly.
Procedure
Check the RUN, ALM and H/S indicators on the shelfs fan module.
3-6
SJ-20120730093520-018|2012-10-31 (R1.0)
Criteria
l
l
l
If any indicator is found in improper status, locate the fault according to the corresponding
alarm given on the LMT and perform troubleshooting. If the problem still exists, contact
ZTE for help.
Purpose
Make sure that the GPBB0/GPBX1 boards are running properly.
Procedure
Check the indicators status on the front panel of each GPBB0/GPBX1 board.
Criteria
l
l
The service status indicator marked with OOS and the hot-swap indicator marked with
H/S are steadily off, and the health status indicator marked with OK flashes green.
The active/standby status indicator marked with ACT is normal (steady on on the
active board and off on the standby board), and the running status/alarm indicator
marked with HOST flashes green.
During the hard disk read/write operations, the hard disk status indicator marked with
HD1/HD2 flashes green.
If any indicator is found in improper status, locate the fault according to the corresponding
alarm given on the LMT and perform troubleshooting. If the problem still exists, contact
ZTE for help.
Purpose
Make sure that the KVM interface of each GPI1 board is running properly.
3-7
SJ-20120730093520-018|2012-10-31 (R1.0)
Procedure
1. Check the VGA port, PS/2 keyboard port and the PS/2 mouse port on the GPI1 board
to see whether the connection of each is proper. Check whether you can use the KVM
to operate the GPBX1 board normally.
2. Check whether the holding screws for the cable joints on the GPI1 board are loose.
Criteria
The VGA port, PS/2 keyboard port and the PS/2 mouse port on the GPI1 board all function
normally.
If you find anything abnormal and cannot solve the problem, contact of ZTE for help.
Purpose
Prevent the Windows operating systems from being infected by viruses.
Procedure
Check whether the anti-virus software is installed on each host where the Windows
operating system is installed and whether the corresponding virus database is updated
respectively.
Criteria
The anti-virus software should be already installed on each host where the Windows
operating system is installed.
Purpose
Prevent virus infection in the Windows operating system.
3-8
SJ-20120730093520-018|2012-10-31 (R1.0)
Procedure
Check the virus database update dates and update the virus databases on each host
where the Windows operating system is installed.
Criteria
l
l
Purpose
Make sure that the scheduled tasks of backup data run properly.
Procedure
1. Log in to the LMT. The default tab page is Terminal.
Note:
To open the Terminal tab page, in the web page of the LMT, click
in the bottom left corner.
(Terminal) button
2. On the command navigation tree in the left pane, select Configuration Data
Maintenance > Backup And Restore > Automatic Backup Management > Show
Automatic Backup Strategy to open the SHOW AUTO STRATEGY command
configuration interface.
Note:
When you are selecting a command under Configuration Data Maintenance
> Backup And Restore > Automatic Backup Management on the command
button (which displays commonly-used commands
navigation tree, do not click the
only) above the navigation tree.
3. Click
SJ-20120730093520-018|2012-10-31 (R1.0)
Note:
To add an automatic backup strategy, execute the ADD AUTO STRATEGY command.
Criteria
The scheduled task of backup the data is correctly configured and activated. The system
produces the related backup data in the corresponding path for saving the backup files on
schedule.
Note:
The scheduled task of backup the data is often set to back up every day. The data to
be backed up includes alarm data, performance data, security data , log data, basic
configuration data and IP stack configuration data .( Execute the SHOW SYS SET
command to query the relation between the ID and Name of System Object Set.)
Purpose
Make sure that the module resource load is proper without obvious fluctuation.
Procedure
1. Log in to the LMT.
2. In the web page of the LMT, click (Performance Management) button in the lower
left corner to open the Performance Management tab page.
3-10
SJ-20120730093520-018|2012-10-31 (R1.0)
l
l
Note:
Before performing the query for the first time, you must create the corresponding
measurement task.
The path to locate the required type of measurement (Resource Load
Measurement) on the navigation tree of performance management is as follows:
Global Measurement > Resource Load Measurement.
Criteria
The performance indices of the module resource load are within the normal range. The
mean ratio of the CPU usage should not exceed 90 %, and the mean ratio of the memory
usage should not exceed 95 %. During the same busy hours, the indices are stable, and
comparing with the historical data, the fluctuation should not exceed 10 %.
l
If the average load of all services or signaling modules is very high for a long time,
consider capacity expansion according to the current designed capacities and actual
network capacities.
If the average load of only a few main processors is very high, check whether the
service load-sharing an the signaling load-sharing are proper, and whether some
CPUs bear heavy load. Plan the improper load-sharing again.
If the load-sharing is properly and the actual capacity is far less than the designed
capacity, but the average load of some main processors is still high, pay attention to it
and contact the local ZTE office to collect relevant log files so as to analyze whether
any hidden safety problem exists.
Purpose
Make sure that the MscServer load is proper without obvious fluctuation.
Procedure
1. Log in to the LMT.
2. In the web page of the LMT, click (Performance Management) button in the lower
left corner to open the Performance Management tab page.
l
l
Note:
Before performing the query for the first time, you must create the corresponding
measurement task.
The path to locate the required type of measurement (MscServer Load
Measurement) on the navigation tree of performance management is as follows:
Global Measurement > MscServer Load Measurement.
Criteria
The performance indices of the MscServer load are within the normal range. The mean
ratio of the CPU usage should not exceed 90 %, and the mean ratio of the memory
usage should not exceed 95 %. During the same busy hours, the indices are stable, and
comparing with the historical data, the fluctuation should not exceed 10 %.
l
If the average load of all services or signaling modules is very high for a long time,
consider capacity expansion according to the current designed capacities and actual
network capacities.
If the average load of only a few main processors is very high, check whether the
service load-sharing an the signaling load-sharing are proper, and whether some
CPUs bear heavy load. Plan the improper load-sharing again.
3-12
SJ-20120730093520-018|2012-10-31 (R1.0)
If the load-sharing is properly and the actual capacity is far less than the designed
capacity, but the average load of some main processors is still high, pay attention to it
and contact the local ZTE office to collect relevant log files so as to analyze whether
any hidden safety problem exists.
3-13
SJ-20120730093520-018|2012-10-31 (R1.0)
3-14
SJ-20120730093520-018|2012-10-31 (R1.0)
Chapter 4
Monthly Routine
Maintenance
Table of Contents
Checking Versions......................................................................................................4-1
Checking System Alarms and Notifications.................................................................4-2
Checking Heat Radiation of Racks .............................................................................4-3
Checking for Viruses ..................................................................................................4-3
Purpose
Make sure that the version running on each CPU of each board is consistent with the OMM
version.
Procedure
1. Log in to the LMT. The default tab page is Terminal.
Note:
To open the Terminal tab page, in the web page of the LMT, click
in the bottom left corner.
(Terminal) button
4-1
SJ-20120730093520-018|2012-10-31 (R1.0)
c.
Click the Operation Record tab. If the Result is Success, click the SHOW
RUNVER command to view the detailed execution results in the right of the tab
page.
Criteria
The CPU version of each board is consistent with the version of the server. If you find any
inconsistency, confirm the cause and contact ZTE for help if necessary.
Purpose
Make sure that there is no abnormality in the system.
Procedure
1. Log in to the LMT.
2. In the web page of the LMT, click (Fault Management) button in the lower left corner
to open the Fault Management tab page.
3. When checking the system alarms and notifications, you should check the following
items in every check period.
l Current real-time alarms, current alarm recovery information and real-time
notifications
l Historical alarms occurred in a month
l Notifications occurred in a week
Criteria
There is no abnormality alarm or notification in the system. Any alarm or notification found
should be analyzed and processed in time.
1. You can double-click the alarm message on the tab page to view the alarm handling
suggestions and eliminate the related faults according to the suggestions.
2. If the problem cannot be solved, contact ZTE technical personnel immediately.
4-2
SJ-20120730093520-018|2012-10-31 (R1.0)
Purpose
Make sure that the air intakes and outtakes of the racks are not blocked and the fans are
working properly.
Procedure
1. Check when the last cleaning was made and whether the dust filter needs to be
cleaned. Clean dust filters regularly.
2. Check whether air intakes and outtakes of the racks are blocked.
3. Check whether the fans are working normally, whether their rotation speeds are
consistent, and whether there is any abnormal sound.
4. Check whether the blank panels at the front and back slots are fixed well.
Criteria
l
l
l
l
The dust filters are clean. If they are not clean, clean them.
The air intakes and outtakes are not blocked. If they are blocked, clear the substances
blocking them. If necessary, contact ZTE for help.
The fans are working properly. If they are not, handle the problem. If necessary,
contact ZTE for help.
The blank panels are fixed well. If they are not, fix them.
Purpose
Prevent the Windows operating system from being infected by viruses.
Procedure
Use the anti-virus software to check for any virus on all the hard disks of each host where
the Windows operating system is installed by following the anti-virus software instructions.
Criteria
Normally, no virus is found.
4-3
SJ-20120730093520-018|2012-10-31 (R1.0)
If any virus is found during the check, kill the virus by following the virus handling
suggestions given by the anti-virus software, and restart the host as long as the normal
running of services is not affected.
4-4
SJ-20120730093520-018|2012-10-31 (R1.0)
Chapter 5
Quarterly Routine
Maintenance
Table of Contents
Cleaning the Dust Screens .........................................................................................5-1
CGSL Operating System Check .................................................................................5-2
Checking Remote Control Tools .................................................................................5-5
Purpose
Clean the dust screens periodically to prevent dust from entering the cabinet, which may
affect the heat dissipation of the cabinet.
Procedure
1. Unlock and open the cabinet door.
2. Loosen the fastening bolts of the dust-proof subrack on the cabinet.
3. Evenly pull out the dust-proof subrack with two hands, pay attention that do not scatter
the dust on the dust-proof subrack to the cabinet.
4. Dismantle the dust-proof subrack, and take out the dust screens.
5. Clean the dust screens with warm water (lower than 40 ) and dry them.
Caution!
Do not install the wet dust screens.
Criteria
The cleaned dust screens should be clean and dust-free.
5-1
SJ-20120730093520-018|2012-10-31 (R1.0)
Purpose
Make sure that the hard disk of the GPBX1 board has enough space.
Procedure
1. Log in to the GPBX1 board of the OMM server using a remote control tool such as
Xmanager.
2. Run the df -k command in the Terminal window to check the hard disk space. The
following is an instance of the command execution result.
# df -k
Filesystem
K-blocks
/dev/sda2
9920624
5939384
3469172
/dev/sda6
18782896
3834264
13979112
22% /swap
64% /
/dev/sda5
29753556
176204
28041540
1% /home
/dev/sda1
497829
16333
455794
4% /boot
tmpfs
517568
517568
0% /dev
Criteria
The hard disk usage should not be higher than 90 %. Otherwise, delete some useless
files.
Purpose
Make sure that the swap space of the GPBX1 board is not used abnormally.
Procedure
1. Log in to the GPBX1 board of the OMM server using a remote control tool such as
vncviewer.
5-2
SJ-20120730093520-018|2012-10-31 (R1.0)
2. Run the swapon -s command in the Terminal window to check the swap space. The
following is an instance of the command execution result.
# swapon -s
Filename
Type
Size
Used
/dev/sda3
partition
16763816
63481
Priority
-1
Note:
In this example, the swap space is 16 GB.
Criteria
The swap space usage of the GPBX1 board should not exceed 5 %. Otherwise, pay
attention to the running of the GPBX1 board. If there is any abnormality, contact ZTE to
ask related technical personnel to troubleshoot the problems.
Purpose
Make sure that the CPU/memory usage of the GPBX1 board is proper.
Procedure
1. Log in to the GPBX1 board of the OMM server using a remote control tool such as
vncviewer.
2. Run the vmstat command in the Terminal window to check the CPU/memory usage.
The following is an instance of the command execution result.
# vmstat
procs -----------memory---------- --swap-- ----io--- --system-- -----cpu---r
0
0
swpd
free
buff
cache
si
so
bi
bo
in
cs us sy
id wa st
0 1052 1182
0 100
0 1051 1168
0 100
Criteria
The CPU usage of the GPBX1 board should not exceed 50 % for a long time and the
memory usage of the GPBX1 board should not exceed 90 %. Otherwise, pay attention to
the running of the GPBX1 board. If there is any abnormality, contact ZTE to ask related
technical personnel to troubleshoot the problems.
5-3
SJ-20120730093520-018|2012-10-31 (R1.0)
Purpose
Make sure the GPBX1 board runs properly.
Procedure
1. Log in to the GPBX1 board of the OMM server using a remote control tool such as
vncviewer.
2. Check log file /var/log/messages on the GPBX1 board.
Criteria
There is no word like panic, fault or error in the log file. Otherwise, contact ZTE to ask
related technical personnel to troubleshoot the problems.
Purpose
Make sure the disk I/O of the GPBX1 board is normal.
Procedure
1. Log in to the GPBX1 board of the OMM server using a remote control tool such as
vncviewer.
2. Run the iostat 1 10 command in the Terminal window to check the disk I/O. The
following is an instance of the command execution result.
5-4
SJ-20120730093520-018|2012-10-31 (R1.0)
Criteria
The iowait should not exceed 30 % continuously for a long time. Otherwise, contact ZTE
to ask related technical personnel to troubleshoot the problems.
Note:
The iowait refers to the percentage of the time for CPU to wait for I/O in a statistical period
of time.
Purpose
Make sure that the remote control tools works properly.
Procedure
1. On the OMM server, check whether the remote desktop is enabled.
a. Choose System > Preferences > Remote Desktop to open the Remote Desktop
Preference dialog box, as shown in Figure 5-2.
5-5
SJ-20120730093520-018|2012-10-31 (R1.0)
b. In the dialog box as shown in Figure 5-2, check whether select Allow other
users to view your desktop and Allow other users to control your desktop,
and check whether select Require the user to enter this password and set a
password that is the login password for vncviewer.
c.
2. On the LMT terminal where vncviewer is installed, check whether vnviewer can
log in to the OMM server properly.
a. Double-click the vncviewer software to open the dialog box as shown in Figure
5-3.
Figure 5-3 Login Dialog Box
b. Fill in the IP address of the OMM server in the VNC server text box, and then
click OK. If you have set a password in the Remote Desktop Preference dialog
box on the OMM server, the dialog box as shown inFigure 5-4 pops up. Type the
password, and then click OK. Otherwise, log in to the main window of the OMM
server directly.
5-6
SJ-20120730093520-018|2012-10-31 (R1.0)
Criteria
The LMT terminal can connected with the OMM server using remote control tool. If
connected fail, ping the IP address of the OMM server.
5-7
SJ-20120730093520-018|2012-10-31 (R1.0)
5-8
SJ-20120730093520-018|2012-10-31 (R1.0)
Chapter 6
Purpose
Make sure that the dual-channel DC power supply of the equipment works properly.
Procedure
1. Check and record the DC operating voltage displayed on the DC power supply device
in the equipment room.
2. Check the power of the equipment currently using the DC power supply. If a
dual-channel power supply is available, check both channels of the power supply.
Caution!
Generally the communication equipment uses a dual-channel power supply, please
check and confirm the two channels of the power system, so as to prevent the potential
hazards which may be caused by a channel of power failure.
Criteria
The standard value of the DC operating voltage is -48 V, and the tolerable variation range
is -57 V to 40 V.
The standard value of the DC operating voltage is -60 V, and the tolerable variation range
is -72 V to 50 V .
6-1
SJ-20120730093520-018|2012-10-31 (R1.0)
If the power supply is abnormal, restore the power supply to normal status as soon as
possible.
Purpose
Make sure that the dual-channel AC power supply of the equipment works properly.
Procedure
1. Check and record the AC operating voltage displayed on the AC power supply device
in the equipment room.
2. Check whether the two-channel power supply of the servers works properly.
Caution!
Generally the communication equipment uses a dual-channel power supply, please
check and confirm the two channels of the power system, so as to prevent the potential
hazards which may be caused by a channel of power failure.
Criteria
l
l
l
In principle, the AC power supply should adopt a dual-channel UPS power, and the
active/standby and the load sharing devices should be connected to two channels of
the power supply separately.
The work range of the AC operating voltage is 220 V22 V and 50 Hz2.5 Hz.
Estimate the AC power supply required by the equipment, and make sure that the
power supply has a power redundancy of at least 20 %.
If the power supply is abnormal, restore the power supply to normal status as soon as
possible.
Purpose
Make sure that the cables of the equipment are properly connected.
Procedure
1. Check the connections of the network cables.
2. Check the connections of the inter-cabinet cables.
Criteria
l
l
The network cables are reliably connected and not damaged, with clear labels.
The inter-cabinet cables are firmly and reliably connected. The cables should be in
good condition, with clear labels.
If the cable connection is loose, reconnect it under the direction of ZTE technical support
engineers. If the cables or labels are damaged, replace them by professionals under the
direction of ZTE technical support engineers.
Purpose
Make sure that the spare parts of the equipment are sufficient and not damaged.
Procedure
1. Check the quantity of the spare parts.
2. Check whether the spare parts are damaged.
Criteria
The necessary spare parts should be sufficient and not damaged.
If the spare parts are insufficient or damaged, buy them as soon as possible.
6-3
SJ-20120730093520-018|2012-10-31 (R1.0)
6-4
SJ-20120730093520-018|2012-10-31 (R1.0)
Appendix A
Date (D/M/Y):
Attended Time
Off-Going Person:
On-Coming Person:
From __ to __
Basic Check Item
1. Check the system alarms: Check the current and historical alarms or notifications
displayed in the Fault Management tab page of the LMT.
Normal Abnormal
Exceptional symptom:
2. Check the board running status: Check the board alarms in the Fault Management tab
page of the LMT. Check the board indicators on the rack.
Normal Abnormal
Exceptional symptom:
3. Check the MTP3 signaling links: Check whether the performance index values of the
MTP3 signaling links have obvious fluctuations in the Performance Management tab page
of the LMT. The fluctuation should not exceed 5 % except in festivals and holidays.
Normal Abnormal
Exceptional symptom:
4. Check the SCTP links: Check whether the performance index values of the SCTP links
have obvious fluctuations in the Performance Management tab page of the LMT. The
fluctuation should not exceed 5 % except in festivals and holidays.
Normal Abnormal
Exceptional symptom:
A-1
SJ-20120730093520-018|2012-10-31 (R1.0)
Office Name:
Date (D/M/Y):
Attended Time
Off-Going Person:
On-Coming Person:
From __ to __
Equipment Room
Environment
1. Temperature (In normal cases, the temperature shown at the air conditioner in the
equipment room is 15 C~25 C.)
Normal Abnormal
2. Humidity (In normal cases, the relative humidity shown at the air conditioner in the
equipment room is 30 %~70 %.)
Normal Abnormal
Duty Memo
Remaining problems:
Inspection by Team-leader:
A-2
SJ-20120730093520-018|2012-10-31 (R1.0)
Date (D/M/Y):
Maintenance Item
Result
Maintenance
Time
Person
Check whether the time of the OMP module is the same as the
SNTP server.
Check whether the indicators on clock sub-card are in proper
statuses. Check whether the clock works in TRACE status.
Check whether the OMP module has sufficient storage space.
Check whether the OMM server has sufficient CPU and memory
resources.
Check whether the OMM server has sufficient hard disk space.
Check whether the NCMM of each shelf is running properly.
Check whether the switching board of each shelf is running
properly.
Check whether the power module of each shelf is running
properly.
Check whether the fan module of each shelf is running properly.
Check whether the GPBB0/GPBX1 boards are running properly.
Check whether the KVM interface of each rear board GPI1 of the
GPBX1 board is running properly.
Check whether the anti-virus software is installed on the host
where the Windows operating system is installed.
Check the virus database update dates and update the virus
databases on the host where the Windows operating system is
installed.
Check whether the scheduled task of backing up data is running
properly.
Check whether the resource load is in the normal range, and
whether the fluctuation is lower than 10 %.
Check whether the MscServer load is in the normal range, and
whether the fluctuation is lower than 10 %.
A-3
SJ-20120730093520-018|2012-10-31 (R1.0)
Office name:
Date (D/M/Y):
Maintenance Item
Result
Maintenance
Time
Person
Problems and Troubleshooting:
Remaining problems:
A-4
SJ-20120730093520-018|2012-10-31 (R1.0)
Date (D/M/Y):
Maintenance Item
Result
Maintenance Person
Time
Remaining problems:
A-5
SJ-20120730093520-018|2012-10-31 (R1.0)
Date (D/M/Y):
Maintenance Item
Result
Maintenance
Time
Person
Clean the dust screens periodically, so as to prevent dust from
entering the cabinet, which may affect the heat dissipation of the
cabinet.
Check whether the host where the CGSL operating system is
installed has enough hard disk space.
Check whether the swap space of the host where the CGSL
operating system is installed is used abnormally.
Check whether the host where the CGSL operating system is
installed has enough CPU and memory resources.
Check the log files to see whether the host where the CGSL
operating system is installed runs properly.
Check whether the disk I/O of the host where the CGSL operating
system is installed is normal.
Check whether the remote control tool is normal.
Problems and Troubleshooting:
Remaining problems:
A-6
SJ-20120730093520-018|2012-10-31 (R1.0)
Date (D/M/Y):
Maintenance Item
Result
Maintenance Person
Time
Remaining problems:
A-7
SJ-20120730093520-018|2012-10-31 (R1.0)
A-8
SJ-20120730093520-018|2012-10-31 (R1.0)
Appendix B
Caution
Board-related
operations
active board.
panel at will.
Pressing the RST button on the active board panel will reset
the active board. The consequence is the same as that of
"Unplugging online active board".
Cable-related
operations
related operations
at will.
B-1
SJ-20120730093520-018|2012-10-31 (R1.0)
B-2
SJ-20120730093520-018|2012-10-31 (R1.0)
Glossary
ATCA
- Advanced Telecommunications Computing Architecture
GUI
- Graphical User Interface
H.248
- ITU-T Rec. H.248 Gateway Control Protocol
IP
- Internet Protocol
KVM
- Keyboard, Video and Mouse
LMT
- Local Maintenance Terminal
MAP
- Mobile Application Part
MTP3
- Message Transfer Part layer 3
NCMM
- New Chassis Management Module
NFCM
- New Fan Controller Module
NPEM
- New Power Entry Module
OMM
- Operation & Maintenance Module
OMP
- Operation & maintenance Main Processor
SCTP
- Stream Control Transmission Protocol
SIGTRAN
- Signalling Transport
SNTP
- Simple Network Time Protocol
SS7
- Signaling System No. 7
I
SJ-20120730093520-018|2012-10-31 (R1.0)
TCP
- Transfer Control Protocol
UPS
- Uninterruptible Power Supply
VGA
- Video Graphic Adapter
II
SJ-20120730093520-018|2012-10-31 (R1.0)