Sei sulla pagina 1di 33

Mesca-X Training iCare (Insight Care)

C.Vangramberen 2011 May


SUMMARY
- Overview
- Monitoring
- System Control
- Configuration
- Maintenance

2 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Overview

Insight Care (iCare) is a software which provide tools for hardware units
maintenance.
Targets are:
Mesca-T (Tukwila based) maintenance for Helios Project (a.k.a Helios 4),
Mesca-X (Xeon based),
bullx server blade,
Water cool Rack
Main functionalities
Easy constitution of managed units:
. Automatic discoveries of hardware units (MESCA, INCA, Cool Cabinet)
. Import from XML file
. Manual creation
Reception of alert traps.Traps are stored in database PostgreSQL.
Alert Monitoring
Autocalls sent to GTS application as XML file
Intervention report
Other Maintenance tools
Automatic Clear System Event Log option is proposed.
Connexion to embedded hardware console.
Connexion to serial console of managed server with SOL (Serial Over LAN)

3 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Overview
Global Architecture

Water cool rack

4 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Overview
Software components

iCare application uses services of Open Source components:


Apache Web Server v2

PHP v5.2

PostgreSQL database v8.3

NET SNMP v5.4

iCare Web site (PHP and Ajax technology)

iCare trap analyze and diagnostic tool (set of php scripts)

iCare product runs on :



Windows XP

Windows Vista

Windows Server 2003

Windows Server 2008

Linux Red Hat (not yet available)

5 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Connection/Disconnection
Connect to

Disconnect by

6 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Global Configuration tab : Topology
used to import managed unit, and to organize them in groups (possibility to export result as xml file):
. Network Discovery (on same subnetwork)
. Import from xml file (template available)

. Manual creation (from a form)

7 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Topology :Groups
To organize resources

New : User/password set by server

8 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Groups: ressources Management
To move ressources to a Group

9 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Groups: ressources Management
Enable / Disable ressources

The iCare server is removed from the list of the Alert Destination, and the
ressource is no longer monitored
10 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable
Configuration
Global Configuration tab :Resource Viewer
Displays summary properties
Direct link to open SHC on the target

11 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Global Configuration tab :iCare configuration /Users
To Create/Delete new Users, Change Password

12 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Global Configuration tab :iCare configuration /super User Password
To change the password of the « super » user (must match the BMC parameters)

13 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Global Configuration tab :iCare configuration
Site information : automatically embedded in the ARPackage and Autocalls

14 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Global Configuration tab :SEL
Clear Policy: To avoid missing recent events in local SEL

15 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Global Configuration tab :Autocalls / General Settings

Select 'Local Dispatch mode' if GTS application is installed locally,


otherwise select 'FTP Dispatch mode', or 'EMAIL Dispatch mode'.
Define the Heartbeat period , Enable Autocalls

Test Autocall
16 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable
Configuration
Global Configuration tab :Autocalls / Filters
Allows to create and customize own filters

One Template per Resource Type

Select the appropriate resource Type

17 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Global Configuration tab :Autocalls / Filters
- Editing filters

. Threshold : To mask isolated events non significant


. Clipping : To limit identical events

18 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
Global Configuration tab :Autocalls / Global Policies

Choose 'Autocalls for Customer Filter Events


Select a filter previously created (This will be the default policy for this resource Type)

19 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Configuration
- Specific Configuration tab
To set, for a unit, a policy different from the default : select a type of policy .
For a 'Custom Filter Events ' policy , select a Filter

20 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Monitoring
Monitoring tab is used to perform query in iCare database

To build complex query

21 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Monitoring
Offers problem tracking facilities
- Alert state modifiable from received to in-review .
- Possibility to insert comment.

Recommended action

22 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Monitoring
Offers problem tracking facilities
- Once the anomaly is fixed , modify Alert state from in-review to concluded.

23 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Monitoring
Event concluded

24 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Monitoring
BIOS Log Viewer
Used to perform query in iCare database to retrieve BIOS Log files
(needs EMM 11.09.xx, BIOSX008.10 or above)

The BIOS Log is in binary format, use 'amel' tool to get a readable view

25 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Monitoring
MCE Status
Used to perform query in iCare database to retrieve DIMM Corrected Errors

26 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


System Control
System Control tab allows to open a console connection:
- SHC : web connection to the BMC
- Remote Console: connection to the System console, via KVM over IP function
- Telnet Console : use of CLP on BMC, or open a Terminal session on the System over a serial access.

27 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


System Control : Telnet Console

28 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


System Control : Telnet Console , CONT'D

A way to get the BIOS trace

29 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Maintenance
Maintenance tab: Intervention Report Creation.
To build Intervention Reports and store them on iCare database (one Report can concern multiple
servers)

30 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Maintenance
Maintenance tab: Intervention Report Viewer.

31 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Maintenance
Maintenance tab: Action request Package.
To collect information : Firmware versions, logs, FRU, IC … , generates a zip file.

32 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable


Maintenance
Maintenance tab: Action request Package.
Contains 3 files :

Site name: FrCL


Customer name: Support
Site number: 1
Site engineer name: VGB
Site engineer phone number: 12345678909
Town: Les Clayes sous Bois
Country code: FR
Request submitted on resource: Rome5_linux
SEL request submitted on: 2010-02-02 17:11:18
Number of events matched: 45
Operator name: admin
AR Reference: WJ0067S789
AR Description: For test
Event Severity: Event State: Date Range:
Critical Event Received Yes From: 2010-01-30 17:01:52
Non-recoverable Yes In review Yes To: 2010-02-02 17:01:52
Critical Yes Concluded Yes
Warning Event Yes
Information Event
Return to OK No
Information No
Monitor No
Unspecified No

Page 1

33 ©Bull, 2010 © Bull Confidentiel - Reproduction interdite sans accord préalable

Potrebbero piacerti anche