Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
If your Solaris system panics and reboots, it'll probably create a crash dump in
/var/crash. You can also force a crash dump either online (using "savecore -L")
or as part of a reboot (using "reboot -d").
Normally this is where I stop and upload the /var/crash/vmdump.0 file up to Orac
le to find out what the problem was. However, you can do some basic investigatio
ns yourself using the following steps:
# savecore f vmdump.0 /somedirectory
# cd /somedirectory
# mdb *0
mdb> ::status
mdb> ::panicinfo
mdb> ::stack
mdb> ::msgbuf
mdb> ::cpuinfo
mdb> ::ps
mdb> ::arc
mdb> ::memstat
(*If the vmdump file is called vmdump.1 then use 1 instead of 0 in the above ste
ps)
--------------------------------------------------------------------------------
--------------------------
http://eldar.aydayev.com/how-to-analyze-oracle-solaris-os-crash-dump/
First of first we need collect memory dump file from dump device. By default aft
er system crash reboot, when is coming up
Solaris save core tool store dump file under /var/crash/server/vmdump.0. If file
system under /var/ have not enough space then
you can save dump output file in separate storage place of NFS shared resource.
Here is command how to do it:
To find out dump device check it by command dumpadm:
# dumpadm
Dump content: kernel pages
Dump device: /dev/md/dsk/d3 (swap) dump device
Savecore directory: /var/crash/mexico
Savecore enabled: yes
Save compressed: on
Now we will save crash dump file to the NFS shared resource on ifs server:
# savecore -vd -f /dev/md/dsk/d3 /net/nfsserver/dump/server01/crash_dump/
We will got file /net/nfsserver/dump/server01/crash_dump/vmdump.0
root@server # savecore -vd -f /dev/md/dsk/d3 /net/nfsserver/dump/server01/crash_
dump/
savecore: System dump time: Sun Nov 27 23:08:00 2011
savecore: Saving compressed system crash dump in /net/nfsserver/dump/server01/cr
ash_dump/vmdump.0
savecore: Copying /dev/md/dsk/d3 to /net/nfsserver/dump/server01/crash_dump/vmdu
mp.0
savecore: Decompress the crash dump with
savecore -vf /net/nfsserver/dump/server01/crash_dump/vmdump.0'
3:15 dump copy is done
Now we need decompress vmdump.0 file. We can make it with savecore command:
# savecore -vd -f vmdump.0 /net/nfsserver/dump/server01/crash_dump/
System dump time: Sun Nov 27 23:08:00 2011
Constructing namelist /net/nfsserver/dump/server01/crash_dump/unix.0
Constructing corefile /net/nfsserver/dump/server01/crash_dump/vmcore.0
: 3902138 of 3902138 pages saved
Now we have crash dump output files: unix.0 (Oracle Solaris kernel) and vmcore.0
(Oracle Solaris Memory snapshot)
Before continue you need install SUNWscat Solaris Crash Analyzer Tool. You can g
et it from Oracle Support.
After installing scat will be located in /opt/SUNWscat/bin/scat
There two type of generating crash dump analyze report interactive or as explore
r collection.
First interactive :
# /opt/SUNWscat/bin/scat /net/nfsserver/dump/server01/crash_dump/vmcore.0
Interactive will provide next output:
Solaris[TM] CAT 5.2 for Solaris 10 64-bit UltraSPARC
SV4990M, Aug 26 2009sn
Copyright 2009 Sun Microsystems, Inc. All rights reserved.
Use is subject to license terms.
Feedback regarding the tool should be sent to SolarisCAT_Feedback@Sun.COM
Visit the Solaris CAT blog at http://blogs.sun.com/SolarisCAT
opening ./vmcore.0 dumphdr
WARNING: ./vmcore.0 incomplete/corrupt. size: 32167952384, expected: 32167960576
symtab core done
loading core data: modules symbols CTF done
core file: /net/nfsserver/dump/server01/crash_dump/vmcore.0
user: UNIX Administrator Eldar Aydayev (eldara:5002)
release: 5.10 (64-bit)
version: Generic_144488-17
machine: sun4u
node name: mexico
domain: mtn.com.ng
hw_provider: Sun_Microsystems
system type: SUNW,SPARC-Enterprise (SPARC64-VII)
hostid: 847c1fc7
dump_conflags: 0x10000 (DUMP_KERNEL) on /dev/md/dsk/d3(125G)
kmem_flags: 0xf (AUDIT|DEADBEEF|REDZONE|CONTENTS)
time of crash: Sun Nov 27 23:06:55 WAT 2011 (core is 74 days old)
age of system: 13 days 1 hours 51 minutes 3.68 seconds
panic CPU: 210 (56 CPUs, 224G memory, 3 nodes)
panic string: BAD TRAP: type=34 rp=2a1040da720 addr=deadbeefdeadc187 mmu_fsr=0
sanity checks: settings
NOTE: /etc/system: symbol not found "set noexec-stack=0x1"
NOTE: /etc/system: lwp_default_stksize set to 0x6000 2 times
NOTE: /etc/system: rpcmod:svc_default_stksize set to 0x6000 2 times
vmem CPU
WARNING: TS thread 0x303572b1120 on CPU32 using 99%CPU
WARNING: TS thread 0x376abc6e140 on CPU105 using 100%CPU
WARNING: CPU136 has cpu_intr_actv for PIL 6
WARNING: PIL6 interrupt thread 0x2a101917ca0 on CPU136 pinning TS thread 0x30351
a6d200
WARNING: TS thread 0x4cecc9e7b00 on CPU208 using 100%CPU
sysent
WARNING: unknown module acctctl seen 4 times in sysent table
clock misc
WARNING: hat_kpr_enabled is 0
WARNING: 80 severe kstat errors (run "kstat xck")
done
SolarisCAT(./vmcore.0/10U)>
Now we will check analyze output from crash dump:
SolarisCAT(./vmcore.0/10U)> analyze
core file: /net/nfsserver/dump/server01/crash_dump/vmcore.0
user: UNIX Administrator Eldar Aydayev (eldara:5002)
release: 5.10 (64-bit)
version: Generic_144488-17
machine: sun4u
node name: server01
domain: aydayev.com
hw_provider: Sun_Microsystems
system type: SUNW,SPARC-Enterprise (SPARC64-VII)
hostid: 847c1fc7
dump_conflags: 0x10000 (DUMP_KERNEL) on /dev/md/dsk/d3(125G)
kmem_flags: 0xf (AUDIT|DEADBEEF|REDZONE|CONTENTS)
time of crash: Sun Nov 27 23:06:55 WAT 2011 (core is 74 days old)
age of system: 13 days 1 hours 51 minutes 3.68 seconds
panic CPU: 210 (56 CPUs, 224G memory, 3 nodes)
panic string: BAD TRAP: type=34 rp=2a1040da720 addr=deadbeefdeadc187 mmu_fsr=0
You can also use Sun Crash Tool explorer output based report to send it to Vendo
r (Oracle) support department to get right solution to solving current issues: