TiMOS R16 rebooting (Fatal Error: Core0 DEAD)
Posted: Fri Oct 30, 2020 2:28 pm
Hi Guys!
My lab (TiMOS R16) is facing rebooting events everytime when using 3 or more nodes up/running on the Lab, the nodes start rebooting with "Fatal Error: Core0 Dead" after a short time period. Do you guys know how to solve this issue ?
See more details below.
### My MiniPC config ( Intel i9 / 64G RAM / 1Tb SSD / Ubuntu 20 )
EVE-NG Pro running on VMware Worksation
---- VMware Application Settings ( Memory Allocation: 56224 MB RAM )
-------- EVE-NG Pro Image Settings ( 4 Processors with 3 Cores Per Processor = 12 Cores Processors / 52128 MB RAM / Virtualize Intel VT-x: Enabled)
### TiMOS Nodes Settings
# Nokia 7750 SR - Distributed - CPM Card Nodes ###
CPU: 2 CPU Limit: Enabled RAM (MB): 4096 Ethernets: 2
Management Address: 192.168.2.204/24@active
TiMOS License Path: cf3:\Universal-License.txt
TiMOS Line: slot=A chassis=SR-12 card=sfm4-12
QEMU Custom Options: -machine type=pc,accel=kvm -serial mon:stdio -nographic -no-user-config -nodefaults -rtc base=utc
# Nokia 7750 SR - Distributed - IOM Card Nodes ###
CPU: 2 CPU Limit: Enabled RAM (MB): 2048 Ethernets: 9
TiMOS Line: slot=2 chassis=SR-12 card=iom3-xp-b mda/1=m12-1gb+2-10gb-xp mda/2=isa-tunnel control-cpu-cores=2
QEMU Custom Options: -machine type=pc,accel=kvm -serial mon:stdio -nographic -no-user-config -nodefaults -rtc base=utc
### ERROR LOG
Watchdog: Task 0x1eb192b0 (sysMonitor) blocked for 146 ticks.
Fatal Error: CORE0 dead.
***************************************************************
Disabling switch fabric and mgmt ethernet communications
***************************************************************
9a4da0d vxTaskEntry +1d : sysMonitorTask (0, 0, 0, 0)
6fcd67f sysMonitorTask +26f: wdCheckForFailedCores (0, 0, 0, 0)
6fca663 wdCheckForFailedCores+133: timosCrashDumpGeneral (20d98660, 0, 6564203045524f43, 6461)
1bc6115 timosCrashDumpGeneral+25 : timosCrashDumpSystemState (0, 1, 20d986b0, 6fca668)
1bc5f68 timosCrashDumpSystemState+a8 : debugDisplayBootLog (5f9c1bdc20d98650, 1a, 2054434f20495246, 37353a3331203033)
1be3791 debugDisplayBootLog+11 : debugCloseBootLog (1, 1, 20d98610, 1bc5f6d)
1be36f8 debugCloseBootLog+28 : debugSaveBootLog (20d98590, 1bc4c65, 12a410, 1)
1be3667 debugSaveBootLog+77 : debugWriteBootLog (20d983e0, 9abe632, a8386b0, ffffffff)
1be35c7 debugWriteBootLog+57 : closeURL (ffffffff00000077, 0, 1, 0)
1ab4790 closeURL +10 : urlCloseFile (20d983b0, 1be35cc, ffffffff00000077, 0)
1ab1cb2 urlCloseFile +52 : close (1468, 1468, 20d981d0, 1ab4792)
9abce44 close +4 : iosClose (20d981c0, 1ab1cb7, 1468, 1468)
9abe4dc iosClose +5c : dosFsClose (400000000000000, 20d981e0, 0, 20d981e0)
9a5d0e0 dosFsClose +180: cbioIoctl (12a410, 1fba1ed0, 77, 0)
9b158a3 cbioIoctl +33 : dpartIoctl (20d98110, 1f8345e0, 1f839fa0, 1f82e510)
9a63f3b dpartIoctl +11b: dcacheIoctl (1f83af38, cb100010, 0, 0)
9a57924 dcacheIoctl +2e4: dcacheQuickFlush (20d980b0, 1f83af38, cb100010, 0)
9a57f0e dcacheQuickFlush+9e : dcacheManyFlushInval (1f843d20, 0, 1911b060, 0)
9a569ab dcacheManyFlushInval+9b : dcacheFlushBatch (1f843cb0, ffffffff00000000, 0, f00000001)
9a5686e dcacheFlushBatch+1de: blkWrapBlkRW (1f83ac00, 0, 1f843d20, 1911b070)
9b161d8 blkWrapBlkRW +88 : ataBlkWrt (1f89b510, 100000000, 2a700000001, 2a600000000)
9b11d0a ataBlkWrt +a : ataBlkRW (20d97f60, 9b161db, 1f89b510, 100000000)
9b11f50 ataBlkRW +130: sysOutWordString (1, 1, 19111ed8, 0)
*** Blocked task info during crash dump - ending ***
Rebooting...
Using preloaded VxWorks boot loader at 0x0000000000008000, size 0x0007D000, entrypoint 0x0000000000008010
My lab (TiMOS R16) is facing rebooting events everytime when using 3 or more nodes up/running on the Lab, the nodes start rebooting with "Fatal Error: Core0 Dead" after a short time period. Do you guys know how to solve this issue ?
See more details below.
### My MiniPC config ( Intel i9 / 64G RAM / 1Tb SSD / Ubuntu 20 )
EVE-NG Pro running on VMware Worksation
---- VMware Application Settings ( Memory Allocation: 56224 MB RAM )
-------- EVE-NG Pro Image Settings ( 4 Processors with 3 Cores Per Processor = 12 Cores Processors / 52128 MB RAM / Virtualize Intel VT-x: Enabled)
### TiMOS Nodes Settings
# Nokia 7750 SR - Distributed - CPM Card Nodes ###
CPU: 2 CPU Limit: Enabled RAM (MB): 4096 Ethernets: 2
Management Address: 192.168.2.204/24@active
TiMOS License Path: cf3:\Universal-License.txt
TiMOS Line: slot=A chassis=SR-12 card=sfm4-12
QEMU Custom Options: -machine type=pc,accel=kvm -serial mon:stdio -nographic -no-user-config -nodefaults -rtc base=utc
# Nokia 7750 SR - Distributed - IOM Card Nodes ###
CPU: 2 CPU Limit: Enabled RAM (MB): 2048 Ethernets: 9
TiMOS Line: slot=2 chassis=SR-12 card=iom3-xp-b mda/1=m12-1gb+2-10gb-xp mda/2=isa-tunnel control-cpu-cores=2
QEMU Custom Options: -machine type=pc,accel=kvm -serial mon:stdio -nographic -no-user-config -nodefaults -rtc base=utc
### ERROR LOG
Watchdog: Task 0x1eb192b0 (sysMonitor) blocked for 146 ticks.
Fatal Error: CORE0 dead.
***************************************************************
Disabling switch fabric and mgmt ethernet communications
***************************************************************
9a4da0d vxTaskEntry +1d : sysMonitorTask (0, 0, 0, 0)
6fcd67f sysMonitorTask +26f: wdCheckForFailedCores (0, 0, 0, 0)
6fca663 wdCheckForFailedCores+133: timosCrashDumpGeneral (20d98660, 0, 6564203045524f43, 6461)
1bc6115 timosCrashDumpGeneral+25 : timosCrashDumpSystemState (0, 1, 20d986b0, 6fca668)
1bc5f68 timosCrashDumpSystemState+a8 : debugDisplayBootLog (5f9c1bdc20d98650, 1a, 2054434f20495246, 37353a3331203033)
1be3791 debugDisplayBootLog+11 : debugCloseBootLog (1, 1, 20d98610, 1bc5f6d)
1be36f8 debugCloseBootLog+28 : debugSaveBootLog (20d98590, 1bc4c65, 12a410, 1)
1be3667 debugSaveBootLog+77 : debugWriteBootLog (20d983e0, 9abe632, a8386b0, ffffffff)
1be35c7 debugWriteBootLog+57 : closeURL (ffffffff00000077, 0, 1, 0)
1ab4790 closeURL +10 : urlCloseFile (20d983b0, 1be35cc, ffffffff00000077, 0)
1ab1cb2 urlCloseFile +52 : close (1468, 1468, 20d981d0, 1ab4792)
9abce44 close +4 : iosClose (20d981c0, 1ab1cb7, 1468, 1468)
9abe4dc iosClose +5c : dosFsClose (400000000000000, 20d981e0, 0, 20d981e0)
9a5d0e0 dosFsClose +180: cbioIoctl (12a410, 1fba1ed0, 77, 0)
9b158a3 cbioIoctl +33 : dpartIoctl (20d98110, 1f8345e0, 1f839fa0, 1f82e510)
9a63f3b dpartIoctl +11b: dcacheIoctl (1f83af38, cb100010, 0, 0)
9a57924 dcacheIoctl +2e4: dcacheQuickFlush (20d980b0, 1f83af38, cb100010, 0)
9a57f0e dcacheQuickFlush+9e : dcacheManyFlushInval (1f843d20, 0, 1911b060, 0)
9a569ab dcacheManyFlushInval+9b : dcacheFlushBatch (1f843cb0, ffffffff00000000, 0, f00000001)
9a5686e dcacheFlushBatch+1de: blkWrapBlkRW (1f83ac00, 0, 1f843d20, 1911b070)
9b161d8 blkWrapBlkRW +88 : ataBlkWrt (1f89b510, 100000000, 2a700000001, 2a600000000)
9b11d0a ataBlkWrt +a : ataBlkRW (20d97f60, 9b161db, 1f89b510, 100000000)
9b11f50 ataBlkRW +130: sysOutWordString (1, 1, 19111ed8, 0)
*** Blocked task info during crash dump - ending ***
Rebooting...
Using preloaded VxWorks boot loader at 0x0000000000008000, size 0x0007D000, entrypoint 0x0000000000008010