jakarta72
13-07-2016, 10:01
Доброго времени суток!
Стал случайно перезагружаться сервер (охлаждение в норме 18-22 С) вне зависимости от % нагрузки на ЦП и памяти, хотя по ощущениям на 70% чаще вылетает.
http://i77.fastpic.ru/big/2016/0713/d2/a75d2c5661baaacc1f4a39a60cf1f6d2.jpg (http://fastpic.ru/)
Конфигурация железа:
------------------
System Information
------------------
Time of this report: 7/13/2016, 09:26:15
Operating System: Windows Server 2012 R2 Standard 64-bit (6.3, Build 9600) (9600.winblue_ltsb.160328-1315)
Language: Russian (Regional Setting: Russian)
System Manufacturer: DEPO Computers
System Model: X9DBL-3F/X9DBL-iF
BIOS: BIOS Date: 08/09/13 15:32:09 Ver: 04.06.05
Processor: Intel(R) Xeon(R) CPU E5-2420 v2 @ 2.20GHz (24 CPUs), ~2.2GHz
Memory: 98304MB RAM
Available OS Memory: 98270MB RAM
Page File: 9143MB used, 121894MB available
Windows Dir: C:\Windows
DirectX Version: DirectX 11
DX Setup Parameters: Not found
User DPI Setting: Using System DPI
System DPI Setting: 96 DPI (100 percent)
DWM DPI Scaling: Disabled
DxDiag Version: 6.03.9600.17415 64bit Unicode
Дамп памяти с расшифровкой (прикреп), тут самый итог деяний:
WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000000, Machine Check Exception
Arg2: ffffe00180147028, Address of the WHEA_ERROR_RECORD structure.
Arg3: 00000000b200003f, High order 32-bits of the MCi_STATUS value.
Arg4: 00000000000100b3, Low order 32-bits of the MCi_STATUS value.
2: kd> !errrec ffffe00180147028
===============================================================================
Common Platform Error Record @ ffffe00180147028
-------------------------------------------------------------------------------
Record Id : 01d1db81c69cf42e
Severity : Fatal (1)
Length : 873
Creator : Microsoft
Notify Type : Machine Check Exception
Timestamp : 7/13/2016 1:43:11 (UTC)
Flags : 0x00000000
===============================================================================
Section 0 : Memory
-------------------------------------------------------------------------------
Descriptor @ ffffe001801470a8
Section @ ffffe00180147180
Offset : 344
Length : 73
Flags : 0x00000001 Primary
Severity : Fatal
===============================================================================
Section 1 : Processor Generic
-------------------------------------------------------------------------------
Descriptor @ ffffe001801470f0
Section @ ffffe001801471c9
Offset : 417
Length : 192
Flags : 0x00000000
Severity : Fatal
Proc. Type : x86/x64
Instr. Set : x64
Error Type : Unknown error
Flags : 0x00
CPU Version : 0x00000000000306e4
Processor ID : 0x0000000000000002
===============================================================================
Section 2 : x86/x64 MCA
-------------------------------------------------------------------------------
Descriptor @ ffffe00180147138
Section @ ffffe00180147289
Offset : 609
Length : 264
Flags : 0x00000000
Severity : Fatal
Error : Unknown (Proc 2 Bank 12)
Status : 0xb200003f000100b3
Есть подозрения на память (меняли с 32 на 96, совсем, т.е старую убрали), по огромная просьба разбирающимся посмотреть логи дампа, они ли?)
Стал случайно перезагружаться сервер (охлаждение в норме 18-22 С) вне зависимости от % нагрузки на ЦП и памяти, хотя по ощущениям на 70% чаще вылетает.
http://i77.fastpic.ru/big/2016/0713/d2/a75d2c5661baaacc1f4a39a60cf1f6d2.jpg (http://fastpic.ru/)
Конфигурация железа:
------------------
System Information
------------------
Time of this report: 7/13/2016, 09:26:15
Operating System: Windows Server 2012 R2 Standard 64-bit (6.3, Build 9600) (9600.winblue_ltsb.160328-1315)
Language: Russian (Regional Setting: Russian)
System Manufacturer: DEPO Computers
System Model: X9DBL-3F/X9DBL-iF
BIOS: BIOS Date: 08/09/13 15:32:09 Ver: 04.06.05
Processor: Intel(R) Xeon(R) CPU E5-2420 v2 @ 2.20GHz (24 CPUs), ~2.2GHz
Memory: 98304MB RAM
Available OS Memory: 98270MB RAM
Page File: 9143MB used, 121894MB available
Windows Dir: C:\Windows
DirectX Version: DirectX 11
DX Setup Parameters: Not found
User DPI Setting: Using System DPI
System DPI Setting: 96 DPI (100 percent)
DWM DPI Scaling: Disabled
DxDiag Version: 6.03.9600.17415 64bit Unicode
Дамп памяти с расшифровкой (прикреп), тут самый итог деяний:
WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000000, Machine Check Exception
Arg2: ffffe00180147028, Address of the WHEA_ERROR_RECORD structure.
Arg3: 00000000b200003f, High order 32-bits of the MCi_STATUS value.
Arg4: 00000000000100b3, Low order 32-bits of the MCi_STATUS value.
2: kd> !errrec ffffe00180147028
===============================================================================
Common Platform Error Record @ ffffe00180147028
-------------------------------------------------------------------------------
Record Id : 01d1db81c69cf42e
Severity : Fatal (1)
Length : 873
Creator : Microsoft
Notify Type : Machine Check Exception
Timestamp : 7/13/2016 1:43:11 (UTC)
Flags : 0x00000000
===============================================================================
Section 0 : Memory
-------------------------------------------------------------------------------
Descriptor @ ffffe001801470a8
Section @ ffffe00180147180
Offset : 344
Length : 73
Flags : 0x00000001 Primary
Severity : Fatal
===============================================================================
Section 1 : Processor Generic
-------------------------------------------------------------------------------
Descriptor @ ffffe001801470f0
Section @ ffffe001801471c9
Offset : 417
Length : 192
Flags : 0x00000000
Severity : Fatal
Proc. Type : x86/x64
Instr. Set : x64
Error Type : Unknown error
Flags : 0x00
CPU Version : 0x00000000000306e4
Processor ID : 0x0000000000000002
===============================================================================
Section 2 : x86/x64 MCA
-------------------------------------------------------------------------------
Descriptor @ ffffe00180147138
Section @ ffffe00180147289
Offset : 609
Length : 264
Flags : 0x00000000
Severity : Fatal
Error : Unknown (Proc 2 Bank 12)
Status : 0xb200003f000100b3
Есть подозрения на память (меняли с 32 на 96, совсем, т.е старую убрали), по огромная просьба разбирающимся посмотреть логи дампа, они ли?)