none
AMD Northbridge Machine Check Exception?

    Question

  • I recently noticed a Machine Check Exception in the event viewer that is as follows:

    (General)
    A corrected hardware error has occurred.
    
    Component: AMD Northbridge
    Error Source: Machine Check Exception
    Error Type: 14
    Processor ID: 0
    
    The details view of this entry contains further information.

    (Details)


    - <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
    - <System>
      <Provider Name="Microsoft-Windows-WHEA-Logger" Guid="{C26C4F3C-3F66-4E99-8F8A-39405CFED220}" /> 
      <EventID>21</EventID> 
      <Version>0</Version> 
      <Level>3</Level> 
      <Task>0</Task> 
      <Opcode>0</Opcode> 
      <Keywords>0x8000000000000000</Keywords> 
      <TimeCreated SystemTime="2009-07-18T22:56:43.022400000Z" /> 
      <EventRecordID>3427</EventRecordID> 
      <Correlation ActivityID="{91CC52B1-FC9F-4EB4-9973-222564DC67B7}" /> 
      <Execution ProcessID="1432" ThreadID="1528" /> 
      <Channel>System</Channel> 
      <Computer>NOBELIUM.home.capp.us</Computer> 
      <Security UserID="S-1-5-19" /> 
      </System>
    - <EventData>
      <Data Name="ErrorSource">3</Data> 
      <Data Name="ApicId">0</Data> 
      <Data Name="MCABank">4</Data> 
      <Data Name="MciStat">0x80000010000e0c0f</Data> 
      <Data Name="MciAddr">0x0</Data> 
      <Data Name="MciMisc">0x0</Data> 
      <Data Name="ErrorType">14</Data> 
      <Data Name="Length">928</Data> 
      <Data Name="RawData">435045521002FFFFFFFF03000200000002000000A003000012381600120709140000000000000000000000000000000000000000000000000000000000000000BDC407CF89B7184EB3C41F732CB57131FE6FF5E89C91C54CBA8865ABE14913BBA1FDD6F3FA07CA0102000000000000000000000000000000000000000000000058010000C00000000102000001000000ADCC7698B447DB4BB65E16F193C4F3DB0000000000000000000000000000000002000000000000000000000000000000000000000000000018020000800000000102000000000000B0A03EDC44A19747B95B53FA242B6E1D0000000000000000000000000000000002000000000000000000000000000000000000000000000098020000080100000102000000000000011D1E8AF94257459C33565E5CC3F7E8000000000000000000000000000000000200000000000000000000000000000000000000000000007F010000000000000002040000030000420F10000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000007000000000000000000000000000000420F10000008030009208000FFFB8B170000000000000000000000000000000000000000000000000000000000000000B3F8F31CB1C5A249AA595EEF92FFA63C01000000000000009E07C080060000000000000000000000000000000000000000000000000000000000000000000000010000000200000060DECAF5FA07CA010000000000000000000000000000000000000000040000000F0C0E0010000080000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000</Data> 
      </EventData>
      </Event>

    Although there's no reference to ErrorType 14 in the MSDN documentation listed here: http://msdn.microsoft.com/en-us/library/dd568265.aspx

    I'd like to figure out exactly what the error pertains to, as I know it happened after returning from sleep, where it took out my Bluetooth transceiver in the process:

    The driver \Driver\BTHUSB failed to load for the device USB\VID_045E&PID_007E\00E4B37F00F20050.
    - <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
    - <System>
      <Provider Name="Microsoft-Windows-Kernel-PnP" Guid="{9C205A39-1250-487D-ABD7-E831C6290539}" /> 
      <EventID>219</EventID> 
      <Version>0</Version> 
      <Level>3</Level> 
      <Task>212</Task> 
      <Opcode>0</Opcode> 
      <Keywords>0x8000000000000000</Keywords> 
      <TimeCreated SystemTime="2009-07-18T22:51:21.334400000Z" /> 
      <EventRecordID>3283</EventRecordID> 
      <Correlation /> 
      <Execution ProcessID="4" ThreadID="68" /> 
      <Channel>System</Channel> 
      <Computer>NOBELIUM.home.capp.us</Computer> 
      <Security UserID="S-1-5-18" /> 
      </System>
    - <EventData>
      <Data Name="DriverNameLength">38</Data> 
      <Data Name="DriverName">USB\VID_045E&PID_007E\00E4B37F00F20050</Data> 
      <Data Name="Status">3221226382</Data> 
      <Data Name="FailureNameLength">14</Data> 
      <Data Name="FailureName">\Driver\BTHUSB</Data> 
      <Data Name="Version">0</Data> 
      </EventData>
      </Event>

    Additionally, this is a brand new system:

    Asus M3N78-VM
    AMD Phenom II X3 710
    8GB OCZ Reaper HPC 1066MHz
    PNY GeForce 9400 GT 512MB
    Sunday, July 19, 2009 4:23 AM

Answers

  • Hi Ryan

    That event ID 14 is associated with a known bug in the Sleep/Hibernation components in Windows 7 RC.

    A general description of the bug is, "when a system resumes from Sleep or Hibernation, one or more random hardware components will fail to reinitialize which requires that the system must be restarted to resume normal operation." We have seen this affect just about every type of hardware device attached to a system.

    In this case, when this bug manifests itself, the Machine Check Exception (MCE) is initiated by the CPU because a hardware error was detected, but there is simply not enough information to define exactly where the error came from. The hardware error is usually the random hardware component that failed to reinitialize.

    The only solution is to try and refrain from using the Sleep/Hibernate functions as much as possible. Also try to save any work you are doing prior to leaving the computer for any extended period.

    This bug only seems to affect systems with particular hardware/driver configurations. The latest information we have is that this bug has been fixed in the RTM Build.


    Hope this helps.

    Thank You for testing Windows 7

    Ronnie Vernon MVP


    • Marked as answer by Ryan Capp Sunday, July 19, 2009 3:40 PM
    Sunday, July 19, 2009 5:17 AM
    Moderator

All replies

  • according to yours specs: http://ca.asus.com/products.aspx?l1=3&l2=161&l3=0&l4=0&model=2268&modelmenu=1 Here is your Chipset "NVIDIA GeForce 8200 Chipset"and here is your drivers
    for
    32bit: http://www.nvidia.com/object/win7_winvista_32bit_186.18_whql.html
    64bit: http://www.nvidia.com/object/win7_winvista_64bit_186.18_whql.html

    this should solve the north bridge and usb issues :)

    Experience is Use/Knowledge and understanding that Use/Knowledge is the key to Experience. -JJTech

    Windows 7 RC: Remember now this OS is not final and is Subject to all kinds of things that can very on different hardware. -JJTech
    • Proposed as answer by JJTech Sunday, July 19, 2009 4:59 AM
    • Unproposed as answer by Ryan Capp Sunday, July 19, 2009 3:40 PM
    Sunday, July 19, 2009 4:59 AM
  • Hi Ryan

    That event ID 14 is associated with a known bug in the Sleep/Hibernation components in Windows 7 RC.

    A general description of the bug is, "when a system resumes from Sleep or Hibernation, one or more random hardware components will fail to reinitialize which requires that the system must be restarted to resume normal operation." We have seen this affect just about every type of hardware device attached to a system.

    In this case, when this bug manifests itself, the Machine Check Exception (MCE) is initiated by the CPU because a hardware error was detected, but there is simply not enough information to define exactly where the error came from. The hardware error is usually the random hardware component that failed to reinitialize.

    The only solution is to try and refrain from using the Sleep/Hibernate functions as much as possible. Also try to save any work you are doing prior to leaving the computer for any extended period.

    This bug only seems to affect systems with particular hardware/driver configurations. The latest information we have is that this bug has been fixed in the RTM Build.


    Hope this helps.

    Thank You for testing Windows 7

    Ronnie Vernon MVP


    • Marked as answer by Ryan Capp Sunday, July 19, 2009 3:40 PM
    Sunday, July 19, 2009 5:17 AM
    Moderator
  • Currently using the nForce 15.37 chipset drivers and GeForce 186.18 graphic drivers. The only drivers I haven't updated are with the audio, as I must have some new motherboard revision or something since SiSoft Sandra and a few other utilities are saying I have a Realtek ALC1200 in addition to the known VIA VT1708B chipset as per specification.

    The only thing I noticed that stopped working was the Bluetooth mouse (Microsoft Wireless IntelliMouse Explorer for Bluetooth), but I just deleted the device from the device manager and did a rescan to reinstall it. It's been working fine ever since.
    • Edited by Ryan Capp Sunday, July 19, 2009 3:38 PM
    Sunday, July 19, 2009 3:36 PM
  • I too get this error, and I do NOT use sleep/hibernation at all. There has to be a better answer. It randomly crashes my PC as well.

    Log Name:      System
    Source:        Microsoft-Windows-WHEA-Logger
    Date:          7/21/2009 11:52:05 AM
    Event ID:      21
    Task Category: None
    Level:         Warning
    Keywords:     
    User:          LOCAL SERVICE
    Computer:      Rich-Win7RC
    Description:
    A corrected hardware error has occurred.

    Component: AMD Northbridge
    Error Source: Machine Check Exception
    Error Type: 14
    Processor ID: 0

    The details view of this entry contains further information.
    Event Xml:
    <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
      <System>
        <Provider Name="Microsoft-Windows-WHEA-Logger" Guid="{C26C4F3C-3F66-4E99-8F8A-39405CFED220}" />
        <EventID>21</EventID>
        <Version>0</Version>
        <Level>3</Level>
        <Task>0</Task>
        <Opcode>0</Opcode>
        <Keywords>0x8000000000000000</Keywords>
        <TimeCreated SystemTime="2009-07-21T18:52:05.013600000Z" />
        <EventRecordID>4212</EventRecordID>
        <Correlation ActivityID="{F4E4BFAA-31AB-4D0C-A2D9-D331072C8E83}" />
        <Execution ProcessID="1596" ThreadID="2040" />
        <Channel>System</Channel>
        <Computer>Rich-Win7RC</Computer>
        <Security UserID="S-1-5-19" />
      </System>
      <EventData>
        <Data Name="ErrorSource">3</Data>
        <Data Name="ApicId">0</Data>
        <Data Name="MCABank">4</Data>
        <Data Name="MciStat">0x80000010000e0c0f</Data>
        <Data Name="MciAddr">0x0</Data>
        <Data Name="MciMisc">0x0</Data>
        <Data Name="ErrorType">14</Data>
        <Data Name="Length">928</Data>
        <Data Name="RawData">435045521002FFFFFFFF03000200000002000000A003000008331200150709140000000000000000000000000000000000000000000000000000000000000000BDC407CF89B7184EB3C41F732CB57131FE6FF5E89C91C54CBA8865ABE14913BBA2F9C630340ACA0102000000000000000000000000000000000000000000000058010000C00000000102000001000000ADCC7698B447DB4BB65E16F193C4F3DB0000000000000000000000000000000002000000000000000000000000000000000000000000000018020000800000000102000000000000B0A03EDC44A19747B95B53FA242B6E1D0000000000000000000000000000000002000000000000000000000000000000000000000000000098020000080100000102000000000000011D1E8AF94257459C33565E5CC3F7E8000000000000000000000000000000000200000000000000000000000000000000000000000000007F010000000000000002040000030000230F10000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000007000000000000000000000000000000230F10000008020009208000FFFB8B170000000000000000000000000000000000000000000000000000000000000000B3F8F31CB1C5A249AA595EEF92FFA63C01000000000000009E07C080060000000000000000000000000000000000000000000000000000000000000000000000010000000200000000269F35340ACA010000000000000000000000000000000000000000040000000F0C0E0010000080000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000</Data>
      </EventData>
    </Event>

    Thursday, July 23, 2009 12:00 AM
  • Have the same error.

    Description:
    A corrected hardware error has occurred.

    Component: AMD Northbridge
    Error Source: Machine Check Exception
    Error Type: 14
    Processor ID: 0

    Harware: nFrorce 730a (GeForce 8300) motherboard, AMD Athlon 64 X2 7750+ BE CPU, DDR1066 2x2048Mb.

    Software: OS - Windows 7 x64 Build 7600, Drivers - nForce Driver 15.26, GeForce/ION Driver Release 190.38.

    And I'm just interesting is that really hardware or just software bug of Windows 7. I don't use sleep/hibernation too.

     

    Saturday, August 01, 2009 9:02 AM
  • Maxim, your problems are definitely from a hacked version of Win 7. Build 7100 is the version supported here and the only official public version.

    All,
    I found this interesting response from a Vista thread which relates to AMD processors and the machine check error. Here's the Nov 1, 2008 response:

    "AMD provides a tool called MCAT (Machine Check Exception Analysis Tool). You can find it on the AMD site in the downloads/processor utilities section. Choose the version appropriate to your processor. It installs a command line utility. To use it, execute mcat on the command line. There are several ways to use it. Execute mcat /? to see the possible options. Hopefully it can shed some light on the source of your problems.

     

    Modern processors such as yours support advanced self-diagnostic capabilities, such that they analyze their own behavior and are capable of recognizing conditions that "can't happen". When one of these not-possible conditions occur, the CPU raises a "machine check exception". Windows is simply retrieving the information provided by the processor and recording it in the event log.

     

    You mention on this thread that you've tried many different processors. That would be my first recommendation. This eliminates the processor as a potential source of the problem. Several other possible causes still remain.

    • Poor voltage regulation (i.e. power supply problem, voltage regulator malfunction, capacitor degradation)
    • Damage due to power spikes
    • Static damage to the motherboard
    • Incorrect processor voltage setting in the BIOS (too low or too high)
    • Overclocking
    • Permanent motherboard or power supply damage caused by prior overclocking
    • Excessive temperature caused by insufficient airflow (possibly caused by fan failure or blockage of air inlet/outlet)
    • Improper BIOS initialization (the BIOS configuring the motherboard or CPU incorrectly)
    • Installation of a processor that is too much for your motherboard to handle (excessive power requirement, incompatibility)
    • Defective hardware that may be drawing excessive power or otherwise disrupting proper voltage regulation

    The TLB is a part of the processor that manages translation of virtual addresses to physical addresses (to implement advanced memory management required for a multitasking operating system). The processor uses "page tables" to map virtual addresses to physical addresses. The TLB caches information from the page tables so it won't have to keep looking up this information. The machine check exception in your post shows that, while the processor was attempting to fetch instructions to be executed, it detected invalid information in the TLB. This can lead to major software errors, so it raises a machine check exception. If windows doesn't crash, the processor may have detected a benign error, but still an error worth reporting since it "can't happen". It is highly unlikely that this sort of error could be caused by software (i.e. Windows), since the operation of the TLB is entirely automatic and managed by the processor itself.

     

    Machine check exceptions are almost always caused by a hardware problem.

     

    Doug Gale [MSFT]"


    The problem with troubleshooting is that trouble shoots back.
    Saturday, August 01, 2009 11:31 AM
  • Thank you for your quick answer. I'm really know all main possible reasons of that error, but... I've tested all components of my PC in all kind of tests (including stress test in S&M) and there was no errors (not S&M reported of them, nor events in system log). All temperatures in normal ranges in all modes of operation (at stand-by and at heavy load) and it's not strange case I have Hi-End coolers on all components (I've installed them personally, so I'm sure the contact of cooler surface with chips is good). PSU have more than enough power reserve for my configuration. I didn't noticed that this error crashes my system. There is no BSOD and no problems with stability. But... I'm still not really happy with this error messages. And one more detail. They are occurs every time I reboot the system. At system start-up actually. And they are not occurred at a time of stress or any other type of tests. And that is really strange you know... So I assumed, that this error have a software nature. Of course I'm not sure and that is the problem:).
    About your advices. I've download the AMD Utility and I'm shame to admit it, but I don't understand how it works:). I have experience of UNIX based systems administration, but I don't understand what should I do with this stuff:). Maybe all this case I worked today toooo much:). The Vista thread describes not exactly the same situation and I've read it already:). Can you please tell me some more about self-diagnostic capabilities of my kind of CPU? Is there any utilities, that can help Me run this self-diagnostic tests and get results of their work?
    Thank you again for your time. And sorry if my english is not very good:).
    Sunday, August 02, 2009 1:30 AM
  • I'd like to figure out exactly what the error pertains to, as I know it happened after returning from sleep, where it took out my Bluetooth transceiver in the process:


    Additionally, this is a brand new system:

    Asus M3N78-VM
    AMD Phenom II X3 710
    8GB OCZ Reaper HPC 1066MHz
    PNY GeForce 9400 GT 512MB
    I've noticed the similar connection of this error to "Bluetooth Support Service", but at start-up in my case. Don't use sleep mode, but I guess I'll check it too. One more thing. The service "Portable Device Enumerator Service" cause exactly the same error in event log and at start-up too. Didn't noticed any other service causing the same effect yet.
    And have the similar hardware by the way.

    ASUS M3N78-EM
    AMD Phenom II X2 550 & Athlon X2 7750+
    Kingston HyperX 2x2Gb 1066MHz
    ASUS ENGTS250DK TOP & MSI N9500GT-MD1G-OC

    Installed Phenom II X2 550 instead of Athlon X2 7750+. No changes. So problem is not the CPU for sure. Memory was replaced earlier, so it's not the RAM too. All possible hardware reason I can see is MB. Seems like bug described by Ronnie Vernon with only one difference. I don't use sleep/hibernation and even if send PC to sleep, don't got this warning, but got it at system start-up. One or more devices fail to initialize correctly at system start-up. It can be because of MB hardware malfunction, OS or drivers bugs.

    • Edited by Maxim Luppov Wednesday, August 12, 2009 12:10 PM
    Sunday, August 02, 2009 2:49 PM
  • Maxim, your problems are definitely from a hacked version of Win 7. Build 7100 is the version supported here and the only official public version.
    Have the build 7100 of x86 Windows 7 installed too. The same thing. Exactly the same error at the same conditions.
    And my Windows is definitely not hacked. It's RTM release.
    Sunday, August 02, 2009 2:54 PM
  • Just an update: Yesterday I uninstalled the Asus ATK0110 ACPI driver that Windows Update originally installed (driver dated 2004) and installed a more recent version dated May 5, 2009 that came with Asus PC Probe II. I did this because the software installation would always hang at five percent and eventually error out about trying to install the included ACPI driver. Moreover, when I uninstalled the driver, I noticed Windows Update had an updated driver as well dated July 27, 2009.

    As it currently stands, my computer is actually going to sleep again after it randomly stopped doing so for several weeks. With everything running for the past 24 hours, including a few sleeps and reboots, I haven't noticed any more machine check exceptions. Although, the Bluetooth transceiver (Microsoft) is still acting up with "Bluetooth HID device (00:50:f2:e3:5c:83) either went out of range or became unresponsive." messages, but it's still usable. Anyway, I'm picking up a new Bluetooth transceiver from Newegg today (cirago BTA-6210 USB 2.0 Micro Bluetooth Dongle support Bluetooth 2.1) which uses the Toshiba stack in order to get more profile support--and it also has better range. So I'll see if those messages go away as well and post here with any more findings.


    Device Manager - ASUS ATK0110 ACPI UTILITY
    • Edited by Ryan Capp Thursday, August 13, 2009 6:28 PM Image
    Thursday, August 13, 2009 6:22 PM
  • Update 2: Still problem-free.
    Saturday, August 15, 2009 12:55 PM
  • I've updated the driver, but the error persists:(.
    Thursday, September 03, 2009 5:24 PM
  • Where can I get Sandra?  Thank you,  Michael
    Michael vallino
    Wednesday, September 09, 2009 10:42 PM
  • Maxim Luppov,
    ASUS has recently posted Win 7 x64 drivers for M3n78-EM. Have you tried those? Get them here. 0606 BIOS, etc.
    ASUS P4PE-P4 Extreme 3.4gHz-2gb G.Skill PC3200-XPSP3:55gb-Win7RC:17gb-DATA:500gb-ASUS N6800GT
    Monday, September 21, 2009 1:57 PM
  • Updated the BIOS to 0704 and MB drivers to 15.45 and it seems that the problem is solved:).
    Thanks, man. I've didn't checked the ASUS support site for a while. Meanwhile they released 2 new versions of BIOS and new drivers:).
    So, as I supposed from the start the problem had a software nature:).
    Thursday, October 01, 2009 10:13 AM