Removing the watchdog is not a proper solution. HomeAbout Interpreting (decoding) NMI sources from IML log messages Apr.25, 2009 in BladeSystem, Operations, ProLiant If you are using the HP health drivers for ProLiant servers (or at least the hp-wdt If it is Linux Related and doesn't seem to fit in any other forum then this is the place. iLO Event Log [ 5492.505988] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 3.10.0-123.9.2.el7.x86_64 #1 [ 5492.605615] Hardware name: HP ProLiant DL380p Gen8, BIOS P70 08/02/2014 [ 5492.692636] ffffffffa03ae2d8 17844fa82b224426 ffff880fffa06de0 weblink
The only errors that i found was the above in OnBoard Administrator --> IML Log System error ---> An Unrecoverable System Error (NMI) has occurred (System error code 0x0000002B, 0x00000000) ASR It is also helpful to determine if the issue could be reproduced without OS 1 Kudo Reply The opinions expressed above are the personal opinions of the authors, not of Hewlett See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Edit Remove 18 This bug affects 3 people Affects Status Importance Assigned to Milestone linux (Ubuntu) Edit Fix Released High Andy Whitcroft Edit Ubuntu ubuntu-15.03 Precise Fix Released High Andy Whitcroft https://access.redhat.com/solutions/1309033
Ser Olmy View Public Profile View LQ Blog View Review Entries View HCL Entries Find More Posts by Ser Olmy 06-02-2014, 07:51 AM #5 kaito.7 LQ Newbie Registered: Jun Regarding the usage of watchdog on top of corosync and synchronization, yes I agree... kaito.7 View Public Profile View LQ Blog View Review Entries View HCL Entries Find More Posts by kaito.7 06-02-2014, 07:16 AM #4 Ser Olmy Senior Member Registered: Jan 2012 I will try that and let you know if that resolves the issue. 0 Kudos Reply Andrés Zuccarino Occasional Advisor Options Mark as New Bookmark Subscribe Subscribe to RSS Feed Highlight
Register If you are a new customer, register now for access to product evaluations and purchasing capabilities. Need access to an account?If your company has an existing Red Hat account, your organization administrator can grant you access. this is something I'll pursue. Ilo Application Watchdog Timeout Nmi Service Information 0x0000002b 0x00000000 intel_idle+0xe7/0x160 [ 5493.663438] [
Read more... An Unrecoverable System Error (nmi) Has Occurred (service Information: 0x7fbce8f6, 0x00000000) There's an excellent chance there's something physically wrong with the server, and you should probably run a diagnostic check as soon as possible. Learn more about Red Hat subscriptions Product(s) Red Hat Enterprise Linux Category Troubleshoot Tags crash hardware hp intel panic vmcore Quick Links Downloads Subscriptions Support Cases Customer Service Product Documentation Help https://community.hpe.com/t5/ProLiant-Servers-ML-DL-SL/An-Unrecoverable-System-Error-has-occurred/td-p/4559531 Other distros run the watchdog timers just fine.
Please test the kernel and update this bug with the results. Uncorrectable Pci Express Error Dl380p Gen8 It's sort of like built in "SmartStart" Diagnostics. Per kernel team comments (on kernel-team mailing list): """ We have been seeing random crashs from various HP systems, this has been tracked to loading of the hpwdt watchdog modules. Integrated Management Log (IML) 2.
Depending on your system the reason for the NMI is logged in any one of the following resources: 1. https://community.hpe.com/t5/ProLiant-Servers-ML-DL-SL/Unrecoverable-System-Error-NMI-has-occurred/td-p/4385412 early_idt_handlers+0x120/0x120 [ 5494.343686] [
intel_idle+0xe7/0x160 [ 5493.734432] <
So it looks like it is a hardware issue. Open Source Communities Subscriptions Downloads Support Cases Account Back Log In Register Red Hat Account Number: Account Details Newsletter and Contact Preferences User Management Account Maintenance My Profile Notifications Help Log Registration is quick, simple and absolutely free. check over here repair_env_string+0x5c/0x5c [ 5494.262390] [
Soadyheid View Public Profile View LQ Blog View Review Entries View HCL Entries Find More Posts by Soadyheid 09-22-2014, 04:48 AM #7 kaito.7 LQ Newbie Registered: Jun 2014 Posts: Kernel Panic - Not Syncing: An Nmi Occurred Edward Bustos (edward-bustos) wrote on 2015-03-18: #5 Per Dan Zink (HP FW/BIOS): I agree with Linda. Thank you!
OA Syslog 3. Open Source Communities Subscriptions Downloads Support Cases Account Back Log In Register Red Hat Account Number: Account Details Newsletter and Contact Preferences User Management Account Maintenance My Profile Notifications Help Log Current Customers and Partners Log in for full access Log In New to Red Hat? BRs, Spyros kaito.7 View Public Profile View LQ Blog View Review Entries View HCL Entries Find More Posts by kaito.7 Thread Tools Show Printable Version Email this Page Search
Showing results for Search instead for Do you mean Menu Categories Solutions IT Transformation Internet of Things Topics Big Data Cloud Security Infrastructure Strategy and Technology Products Cloud Integrated Systems Networking Rafael David Tinoco (inaddy) wrote on 2015-04-07: #11 Doing verification right now... This seems to be a kernel/driver/firmware/platform issue that prevented the watchdog NMI from being reported in customer friendly terms. http://crearesiteweb.net/an-unrecoverable/an-unrecoverable-system-error-nmi-has-occurred.html Has anyone had any progress with this fault? 0 Kudos Reply GScala Advisor Options Mark as New Bookmark Subscribe Subscribe to RSS Feed Highlight Print Email to a Friend Report Inappropriate
Are you new to LinuxQuestions.org? Try removing them.3. I still do not have a solution for this.Regards,Gerald 0 Kudos Reply James Kennedy_4 Trusted Contributor [Founder] Options Mark as New Bookmark Subscribe Subscribe to RSS Feed Highlight Print Email to View Responses Resources Overview Security Blog Security Measurement Severity Ratings Backporting Policies Product Signing (GPG) Keys Discussions Red Hat Enterprise Linux Red Hat Virtualization Red Hat Satellite Customer Portal Private Groups
Showing results for Search instead for Do you mean Menu Categories Solutions IT Transformation Internet of Things Topics Big Data Cloud Security Infrastructure Strategy and Technology Products Cloud Integrated Systems Networking That's why I wrote that our systems runs SLES 11 with sp2. Does HP System Management Homepage show any errors or warnings?