Support in other languages: 
Reply
Ctrl-Alt-Del
blinkingled
Posts: 21
Registered: ‎09-09-2011
Location: United States
0
Accepted Solution

X220 Machine Check - Thermal

[ Edited ]

I have a new X220 with the updated 1.21 BIOS. I am running Linux on it and I periodically get MCE in the system logs - these were present when I ran Windows too. These only appear when the system is under load - when doing compilations, video encoding etc.

 

My question - is that normal with this laptop or should I be concerned? The MCEs all seem to be Thermal/Throttling related (as opposed to bad RAM for e.g.)

 

 2062.384597] CPU1: Core temperature/speed normal
[ 2062.384600] CPU2: Package temperature/speed normal
[ 2062.384602] CPU0: Core temperature/speed normal
[ 2062.384603] CPU3: Package temperature/speed normal
[ 2062.384605] CPU0: Package temperature/speed normal
[ 2062.384607] CPU1: Package temperature/speed normal
[ 2098.375910] [Hardware Error]: Machine check events logged
[ 2362.011683] CPU0: Core temperature above threshold, cpu clock throttled (total events = 32474)
[ 2362.011687] CPU3: Package temperature above threshold, cpu clock throttled (total events = 35277)
[ 2362.011689] CPU1: Core temperature above threshold, cpu clock throttled (total events = 32474)
[ 2362.011692] CPU2: Package temperature above threshold, cpu clock throttled (total events = 35277)
[ 2362.011694] CPU1: Package temperature above threshold, cpu clock throttled (total events = 35277)
[ 2362.011697] CPU0: Package temperature above threshold, cpu clock throttled (total events = 35277)
[ 2362.013644] CPU0: Core temperature/speed normal
[ 2362.013647] CPU3: Package temperature/speed normal
[ 2362.013649] CPU1: Core temperature/speed normal
[ 2362.013651] CPU2: Package temperature/speed normal
[ 2362.013653] CPU1: Package temperature/speed normal
[ 2362.013655] CPU0: Package temperature/speed normal
[ 2398.018594] [Hardware Error]: Machine check events logged


Punch Card
Amphibulus
Posts: 75
Registered: ‎07-22-2011
Location: Canada
0

Re: Machine Check - Thermal

What is the value of the threshold?

ThinkPad X220, I7 2620M, 8GB RAM, Intel 320 160GB SSD, IPS screen, USB 3.0, Bluethoot, Cam, Win 7 pro 64, BIOS: 1.17, Model: 4286-CTO, Fabrication date: 07/11
Ctrl-Alt-Del
blinkingled
Posts: 21
Registered: ‎09-09-2011
Location: United States
0

Re: Machine Check - Thermal


Amphibulus wrote:

What is the value of the threshold?


Do you mean the temperature threshold? Where can I find that?

 

Thanks

Ctrl-Alt-Del
blinkingled
Posts: 21
Registered: ‎09-09-2011
Location: United States

Re: Machine Check - Thermal

[ Edited ]

If I clamp down scaling_max_freq to 2.4Ghz the errors disappear. As a bonus I don't get the infrequent throttling related MCEs - ie no overheating. Sounds like the the machine is not designed to really work out the CPU/GPU at it's maximum capacity.

For anyone else with X220 running Linux and having MCE errors logged (package power limit) - do this -

for i in 0 1 2 3; do echo 2400000 > /sys/devices/system/cpu/cpu$i/cpufreq/scaling_max_freq; done