VMware Cloud Community
cz1138
Contributor
Contributor

Guest Servers Shutting Down - What is Triggering This?

I may be posting this in the wrong place, if so, please direct me to the proper forum.

I have an ESXi server with four guest machines.  Two of the four shut down on Monday or Tuesday, sometime between 5 AM and 10 AM.  The guest machines are running Windows server.  The logs say:

 

"The kernel power manager has initiated a shutdown transition.

Shutdown Reason: Kernel API"

I suspect that ESXi is shutting the servers down, but not sure why or how.  Would love some help in trying to root this out.

Please advise.

Tags (1)
0 Kudos
11 Replies
scott28tt
VMware Employee
VMware Employee

I have reported your post to the forum moderators so one of them should move it to the correct area for you (please don't create a duplicate)

-------------------------------------------------------------------------------------------------------------------------------------------------------------

Although I am a VMware employee I contribute to VMware Communities voluntarily (ie. not in any official capacity)
VMware Training & Certification blog
0 Kudos
miladmeh8
Hot Shot
Hot Shot

are you sure that windows machines have been activated currectly?

0 Kudos
cz1138
Contributor
Contributor

Yes.

 

I've attached a screenshot of the Events page for this server.  

Nothing I've found so far has helped me root out what is triggering this shutdown.  Since it's only two of the four VMs on this box, I think ESXi is doing this.

 

 

 

 

0 Kudos
markey165
Expert
Expert

There are a lot of information gaps here. Is this a standalone ESXi host or is it managed by vCenter. Do you have the option of moving the VMs to another host to see if the problem follows the VM(s)?

 

It would help to know which version of ESXi you're running? What version(s) of Windows are the VMs affected? and what OS are the VMs that are unaffected?

 

As asked above by @miladmeh8,  is Windows properly activated, as an expired license can cause such behaviour. Its a good question and it needs confirming the license status shows as "activated"

HTH

 

 

_____________________________________________
If this post helps you, please leave Kudo | or mark this reply as an answer
0 Kudos
miladmeh8
Hot Shot
Hot Shot

@markey165 

Yea, if the windows is downloaded from microsoft website directly for a test or trial version without activating that defenately after some hours of working it will be powered off.

0 Kudos
Alfista_PS
Hot Shot
Hot Shot

Hi,

 

do you have running HA and DRS on your system and do you have enough resources on the host to run in it?

When do you have enabled HA and DRS and you don't have enough resources on your hosts then the system will disable (shutdown) some hosts for that isn't enough resources. I had the same problem.

Look at it.

 

Alfista
----------------------
Audio-Video Accessories
Selling and Integration of Audio & Video Accessories and Technology
If my answer has resolved your problem please mark as RESOLVED or if it has only was a good help then give me the KUDOS. Thanks.
0 Kudos
AnaghB
Enthusiast
Enthusiast

Hello @cz1138 ,

VMs Powering OFF automatically can be caused by many reasons.

1. Windows is not Liscenced.

2. We have allocated more CPU and Memory to VMs and Esxi host is unable to satisfy the allocation.

3. The HA condition is not met hence it is Powering OFF the VMs.

Please select the Cluster on which you have the hosts and the VMs and then select Monitor - Tasks and see what is the task or event that is triggerred while Powering OFF the VM.

Share that Task/Event and we will have the answer.

Anagh B
VCIX-DCV6.5, VSAN Specialist
Please mark help full or correct if my answer is use full for you
0 Kudos
cz1138
Contributor
Contributor

@AnaghB 

 

Thank you for the input.

Looking at the UI Host > Monitor > Events, I only see entries for today.  "Tasks" are the same.  Doesn't look like there's a way to look at previous dates.  

0 Kudos
cz1138
Contributor
Contributor


@Alfista_PS:

 

I'm a bit new to this side of VMWare, I'm not sure what "HA" and "DRS" mean in this context.  

Someone else mentioned the possibility of a resource overrun creating an issue.  I do wonder about that.

There are two processors on the host machine.  They are Intel Xeon Silver 4309Y CPUs with 2.80GHz clocks.  They have 8 cores each.

The first server that shuts down appears to have the following configuration:

For CPU the first value is 16, Cores per Socket is 8.  So this VM was configured for a system that has 16 CPUs with 8 cores per socket, correct?  Looking at the second VM I see the same configuration.

I reckon these may be our smoking gun.

 

0 Kudos
AnaghB
Enthusiast
Enthusiast

Hello @cz1138 ,

 

The HA and DRS are resource allocation tools used to protect such events. For this issue we need to get on zoom session for 20 mins and need to check the config so the cause of the issue can be identified.

Anagh B
VCIX-DCV6.5, VSAN Specialist
Please mark help full or correct if my answer is use full for you
0 Kudos
Alfista_PS
Hot Shot
Hot Shot

@cz1138 

You have only one host cluster?

 

HA - high availability

enable that if one host is down that other will take its jobs

DRS - Distributed Resource Scheduler

distribute resources across all host that all host are equal usability or how you have it set in the Cluster -Configure - Services settings.

 

I thing I have find your problem. You have the added to the VM's more resources as you have and your additional settings of the host (ESXi or vCenter) then shut down to VM's while the host resources are overloaded.

You need to follow some rules by configuration:

  1. mare a table where you will have all CPU's (cores x 2 if your CPU has HT). Then count all added vCPUs added to all VM's and it shouldn't be more as are real on the host.
  2. the same as in point 1. you shout do it with memory.
  3. For Win VM's don't use more cores per CPU and also too many CPU's while the OS can't correctly use it and the vCPU's only block this resources
  4. its needed to check in reconfiguration in VM settings - power management. can't cause the power shutdown.

 

Alfista
----------------------
Audio-Video Accessories
Selling and Integration of Audio & Video Accessories and Technology
If my answer has resolved your problem please mark as RESOLVED or if it has only was a good help then give me the KUDOS. Thanks.
0 Kudos