Please let us know the version of the Linux agent installed in Ubuntu 20.04 VM.
Do you use multiple monitors?
If so, please check if the whole resolution exceeds 8192(8K) in any direction, landscape or portrait. For details, please consult the section "Multiple Monitors" in the page https://docs-staging.vmware.com/en/VMware-Horizon/2006/linux-desktops-setup/GUID-67F7E8D6-E98C-4242-...
Hi,
Thanks for the reply:
~# cd VMware-horizonagent-linux-x86_64-2103-8.2.0-17771892/
~/VMware-horizonagent-linux-x86_64-2103-8.2.0-17771892# cat Product.txt
VMware-horizonagent-linux-x86_64-2103-8.2.0-17771892
Hi,
Our setup for the VM does support multiple monitors but I did the launch attempt from a single monitor using my laptop monitor that has display size 14" and resolution set to 1920 x 1080 (resolution recommended by Win10).
Hi,
As the limitation of llvmpipe, it panics if the resolution exceeds 8K. But seems your case is different.
Could you provide the logs collected by running /usr/lib/vmware/viewagent/bin/dct-debug.sh?
Thanks,
Hang
Hi Hang,
Thanks for your time. Can you provide me a secure way to deliver the logs? An email address or maybe open a new case for us on the closed Customer Support side? My organization has an account there.
Best,
Juhana
Hi Juhana,
If possible, you can send the logs to me and our team, hangl(at)vmware(dot)com and linux-agent-bj-dev(at)vmware(dot)com
I'll give it an investigation and respond ASAP.
Thanks,
Hang
Hi Hang,
Thanks, logs are in the email now. The attachment size is 28MB but I will much rather deliver them like this than post here publicly as sensitive information may be enclosed.
But thanks for helping us.
Best,
Juhana
Hi Juhana,
I didn't get the mail. Please double check.
Regards,
Hang
Hi Hang,
Thanks for the notification. I received an autoreply email from vmware.com that the mail was too large to send. I will use a local delivery system for large files next. You should receive a download link via email.
Hi Juhana,
In syslog, there are some suspicious errors,
May 4 09:35:24 vdi-*** gnome-shell[5096]: Getting invalid resource scale property
May 4 09:35:24 vdi-*** kernel: [ 621.258117] traps: llvmpipe-0[5103] trap invalid opcode ip:7f741c022087 sp:7f7451e8d340 error:0
Did the customer ever set the scale factor?
Hi Hang,
Thanks for the reply and your input - and sorry for the delay. I don't think it's possible that the customer has set the scale factor. The crash happens right after when a session is started via VMWare Horizon client and no desktop ever appears, just a black screen for 1-2 seconds and then disappears.
I wonder if we can set and force a valid scale factor for the session from the command line somewhere?
Moreover, our setup for these VMs should be that the MATE desktop is loaded by default so I'm surprised to see gnome-shell entries in the log. I need to investigate that all settings are OK regarding the default desktop.
Hi Juhana,
Yes, your desktop environment is MATE. However the greeter(for login) is still gnome-shell and the issue just occurred at that time.
And seems that the gnome-shell was updated from 3.36.4 to 3.36.7. So I suggest to downgrade the gnome-shell and reinstall viewagent see if the issue is still there.
Hi Hang,
Thanks for the advice. I removed the vGPU from the instance, uninstalled the NVIDIA module (we had installed this between our correspondence here as a workaround for the customer). Then I downgraded gnome-shell:
~# apt install gnome-shell-common=3.36.4-1ubuntu1~20.04.2 gnome-shell=3.36.4-1ubuntu1~20.04.2
...
~# gnome-shell --version
GNOME Shell 3.36.4
Then re-installed the viewagent, rebooted the machine. Unfortunately, the "traps"-entries still appear in the logs and the behavior is the same, in VMWare Horizon Client no session opens to this machine, a black screen briefly appears, then disappears and the client returns to the list of available VM pools.
-Juhana
Hi Juhana,
Since the issue occurs in gnome-shell(greeter), I do suspect this is strongly related to it.
Please try below two ways separately.
1. Reinstall gnome-shell and reboot without reinstalling viewagent.
2. Disable all gnome-shell extensions.
gsettings set org.gnome.shell disable-user-extensions true
and move all sub-folders out of /usr/share/gnome-shell/extensions/
has anyone tested this or hace a solution?
I had the same basic issue using Proxmox as the hypervisor instead of VMware. This forum post got me in the right direction.
I was able to solve the problem by removing /var/lib/gdm3/.cache/mesa_shader_cache that was somehow causing the scaling error, and the resulting llvm problems. My VM image had been created from a system using an NVidia GPU for display, in case that's helpful.
Hi all,
This issue seems to be corrected in Ubuntu 20.04 kernel 5.11.0-27-generic
Hi,
Apparently this is not a kernel issue but a rare occasion being tied to an image, where the VM is cloned from.
In case that the original image has been previously running on a newer processor architecture, there might be precompiled caches at /var/lib/gdm3/.cache/mesa_shader_cache pointing to instructions, which are not supported on the current hardware. Hence invalid opcode.
Apparently there is a fix on its way. Newest Redhat has it, maybe Ubuntu soon as well.
https://bugzilla.redhat.com/show_bug.cgi?format=multiple&id=1982746
And personally I'm running the following snippet as part of the post-customization userscript (the 'log' there is just a function call to a logging wrapper)
CACHEDIR=/var/lib/gdm3/.cache/mesa_shader_cache
if [ -d "$CACHEDIR" ]
then
rm -rf $CACHEDIR
log "removed $CACHEDIR"
else
log "$CACHEDIR not found"
fi