<div dir="ltr">Thanks Ulrich!<div><br></div><div>Yes indeed the system was using the libGL of nvidia. I did these steps to recover the system:</div><div><br></div><div>yum reinstall libglvnd* mesa*<br></div><div><br></div><div>That fixes most of the symlinks of libGL in /usr/lib64 to again point to the system's libGL.</div><div><br></div><div>Except for this one:</div><div> # ll /usr/lib64/libGLX_indirect.so.0 <br>lrwxrwxrwx. 1 root root 26 Jul 1 16:03 /usr/lib64/libGLX_indirect.so.0 -> libGLX_nvidia.so.470.42.01<br></div><div><br></div><div>So I did this:</div><div><br># cd /usr/lib64<br># rm -f libGLX_indirect.so.0<br># ln -s libGLX_mesa.so.0.0.0 libGLX_indirect.so.0<br># ll /usr/lib64/libGLX_indirect.so.0 <br>lrwxrwxrwx. 1 root root 20 Jul 14 11:43 /usr/lib64/libGLX_indirect.so.0 -> libGLX_mesa.so.0.0.0<br></div><div><br></div><div>Now glxinfo (and firefox) don't crash nxagent!</div><div><br></div><div>Not sure of the full implications of manually modifying that symlink.</div><div><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, 13 Jul 2021 at 20:07, Ulrich Sibiller <<a href="mailto:uli42@gmx.de">uli42@gmx.de</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">I think the problem is that the system now uses the libGL of nvidia<br>
instead of the libGL that came with the system. So try to reinstall<br>
the lib. See <a href="https://forums.developer.nvidia.com/t/multiple-glx-client-libraries-in-the-nvidia-linux-driver-installer-package/41308" rel="noreferrer" target="_blank">https://forums.developer.nvidia.com/t/multiple-glx-client-libraries-in-the-nvidia-linux-driver-installer-package/41308</a><br>
for some details regarding the libGL handling in the nvidia driver.<br>
<br>
Hope that helps!<br>
<br>
Uli<br>
<br>
On Tue, Jul 13, 2021 at 9:05 AM Norman Gaywood <<a href="mailto:ngaywood@une.edu.au" target="_blank">ngaywood@une.edu.au</a>> wrote:<br>
><br>
> We have some centos 7 systems with GPUs that users access with x2go to run their machine learning task.<br>
><br>
> After an update to cuda on the centos 7 systems, nxagent now segfaults when I run glxinfo or firefox.<br>
><br>
> nxagent-3.5.99.26-1.el7.x86_64<br>
> x2goserver-4.1.0.3-9.el7.x86_64<br>
> cuda-11.4.0-1.x86_64<br>
> kmod-nvidia-latest-dkms-470.42.01-1.el7.x86_64<br>
><br>
> What's also interesting is that if I x2go into a host that does not have cuda installed, and then:<br>
> ssh -Y cudahost glxinfo<br>
> then the nxagent on the non-cuda host segfaults.<br>
><br>
> This happens with glxinfo and when trying to start firefox. google-chrome works fine.<br>
><br>
> This was all working fine until I updated cuda and the GPU driver. On Centos 7.9.2009<br>
> This happens on a host with a K80 GPU and another host that has a V100 GPU.<br>
><br>
> I also have cuda-11.4 on a Fedora 34 host with a V100 GPU.<br>
> If I x2go (or ssh -Y) to the Fedora 34 host, glxinfo (and firefox) run fine.<br>
><br>
> nxagent-3.5.99.26-1.fc34.x86_64<br>
> x2goserver-4.1.0.3-10.fc34.x86_64<br>
> cuda-11.4.0-1.x86_64<br>
> kmod-nvidia-latest-dkms-3:465.19.01-1.fc33.x86_64<br>
><br>
> Any suggestions on how I might provide some debugging information to the developers?<br>
><br>
><br>
> abrt-cli list --since 1626070479<br>
> id 35db2c461122be7229abdbe219ddfd92d0613da8<br>
> reason: nxagent killed by SIGSEGV<br>
> time: Mon 05 Jul 2021 09:37:02 AEST<br>
> cmdline: x2goagent -nolisten tcp -nolisten tcp -dpi 97 -D -auth /home/ngaywood/.Xauthority -geometry 2560x1440 -name X2GO-ngaywood-50-1625441817_stDMATE_dp24 :50<br>
> package: nxagent-3.5.99.26-1.el7<br>
> uid: 5125 (ngaywood)<br>
> count: 21<br>
> Directory: /var/spool/abrt/ccpp-2021-07-05-09:37:02-2523<br>
><br>
><br>
><br>
> --<br>
> Norman Gaywood, Computer Systems Officer<br>
> School of Science and Technology<br>
> University of New England<br>
> Armidale NSW 2351, Australia<br>
><br>
> <a href="mailto:ngaywood@une.edu.au" target="_blank">ngaywood@une.edu.au</a> <a href="http://turing.une.edu.au/~ngaywood" rel="noreferrer" target="_blank">http://turing.une.edu.au/~ngaywood</a><br>
> Phone: +61 (0)2 6773 2412 Mobile: +61 (0)4 7862 0062<br>
><br>
> Please avoid sending me Word or Power Point attachments.<br>
> See <a href="http://www.gnu.org/philosophy/no-word-attachments.html" rel="noreferrer" target="_blank">http://www.gnu.org/philosophy/no-word-attachments.html</a><br>
> _______________________________________________<br>
> x2go-user mailing list<br>
> <a href="mailto:x2go-user@lists.x2go.org" target="_blank">x2go-user@lists.x2go.org</a><br>
> <a href="https://lists.x2go.org/listinfo/x2go-user" rel="noreferrer" target="_blank">https://lists.x2go.org/listinfo/x2go-user</a><br>
</blockquote></div><br clear="all"><div><br></div>-- <br><div dir="ltr" class="gmail_signature"><div dir="ltr"><div>Norman Gaywood, Computer Systems Officer</div><div>School of Science and Technology<br>University of New England</div><div>Armidale NSW 2351, Australia<br><br><a href="mailto:ngaywood@une.edu.au" target="_blank">ngaywood@une.edu.au</a> <a href="http://turing.une.edu.au/~ngaywood" target="_blank">http://turing.une.edu.au/~ngaywood</a></div><div>Phone: +61 (0)2 6773 2412 Mobile: +61 (0)4 7862 0062<br><br>Please avoid sending me Word or Power Point attachments.<br>See <a href="http://www.gnu.org/philosophy/no-word-attachments.html" target="_blank">http://www.gnu.org/philosophy/no-word-attachments.html</a><br></div></div></div>