[X2Go-User] nxagent SIGSEGV when running glxinfo (and firefox) on a host with cuda installed
Norman Gaywood
ngaywood at une.edu.au
Wed Jul 14 03:51:10 CEST 2021
Thanks Ulrich!
Yes indeed the system was using the libGL of nvidia. I did these steps to
recover the system:
yum reinstall libglvnd* mesa*
That fixes most of the symlinks of libGL in /usr/lib64 to again point to
the system's libGL.
Except for this one:
# ll /usr/lib64/libGLX_indirect.so.0
lrwxrwxrwx. 1 root root 26 Jul 1 16:03 /usr/lib64/libGLX_indirect.so.0 ->
libGLX_nvidia.so.470.42.01
So I did this:
# cd /usr/lib64
# rm -f libGLX_indirect.so.0
# ln -s libGLX_mesa.so.0.0.0 libGLX_indirect.so.0
# ll /usr/lib64/libGLX_indirect.so.0
lrwxrwxrwx. 1 root root 20 Jul 14 11:43 /usr/lib64/libGLX_indirect.so.0 ->
libGLX_mesa.so.0.0.0
Now glxinfo (and firefox) don't crash nxagent!
Not sure of the full implications of manually modifying that symlink.
On Tue, 13 Jul 2021 at 20:07, Ulrich Sibiller <uli42 at gmx.de> wrote:
> I think the problem is that the system now uses the libGL of nvidia
> instead of the libGL that came with the system. So try to reinstall
> the lib. See
> https://forums.developer.nvidia.com/t/multiple-glx-client-libraries-in-the-nvidia-linux-driver-installer-package/41308
> for some details regarding the libGL handling in the nvidia driver.
>
> Hope that helps!
>
> Uli
>
> On Tue, Jul 13, 2021 at 9:05 AM Norman Gaywood <ngaywood at une.edu.au>
> wrote:
> >
> > We have some centos 7 systems with GPUs that users access with x2go to
> run their machine learning task.
> >
> > After an update to cuda on the centos 7 systems, nxagent now segfaults
> when I run glxinfo or firefox.
> >
> > nxagent-3.5.99.26-1.el7.x86_64
> > x2goserver-4.1.0.3-9.el7.x86_64
> > cuda-11.4.0-1.x86_64
> > kmod-nvidia-latest-dkms-470.42.01-1.el7.x86_64
> >
> > What's also interesting is that if I x2go into a host that does not have
> cuda installed, and then:
> > ssh -Y cudahost glxinfo
> > then the nxagent on the non-cuda host segfaults.
> >
> > This happens with glxinfo and when trying to start firefox.
> google-chrome works fine.
> >
> > This was all working fine until I updated cuda and the GPU driver. On
> Centos 7.9.2009
> > This happens on a host with a K80 GPU and another host that has a V100
> GPU.
> >
> > I also have cuda-11.4 on a Fedora 34 host with a V100 GPU.
> > If I x2go (or ssh -Y) to the Fedora 34 host, glxinfo (and firefox) run
> fine.
> >
> > nxagent-3.5.99.26-1.fc34.x86_64
> > x2goserver-4.1.0.3-10.fc34.x86_64
> > cuda-11.4.0-1.x86_64
> > kmod-nvidia-latest-dkms-3:465.19.01-1.fc33.x86_64
> >
> > Any suggestions on how I might provide some debugging information to the
> developers?
> >
> >
> > abrt-cli list --since 1626070479
> > id 35db2c461122be7229abdbe219ddfd92d0613da8
> > reason: nxagent killed by SIGSEGV
> > time: Mon 05 Jul 2021 09:37:02 AEST
> > cmdline: x2goagent -nolisten tcp -nolisten tcp -dpi 97 -D -auth
> /home/ngaywood/.Xauthority -geometry 2560x1440 -name
> X2GO-ngaywood-50-1625441817_stDMATE_dp24 :50
> > package: nxagent-3.5.99.26-1.el7
> > uid: 5125 (ngaywood)
> > count: 21
> > Directory: /var/spool/abrt/ccpp-2021-07-05-09:37:02-2523
> >
> >
> >
> > --
> > Norman Gaywood, Computer Systems Officer
> > School of Science and Technology
> > University of New England
> > Armidale NSW 2351, Australia
> >
> > ngaywood at une.edu.au http://turing.une.edu.au/~ngaywood
> > Phone: +61 (0)2 6773 2412 Mobile: +61 (0)4 7862 0062
> >
> > Please avoid sending me Word or Power Point attachments.
> > See http://www.gnu.org/philosophy/no-word-attachments.html
> > _______________________________________________
> > x2go-user mailing list
> > x2go-user at lists.x2go.org
> > https://lists.x2go.org/listinfo/x2go-user
>
--
Norman Gaywood, Computer Systems Officer
School of Science and Technology
University of New England
Armidale NSW 2351, Australia
ngaywood at une.edu.au http://turing.une.edu.au/~ngaywood
Phone: +61 (0)2 6773 2412 Mobile: +61 (0)4 7862 0062
Please avoid sending me Word or Power Point attachments.
See http://www.gnu.org/philosophy/no-word-attachments.html
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.x2go.org/pipermail/x2go-user/attachments/20210714/002cbecb/attachment.htm>
More information about the x2go-user
mailing list