Nouveau Driver and Frequent Crashes

Been having frequent crashes requiring reboot.

Using: UM Version 18.04 b
Nvidea GeForce 6015 SE

Test performed to date:

  1. Ran memtest… all good

  2. Ran live session of 18.04 b …all good ( at least for the 4 hours I was testing).

3.Ran /var/syslog. At line Apr 5 23:12:45 Nouveau driver faults. This is precisely when the last crash occurred.

Q: Does this mean it’s a driver problem and should I try another…if there is one?
Below are the /var/syslog and /var/kern.log files with the Nouveau Fault reference. Any assistance would be greatly appreciated.

Syslog File
[email protected]:~$ tail -f /var/log/syslog
Apr 5 23:10:34 hp systemd[1]: Starting Hostname Service…
Apr 5 23:10:34 hp dbus-daemon[762]: [system] Successfully activated service ‘org.freedesktop.hostname1’
Apr 5 23:10:34 hp systemd[1]: Started Hostname Service.
Apr 5 23:12:45 hp kernel: [ 2747.266794] nouveau 0000:00:0d.0: bus: MMIO write of 024f0001 FAULT at 00b030
Apr 5 23:12:45 hp kernel: [ 2747.281668] nouveau 0000:00:0d.0: bus: MMIO write of 00000000 FAULT at 00b030
Apr 5 23:17:01 hp CRON[4340]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Apr 5 23:20:06 hp kernel: [ 3188.882687] nouveau 0000:00:0d.0: bus: MMIO write of 00000000 FAULT at 00b020
Apr 5 23:20:31 hp kernel: [ 3214.047023] nouveau 0000:00:0d.0: bus: MMIO write of 00dd0001 FAULT at 00b020
Apr 5 23:20:31 hp kernel: [ 3214.066994] nouveau 0000:00:0d.0: bus: MMIO write of 00000000 FAULT at 00b020
Apr 5 23:21:05 hp systemd-resolved[753]: Using degraded feature set (UDP) for DNS server 192.168.1.1.

Kernal Log File
[email protected]:~$ tail -f /var/log/kern.log
Apr 5 22:54:49 hp kernel: [ 1671.645782] nouveau 0000:00:0d.0: bus: MMIO write of 00000000 FAULT at 00b030
Apr 5 23:12:45 hp kernel: [ 2747.266794] nouveau 0000:00:0d.0: bus: MMIO write of 024f0001 FAULT at 00b030
Apr 5 23:12:45 hp kernel: [ 2747.281668] nouveau 0000:00:0d.0: bus: MMIO write of 00000000 FAULT at 00b030
Apr 5 23:20:06 hp kernel: [ 3188.882687] nouveau 0000:00:0d.0: bus: MMIO write of 00000000 FAULT at 00b020
Apr 5 23:20:31 hp kernel: [ 3214.047023] nouveau 0000:00:0d.0: bus: MMIO write of 00dd0001 FAULT at 00b020
Apr 5 23:20:31 hp kernel: [ 3214.066994] nouveau 0000:00:0d.0: bus: MMIO write of 00000000 FAULT at 00b020
Apr 5 23:27:29 hp kernel: [ 3631.427793] kauditd_printk_skb: 28 callbacks suppressed
Apr 5 23:27:29 hp kernel: [ 3631.427798] audit: type=1400 audit(1522985249.352:40): apparmor=“ALLOWED” operation=“open” profile=“libreoffice-soffice” name="/home/ork2k/.cache/mesa_shader_cache/index" pid=4608 comm=“soffice.bin” requested_mask=“wrc” denied_mask=“wrc” fsuid=1000 ouid=1000
Apr 5 23:27:29 hp kernel: [ 3631.447134] nouveau 0000:00:0d.0: bus: MMIO write of 00c70001 FAULT at 00b020
Apr 5 23:27:30 hp kernel: [ 3632.648700] nouveau 0000:00:0d.0: bus: MMIO write of 00000000 FAULT at 00b020

Hi @LoRider,

have you opened Control Centre > Additional Drivers to see if there are any other drivers available for you?:

Choose the recommend (Tested) drivers with the highest number should any be available to you!. :smiley:

Wolfman

Thanks for your prompt response. Yes I’ve checked for additional drivers , no additional drivers available.

There is one on the Nvidia site but I’m a little hesitant to try that one, it mentions that it might be better to use the one supplied with the OS.

Does this mean I’ll be needing a new video card or is there another course of action that you can suggest?

Thanks for your help.

Yes

Maybe a least radical option would be to use an other Kernel. Since your hardware appears stable in the live session maybe use the kernel from the LiveUSB.

Hi ouroumov

Was going to chroot the kernal from the Live usb…but opted to re-install 18.04 beta instead. I have nothing saved on this computer so it seemed like an easier and quicker way to get a fresh kernal installed.

Upon booting and running the software updater a couple of times and rebooting and then installing the drivers and updating the firmaware and codecs I rebooted.

Back in I clicked on the mate tweak link and the same crash with diagonal diagonal lines occurred, with the colors of the underlying window.

At the moment I’m running /var /log/kern.log in the terminal and seeing numerous nouveau write faults appearing again.

I’m wondering since the problem is pointing to a driver issue if I should have updated the drivers after I re-installed the OS? Not sure what video driver is implemented upon installation. No additional drivers are available for the card as well.

Not sure where to go from here…any help greatly appreciated. Thanks

So, I’m not an expert on driver issues, and maybe someone else will be able to offer a better suggestion, but if the update is causing the driver to stop working, maybe hold the update for now. This is not a viable long term solution but I don’t really see another one for the short term. :/

In order to determine the video driver used at any point, you should install the inxi command if you don’t have it already (it’s been nixed from the Bionic seed but Wimpy is going to put it back) and use it like so:

sudo inxi -G

You can check the version of the nouveau driver using:

apt-cache policy nouveau-firmware

Well thanks for all your help ouroumov. I will try to reinstall and not update to see what happens checking versions of nouveau with inxi-G which I already installed a few days ago.

Been researching this problem extensively and as a newbie to the linux world, going on my 7th day actually with this wonderful OS, have learned quite a bit about linux and it’s it’s inner workings and find it fascinating.

This crashing video problem is a bit of thorn in my side I must admit…trying to tap away some of my newbie enthusiasm…but I just find this OS too good to give up on it.

Maybe Wolfman whom I’ve spoken to recently has some further suggestions or ideas as to a course of action…or anyone else for that matter if they happen to stumble on this post. I’ll welcome any insight.

One last question for you ouroumov:

Any recommendations on video cards models or brands that are more compatible and known to work well with UM?

Thanks again for all your help…

Hi LoRider,

it might well be a corrupt ISO download but I really don’t know, I also have an Nvidia card which is old now but still works and still has drivers available!.

Try a new daily build ISO and do a fresh install again, I know it is a pain but it is really all I can think of for now!:

http://cdimage.ubuntu.com/ubuntu-mate/daily-live/current/HEADER.html :smiley:

Hi Wolfman

Well some progress on the crash front, a combination of the ideas posted by ouroumov and yourself.

Nouveau driver was faulting in 18.04 and you Wolfman suggested I install the proprietary driver from additional drivers…but none were available in 18.04.

Ouroumov said I should try another version or run off Live usb to reload a fresh kernal. I only tested with 18.04 Live usb.

So yesterday I installed 16.04 and got the option to choose the Nvidia legacy binary driver from additional drivers. Had to cycle through it a few times and change sever to get the radio button to stick.

Then upon reboot I checked the log report, Nvidia driver was up and running and no faulting to be found anywhere and no instability.

Then I knew we had found something and it felt good…y’all know the feeling.

Then tried the same the same thing with 17.10…worked well as well. Currently running stable with 17.10 with Nvidia driver.

The next thing I’ll be performing will be an upgrade to 18.04 and hope it ports the proprietary driver.

If it doesn’t, then I’ll try to install the driver through the terminal and cross my fingers. Will have to track down the terminal command for a video driver update …unless you all know it…

So problem solved!..thanks Wolfman and ouroumov. With your insight and excellent suggestions and a little bit of sweat on my part it all works!!!

Flying high again…thanks

1 Like

Hello LoRider, thanks for detailing the issue.
I have similar crashes. using ubuntu 18.04 LTS and kernel 5.4.0-51-generic. Default driver installed.
I have tried alternative drivers under the same OS , but I couldn't get it set properly, between recursive logout issue, couldnt set the screen resolution right etc.. I gave up and back to the built-in driver.
every now and then I have a crash, the log never or rarely shows the same report so it is hard to investigate (plus I am also not that good with that level of debugging).

I am using a Dell Prcision M4800, a very good and expensive laptop, so quite annoying that Nvidia don't do their job properly on the driver side.

Could you confirm you have managed to update to 18.04 or 20.04 without the crash?

For instance my log crash today is the following
[Mon Oct 19 07:42:09 2020] nouveau 0000:01:00.0: fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]
[Mon Oct 19 07:42:09 2020] nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
[Mon Oct 19 07:42:09 2020] nouveau 0000:01:00.0: fifo: channel 4: killed
[Mon Oct 19 07:42:09 2020] nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
[Mon Oct 19 07:42:09 2020] nouveau 0000:01:00.0: fifo: fault 01 [WRITE] at 0000000120004000 engine 00 [GR] client 07 [GPC2/T1_2] reason 03 [VA_LIMIT_VIOLATION] on channel -1 [4008704000 unknown]
[Mon Oct 19 07:42:09 2020] nouveau 0000:01:00.0: Xorg[1362]: channel 4 killed!

And more weird, I never managed before but this time I used the SysReq s command and it came back to life without rebooting.

Thanks for any help, and if a Nvidia guy is reading it... please comment too!
Thanks

Hi mbristol,

try the following. :smiley: