Matthew Brost
452bca0edb
drm/xe: Don't suspend device upon wedge
When wedging a device we shouldn't be suspending device as state for
debug will be lost.
Also this appears to not work as the below stack trace pops upon trying
to resume a wedged device:
[ 304.245044] INFO: task cat:12115 blocked for more than 151 seconds.
[ 304.251333] Tainted: G W 6.10.0-rc7-xe+ #3518
[ 304.257617] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 304.265459] task:cat state:D stack:13384 pid:12115 tgid:12115 ppid:3986 flags:0x00000006
[ 304.265465] Call Trace:
[ 304.265467] <TASK>
[ 304.265469] __schedule+0x3c4/0xdf0
[ 304.265478] schedule+0x3c/0x140
[ 304.265481] rpm_resume+0x1cc/0x740
[ 304.265484] ? __pfx_autoremove_wake_function+0x10/0x10
[ 304.265489] __pm_runtime_resume+0x49/0x80
[ 304.265494] guc_info+0x6b/0xb0 [xe]
[ 304.265538] ? __pfx___drm_printfn_seq_file+0x10/0x10
[ 304.265541] ? __pfx___drm_puts_seq_file+0x10/0x10
[ 304.265545] seq_read_iter+0x111/0x4c0
[ 304.265551] seq_read+0xfc/0x140
[ 304.265556] full_proxy_read+0x58/0x80
[ 304.265560] vfs_read+0xa7/0x360
[ 304.265563] ? find_held_lock+0x2b/0x80
[ 304.265568] ksys_read+0x64/0xe0
[ 304.265571] do_syscall_64+0x68/0x140
[ 304.265575] entry_SYSCALL_64_after_hwframe+0x76/0x7e
[ 304.265578] RIP: 0033:0x7f4254d14992
[ 304.265580] RSP: 002b:00007ffc558666f8 EFLAGS: 00000246 ORIG_RAX: 0000000000000000
[ 304.265583] RAX: ffffffffffffffda RBX: 0000000000020000 RCX: 00007f4254d14992
[ 304.265584] RDX: 0000000000020000 RSI: 00007f4254ebb000 RDI: 0000000000000003
[ 304.265586] RBP: 00007f4254ebb000 R08: 00007f4254eba010 R09: 00007f4254eba010
[ 304.265587] R10: 0000000000000022 R11: 0000000000000246 R12: 0000000000022000
[ 304.265588] R13: 0000000000000003 R14: 0000000000020000 R15: 0000000000020000
[ 304.265593] </TASK>
[ 304.265594]
Showing all locks held in the system:
[ 304.265598] 1 lock held by khungtaskd/57:
[ 304.265599] #0: ffffffff8273b860 (rcu_read_lock){....}-{1:2}, at: debug_show_all_locks+0x36/0x1c0
[ 304.265607] 3 locks held by kworker/6:1/90:
[ 304.265610] 1 lock held by in:imklog/547:
[ 304.265611] #0: ffff88810498cd88 (&f->f_pos_lock){+.+.}-{3:3}, at: __fdget_pos+0x76/0xc0
[ 304.265620] 1 lock held by dmesg/1310:
v2: Drop local 'err' variable (Jonathan)
Fixes: 8ed9aaae39 ("drm/xe: Force wedged state and block GT reset upon any GPU hang")
Cc: Rodrigo Vivi <rodrigo.vivi@intel.com>
Signed-off-by: Matthew Brost <matthew.brost@intel.com>
Reviewed-by: Jonathan Cavitt <jonathan.cavitt@intel.com>
Link: https://patchwork.freedesktop.org/patch/msgid/20240716063902.1390130-2-matthew.brost@intel.com
2024-07-17 12:01:34 -07:00
..
2024-06-14 09:15:28 +02:00
2024-05-24 08:43:25 -07:00
2024-06-05 16:58:09 +02:00
2024-05-04 18:59:47 +02:00
2024-06-06 14:53:34 +02:00
2024-05-06 18:26:47 -07:00
2024-05-22 10:45:12 -07:00
2024-06-12 11:31:42 -04:00
2024-06-05 12:12:54 -06:00
2024-05-23 12:04:36 -07:00
2024-05-22 20:14:47 -04:00
2024-05-15 12:59:55 -06:00
2024-06-24 16:20:58 +10:00
2024-05-29 12:31:02 -07:00
2024-05-22 09:56:00 -07:00
2024-04-23 13:16:03 +02:00
2024-06-03 18:00:23 +02:00
2024-05-16 08:50:32 -07:00
2024-05-23 12:04:36 -07:00
2024-05-28 16:09:17 -07:00
2024-05-19 09:21:03 -07:00
2024-05-09 00:30:37 +09:00
2024-05-21 11:15:56 -07:00
2024-06-21 11:06:56 +10:00
2024-05-13 16:53:53 -07:00
2024-06-04 11:29:52 +02:00
2024-05-03 22:09:50 +02:00
2024-05-09 01:03:39 +09:00
2024-05-27 07:34:58 +09:00
2024-06-06 09:39:36 -07:00
2024-05-22 12:26:46 -07:00
2024-06-07 10:05:21 +02:00
2024-07-17 12:01:34 -07:00
2024-05-04 18:59:41 +02:00
2024-06-08 10:48:11 -07:00
2024-05-22 12:26:46 -07:00
2024-05-30 09:05:19 -07:00
2024-05-22 12:26:46 -07:00
2024-05-23 13:38:15 +02:00
2024-05-23 00:29:19 +02:00
2024-05-22 12:26:46 -07:00
2024-05-22 20:14:47 -04:00
2024-06-03 11:10:08 +02:00
2024-05-22 20:14:47 -04:00
2024-06-04 14:00:59 +02:00
2024-06-03 18:20:00 +02:00
2024-05-02 18:12:27 +01:00
2024-05-17 09:05:46 -07:00
2024-05-31 12:39:15 -05:00
2024-04-29 10:53:31 +02:00
2024-05-28 06:55:59 -06:00
2024-05-30 16:34:59 +02:00
2024-05-17 08:53:47 -07:00
2024-04-25 11:58:52 +02:00
2024-05-14 18:25:53 -07:00
2024-05-21 10:09:28 -07:00
2024-06-19 11:38:31 +03:00
2024-05-16 08:56:49 -07:00
2024-05-21 09:51:42 -07:00
2024-04-29 08:20:07 -07:00
2024-06-06 10:15:14 +02:00
2024-05-29 13:08:31 +01:00
2024-04-25 12:53:30 -05:00
2024-05-23 12:04:36 -07:00
2024-05-31 13:50:59 -07:00
2024-05-03 07:26:39 +02:00
2024-05-30 19:43:47 -05:00
2024-05-17 13:01:24 +02:00
2024-04-25 20:55:53 -07:00
2024-06-04 12:10:05 -05:00
2024-05-23 12:09:22 -07:00
2024-05-21 11:19:18 -07:00
2024-05-22 10:41:14 -07:00
2024-06-19 11:38:31 +03:00
2024-05-27 08:18:31 -07:00
2024-05-28 21:53:51 +02:00
2024-05-20 08:55:18 -07:00
2024-04-30 21:10:37 +02:00
2024-04-29 08:20:06 -07:00
2024-06-05 16:01:16 -07:00
2024-05-10 07:30:27 +02:00
2024-05-14 23:36:19 +09:00
2024-05-28 13:22:54 +01:00
2024-05-06 13:34:12 -06:00
2024-05-18 12:48:37 -07:00
2024-05-23 12:04:36 -07:00
2024-05-07 23:40:46 +02:00
2024-05-21 12:09:36 -07:00
2024-06-06 14:40:51 -07:00
2024-05-08 19:21:51 +01:00
2024-05-22 20:14:47 -04:00
2024-05-21 11:23:36 -07:00
2024-05-29 19:12:09 +01:00
2024-05-08 19:46:11 +01:00
2024-05-22 12:11:48 -07:00
2024-05-21 13:11:44 -07:00
2024-04-29 22:29:44 +02:00
2024-05-27 13:00:00 +02:00
2024-05-10 10:25:22 +01:00
2024-05-24 08:38:28 -07:00
2024-05-30 20:40:48 -04:00
2024-04-23 13:16:03 +02:00
2024-05-22 20:14:47 -04:00
2024-05-22 08:32:48 -04:00
2024-05-20 14:56:50 -07:00
2024-05-23 12:04:36 -07:00
2024-05-27 11:08:31 +02:00
2024-05-19 09:21:03 -07:00
2024-05-23 12:04:36 -07:00
2024-05-11 11:32:06 +02:00
2024-05-24 10:24:49 -07:00
2024-04-29 16:45:53 +02:00
2024-05-10 04:34:52 +09:00