proc, coredump: add CoreDumping flag to /proc/pid/statussubmit/tizen/20200529.040605 accepted/tizen/unified/20200529.124048

Right now there is no convenient way to check if a process is being coredumped at the moment. It might be necessary to recognize such state to prevent killing the process and getting a broken coredump. Writing a large core might take significant time, and the process is unresponsive during it, so it might be killed by timeout, if another process is monitoring and killing/restarting hanging tasks. We're getting a significant number of corrupted coredump files on machines in our fleet, just because processes are being killed by timeout in the middle of the core writing process. We do have a process health check, and some agent is responsible for restarting processes which are not responding for health check requests. Writing a large coredump to the disk can easily exceed the reasonable timeout (especially on an overloaded machine). This flag will allow the agent to distinguish processes which are being coredumped, extend the timeout for them, and let them produce a full coredump file. To provide an ability to detect if a process is in the state of being coredumped, we can expose a boolean CoreDumping flag in /proc/pid/status. Example: $ cat core.sh #!/bin/sh echo "|/usr/bin/sleep 10" > /proc/sys/kernel/core_pattern sleep 1000 & PID=$! cat /proc/$PID/status | grep CoreDumping kill -ABRT $PID sleep 1 cat /proc/$PID/status | grep CoreDumping $ ./core.sh CoreDumping: 0 CoreDumping: 1 [guro@fb.com: document CoreDumping flag in /proc/<pid>/status] Link: http://lkml.kernel.org/r/20170928135357.GA8470@castle.DHCP.thefacebook.com Link: http://lkml.kernel.org/r/20170920230634.31572-1-guro@fb.com Signed-off-by: Roman Gushchin <guro@fb.com> Cc: Alexander Viro <viro@zeniv.linux.org.uk> Cc: Ingo Molnar <mingo@kernel.org> Cc: Konstantin Khlebnikov <koct9i@gmail.com> Cc: Oleg Nesterov <oleg@redhat.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> [k.lewandowsk: backport mainline commit c643401218 for userspace process managers (eg. resourced) to be able to avoid killing dead processes, breaking crash reports] Signed-off-by: Karol Lewandowski <k.lewandowsk@samsung.com> Change-Id: I7a7588752a24392d16963a7b3f78764d9bce1e2b
author: Roman Gushchin <guro@fb.com> 2017-11-17 15:26:45 -0800
committer: Karol Lewandowski <k.lewandowsk@samsung.com> 2020-05-26 15:11:29 +0200
commit: 7d599670f9864fc54e953f1a9ac3a0767968bfca (patch)
tree: c57bb41d461fa62c4db11d65427ed8507baea3fc
parent: db92f438ea5fcdb580c0190a624cd833208a7eae (diff)
download: linux-exynos-7d599670f9864fc54e953f1a9ac3a0767968bfca.tar.gz
linux-exynos-7d599670f9864fc54e953f1a9ac3a0767968bfca.tar.bz2
linux-exynos-7d599670f9864fc54e953f1a9ac3a0767968bfca.zip
2 files changed, 9 insertions, 0 deletions
diff --git a/Documentation/filesystems/proc.txt b/Documentation/filesystems/proc.txt
index 4cee34ce496e..41cacebf33e0 100644
--- a/Documentation/filesystems/proc.txt
+++ b/Documentation/filesystems/proc.txt
@@ -181,6 +181,7 @@ read the file /proc/PID/status:
   VmPTE:        20 kb
   VmSwap:        0 kB
   HugetlbPages:          0 kB
+  CoreDumping:    0
   Threads:        1
   SigQ:   0/28578
   SigPnd: 0000000000000000
@@ -254,6 +255,8 @@ Table 1-2: Contents of the status files (as of 4.8)
  VmSwap                      amount of swap used by anonymous private data
                              (shmem swap usage is not included)
  HugetlbPages                size of hugetlb memory portions
+ CoreDumping                 process's memory is currently being dumped
+                             (killing the process may lead to a corrupted core)
  Threads                     number of threads
  SigQ                        number of signals queued/max. number for queue
  SigPnd                      bitmap of pending signals for the thread
diff --git a/fs/proc/array.c b/fs/proc/array.c
index 4ac811e1a26c..95691adcb107 100644
--- a/fs/proc/array.c
+++ b/fs/proc/array.c
@@ -391,6 +391,11 @@ static void task_cpus_allowed(struct seq_file *m, struct task_struct *task)
 		   cpumask_pr_args(&task->cpus_allowed));
 }
 
+static inline void task_core_dumping(struct seq_file *m, struct mm_struct *mm)
+{
+	seq_printf(m, "CoreDumping:\t%d\n", !!mm->core_state);
+}
+
 int proc_pid_status(struct seq_file *m, struct pid_namespace *ns,
 			struct pid *pid, struct task_struct *task)
 {
@@ -401,6 +406,7 @@ int proc_pid_status(struct seq_file *m, struct pid_namespace *ns,
 
 	if (mm) {
 		task_mem(m, mm);
+		task_core_dumping(m, mm);
 		mmput(mm);
 	}
 	task_sig(m, task);
author	Roman Gushchin <guro@fb.com>	2017-11-17 15:26:45 -0800
committer	Karol Lewandowski <k.lewandowsk@samsung.com>	2020-05-26 15:11:29 +0200
commit	7d599670f9864fc54e953f1a9ac3a0767968bfca (patch)
tree	c57bb41d461fa62c4db11d65427ed8507baea3fc
parent	db92f438ea5fcdb580c0190a624cd833208a7eae (diff)
download	linux-exynos-7d599670f9864fc54e953f1a9ac3a0767968bfca.tar.gz linux-exynos-7d599670f9864fc54e953f1a9ac3a0767968bfca.tar.bz2 linux-exynos-7d599670f9864fc54e953f1a9ac3a0767968bfca.zip