strace应该如何使用?

Strace可以用作调试工具，也可以用作原语分析器。

As a debugger, you can see how given system calls were called, executed and what they return. This is very important, as it allows you to see not only that a program failed, but WHY a program failed. Usually it's just a result of lousy coding not catching all the possible outcomes of a program. Other times it's just hardcoded paths to files. Without strace you get to guess what went wrong where and how. With strace you get a breakdown of a syscall, usually just looking at a return value tells you a lot.

剖析是另一个用途。您可以使用它来分别计时每个系统调用的执行，或者作为一个聚合。虽然这可能不足以解决您的问题，但至少可以大大缩小潜在嫌疑人的范围。如果您在单个文件上看到大量的fopen/close对，那么您可能在每次执行循环时都不必要地打开和关闭文件，而不是在循环之外打开和关闭它。

Ltrace是strace的近亲，也非常有用。你必须学会区分你的瓶颈在哪里。如果执行的总时间是8秒，而你在系统调用上只花了0.05秒，那么对程序进行分段不会有什么好处，问题出在你的代码中，这通常是一个逻辑问题，或者程序实际上需要花那么长时间来运行。

The biggest problem with strace/ltrace is reading their output. If you don't know how the calls are made, or at least the names of syscalls/functions, it's going to be difficult to decipher the meaning. Knowing what the functions return can also be very beneficial, especially for different error codes. While it's a pain to decipher, they sometimes really return a pearl of knowledge; once I saw a situation where I ran out of inodes, but not out of free space, thus all the usual utilities didn't give me any warning, I just couldn't make a new file. Reading the error code from strace's output pointed me in the right direction.

2009-08-03 18:21:55

最小可运行示例

如果一个概念不清楚，有一个你没有见过的更简单的例子可以解释它。

在本例中，这个例子是Linux x86_64程序集独立(无libc) hello world:

你好。年代

.text
.global _start
_start:
    /* write */
    mov $1, %rax    /* syscall number */
    mov $1, %rdi    /* stdout */
    mov $msg, %rsi  /* buffer */
    mov $len, %rdx  /* buffer len */
    syscall

    /* exit */
    mov $60, %rax   /* exit status */
    mov $0, %rdi    /* syscall number */
    syscall
msg:
    .ascii "hello\n"
len = . - msg

GitHub上游。

组装和运行:

as -o hello.o hello.S
ld -o hello.out hello.o
./hello.out

输出期望:

hello

现在让我们在这个例子中使用strace:

env -i ASDF=qwer strace -o strace.log -s999 -v ./hello.out arg0 arg1
cat strace.log

我们使用:

env -i ASDF=qwer用于控制环境变量:https://unix.stackexchange.com/questions/48994/how-to-run-a-program-in-a-clean-environment-in-bash -s999 -v显示更详细的日志信息

Strace.log现在包含:

execve("./hello.out", ["./hello.out", "arg0", "arg1"], ["ASDF=qwer"]) = 0
write(1, "hello\n", 6)                  = 6
exit(0)                                 = ?
+++ exited with 0 +++

在这样一个最小的例子中，输出的每个字符都是不言而喻的:

执行行:显示strace如何执行hello。包括CLI参数和man execve中记录的环境写行:显示我们所做的写系统调用。6是字符串“hello\n”的长度。 = 6是系统调用的返回值，在man 2 write中记录的是写入的字节数。退出行:显示我们所做的退出系统调用。没有返回值，因为程序退出了!

更复杂的例子

当然，strace的应用是为了查看复杂程序实际上执行了哪些系统调用，以帮助调试/优化程序。

值得注意的是，您在Linux中可能遇到的大多数系统调用都有glibc包装器，其中许多来自POSIX。

在内部，glibc包装器或多或少像这样使用内联汇编:如何在内联汇编中通过sysenter调用系统调用?

你应该学习的下一个例子是POSIX write hello world:

c

#define _XOPEN_SOURCE 700
#include <unistd.h>

int main(void) {
    char *msg = "hello\n";
    write(1, msg, 6);
    return 0;
}

编译并运行:

gcc -std=c99 -Wall -Wextra -pedantic -o main.out main.c
./main.out

这一次，您将看到glibc在main之前执行了一系列系统调用，以便为main设置一个良好的环境。

这是因为我们现在使用的不是一个独立的程序，而是一个更常见的glibc程序，它允许libc功能。

然后，在每一端，strace.log包含:

write(1, "hello\n", 6)                  = 6
exit_group(0)                           = ?
+++ exited with 0 +++

因此我们得出结论，写POSIX函数使用，惊喜!， Linux写系统调用。

我们还观察到return 0导致exit_group调用而不是exit。哈，我不知道这个!这就是为什么strace这么酷。Man exit_group解释道:

这个系统调用等同于exit(2)，只是它不仅终止了调用线程，而且终止了调用进程线程组中的所有线程。

下面是我研究dlopen使用哪个系统调用的另一个示例:https://unix.stackexchange.com/questions/226524/what-system-call-is-used-to-load-libraries-in-linux/462710#462710

在Ubuntu 16.04, GCC 6.4.0, Linux内核4.4.0中测试。

2019-03-28 12:07:01

Strace可以用作调试工具，也可以用作原语分析器。

As a debugger, you can see how given system calls were called, executed and what they return. This is very important, as it allows you to see not only that a program failed, but WHY a program failed. Usually it's just a result of lousy coding not catching all the possible outcomes of a program. Other times it's just hardcoded paths to files. Without strace you get to guess what went wrong where and how. With strace you get a breakdown of a syscall, usually just looking at a return value tells you a lot.

剖析是另一个用途。您可以使用它来分别计时每个系统调用的执行，或者作为一个聚合。虽然这可能不足以解决您的问题，但至少可以大大缩小潜在嫌疑人的范围。如果您在单个文件上看到大量的fopen/close对，那么您可能在每次执行循环时都不必要地打开和关闭文件，而不是在循环之外打开和关闭它。

Ltrace是strace的近亲，也非常有用。你必须学会区分你的瓶颈在哪里。如果执行的总时间是8秒，而你在系统调用上只花了0.05秒，那么对程序进行分段不会有什么好处，问题出在你的代码中，这通常是一个逻辑问题，或者程序实际上需要花那么长时间来运行。

The biggest problem with strace/ltrace is reading their output. If you don't know how the calls are made, or at least the names of syscalls/functions, it's going to be difficult to decipher the meaning. Knowing what the functions return can also be very beneficial, especially for different error codes. While it's a pain to decipher, they sometimes really return a pearl of knowledge; once I saw a situation where I ran out of inodes, but not out of free space, thus all the usual utilities didn't give me any warning, I just couldn't make a new file. Reading the error code from strace's output pointed me in the right direction.

2009-08-03 18:21:55

strace -tfp PID将监控PID进程的系统调用，因此我们可以调试/监控我们的进程/程序状态。

2014-06-19 14:44:31

Strace概述 Strace可以看作是一个轻量级调试器。它允许程序员/用户快速发现程序是如何与操作系统交互的。它通过监控系统调用和信号来做到这一点。

使用当你没有源代码或者不想被打扰去真正浏览它的时候，这很好。此外，如果您不喜欢打开GDB，而只是对理解外部交互感兴趣，那么对于您自己的代码也很有用。

这是一个很好的介绍下面是一个使用strace来调试进程挂起的温和介绍

2008-10-06 16:16:16

我一直使用strace来调试权限问题。技巧是这样的:

$ strace -e trace=open,stat,read,write gnome-calculator

其中gnome-calculator是您想要运行的命令。

2015-05-04 15:33:22

strace应该如何使用?

推荐文章

最新文章

标签