如何评测Python脚本？

Project Euler和其他编码竞赛通常有最长的运行时间，或者人们吹嘘他们的特定解决方案运行速度有多快。对于Python，有时方法有些笨拙——即向__main__添加计时代码。

描述Python程序运行时间的好方法是什么？

当前回答

我发现cprofiler和其他资源更多地用于优化目的，而不是调试。

我制作了自己的测试模块，用于简单的python脚本速度测试。（在我的例子中，使用ScriptProfilerPy测试了1K+行py文件，并在几分钟内将代码速度提高了10倍。

模块ScriptProfilerPy（）将运行代码，并向其添加时间戳。我把模块放在这里：https://github.com/Lucas-BLP/ScriptProfilerPy

Use:

from speed_testpy import ScriptProfilerPy

ScriptProfilerPy("path_to_your_script_to_test.py").Profiler()

输出：

2022-01-11 10:27:09

其他回答

根据乔·肖（Joe Shaw）关于多线程代码无法按预期工作的回答，我认为cProfile中的runcall方法只是围绕着已配置的函数调用执行self.enable（）和self.disable（）调用，因此您可以简单地自己执行，并在对现有代码的干扰最小的情况下使用任何代码。

2011-11-09 12:59:04

一个很好的评测模块是line_profiler（使用kernprof.py脚本调用）。它可以在这里下载。

我的理解是，cProfile只提供每个函数花费的总时间的信息。因此，单独的代码行是不定时的。这是科学计算中的一个问题，因为通常一条线会花费很多时间。而且，我记得，cProfile没有抓住我在say numpy.dot上花费的时间。

2011-10-20 16:05:34

还有一个叫做statprof的统计分析器。它是一个采样分析器，因此它为代码增加了最小的开销，并提供了基于行的（而不仅仅是基于函数的）计时。它更适合于游戏等软实时应用，但精度可能低于cProfile。

pypi中的版本有点旧，因此可以通过指定git存储库来使用pip安装：

pip install git+git://github.com/bos/statprof.py@1a33eba91899afe17a8b752c6dfdec6f05dd0c01

您可以这样运行：

import statprof

with statprof.profile():
    my_questionable_function()

另请参见https://stackoverflow.com/a/10333592/320036

2016-02-11 22:50:49

如果你想做一个累积分析器，意思是连续运行函数几次并观察结果的总和。

您可以使用此cumulative_profiler装饰器：

它是python>=3.6特定的，但您可以删除非本地的，因为它可以在旧版本上工作。

import cProfile, pstats

class _ProfileFunc:
    def __init__(self, func, sort_stats_by):
        self.func =  func
        self.profile_runs = []
        self.sort_stats_by = sort_stats_by

    def __call__(self, *args, **kwargs):
        pr = cProfile.Profile()
        pr.enable()  # this is the profiling section
        retval = self.func(*args, **kwargs)
        pr.disable()

        self.profile_runs.append(pr)
        ps = pstats.Stats(*self.profile_runs).sort_stats(self.sort_stats_by)
        return retval, ps

def cumulative_profiler(amount_of_times, sort_stats_by='time'):
    def real_decorator(function):
        def wrapper(*args, **kwargs):
            nonlocal function, amount_of_times, sort_stats_by  # for python 2.x remove this row

            profiled_func = _ProfileFunc(function, sort_stats_by)
            for i in range(amount_of_times):
                retval, ps = profiled_func(*args, **kwargs)
            ps.print_stats()
            return retval  # returns the results of the function
        return wrapper

    if callable(amount_of_times):  # incase you don't want to specify the amount of times
        func = amount_of_times  # amount_of_times is the function in here
        amount_of_times = 5  # the default amount
        return real_decorator(func)
    return real_decorator

实例

剖析函数baz

import time

@cumulative_profiler
def baz():
    time.sleep(1)
    time.sleep(2)
    return 1

baz()

baz跑了5次并打印了以下内容：

         20 function calls in 15.003 seconds

   Ordered by: internal time

   ncalls  tottime  percall  cumtime  percall filename:lineno(function)
       10   15.003    1.500   15.003    1.500 {built-in method time.sleep}
        5    0.000    0.000   15.003    3.001 <ipython-input-9-c89afe010372>:3(baz)
        5    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects}

指定次数

@cumulative_profiler(3)
def baz():
    ...

2019-09-11 19:52:11

Scalene是一个新的python分析器，它涵盖了许多用例，对性能的影响最小：

https://github.com/plasma-umass/scalene

它可以在非常精细的水平上评测CPU、GPU和内存利用率。它还特别支持多线程/并行化的python代码。

2022-10-30 19:22:44

如何评测Python脚本？

推荐文章

最新文章

标签