如何读取文件的前N行?

我们有一个很大的原始数据文件，我们想把它修剪成指定的大小。

如何在python中获取文本文件的前N行?所使用的操作系统对实现有任何影响吗?

当前回答

Python 3:

with open("datafile") as myfile:
    head = [next(myfile) for x in range(N)]
print(head)

Python 2:

with open("datafile") as myfile:
    head = [next(myfile) for x in xrange(N)]
print head

下面是另一种方法(Python 2和3都是):

from itertools import islice

with open("datafile") as myfile:
    head = list(islice(myfile, N))
print(head)

2009-11-20 00:27:18

其他回答

如果你想快速读取第一行并且不关心性能，你可以使用.readlines()返回列表对象，然后对列表进行切片。

例如，前5行:

with open("pathofmyfileandfileandname") as myfile:
    firstNlines=myfile.readlines()[0:5] #put here the interval you want

注意:整个文件是读取的，所以不是最好的从性能的角度来看，但它是易于使用，快速编写和易于记忆，所以如果你只是想执行一些一次性计算非常方便

print firstNlines

与其他答案相比，一个优点是可以轻松地选择行范围，例如跳过前10行[10:30]或最后10行[:-10]或只选择偶数行[::2]。

2013-12-07 12:59:02

我所做的就是用熊猫形来称呼N行。我认为性能不是最好的，但是举个例子，如果N=1000:

import pandas as pd
yourfile = pd.read_csv('path/to/your/file.csv',nrows=1000)

2017-04-11 14:54:59

没有特定的方法来读取文件对象暴露的行数。

我想最简单的方法是:

lines =[]
with open(file_name) as f:
    lines.extend(f.readline() for i in xrange(N))

2009-11-20 00:27:39

从Python 2.6开始，您可以利用IO基类中更复杂的函数。所以上面评分最高的答案可以改写为:

    with open("datafile") as myfile:
       head = myfile.readlines(N)
    print head

(你不必担心你的文件少于N行，因为没有StopIteration异常抛出。)

2012-12-06 18:02:26

#!/usr/bin/python

import subprocess

p = subprocess.Popen(["tail", "-n 3", "passlist"], stdout=subprocess.PIPE)

output, err = p.communicate()

print  output

这个方法对我很有效

2017-07-12 16:25:03

如何读取文件的前N行?

推荐文章

最新文章

标签