我如何计算在熊猫数据框架一列的NaN值?

我想找出我的数据的每一列中NaN的数量。

当前回答

数零:

df[df == 0].count(axis=0)

计算NaN:

df.isnull().sum()

df.isna().sum()

其他回答

https://pandas.pydata.org/pandas-docs/stable/generated/pandas.Series.count.html#pandas.Series.count

pandas.Series.count
Series.count(level=None)[source]

返回系列中非na /null观测值的个数

另一个尚未被建议的简单选项是，为了只计算NaN，将在形状中添加以返回具有NaN的行数。

df[df['col_name'].isnull()]['col_name'].shape

数零:

df[df == 0].count(axis=0)

计算NaN:

df.isnull().sum()

df.isna().sum()

你可以从非nan值的计数中减去总长度:

count_nan = len(df) - df.count()

你应该根据你的数据计算时间。与isnull解相比，小级数的速度提高了3倍。

.sum df.isnull () () 将给出缺失值的列和。

如果你想知道特定列中缺失值的总和，那么以下代码将起作用:

推荐文章