如何添加一个新的列到现有的数据帧?

我有以下索引DataFrame命名列和行不连续的数字:

          a         b         c         d
2  0.671399  0.101208 -0.181532  0.241273
3  0.446172 -0.243316  0.051767  1.577318
5  0.614758  0.075793 -0.451460 -0.012493

我想添加一个新列，'e'，到现有的数据帧，并不想改变数据帧中的任何东西(即，新列始终具有与DataFrame相同的长度)。

0   -0.335485
1   -1.166658
2   -0.385571
dtype: float64

如何将列e添加到上面的例子中?

当前回答

向现有数据框架添加新列的简单方法是:

new_cols = ['a' , 'b' , 'c' , 'd']

for col in new_cols:
    df[f'{col}'] = 0 #assiging 0 for the placeholder

print(df.columns)

2021-09-15 07:54:01

其他回答

如果你只需要创建一个新的空列，那么最短的解决方案是:

df.loc[:, 'e'] = pd.Series()

2020-11-27 08:26:56

如果我们想给df中一个新列的所有行赋一个标量值，例如:10:

df = df.assign(new_col=lambda x:10)  # x is each row passed in to the lambda func

Df现在在所有行中都有值为10的新列'new_col'。

2021-01-24 04:27:37

这是添加新列的简单方法:df['e'] = e

2012-12-12 16:04:31

这是向pandas数据框架添加新列的特殊情况。在这里，我基于数据框架的现有列数据添加了一个新特性/列。

因此，让我们的dataFrame有列'feature_1'， 'feature_2'， 'probability_score'，我们必须根据'probability_score'列中的数据添加一个new_column 'predicted_class'。

我将使用来自python的map()函数，并定义一个我自己的函数，该函数将实现如何给dataFrame中的每一行一个特定的class_label的逻辑。

data = pd.read_csv('data.csv')

def myFunction(x):
   //implement your logic here

   if so and so:
        return a
   return b

variable_1 = data['probability_score']
predicted_class = variable_1.map(myFunction)

data['predicted_class'] = predicted_class

// check dataFrame, new column is included based on an existing column data for each row
data.head()

2020-06-19 12:24:35

如果你得到SettingWithCopyWarning，一个简单的解决方法是复制你想要添加列的数据帧。

df = df.copy()
df['col_name'] = values

2016-03-07 03:28:54

如何添加一个新的列到现有的数据帧?

推荐文章

最新文章

标签