在Pandas中重命名列名

我想从

['$a', '$b', '$c', '$d', '$e']

['a', 'b', 'c', 'd', 'e']

当前回答

Pandas 0.21+答案

0.21版中的列重命名有一些重要更新。

重命名方法添加了可以设置为columns或1的axis参数。此更新使此方法与panda API的其余部分相匹配。它仍然具有索引和列参数，但不再强制您使用它们。intlace设置为False的set_axis方法允许您使用列表重命名所有索引或列标签。

Pandas 0.21示例+

构造示例DataFrame：

df = pd.DataFrame({'$a':[1,2], '$b': [3,4], 
                   '$c':[5,6], '$d':[7,8], 
                   '$e':[9,10]})

   $a  $b  $c  $d  $e
0   1   3   5   7   9
1   2   4   6   8  10

使用axis='columns'或axis=1的重命名

df.rename({'$a':'a', '$b':'b', '$c':'c', '$d':'d', '$e':'e'}, axis='columns')

df.rename({'$a':'a', '$b':'b', '$c':'c', '$d':'d', '$e':'e'}, axis=1)

两者都会导致以下结果：

   a  b  c  d   e
0  1  3  5  7   9
1  2  4  6  8  10

仍然可以使用旧方法签名：

df.rename(columns={'$a':'a', '$b':'b', '$c':'c', '$d':'d', '$e':'e'})

重命名函数还接受将应用于每个列名的函数。

df.rename(lambda x: x[1:], axis='columns')

df.rename(lambda x: x[1:], axis=1)

将set_axis与列表一起使用，inplace=False

可以为set_axis方法提供一个长度等于列数（或索引）的列表。目前，inplace默认为True，但在未来的版本中，inplace将默认为False。

df.set_axis(['a', 'b', 'c', 'd', 'e'], axis='columns', inplace=False)

df.set_axis(['a', 'b', 'c', 'd', 'e'], axis=1, inplace=False)

为什么不使用df.columns=[‘a’，‘b’，‘c’，‘d’，‘e’]？

像这样直接分配列没有错。这是一个非常好的解决方案。

使用set_axis的优点是它可以作为方法链的一部分使用，并返回DataFrame的新副本。如果没有它，在重新分配列之前，必须将链的中间步骤存储到另一个变量。

# new for pandas 0.21+
df.some_method1()
  .some_method2()
  .set_axis()
  .some_method3()

# old way
df1 = df.some_method1()
        .some_method2()
df1.columns = columns
df1.some_method3()

2017-10-24 13:39:15

其他回答

只需将其分配给.columns属性：

>>> df = pd.DataFrame({'$a':[1,2], '$b': [10,20]})
>>> df
   $a  $b
0   1  10
1   2  20

>>> df.columns = ['a', 'b']
>>> df
   a   b
0  1  10
1  2  20

2012-07-05 14:23:27

一个简单而“有趣”（和蟒蛇？）的解决方案：

df.rename(columns={x: x.replace('$', '') for x in df.columns})

哪里：

df = pd.DataFrame(columns=['$a', '$b', '$c', '$d', '$e'])

步骤：

获取DataFrame的列作为列表：

df.columns

在DataFrames中重命名的方法：

df.rename()

属性以指定要重命名列：

columns={}

在字典中，您需要指定要重命名的列（在每个键中）以及它们将获得的新名称（每个值）

{'old_col_name': 'new_col_name', ...}

由于您的更改遵循一种模式，为了删除每列中的$字符，我们可以使用字典理解：

{x: x.replace('$', '') for x in df.columns}

2022-10-29 11:55:27

许多panda函数都有一个就地参数。当设置为True时，转换将直接应用于调用它的数据帧。例如：

df = pd.DataFrame({'$a':[1,2], '$b': [3,4]})
df.rename(columns={'$a': 'a'}, inplace=True)
df.columns

>>> Index(['a', '$b'], dtype='object')

或者，在某些情况下，您希望保留原始数据帧。如果创建数据帧是一项昂贵的任务，我经常看到人们陷入这种情况。例如，如果创建数据帧需要查询雪花数据库。在这种情况下，只需确保将inplace参数设置为False。

df = pd.DataFrame({'$a':[1,2], '$b': [3,4]})
df2 = df.rename(columns={'$a': 'a'}, inplace=False)
df.columns

>>> Index(['$a', '$b'], dtype='object')

df2.columns

>>> Index(['a', '$b'], dtype='object')

如果这些类型的转换是您经常做的，那么您还可以研究一些不同的panda GUI工具。我是一个叫做水户的人的创造者。它是一个电子表格，可以自动将您的编辑转换为python代码。

2021-06-15 00:38:13

可以将lstrip或strip方法与索引一起使用：

df.columns = df.columns.str.lstrip('$')

cols = ['$a', '$b', '$c', '$d', '$e']
pd.Series(cols).str.lstrip('$').tolist()

输出：

['a', 'b', 'c', 'd', 'e']

2022-07-17 09:23:08

重命名特定列

使用df.reame（）函数并引用要重命名的列。并非所有列都必须重命名：

df = df.rename(columns={'oldName1': 'newName1', 'oldName2': 'newName2'})
# Or rename the existing DataFrame (rather than creating a copy) 
df.rename(columns={'oldName1': 'newName1', 'oldName2': 'newName2'}, inplace=True)

最小代码示例

df = pd.DataFrame('x', index=range(3), columns=list('abcde'))
df

   a  b  c  d  e
0  x  x  x  x  x
1  x  x  x  x  x
2  x  x  x  x  x

以下方法都可以工作并产生相同的输出：

df2 = df.rename({'a': 'X', 'b': 'Y'}, axis=1)  # new method
df2 = df.rename({'a': 'X', 'b': 'Y'}, axis='columns')
df2 = df.rename(columns={'a': 'X', 'b': 'Y'})  # old method  

df2

   X  Y  c  d  e
0  x  x  x  x  x
1  x  x  x  x  x
2  x  x  x  x  x

请记住将结果指定回，因为修改不在原位。或者，指定inplace=True：

df.rename({'a': 'X', 'b': 'Y'}, axis=1, inplace=True)
df

   X  Y  c  d  e
0  x  x  x  x  x
1  x  x  x  x  x
2  x  x  x  x  x

在v0.25中，如果指定了要重命名的无效列，还可以指定errors='raise'来引发错误。请参阅v0.25 rename（）文档。

重新分配列标题

使用df.set_axis（），axis=1，inplace=False（返回副本）。

df2 = df.set_axis(['V', 'W', 'X', 'Y', 'Z'], axis=1, inplace=False)
df2

   V  W  X  Y  Z
0  x  x  x  x  x
1  x  x  x  x  x
2  x  x  x  x  x

这将返回一个副本，但您可以通过设置inplace=True来修改DataFrame（这是<=0.24版本的默认行为，但将来可能会更改）。

您也可以直接分配标题：

df.columns = ['V', 'W', 'X', 'Y', 'Z']
df

   V  W  X  Y  Z
0  x  x  x  x  x
1  x  x  x  x  x
2  x  x  x  x  x

2012-07-06 01:48:15

在Pandas中重命名列名

推荐文章

最新文章

标签