在Pandas中重命名列名

我想从

['$a', '$b', '$c', '$d', '$e']

['a', 'b', 'c', 'd', 'e']

当前回答

另一种替换原始列标签的方法是从原始列标签中删除不需要的字符（此处为“$”）。

这可以通过在df.columns上运行for循环并将剥离的列附加到df.column来完成。

相反，我们可以通过使用下面的列表理解在一个语句中巧妙地做到这一点：

df.columns = [col.strip('$') for col in df.columns]

（Python中的strip方法会从字符串的开头和结尾剥离给定的字符。）

2015-11-23 13:56:10

其他回答

如果您已经获得了数据帧，df.columns将所有内容转储到您可以操作的列表中，然后作为列的名称重新分配到数据帧中。。。

columns = df.columns
columns = [row.replace("$", "") for row in columns]
df.rename(columns=dict(zip(columns, things)), inplace=True)
df.head() # To validate the output

最佳方式？我不知道。一种方式——是的。

评估问题答案中提出的所有主要技术的更好方法如下：使用cProfile测量内存和执行时间@kadee、@kaitlyn和@eumiro拥有执行时间最快的函数-尽管这些函数非常快，但我们比较了所有答案的0.000和0.001秒舍入。寓意：我上面的答案可能不是“最好”的方式。

import pandas as pd
import cProfile, pstats, re

old_names = ['$a', '$b', '$c', '$d', '$e']
new_names = ['a', 'b', 'c', 'd', 'e']
col_dict = {'$a': 'a', '$b': 'b', '$c': 'c', '$d': 'd', '$e': 'e'}

df = pd.DataFrame({'$a':[1, 2], '$b': [10, 20], '$c': ['bleep', 'blorp'], '$d': [1, 2], '$e': ['texa$', '']})

df.head()

def eumiro(df, nn):
    df.columns = nn
    # This direct renaming approach is duplicated in methodology in several other answers:
    return df

def lexual1(df):
    return df.rename(columns=col_dict)

def lexual2(df, col_dict):
    return df.rename(columns=col_dict, inplace=True)

def Panda_Master_Hayden(df):
    return df.rename(columns=lambda x: x[1:], inplace=True)

def paulo1(df):
    return df.rename(columns=lambda x: x.replace('$', ''))

def paulo2(df):
    return df.rename(columns=lambda x: x.replace('$', ''), inplace=True)

def migloo(df, on, nn):
    return df.rename(columns=dict(zip(on, nn)), inplace=True)

def kadee(df):
    return df.columns.str.replace('$', '')

def awo(df):
    columns = df.columns
    columns = [row.replace("$", "") for row in columns]
    return df.rename(columns=dict(zip(columns, '')), inplace=True)

def kaitlyn(df):
    df.columns = [col.strip('$') for col in df.columns]
    return df

print 'eumiro'
cProfile.run('eumiro(df, new_names)')
print 'lexual1'
cProfile.run('lexual1(df)')
print 'lexual2'
cProfile.run('lexual2(df, col_dict)')
print 'andy hayden'
cProfile.run('Panda_Master_Hayden(df)')
print 'paulo1'
cProfile.run('paulo1(df)')
print 'paulo2'
cProfile.run('paulo2(df)')
print 'migloo'
cProfile.run('migloo(df, old_names, new_names)')
print 'kadee'
cProfile.run('kadee(df)')
print 'awo'
cProfile.run('awo(df)')
print 'kaitlyn'
cProfile.run('kaitlyn(df)')

2015-09-01 02:24:17

请注意，前面答案中的方法不适用于MultiIndex。对于MultiIndex，您需要执行以下操作：

>>> df = pd.DataFrame({('$a','$x'):[1,2], ('$b','$y'): [3,4], ('e','f'):[5,6]})
>>> df
   $a $b  e
   $x $y  f
0  1  3  5
1  2  4  6
>>> rename = {('$a','$x'):('a','x'), ('$b','$y'):('b','y')}
>>> df.columns = pandas.MultiIndex.from_tuples([
        rename.get(item, item) for item in df.columns.tolist()])
>>> df
   a  b  e
   x  y  f
0  1  3  5
1  2  4  6

2016-08-29 21:27:20

假设这是您的数据帧。

可以使用两种方法重命名列。

使用dataframe.columns=[#list]df.columns=[‘a’，‘b’，‘c’，‘d’，‘e’]此方法的限制是，如果必须更改一列，则必须传递完整的列列表。此外，此方法不适用于索引标签。例如，如果您通过以下步骤：df.columns=[‘a’、‘b’、‘c’、‘d’]这将引发错误。长度不匹配：预期轴有5个元素，新值有4个元素。另一种方法是Pandasrename（）方法，用于重命名任何索引、列或行df=df.rename（列=｛‘$a‘：‘a‘｝）

同样，您可以更改任何行或列。

2019-08-27 08:30:27

由于您只想删除所有列名中的$符号，因此只需执行以下操作：

df = df.rename(columns=lambda x: x.replace('$', ''))

df.rename(columns=lambda x: x.replace('$', ''), inplace=True)

2014-03-26 10:20:45

df = pd.DataFrame({'$a': [1], '$b': [1], '$c': [1], '$d': [1], '$e': [1]})

如果新列列表的顺序与现有列的顺序相同，则分配很简单：

new_cols = ['a', 'b', 'c', 'd', 'e']
df.columns = new_cols
>>> df
   a  b  c  d  e
0  1  1  1  1  1

如果您有一个将旧列名键入到新列名的字典，可以执行以下操作：

d = {'$a': 'a', '$b': 'b', '$c': 'c', '$d': 'd', '$e': 'e'}
df.columns = df.columns.map(lambda col: d[col])  # Or `.map(d.get)` as pointed out by @PiRSquared.
>>> df
   a  b  c  d  e
0  1  1  1  1  1

如果你没有列表或字典映射，你可以通过列表理解去掉前导$符号：

df.columns = [col[1:] if col[0] == '$' else col for col in df]

2016-02-14 00:31:53

在Pandas中重命名列名

推荐文章

最新文章

标签