如何将列表的字符串表示转换为列表

我想知道最简单的方法是将如下列表的字符串表示转换为列表：

x = '[ "A","B","C" , " D"]'

即使在用户在逗号之间加空格，在引号内加空格的情况下，我也需要处理它并将其转换为：

x = ["A", "B", "C", "D"]

我知道我可以用strip（）和split（）分隔空格，并检查非字母字符。但代码变得非常笨拙。有没有我不知道的快速功能？

当前回答

不需要导入任何内容或进行评估。对于大多数基本用例，包括原始问题中给出的用例，您可以在一行中完成此操作。

一个衬垫

l_x = [i.strip() for i in x[1:-1].replace('"',"").split(',')]

解释

x = '[ "A","B","C" , " D"]'
# String indexing to eliminate the brackets.
# Replace, as split will otherwise retain the quotes in the returned list
# Split to convert to a list
l_x = x[1:-1].replace('"',"").split(',')

输出：

for i in range(0, len(l_x)):
    print(l_x[i])
# vvvv output vvvvv
'''
 A
B
C
  D
'''
print(type(l_x)) # out: class 'list'
print(len(l_x)) # out: 4

您可以根据需要使用列表理解来解析和清理此列表。

l_x = [i.strip() for i in l_x] # list comprehension to clean up
for i in range(0, len(l_x)):
    print(l_x[i])
# vvvvv output vvvvv
'''
A
B
C
D
'''

嵌套列表

如果您有嵌套列表，它确实会变得有点烦人。如果不使用正则表达式（这将简化替换），并且假设您希望返回一个扁平列表（python的zen表示扁平优于嵌套）：

x = '[ "A","B","C" , " D", ["E","F","G"]]'
l_x = x[1:-1].split(',')
l_x = [i
    .replace(']', '')
    .replace('[', '')
    .replace('"', '')
    .strip() for i in l_x
]
# returns ['A', 'B', 'C', 'D', 'E', 'F', 'G']

如果您需要保留嵌套列表，它会变得有点难看，但仍然可以通过正则表达式和列表理解来完成：

import re

x = '[ "A","B","C" , " D", "["E","F","G"]","Z", "Y", "["H","I","J"]", "K", "L"]'
# Clean it up so the regular expression is simpler
x = x.replace('"', '').replace(' ', '')
# Look ahead for the bracketed text that signifies nested list
l_x = re.split(r',(?=\[[A-Za-z0-9\',]+\])|(?<=\]),', x[1:-1])
print(l_x)
# Flatten and split the non nested list items
l_x0 = [item for items in l_x for item in items.split(',') if not '[' in items]
# Convert the nested lists to lists
l_x1 = [
    i[1:-1].split(',') for i in l_x if '[' in i
]
# Add the two lists
l_x = l_x0 + l_x1

最后一个解决方案可以处理任何以字符串形式存储的列表，无论是否嵌套。

2021-07-07 21:56:34

其他回答

>>> import ast
>>> x = '[ "A","B","C" , " D"]'
>>> x = ast.literal_eval(x)
>>> x
['A', 'B', 'C', ' D']
>>> x = [n.strip() for n in x]
>>> x
['A', 'B', 'C', 'D']

上次迭代评估：

使用ast.literal_eval，可以安全地计算表达式节点或包含Python文本或容器显示的字符串。提供的字符串或节点只能由以下Python文本结构组成：字符串、字节、数字、元组、列表、字典、布尔值和None。

2009-12-12 18:30:49

如果它只是一个一维列表，则可以在不导入任何内容的情况下完成此操作：

>>> x = u'[ "A","B","C" , " D"]'
>>> ls = x.strip('[]').replace('"', '').replace(' ', '').split(',')
>>> ls
['A', 'B', 'C', 'D']

2018-08-28 13:02:10

只需从列表的字符串表示形式中切下第一个和最后一个字符，即可保存.strip（）函数（请参见下面的第三行）：

>>> mylist=[1,2,3,4,5,'baloney','alfalfa']
>>> strlist=str(mylist)
['1', ' 2', ' 3', ' 4', ' 5', " 'baloney'", " 'alfalfa'"]
>>> mylistfromstring=(strlist[1:-1].split(', '))
>>> mylistfromstring[3]
'4'
>>> for entry in mylistfromstring:
...     print(entry)
...     type(entry)
...
1
<class 'str'>
2
<class 'str'>
3
<class 'str'>
4
<class 'str'>
5
<class 'str'>
'baloney'
<class 'str'>
'alfalfa'
<class 'str'>

2019-01-08 23:24:24

eval很危险——你不应该执行用户输入。

如果您有2.6或更高版本，请使用ast而不是eval：

>>> import ast
>>> ast.literal_eval('["A","B" ,"C" ," D"]')
["A", "B", "C", " D"]

一旦你做到了，就把绳子脱下来。

如果您使用的是较旧版本的Python，则可以使用简单的正则表达式来实现所需的功能：

>>> x='[  "A",  " B", "C","D "]'
>>> re.findall(r'"\s*([^"]*?)\s*"', x)
['A', 'B', 'C', 'D']

这不如ast解决方案好，例如，它不能正确处理字符串中的转义引号。但它很简单，不涉及危险的求值，如果您使用的是没有ast的旧Python，那么它可能足以满足您的目的。

2009-12-12 18:29:08

因此，根据所有答案，我决定对最常见的方法进行计时：

from time import time
import re
import json

my_str = str(list(range(19)))
print(my_str)

reps = 100000

start = time()
for i in range(0, reps):
    re.findall("\w+", my_str)
print("Regex method:\t", (time() - start) / reps)

start = time()
for i in range(0, reps):
    json.loads(my_str)
print("JSON method:\t", (time() - start) / reps)

start = time()
for i in range(0, reps):
    ast.literal_eval(my_str)
print("AST method:\t\t", (time() - start) / reps)

start = time()
for i in range(0, reps):
    [n.strip() for n in my_str]
print("strip method:\t", (time() - start) / reps)

    regex method:     6.391477584838867e-07
    json method:     2.535374164581299e-06
    ast method:         2.4425282478332518e-05
    strip method:     4.983267784118653e-06

所以最终正则表达式获胜！

2018-08-06 10:54:36

如何将列表的字符串表示转换为列表

推荐文章

最新文章

标签