是否有一个简单的方法来删除字符串中的多个空格?

假设这个字符串:

The   fox jumped   over    the log.

变成:

The fox jumped over the log.

在不分割和进入列表的情况下，最简单的实现方法(1-2行)是什么?

当前回答

我不得不同意Paul McGuire的评论。对我来说,

' '.join(the_string.split())

比快速生成正则表达式要好得多。

我的测量结果(Linux和Python 2.5)显示，先分离后连接的速度几乎比“re.sub(…)”快5倍，如果你一次预编译正则表达式并多次执行该操作，速度仍然快3倍。而且无论从哪方面看，它都更容易理解——更python化。

2009-10-10 02:39:51

其他回答

你也可以在Pandas DataFrame中使用字符串分割技术，而不需要使用.apply(..)，如果你需要对大量字符串快速执行操作，这是非常有用的。这是一行话:

df['message'] = (df['message'].str.split()).str.join(' ')

2018-06-19 00:31:13

我没有深入研究其他示例，但是我刚刚创建了这个方法来合并多个连续的空格字符。

它不使用任何库，虽然它的脚本长度相对较长，但它不是一个复杂的实现:

def spaceMatcher(command):
    """
    Function defined to consolidate multiple whitespace characters in
    strings to a single space
    """
    # Initiate index to flag if more than one consecutive character
    iteration
    space_match = 0
    space_char = ""
    for char in command:
      if char == " ":
          space_match += 1
          space_char += " "
      elif (char != " ") & (space_match > 1):
          new_command = command.replace(space_char, " ")
          space_match = 0
          space_char = ""
      elif char != " ":
          space_match = 0
          space_char = ""
   return new_command

command = None
command = str(input("Please enter a command ->"))
print(spaceMatcher(command))
print(list(spaceMatcher(command)))

2017-12-18 17:29:40

>>> import re
>>> re.sub(' +', ' ', 'The     quick brown    fox')
'The quick brown fox'

2009-10-09 21:52:29

" ".join(foo.split())对于所问的问题不太正确，因为它也完全删除了单个前导和/或尾随空格。所以，如果它们也将被1个空白替换，你应该像下面这样做:

" ".join(('*' + foo + '*').split()) [1:-1]

当然，它没有那么优雅。

2020-07-13 19:14:25

一个简单的灵魂

>>> import re
>>> s="The   fox jumped   over    the log."
>>> print re.sub('\s+',' ', s)
The fox jumped over the log.

2015-11-04 06:11:39

是否有一个简单的方法来删除字符串中的多个空格?

推荐文章

最新文章

标签