在Python中，如何分割字符串并保留分隔符?

这是最简单的解释。这是我正在使用的:

re.split('\W', 'foo/bar spam\neggs')
>>> ['foo', 'bar', 'spam', 'eggs']

这是我想要的:

someMethod('\W', 'foo/bar spam\neggs')
>>> ['foo', '/', 'bar', ' ', 'spam', '\n', 'eggs']

原因是我想把一个字符串分割成令牌，操作它，然后再把它组合在一起。

当前回答

将所有分隔符:(\W)替换为分隔符+ new_分隔符:(\W;) 由new_separator分隔符拆分:(;)

def split_and_keep(seperator, s):
  return re.split(';', re.sub(seperator, lambda match: match.group() + ';', s))

print('\W', 'foo/bar spam\neggs')

2020-04-13 01:44:24

其他回答

我在尝试分割文件路径时遇到了类似的问题，并努力找到一个简单的答案。这对我来说很有效，并且不需要将分隔符替换回分割文本:

My_path = 'folder1/folder2/folder3/file1'

进口再保险

re.findall ('[^/]+/|[^/]+', my_path)

['folder1/'， 'folder2/'， 'folder3/'， 'file1']

2018-12-12 15:20:23

我发现这种基于生成器的方法更令人满意:

def split_keep(string, sep):
    """Usage:
    >>> list(split_keep("a.b.c.d", "."))
    ['a.', 'b.', 'c.', 'd']
    """
    start = 0
    while True:
        end = string.find(sep, start) + 1
        if end == 0:
            break
        yield string[start:end]
        start = end
    yield string[start:]

它避免了需要找出正确的正则表达式，而在理论上应该相当便宜。它不创建新的字符串对象，并将大部分迭代工作委托给高效的find方法。

．.．在Python 3.8中，它可以短到:

def split_keep(string, sep):
    start = 0
    while (end := string.find(sep, start) + 1) > 0:
        yield string[start:end]
        start = end
    yield string[start:]

2019-11-07 15:04:09

一个懒惰而简单的解决方案

假设你的正则表达式模式是split_pattern = r'(!|\?)'

首先，添加一些与新分隔符相同的字符，如'[cut]'

New_string = re.sub(split_pattern， '\\1[cut]'， your_string)

然后拆分新的分隔符new_string.split('[cut]')

2018-08-22 02:18:35

re.split的文档中提到:

根据出现的模式拆分字符串。如果捕获括号是在模式中使用的，然后是文本中的所有组模式也作为结果列表的一部分返回。

所以你只需要用一个捕获组来包装分隔符:

>>> re.split('(\W)', 'foo/bar spam\neggs')
['foo', '/', 'bar', ' ', 'spam', '\n', 'eggs']

2010-01-25 23:45:19

另一个例子，在非字母数字上进行分割，并保留分隔符

import re
a = "foo,bar@candy*ice%cream"
re.split('([^a-zA-Z0-9])',a)

输出:

['foo', ',', 'bar', '@', 'candy', '*', 'ice', '%', 'cream']

解释

re.split('([^a-zA-Z0-9])',a)

() <- keep the separators
[] <- match everything in between
^a-zA-Z0-9 <-except alphabets, upper/lower and numbers.

2018-05-29 04:05:32

在Python中，如何分割字符串并保留分隔符?

推荐文章

最新文章

标签