最终的邮政编码和邮政正则表达式是什么?

我正在寻找最终的邮政编码和邮政编码正则表达式。我正在寻找一种能覆盖世界大部分地区(希望是所有地区)的东西。

当前回答

根据您的应用程序，您可能希望对大多数访问者来自的国家实现正则表达式匹配，而对其余的国家不进行验证(接受任何东西)。

2009-02-23 17:12:28

其他回答

这是一个非常简单的RegEx，用于验证美国邮政编码(而不是邮政编码+ 4):

(?!([089])\1{4})\d{5}

似乎除了00000、88888和99999之外，所有五位数字都是有效的邮政编码。

我已经用http://regexpal.com/测试了这个RegEx

2012-11-13 15:38:10

每个邮政编码系统只使用A-Z和/或0-9，有时还使用空格/破折号并不是每个国家都使用邮政编码(例如都柏林以外的爱尔兰)，但我们在这里忽略它。最短的邮政编码格式是塞拉利昂NN 最长的是美属萨摩亚，有NNNNN-NNNNNN 你应该留出一个空格或破折号。不应该以空格或破折号开始或结束

这应包括上述内容:

(?i)^[a-z0-9][a-z0-9\- ]{0,10}[a-z0-9]$

2013-11-07 19:00:16

考虑到每个国家都有如此多的边缘情况(例如。伦敦地址可能使用与英国其他地区略有不同的格式)我不认为有一个终极正则表达式，除了可能:

[0-9a-zA-Z]+

最好是使用一个相当广泛的模式(好吧，不像上面那么广泛)，或者用自己的特定模式对待每个国家/地区!

更新:然而，它可能动态地构造一个正则表达式基于许多更小的，特定于区域的规则-虽然不确定性能!

在RegExLib网站上可以找到许多特定国家的模式。

2009-02-23 17:06:55

正如其他地方指出的那样，世界各地的差异很大。即使某些东西符合模式也不意味着它存在。

当然，还有很多地方不使用邮政编码(例如much或爱尔兰)。

2009-02-23 17:20:46

我知道这是一个老问题，但我无意中遇到了同样的问题。我有来自100多个国家的发票，并试图得到正确的债权人在zip(如果每其他检查失败)。所以我所做的就是写一个简短的Python脚本，从一个字符串创建一个模式:

class RegexPatternBuilder:
    """
    Builds a regex pattern out of a given string(i.e. --> HM452 AX2155 : [A-Z]{2}\d{3}\s{1}[A-Z]{2}\d{4})
    """
    __is_alpha_count = 0
    __is_numeric_count = 0
    __is_whitespace_count = 0
    __pattern = ""

    # Count: wich character of the string we're locking at right now
    __count = 0

    # Countrys like  Andora starts theire ZIP with the country abbreviation :AD500
    # So check at first if the ZIP starts with the abbreviation and if so, add it to the pattern and increase the count.
    def __init__(self, zip_string, country):
        self.__zip_string = zip_string
        self.__country = country
        if self.__zip_string.startswith(country):
            self.__pattern = f'({self.__country})'
            self.__count += len(self.__country)

    def build_regex(self):
        # Last step ;
        # Add the current alpha_numeric pattern with count
        if len(self.__zip_string) == self.__count:
            if self.__is_alpha_count:
                self.__pattern += f"[A-Z]{{{self.__is_alpha_count}}}"
            if self.__is_numeric_count:
                self.__pattern += f"\d{{{self.__is_numeric_count}}}"
            return f'{self.__pattern}\\b'

        # Case: Whitespace
        # Check if there is a crossing from numeric / alphanumeric to whitespace,
        # if so --> add the alpha_numeric regex to the whole pattern with the
        # count as the number of viable appeaerances.
        # Since there is max 1 whitespace in a ZIP, add the whitespace regex immediately.
        # Every other case is similar to that.
        if self.__zip_string[self.__count].isspace():
            if self.__is_numeric_count:
                self.__pattern += f"\d{{{self.__is_numeric_count}}}"
            if self.__is_alpha_count:
                self.__pattern += f"[A-Z]{{{self.__is_alpha_count}}}"
            self.__pattern += "\s{1}"
            self.__is_whitespace_count += 1
            self.__is_alpha_count = 0
            self.__is_numeric_count = 0

        # Case: Is Alphanumeric
        if self.__zip_string[self.__count].isalpha():
            if self.__is_numeric_count:
                self.__pattern += f"[0-9]{{{self.__is_numeric_count}}}"
            self.__is_whitespace_count = 0
            self.__is_alpha_count += 1
            self.__is_numeric_count = 0

        # Case: Is Numeric
        if self.__zip_string[self.__count].isnumeric():
            if self.__is_alpha_count:
                self.__pattern += f"[A-Z]{{{self.__is_alpha_count}}}"
            self.__is_whitespace_count = 0
            self.__is_alpha_count = 0
            self.__is_numeric_count += 1

        # Case: Special Character (i.e. - )
        # No escaping or count for this so far, because it shouldn't be needed for our zip purposes
        if not self.__zip_string[self.__count].isalpha() \
                and not self.__zip_string[self.__count].isnumeric() \
                and not self.__zip_string[self.__count].isspace():
            self.__pattern += f'{self.__zip_string[self.__count]}{{1}}'
        self.__count += 1
        return self.build_regex()

有了这个，我创建了所有不同的可能的正则表达式的所有拉链(按国家)，我们历史上写回一个db表(即这样的东西在最后: 国家:RE PATTERN:(\d{5})\b[这可能是什么国家;d])

也许它能帮助别人。

2022-01-28 07:07:56

最终的邮政编码和邮政正则表达式是什么?

推荐文章

最新文章

标签