从文本JavaScript中删除HTML

有没有一种简单的方法可以在JavaScript中获取一个html字符串并去掉html？

当前回答

这应该可以在任何Javascript环境（包括NodeJS）上完成工作。

    const text = `
    <html lang="en">
      <head>
        <style type="text/css">*{color:red}</style>
        <script>alert('hello')</script>
      </head>
      <body><b>This is some text</b><br/><body>
    </html>`;
    
    // Remove style tags and content
    text.replace(/<style[^>]*>.*<\/style>/gm, '')
        // Remove script tags and content
        .replace(/<script[^>]*>.*<\/script>/gm, '')
        // Remove all opening, closing and orphan HTML tags
        .replace(/<[^>]+>/gm, '')
        // Remove leading spaces and repeated CR/LF
        .replace(/([\r\n]+ +)+/gm, '');

2017-01-20 05:49:54

其他回答

对于转义字符，也可以使用模式匹配：

myString.replace(/((&lt)|(<)(?:.|\n)*?(&gt)|(>))/gm, '');

2016-11-08 10:44:34

function strip_html_tags(str)
{
   if ((str===null) || (str===''))
       return false;
  else
   str = str.toString();
  return str.replace(/<[^>]*>/g, '');
}

2018-07-04 21:59:23

const-htmlParser=new DOMParser（）.parseFromString（“<h6>用户<p>名称</p></h6>”，'text/html'）；const textString=htmlParser.body.textContent；console.log（textString）

2022-07-14 06:25:29

如果您不想为此创建DOM（可能您不在浏览器上下文中），可以使用striptags npm包。

import striptags from 'striptags'; //ES6 <-- pick one
const striptags = require('striptags'); //ES5 <-- pick one

striptags('<p>An HTML string</p>');

2021-07-05 09:31:20

很多人已经回答了这个问题，但我认为分享我编写的函数可能会有用，该函数可以从字符串中删除HTML标记，但允许您包含一个不希望删除的标记数组。它很短，对我来说一直很好。

function removeTags(string, array){
  return array ? string.split("<").filter(function(val){ return f(array, val); }).map(function(val){ return f(array, val); }).join("") : string.split("<").map(function(d){ return d.split(">").pop(); }).join("");
  function f(array, value){
    return array.map(function(d){ return value.includes(d + ">"); }).indexOf(true) != -1 ? "<" + value : value.split(">")[1];
  }
}

var x = "<span><i>Hello</i> <b>world</b>!</span>";
console.log(removeTags(x)); // Hello world!
console.log(removeTags(x, ["span", "i"])); // <span><i>Hello</i> world!</span>

2017-01-27 06:55:53

从文本JavaScript中删除HTML

推荐文章

最新文章

标签