从文本JavaScript中删除HTML

有没有一种简单的方法可以在JavaScript中获取一个html字符串并去掉html？

当前回答

使用jQuery，您可以使用

$('#elementID').text()

2012-09-03 15:03:35

其他回答

另一个公认不如nickf或Shog9优雅的解决方案是从＜body＞标记开始递归遍历DOM并附加每个文本节点。

var bodyContent = document.getElementsByTagName('body')[0];
var result = appendTextNodes(bodyContent);

function appendTextNodes(element) {
    var text = '';

    // Loop through the childNodes of the passed in element
    for (var i = 0, len = element.childNodes.length; i < len; i++) {
        // Get a reference to the current child
        var node = element.childNodes[i];
        // Append the node's value if it's a text node
        if (node.nodeType == 3) {
            text += node.nodeValue;
        }
        // Recurse through the node's children, if there are any
        if (node.childNodes.length > 0) {
            appendTextNodes(node);
        }
    }
    // Return the final result
    return text;
}

2009-05-04 23:14:30

如果您不想为此创建DOM（可能您不在浏览器上下文中），可以使用striptags npm包。

import striptags from 'striptags'; //ES6 <-- pick one
const striptags = require('striptags'); //ES5 <-- pick one

striptags('<p>An HTML string</p>');

2021-07-05 09:31:20

我想分享一下Shog9批准答案的编辑版本。

正如Mike Samuel在评论中指出的那样，该函数可以执行内联javascript代码。但Shog9说“让浏览器为你做……”是对的

所以…这里是我的编辑版本，使用DOMParser：

function strip(html){
   let doc = new DOMParser().parseFromString(html, 'text/html');
   return doc.body.textContent || "";
}

这里是测试内联javascript的代码：

strip("<img onerror='alert(\"could run arbitrary JS here\")' src=bogus>")

此外，它不会在解析时请求资源（如图像）

strip("Just text <img src='https://assets.rbl.ms/4155638/980x.jpg'>")

2017-11-06 15:46:44

对于转义字符，也可以使用模式匹配：

myString.replace(/((&lt)|(<)(?:.|\n)*?(&gt)|(>))/gm, '');

2016-11-08 10:44:34

var text = html.replace(/<\/?("[^"]*"|'[^']*'|[^>])*(>|$)/g, "");

这是一个正则表达式版本，对格式错误的HTML更具弹性，例如：

未闭合的标记

某些文本<img

标记属性内的“<”，“>”

某些文本<img alt=“x>y”>

换行符

一些<ahref=“http://google.com">

代码

var html = '<br>This <img alt="a>b" \r\n src="a_b.gif" />is > \nmy<>< > <a>"text"</a'
var text = html.replace(/<\/?("[^"]*"|'[^']*'|[^>])*(>|$)/g, "");

2018-07-06 10:39:57

从文本JavaScript中删除HTML

推荐文章

最新文章

标签