从文本JavaScript中删除HTML

有没有一种简单的方法可以在JavaScript中获取一个html字符串并去掉html？

当前回答

还可以使用出色的htmlparser2纯JSHTML解析器。这里是一个工作演示：

var htmlparser = require('htmlparser2');

var body = '<p><div>This is </div>a <span>simple </span> <img src="test"></img>example.</p>';

var result = [];

var parser = new htmlparser.Parser({
    ontext: function(text){
        result.push(text);
    }
}, {decodeEntities: true});

parser.write(body);
parser.end();

result.join('');

输出将是这是一个简单的示例。

请在此处查看实际操作：https://tonicdev.com/jfahrenkrug/extract-text-from-html

如果您使用类似webpack的工具打包web应用程序，则这在节点和浏览器中都有效。

2015-12-29 19:11:59

其他回答

另一个公认不如nickf或Shog9优雅的解决方案是从＜body＞标记开始递归遍历DOM并附加每个文本节点。

var bodyContent = document.getElementsByTagName('body')[0];
var result = appendTextNodes(bodyContent);

function appendTextNodes(element) {
    var text = '';

    // Loop through the childNodes of the passed in element
    for (var i = 0, len = element.childNodes.length; i < len; i++) {
        // Get a reference to the current child
        var node = element.childNodes[i];
        // Append the node's value if it's a text node
        if (node.nodeType == 3) {
            text += node.nodeValue;
        }
        // Recurse through the node's children, if there are any
        if (node.childNodes.length > 0) {
            appendTextNodes(node);
        }
    }
    // Return the final result
    return text;
}

2009-05-04 23:14:30

    (function($){
        $.html2text = function(html) {
            if($('#scratch_pad').length === 0) {
                $('<div id="lh_scratch"></div>').appendTo('body');  
            }
            return $('#scratch_pad').html(html).text();
        };

    })(jQuery);

将其定义为jquery插件，并按如下方式使用：

$.html2text(htmlContent);

2012-03-16 06:25:57

来自CSS技巧：

https://css-tricks.com/snippets/javascript/strip-html-tags-in-javascript/

常量原始字符串=`＜div＞<p>嘿，这是什么东西</p></div>`;conststripedString=originalString.replace（/（<（[^>]+）>）/gi，“”）；console.log（strippedString）；

2020-09-03 15:52:21

使用Jquery：

function stripTags() {
    return $('<p></p>').html(textToEscape).text()
}

2016-12-09 08:41:42

如果您不想为此创建DOM（可能您不在浏览器上下文中），可以使用striptags npm包。

import striptags from 'striptags'; //ES6 <-- pick one
const striptags = require('striptags'); //ES5 <-- pick one

striptags('<p>An HTML string</p>');

2021-07-05 09:31:20

从文本JavaScript中删除HTML

推荐文章

最新文章

标签