The search for a better way to strip HTML

Currently, the removeHTMLTags() function uses a simple regular expression to remove *all* HTML-like tags. I say HTML-like because there is a slight possibility that this method might remove some string fragments that are not HTML tags. One premise is that it should be possible to specify a white list to the function (tags to be excluded from removal), so this might require some work.

No comments yet. Be the first.

Leave a reply

CAPTCHA image