August 3, 2007
The search for a better way to strip HTML
Currently, the removeHTMLTags() function uses a simple regular expression to remove *all* HTML-like tags. I say HTML-like because there is a slight possibility that this method might remove some string fragments that are not HTML tags. One premise is that it should be possible to specify a white list to the function (tags to be excluded from removal), so this might require some work.