HTML Parser

  

I have a use case where I have a server action that is given a chunk of HTML (usually an entire HTML page).

I need to be able to easily inspect / parse the page and pull out the content such as the BODY or HEAD or META content.


Is there a utility out there that would help me to inspect/parse HTML?

Hello Bruce.

Depending on what you can expect from this HTML, you can use regular expressions. For example, a very simple regexp to match for a META tag would be:

"<meta name=""([^""]*)"" content=""([^""]*)"" />"

Note that the pair of double quotes is Service Studio's escape sequence for a double quote.


Some regular expression actions are available in the Text extension.