XSLT And XMLTransform() Work Properly With XHTML In ColdFusion

Posted January 12, 2009 at 9:19 AM by Ben Nadel

Tags: ColdFusion

I have done a good amount of experimentation with XSLT and the XMLTransform() method in ColdFusion; but, up until now, that's all pretty much been with strict XML documents. Over the weekend, I was thinking of re-architecting some parts of my blog and I thought XSLT might just be the best approach. But then it occurred to me that I didn't know if XSLT would play nicely with XHTML-style data. To be honest, I am not even sure if XHTML validates as XML. For the most part, the parallel nature of the two types of document are obvious; the one uncertainty for me is the mixture of text nodes and elements within a single parent.

To see how XSLT and XHTML work together, I set up a simple demo that creates a copy of a chunk of XHTML data by copying each node within it separately:

  • <!--- Define XHTML style data. --->
  • <cfsavecontent variable="strData">
  •  
  • <p class="content">
  • This is some text with some other formatted text
  • <strong>contained within in</strong>. While this is
  • valid XHTML, I am wondering how it will hold up
  • when put through <em>XSLT</em> node copying.
  • <img src="about:blank" /> Embedded image.
  • </p>
  •  
  • </cfsavecontent>
  •  
  •  
  • <!--- Define the XSLT. --->
  • <cfsavecontent variable="strXSLT">
  •  
  • <!--- Document type declaration. --->
  • <?xml version="1.0" encoding="ISO-8859-1"?>
  •  
  • <xsl:transform
  • version="1.0"
  • xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
  •  
  • <!--- Match all generic nodes. --->
  • <xsl:template match="*">
  • <!--- Copy this node (non-deep copy). --->
  • <xsl:copy>
  • <!---
  • Make sure that all attributes are copied
  • over for the current node.
  • --->
  • <xsl:copy-of select="@*" />
  •  
  • <!---
  • Apply templates to all of it's child nodes
  • (so that they can be copied).
  • --->
  • <xsl:apply-templates />
  • </xsl:copy>
  • </xsl:template>
  •  
  • </xsl:transform>
  •  
  • </cfsavecontent>
  •  
  •  
  • <!---
  • Transfor the XHTML. Let's see if this creates an accurate
  • copy of the XHTML.
  • --->
  • #HtmlEditFormat(
  • XMLTransform(
  • Trim( strData ),
  • Trim( strXSLT )
  • )
  • )#

As you can see, my XSLT is simply copying each node and then recursively calling Copy on each of its child nodes. When we run this code, we get the following output:

<?xml version="1.0" encoding="UTF-8"?> <p class="content"> This is some text with some other formatted text <strong>contained within in</strong>. While this is valid XHTML, I am wondering how it will hold up when put through <em>XSLT</em> node copying. <img src="about:blank"/> Embedded image. </p>

With the exception of the added XML DocType, I am happy to see that ColdFusion's XMLTransform() and my XSLT work quite well with XHTML style data. This verifies that XHTML does indeed validate as XML. Let's see if I can start putting this new found compatibility insight into use.




Reader Comments

Jan 12, 2009 at 11:46 AM // reply »
7 Comments

One thing to keep in mind is that CF8 still ships with the old Xalan processor from Apache. Unfortunately, that parser only supports xslt 1.0 and hasn't had any active development in years. What Adobe really needs to do is upgrade that library to the Saxon-B parser so that it supports XSLT 2.0, XQuery 1.0, and XPath 2.0.

There is a way to swap out the parser for the updated one, but then you have to remember to do it each time you update CF.


Jan 12, 2009 at 11:49 AM // reply »
10,640 Comments

@Mike,

This is a good point. However, for most of what I will am thinking about doing, this should be good enough for now. I'll have some more blog posts on the related topics to come.


Jan 12, 2009 at 6:19 PM // reply »
22 Comments

"To be honest, I am not even sure if XHTML validates as XML" - XHTML was developed to represent HTML as valid XML - http://www.w3.org/TR/xhtml1/#xhtml


Jan 13, 2009 at 12:30 PM // reply »
10,640 Comments

@Johans,

Awesome. Good to know. And, since I use XStandard as my blog editor, I can rest assured that all of my blog content is actually XHTML / XML compliant. Man, I love XStandard.

http://www.bennadel.com/blog/tags/15-XStandard-WYSIWYG-blog-entries.htm


Jan 13, 2009 at 1:34 PM // reply »
22 Comments

Agree 100% - XStandard is the way to go. It is built into my CMS:

http://www.assetnow.com

It would be great if they developed an ActionScript version for Flex/Air.


Jan 13, 2009 at 1:38 PM // reply »
10,640 Comments

@Johans,

Word up! XStandard is such an awesome editor! I hate when I have to use anything else.


Jan 13, 2009 at 2:29 PM // reply »
22 Comments

BTW - on topic of XS editor - I noticed in your XS posts you were concerned your web services would not work with v2 and was pleasantly surprised when they did.

The changes to the web services were minor and are listed in the XS OEM docs change log. Mainly added ability to order items returned in response and syntax for max upload size parameters.

The main change was the license file/format.


Jan 13, 2009 at 2:55 PM // reply »
10,640 Comments

@Johans,

Yeah, that was pretty exciting :)


Jan 13, 2009 at 5:44 PM // reply »
38 Comments

Too bad XStandard is NOT written in JS, but a browser plug-in.


Jan 13, 2009 at 5:46 PM // reply »
10,640 Comments

@Henry,

Sometimes ultimate power comes with minor compromises :)


Aug 5, 2009 at 4:47 PM // reply »
1 Comments

@Mike Rankin:
Thanks! Your comment spared me a lot of head-banging when I was trying to get some "xsl:character-map" stuff to work in CF 8.

Anyone know what parser CF 9 uses? (yes, I know I could install the beta and find out...)


Aug 5, 2009 at 7:05 PM // reply »
10,640 Comments

@Evan,

I'm not sure what the parse is these days. I assume it is the same one - I haven't seen any XML-based updates.


Feb 17, 2010 at 12:00 PM // reply »
1 Comments

@Mike: Could you explain me the procedure to upgrade the parser to saxon-b for use of xslt 2.0

thx a lot


Feb 17, 2010 at 1:59 PM // reply »
10,640 Comments

@Steve,

@Mike probably replaced his JAR file for the XML parsing engine (Xalan.. or is that Xerces)? I'll let him answer the question, but I figured I would give my best guess.


Sep 24, 2010 at 2:26 PM // reply »
3 Comments

Ben, when I try this approach with a <JOBDESCRIPTION> node that contains CDATA, it strips out the <![CDATA...]]> tag and escapes all of the html characters. When I test it by simply adding the CDATA tag, this is what gets returned. I've been working on this for days now trying everything I can think of and I'm starting to look like I don't know what I'm doing to the bosses here. I'd appreciate any help you can give me. Ray


Sep 24, 2010 at 2:28 PM // reply »
3 Comments

Oops, it removed all of the formatting for my post, guess I should put it in a code tag, huh? Ok here's the output again...I hope...lol.

  • <?xml version="1.0" encoding="UTF-8"?> <JOBDESCRIPTION>&lt;p class="content"&gt;This is some text with some other formatted text &lt;strong&gt;contained within in&lt;/strong&gt;. While this is valid XHTML, I am wondering how it will hold up when put through &lt;em&gt;XSLT&lt;/em&gt; node copying.&lt;img src="about:blank" /&gt; Embedded image.&lt;/p&gt;</JOBDESCRIPTION>


Sep 24, 2010 at 2:55 PM // reply »
10,640 Comments

@Ray,

There's no need to post the same question on different blog posts on my site - I get *all* the comments that come through here.

On an unrelated note, can you tell me what happened when the post stripped out your formatting? I just updated my comment code this morning, so I want to make sure that I didn't screw something up.

If you posted markup in the comment, it should have just escaped it (except for Strong and Em tags which it honors). It looks like everything got stripped out, though.



Post A Comment

Comment Etiquette: Please do not post spam. Please keep the comments on-topic. Please do not post unrelated questions or large chunks of code. And, above all, please be nice to each other - we're trying to have a good conversation here.

Please review the following issues:

Author Name:


Author Email:

Author Website:

Comment:

Supported HTML tags for formatting: <strong>bold</strong>   <em>italic</em>   <code>code</code>







  • Help Wanted - Find Your Next ColdFusion Job
InVision App - Prototyping Made Beautiful With Prototyping Tools Ben Nadel's Company - Epicenter Consulting Recent Blog Comments
Feb 12, 2012 at 3:37 AM
Learning ColdFusion 8: CFImage Part III - Watermarks And Transparency
Hi Ben, Just to ask currently it is placed bottom right corner, if i need to replace the same rendered image on the bottom left side or in the bottom center, how that can be calculated. bottom ce ... read »
Feb 11, 2012 at 9:29 PM
Use jQuery's SlideDown() With Fixed-Width Elements To Prevent Jumping
I can't say how glad I am that I found your post. Thank you very much. ... read »
Feb 10, 2012 at 7:21 PM
jQuery AJAX Strips Script Tags And Inserts Them After Parent-Most Elements
Update! Instead of $(eval(options.insertAfter)).after(data['insertData']); I now use: var ajaxNode = document.createElement('span'); var parent = $(eval(options.insertAfter))[0].parentNode; ... read »
Feb 10, 2012 at 6:18 PM
jQuery AJAX Strips Script Tags And Inserts Them After Parent-Most Elements
encountered this same, what I consider, jQuery bug last week. I'm building a site in which I load some content via AJAX. This content contains Linkedin share button placeholders which Linkedin API ne ... read »
Feb 10, 2012 at 11:30 AM
Cross-Origin Resource Sharing (CORS) AJAX Requests Between jQuery And Node.js
After you understand the concepts here, this is an awesome cheatsheet for enabling CORS in just about anything http://enable-cors.org/ ... read »
JM
Feb 10, 2012 at 9:10 AM
My Safari Browser SQLite Database Hello World Example
@Amy, Here is a very good tutorial on how to use JOIN: http://www.sqltutorial.org/sqljoin-innerjoin.aspx ... read »
Feb 10, 2012 at 4:42 AM
Building A Twitter-Inspired RESTful API Architecture In ColdFusion
This is great, very useful Ben. I spotted a small typo in the api.cgm listing: <cfthrow type="Unauthroized" /> Cheers Stefan ... read »
Feb 9, 2012 at 10:35 PM
CFDirectory Filtering Uses Pipe Character For Multiple Filters (Thanks Steve Withington)
I was wondering if there would be a filter you could apply so that you got everything but what you included in the filter. As in show me all docs that are not a .pdf. ... read »