Skip to main content
Ben Nadel at Scotch On The Rock (SOTR) 2010 (London) with: Claude Englebert and Ciqala Burt and Mark Drew
Ben Nadel at Scotch On The Rock (SOTR) 2010 (London) with: Claude Englebert ( @cfemea ) Ciqala Burt ( @ciqala ) Mark Drew ( @markdrew )

Creating An XML Representation Of A Directory (And Sub-Directories) Using ColdFusion

By on
Tags:

A while back, I had read about replacing CFDirectory functionality with Java over on ColdFusion Muse. This is a cool idea and I have never played around with it, so I figure I might as well sit down and do a little demo for myself.

I happen to Love CFDirectory. I think it's a fabulous tag. The one thing I really hate about it though, is that it is not really structured in any useful way. It returns the whole directory structure in ONE massive query. As part of my File Explorer project (yes, I WILL finish that one day), I would love the ability to get a representation of a directory and sub-directories / files in some sort of structure that made more sense in my head.

To accomplish this, I have created a ColdFusion user defined function, GetDirectoryXML(), that recurses over a directory (the recursion itself is optional) and creates an XML document in this format:

<directory
	name="..."
	path="...">

	<directories>

		<directory
			name="..."
			path="...">

			--[ RECURSIVE AREA ]--

		</directory>

	</directories>

	<files>

		<file
			name="..."
			path="..."
			bytes="..."
			/>

	</files>

</directory>

For things like the Name and Path attributes, I opted for using XML node attributes over text nodes because XPath searches are much easier on attributes (at least at my low-xml-xpath-skill level). I like the above structure because it becomes very clear where all the files and directories belong. Also, If I wanted to render this XML it provides a great structure that has sub-structures that I could pass off to a recursive function for rendering.

Here is the code for the GetDirectoryXml() ColdFusion function:

<cffunction
	name="GetDirectoryXml"
	access="public"
	returntype="string"
	output="false"
	hint="Gets directory information from a directory File object and returns it in XML format.">

	<!--- Define arguments. --->
	<cfargument
		name="Directory"
		type="any"
		required="true"
		hint="The Java FILE object representing a directory."
		/>

	<cfargument
		name="Recurse"
		type="boolean"
		required="false"
		default="false"
		hint="Determines whether we will get child directory information recursively."
		/>

	<cfargument
		name="Buffer"
		type="any"
		required="false"
		default=""
		hint="This is the string buffer to which the result is written before being returned."
		/>

	<cfscript>

		// Set up local scope.
		var LOCAL = StructNew();

		// Create an array for files.
		LOCAL.Files = ArrayNew( 1 );

		// Create an array for directories.
		LOCAL.Directories = ArrayNew( 1 );

		// Get the children for this directory.
		LOCAL.Children = ARGUMENTS.Directory.ListFiles();

		// Set up a flag for returning the buffer. If the buffer
		// is a simple value then we want to return the buffer.
		// If it is a complex value, then either we are part
		// of the recursive call or a buffer was passed in.
		// Either way, don't return anything in that case.
		LOCAL.ReturnBuffer = IsSimpleValue( ARGUMENTS.Buffer );


		// Check to see if we have a buffer yet to which we
		// will be writing the result.
		if (IsSimpleValue( ARGUMENTS.Buffer )){

			// Create the string buffer.
			ARGUMENTS.Buffer = CreateObject(
				"java",
				"java.lang.StringBuffer"
				).Init();

		}


		// Loop over the children objects to figure out which
		// array they should go in: File or Directory.
		for (
			LOCAL.ChildIndex = 1 ;
			LOCAL.ChildIndex LTE ArrayLen( LOCAL.Children ) ;
			LOCAL.ChildIndex = (LOCAL.ChildIndex + 1)
			){

			// Check for file type.
			if (LOCAL.Children[ LOCAL.ChildIndex ].IsDirectory()){

				// Add to directory array.
				ArrayAppend(
					LOCAL.Directories,
					LOCAL.Children[ LOCAL.ChildIndex ]
					);

			} else {

				// Add to directory array.
				ArrayAppend(
					LOCAL.Files,
					LOCAL.Children[ LOCAL.ChildIndex ]
					);

			}

		}


		// ASSERT: At this point, we have two fully populated
		// arrays for Files and Directories. These are populated
		// with actual Java FILE objects, not just paths.


		// Start the directory node.
		ARGUMENTS.Buffer.Append(
			"<directory name=""" &
			XmlFormat( ARGUMENTS.Directory.GetName() ) &
			""" path=""" &
			XmlFormat( ARGUMENTS.Directory.GetPath() ) &
			""">"
			);


		// Start the directory node.
		ARGUMENTS.Buffer.Append(
			"<directories>"
			);

		// Loop over the directories to output xml data.
		for (
			LOCAL.DirectoryIndex = 1 ;
			LOCAL.DirectoryIndex LTE ArrayLen( LOCAL.Directories ) ;
			LOCAL.DirectoryIndex = (LOCAL.DirectoryIndex + 1)
			){

			// Get short hand.
			LOCAL.Directory = LOCAL.Directories[ LOCAL.DirectoryIndex ];

			// Check to see if we are recursing. If we are then
			// we can send this directory object right back to
			// this function (hello recursion). If not, we can
			// just output the basic data.
			if (ARGUMENTS.Recurse){

				// Since we are recursing, send back to self.
				GetDirectoryXml(
					Directory = LOCAL.Directory,
					Recurse = ARGUMENTS.Recurse,
					Buffer = ARGUMENTS.Buffer
					);

			} else {

				// Since we are not recursing, just output
				// simple data.
				ARGUMENTS.Buffer.Append(
					"<directory name=""" &
					XmlFormat( LOCAL.Directory.GetName() ) &
					""" path=""" &
					XmlFormat( LOCAL.Directory.GetPath() ) &
					""">"
					);

			}

		}

		// End the directory node.
		ARGUMENTS.Buffer.Append(
			"</directories>"
			);

		// Start the files node.
		ARGUMENTS.Buffer.Append(
			"<files>"
			);

		// Loop over the files to output xml data.
		for (
			LOCAL.FileIndex = 1 ;
			LOCAL.FileIndex LTE ArrayLen( LOCAL.Files ) ;
			LOCAL.FileIndex = (LOCAL.FileIndex + 1)
			){

			// Get short hand.
			LOCAL.File = LOCAL.Files[ LOCAL.FileIndex ];

			// Write file node.
			ARGUMENTS.Buffer.Append(
				"<file name=""" &
				XmlFormat( LOCAL.File.GetName() ) &
				""" path=""" &
				XmlFormat( LOCAL.File.GetPath() ) &
				""" bytes=""" &
				XmlFormat( LOCAL.File.Length() ) &
				"""/>"
				);

		}

		// Start the files node.
		ARGUMENTS.Buffer.Append(
			"</files>"
			);

		// Close the directory node.
		ARGUMENTS.Buffer.Append(
			"</directory>"
			);


		// Check to see if we should return the buffer.
		if (LOCAL.ReturnBuffer){

			// Return the directory Xml data.
			return( ARGUMENTS.Buffer.ToString() );

		} else {

			// Just return the empty string.
			return( "" );

		}

	</cfscript>
</cffunction>

You might notice that one of the arguments is an optional Java StringBuffer object. Internally, the UDF writes all the directory XML to a String Buffer which it then passes around recursively. This is done to minimize the amount of string concatenation that is being done. If you pass in a String Buffer object, the GetDirectoryXml() UDF will not return anything as you will already have a pointer to the String Buffer object. I like this flexibility, but not sure that it would be useful (other than for performance).

Calling the GetDirectoryXml() UDF on a simple directory might return something like this:

CFDump CFDirectory As XML

Notice the highly structured data. Even though it is structured in this nested way, you can still use XPath to search for files just as you could with a ColdFusion query of queries using CFQuery:

<!--- Search for the file "set.cfm". --->
<cfset arrFind = XmlSearch(
	xmlDir,
	"//file[@name='set.cfm']"
	) />

This will return an array of File XML nodes whose name attribute is "set.cfm" regardless of where in the structure it is.

Now, I mentioned that a highly structured XML directory representation could be easily rendered. This is another ColdFusion UDF that I have created in order to render the XML generated by GetDirectoryXml(). It's called RenderDirectoryXml():

<cffunction
	name="RenderDirectoryXml"
	access="public"
	output="true"
	returntype="void"
	hint="Graphically renders the XML created via GetDirectoryXml().">

	<!--- Define arguments. --->
	<cfargument
		name="Xml"
		type="xml"
		required="true"
		hint="The XML object created via GetDirectoryXml()."
		/>

	<cfargument
		name="Indent"
		type="string"
		required="false"
		default=""
		hint="The left hand buffer (in spaces)"
		/>

	<!--- Define local scope. --->
	<cfset var LOCAL = StructNew() />

	<!---
		Get current directory element. This might be the full
		XML object, or it might be a sub directory.
	--->
	<cfif StructKeyExists( ARGUMENTS.Xml, "XmlRoot" )>

		<!---
			We have a full on ColdFusion XML document object.
			Grab the root and use that.
		--->
		<cfset LOCAL.Root = ARGUMENTS.Xml.XmlRoot />

	<cfelse>

		<!---
			We are dealing with an actual XML directory
			node. Just use that as the root.
		--->
		<cfset LOCAL.Root = ARGUMENTS.Xml />

	</cfif>


	<!--- Get the sub directories. --->
	<cfset LOCAL.Directories = LOCAL.Root.directories.XmlChildren />

	<!--- Get the files. --->
	<cfset LOCAL.Files = LOCAL.Root.files.XmlChildren />


	<!--- Safe guard for indent. --->
	<cfif (Len( ARGUMENTS.Indent ) GT 20)>

		<cfthrow
			type="Too.Much.Recursion"
			message="We have recursed to many times."
			/>

	</cfif>


	<!--- Output the name of the current directory. --->
	<pre>#ARGUMENTS.Indent##LOCAL.Root.XmlAttributes.Name#/</pre>

	<!--- Loop over directories to output recursively. --->
	<cfloop
		index="LOCAL.DirectoryIndex"
		from="1"
		to="#ArrayLen( LOCAL.Directories )#"
		step="1">

		<!---
			Render sub directories. Notice that we are adding
			".." to the indent to show that we are in a sub-
			directory.
		--->
		<cfset RenderDirectoryXml(
			Xml = LOCAL.Directories[ LOCAL.DirectoryIndex ],
			Indent = (ARGUMENTS.Indent & "..")
			) />

	</cfloop>


	<!--- Loop over files to output. --->
	<cfloop
		index="LOCAL.FileIndex"
		from="1"
		to="#ArrayLen( LOCAL.Files )#"
		step="1">

		<pre>#ARGUMENTS.Indent#..#LOCAL.Files[ LOCAL.FileIndex ].XmlAttributes.Name#</pre>

	</cfloop>

	<!--- Return out. --->
	<cfreturn />
</cffunction>

This produces an output that looks like:

CFDump CFDirectory As Rendered XML

I think this is going to come in quite handy for finishing my blasted File Explorer. Thanks Mark Kruger!

Want to use code from this post? Check out the license.

Reader Comments

15,674 Comments

Shuns,

It is hard kicking so much ass? Cause that's probably the cleanest, coolest file explorer I have ever seen. I had a tiny bit of trouble figuring out how to set up the path, but got it going. You might want to throw a comment in there or a little demo snippet so people know what you are talking about. Very cool though!

15,674 Comments

Hmmm. That's odd. You are probably not passing in a File object to the method. The argument [Directory] needs a File object, NOT a file path:

Directory = CreateObject( "java", "java.io.File" ).Init( ExpandPath( "./directory_name_here/") )

You might be passing in just the ExpandPath() stuff in.

Am I close?

3 Comments

just getting back to this now. Yes, that was the problem.

I have one small problem though. When I run it through the renderer, if I have a structure like this in my FS...

Folder_DOCS
--doc1
--doc2
----Folder_DOCS_Sub
------doc1
------doc2
----------Folder_DOCS_Sub_sub
------------doc1
------------doc2

it ends up outputting like this

Folder_DOCS
----Folder_DOCS_Sub
----------Folder_DOCS_Sub_sub
------------doc1
------------doc2
------doc1
------doc2
--doc1
--doc2

how can i make it exactly replicate the FS stucture?

15,674 Comments

Derek,

I will make a folder structure as you describe and try running it. I am sure it's a minor bug. Thanks for the heads up. I will get back to you.

I believe in love. I believe in compassion. I believe in human rights. I believe that we can afford to give more of these gifts to the world around us because it costs us nothing to be decent and kind and understanding. And, I want you to know that when you land on this site, you are accepted for who you are, no matter how you identify, what truths you live, or whatever kind of goofy shit makes you feel alive! Rock on with your bad self!
Ben Nadel