Miss an article? Archives

Feature Article

Sunday, March 01, 2009

Making The Move To Creating Structured XML: An Interview with Thomas Aldous

Scott: Thomas, thanks for agreeing to chat with us a little today about making the move to structured XML. For those readers who don’t yet know who you are, tell us a little about yourself and your experience in the content industry.

Thomas: Thanks, Scott. I am a principal owner of a company called Integrated Technologies, better known as InTech. We began as a UNIX training company back in 1989. Within a few years, we made major inroads in the Engineering, Financial and Telecom industries. One of our first financial clients was Solomon Brothers. You may remember them. Solomon Brothers was a very large investment house that was purchased by Smith Barney, who was then purchased by Citi Bank. We had an exclusive contract to train their employees on how to use their Sun Workstations as a desktop and train their Systems Administrators how to administrate them. Solomon needed a desktop editor that was cross-platform compatible and decided to standardize on FrameMaker. They liked our training style, which is based on a system my partner created that we called “Edutainment”, which has a foundation built on the fact that you can’t teach them if they are asleep. Solomon Brothers asked us to get Certified as an Authorized Training Center and added FrameMaker Training to our contract.

imageAt this time, I was bored with only working on the business development alone and decide to take this FrameMaker project on myself. My first FrameMaker training class was a four day Basic and Advanced, that ended on a Thursday. Now this is where luck comes in. At the same time, we just received a UNIX training contract with the research arm of the baby bells called Bellcore. They were based in NJ. When my class ended at Solomon Brothers, I decided to visit the Bellcore curriculum manager in charger of the UNIX training contract. This curriculum manager introduced me to one of his colleagues who asked me a question that changed my life forever. The question was “I just received a project and have to train thousands of employees on a product called FrameMaker, do you know anyone that does this. Well, needless to say we became the largest FrameMaker trainer in a matter of months. In addition to Solomon Brothers and Bellcore, almost every Unix training contract we had, also picked up our FrameMaker training. At the same time, we developed a reseller network of Unix Hardware Vendors to sell our training offering for a margin. It was a new concept at the time, they were used to just passing the training portion of their deals back to the hardware vendor with no margin.

Let’s say hiring and training new trainers to deliver FrameMaker training to our customers under our “Edutainment” methodology was my new pastime.

Along the way, I listened to our customers. We helped them convert their legacy documents from Troff, Interleaf, Word Perfect and Word—to named a few legcy formats—into Unstructured FrameMaker and FrameBuilder (Structured). Our customers also involved us earlier on with exporting their content to SGML and then XML. We worked with them to integrate their documents into Content Management and publishing systems.

It was exciting, and this industry continues to be so because of the pace at which technology changes. We were getting into real information management. We partnered up with RR Donnelly, a financial reporting firm for all the large public corporations and we started looking at chunking information.

Scott: I know that your firm, Integrated Technologies, is a provider of various types of services designed to help folks make the move from unstructured to structured content. But what exactly does your company do? What types of services do you provide today?

Thomas: Basically, we are everything from a one-stop shop (handling everything from information architecture to implementation) to a provider of individual services that augment an organization’s existing talent pool.  We start with training content managers on the possibilities and go from there. We find most companies are now trying to transition from creating unstructured documentation to creating structured XML, which the goal of integrating their newly application-independent content into a content management system so they can efficiently manage their information assets. Of course, we get excited when they mention DITA specialization, XSLT, DTD and Schemas. Every project is a new challenge and we enjoy creating software tools to automate the conversion process to the structure our clients need. The more we can automate the process, the less time our clients’ documents are off-line. Often they are only off-line for only minutes. If we don’t have the knowledge and skill needed to provide a specific service, we help our clients find someone who can. Doing so allows us to focus on the entire project rather than just pieces.

imageScott: As a content technologist who specializes in XML publishing solutions, what are the main reasons the organizations you serve say they are moving to XML? Give us a few examples (and name drop, if you can - with URLs, where possible)

Thomas: Each organization usually has it’s own unique reasons for moving to XML, but usually it is because of cost savings and compliance.  For instance, TEEX, the Homeland Security Training Arm of Texas A & M wanted to lower costs by eliminating duplication of efforts when creating Homeland Security training manuals. Content reuse was the primary goal. We helped them modularize their information and placed it into a CMS making it easy for them to update and reuse their content.

Another interesting case was a client who wanted to move to XML, so that they could disseminate their content internationally, in any language. Converting their content to XML enabled them to expand their market exponentially.

We are currently working on an interesting project in the Nuclear Industry. This customer chose to migrate to XML from their legacy Interleaf system with specialized DITA structure, so they could separate content from formatting. They use Adobe FrameMaker and they own an Enterprise Level License for Documentum. Because there was no FrameMaker / Documentum / DITA integration, we developed it (with the help of a software business partner). It will be released for sale shortly.

Scott: As you know I speak at a lot of conference and get to talk to people who are trying to understand the rapidly changing publishing landscape, I want to ask you a few questions that I hear commonly. First, what is unstructured legacy content, exactly? And, what do you mean by unstructured?

Thomas: Unstructured legacy content is usually defined as content that has been created using software (authoring tool) that may or may not have used formatting tags, but did not have programmatic validation against a set of rules on how and where these tags could be used. Word users may have used styles or not. FrameMaker users may have used Paragraph and Character tags. Interleaf users may have used components. But these examples are all unstructured content.

In very general terms, the real difference between structured and unstructured content is that in the structured editing environment there are rule sets called DTDs or Schemas that define where we can place a certain piece of content and what metadata information can—and should—be provided. In Structured FrameMaker formatting can be accomplished with an EDD that will both validate the structure and apply formatting based on that structure. In most other major XML editors and also FrameMaker XSLT-FO stylesheets can be written for various outputs like PDF, HTML and mobile devices.

Scott: What is the most common type of content you are asked to convert?

Thomas: MS Word, Unstructured Frame, Interleaf and plain ASCII text are the most popular.

In most cases there is some inherent structure in an unstructured document. What I mean is that when an author created the content, they probably used a style guide that was either written or understood. They may have had rules like Heading1 always goes before Heading 2. They may have only meant to use these tags to create the formatting (or look-and-feel) of the document, but what they were actually doing was enforcing a simple structure. InTech makes software tools to help aid the legacy content conversion process that extend the inherent structure of unstructured documents.

Scott: As one would expect, as a user advocate I often ...

Read more

Filed under: Content ConversionDITAStructured ContentTechnical WritingAuthoring ToolsAdobe FrameMakerUnstructured ContentXMLXSLT

News & Notes
(updated daily. almost.)
News RSS Feed

DITA Workshops in Wisconsin; Outstanding Value!

Monday, June 29, 2009

The STC Wisconsin, Four Lakes and Twin Cities communities are cooperatively presenting three DITA workshops this July. DITA Content Modeling - July 11, Milwaukee; DITA-In-a-Day: Authoring - July 13, Minneapolis (FULL, Waiting list only); DITA-In-a-Day: Authoring - July 15, Madison; DITA Production - July 18, Milwaukee. Cost: $50 for STC members, $85 for non-members, $25 for students. You can’t beat this value anywhere. If you don’t sign up for these classes, you are really missing a BIG opportunity to learn at a deep discount.

Eight Slides Explain DITA Topics, Maps, Specialization

Wednesday, October 29, 2008

From the DITA Newsletter: ”Download a zipped folder with eight explanatory slides that you can use to educate your tech pubs group about DITA. Feel free to use these in your own presentations on DITA. They combine ideas from some of the best slides in use over the past few years by DITA evangelists. Or listen to the 5-minute Flash tutorial that uses animated versions of these slides to describe the core functionality of DITA.”

[Case Study] How Our DITA Conversion Saved Us $100,000, For Starters

Friday, September 26, 2008

Ever wonder how converting to a DITA/XML content management system would play out in real life? What if we added globalization? What if it showed nearly $100,000 savings for the first two deliverables (in 9 languages)? This step-by-step plan, by Jennifer Linton of CaridianBCT (formerly Gambro BCT), tells us exactly how it played out for them. Read the article in Data Conversion Labs newsletter.

DITA Maturity Model Contest - Win an iPod Touch, Free Tickets To DocTrain East

Monday, August 18, 2008

image
Submit your quality contributions to the newly launched DITA Maturity Model Community (MMC) before August 31, 2008, and you could win an 8GB Apple iPod Touch and two tickets to Documentation and Training East.

The DITA MMC (part of the DITA XML.org Wiki Knowledgebase) is an extension of the original white paper developed by Michael Priestley, lead IBM DITA architect, and Amber Swope, principal consultant for JustSystems. The DITA MMC lets you browse, edit, and add to a collaborative knowledgebase of information on using and understanding the DITA Maturity Model.

“The goal of the DITA Maturity Model Community is to elevate the discussion off the page,” said Jake Sorofman, senior vice president of marketing and business development for JustSystems. “Now, practitioners, managers and executives have access to a working forum where they can share knowledge and practical guidance, develop business cases, test out implementation scenarios, and get peer-to-peer advice.”

Subscribe: Direct Inbox Delivery

Get The Content Wrangler Newsletter delivered straight to your home or work Inbox. It's full of content goodness.

sponsors Image Image image Image image Image Image Image Image Image Image Image
Internet Blogs - Blog Top Sites