Articles

Friday, May 18, 2007

Converting Poorly Structured Legacy Content Creates Difficult Challenges When Moving To DITA

Content management and information design expert JoAnn Hackos discusses moving legacy documentation to the Darwin Information Typing Architecture (DITA) in this interview with Data Conversion Laboratory. Hackos defines legacy content, why and when converting content to DITA is desirable, how to decide what to convert (and what not to), and why writing content and designing new structures are a critical success factor when moving to DITA.

Hackos says: “Do not accept the old structures for the future. The new structure not only should be DITA; it’s going to be more structured and more effective. In the process you may find information that can move to DITA and information that could be dropped. You could devise a fairly sophisticated conversion script to do some of the work.”

“You might have content that is already well structured in the original,” says Hackos. “Conversion to DITA is going to give you a lot of value. Then you have some content that is so badly done that you don’t want to use it. This information may have to be completely rewritten. You might even need to start from scratch.”

“You will likely have a considerable middle ground-that’s where most people are. You have valuable information with good content. You want to move to DITA but in the process of getting there you want to make some intelligent decisions about using that content in the future. Put all your decisions about your content in at least three buckets: what to leave back as legacy, what to convert, and what to rewrite. One caution, if you convert information that is badly structured, it becomes even harder to fix later.”

Filed under: DITALegacy Content Conversion

Comments

By Marcus Carr on June 5, 2007 -- 5:35pm

Hackos says: “Do not accept the old structures for the future. The new structure not only should be DITA; it’s going to be more structured and more effective.”

I say: What do you mean by “old structures”? My existing XML? My “old structures” were designed to precisely describe the dataset, so how is a move to a generic schema like DITA going to result in something more structured?

Hackos says: “You might have content that is already well structured in the original,” says Hackos. “Conversion to DITA is going to give you a lot of value.

I say: No it’s not.

Hackos says: Then you have some content that is so badly done that you don’t want to use it. This information may have to be completely rewritten. You might even need to start from scratch.”

I say: In over fifteen years of doing conversions spanning numerous industries, I’ve never met a client that valued the structure of their documents more highly than the content. It’s usually possible to get them to make concessions where you can demonstrate that the data is not consistently organized, but telling them to rewrite it to fit with the structure that you’ve adopted is simply unrealistic.

Documentation problems are hard. Despite hype to the contrary, they don’t just dry up and blow away when they see DITA coming.

By ScottAbel on June 5, 2007 -- 6:57pm

Marcus:

Thanks for your comments. I hear what you’re saying and think that this topic is certainly going to be an important one as the industry increasingly adopts structured XML content management methodologies.

Scott Abel
The Content Wrangler

Leave a comment

Name:

Email:

Location:

URL:

Remember my personal information

Notify me of follow-up comments?

Please enter the word you see in the image below:


Subscribe: Direct Inbox Delivery

Get The Content Wrangler Newsletter delivered straight to your home or work Inbox. It's full of content goodness.

sponsors Image Image Image Image Image Image image Image Image Image Image Image Image Image Image Image