<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: More on Microsoft Word and non-interoperable standards compliance</title>
	<atom:link href="http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm/feed" rel="self" type="application/rss+xml" />
	<link>http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm</link>
	<description>This seems to be a workblog</description>
	<lastBuildDate>Thu, 11 Mar 2010 01:01:38 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Bruce D'Arcus</title>
		<link>http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm/comment-page-1#comment-134</link>
		<dc:creator>Bruce D'Arcus</dc:creator>
		<pubDate>Sun, 22 Mar 2009 01:25:27 +0000</pubDate>
		<guid isPermaLink="false">http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm#comment-134</guid>
		<description>Sigh ... I see Paul is trotting out the same distortions about the ODF metadata process again. I see that again, elsewhere, he&#039;s presenting conspiracy theories about backroom deals and such, suggesting my position on an issue was due to some ethically-dubious quid pro quo. 

Having been involved in all of those conversations (which Paul was not, by his own choice), I cannot recall a single engineer who agreed with Paul and Gary&#039;s view. In fact, the strongest voice against their position in the end was Thomas Zander, who is one of the lead developers of KOffice. Hard to pin that entirely on Sun, then, or &quot;big vendors.&quot;

I&#039;m certainly biased, but I am optimistic that the new RDF support in ODF 1.2 will provide room for enhancing interop. But as I&#039;ve &lt;a href=&quot;http://community.muohio.edu/blogs/darcusb/archives/2009/03/01/the-babel-of-citations&quot; rel=&quot;nofollow&quot;&gt;said recently&lt;/a&gt;, this is about much more than just document format specs. Office 2007/2008 has citation and bibliography support, for example. But AFAIK, no third-party extensions actually use it. There are probably many reasons for this, but one big one is the inadequate API that MS exposes.</description>
		<content:encoded><![CDATA[<p>Sigh &#8230; I see Paul is trotting out the same distortions about the ODF metadata process again. I see that again, elsewhere, he&#8217;s presenting conspiracy theories about backroom deals and such, suggesting my position on an issue was due to some ethically-dubious quid pro quo. </p>
<p>Having been involved in all of those conversations (which Paul was not, by his own choice), I cannot recall a single engineer who agreed with Paul and Gary&#8217;s view. In fact, the strongest voice against their position in the end was Thomas Zander, who is one of the lead developers of KOffice. Hard to pin that entirely on Sun, then, or &#8220;big vendors.&#8221;</p>
<p>I&#8217;m certainly biased, but I am optimistic that the new RDF support in ODF 1.2 will provide room for enhancing interop. But as I&#8217;ve <a href="http://community.muohio.edu/blogs/darcusb/archives/2009/03/01/the-babel-of-citations" rel="nofollow">said recently</a>, this is about much more than just document format specs. Office 2007/2008 has citation and bibliography support, for example. But AFAIK, no third-party extensions actually use it. There are probably many reasons for this, but one big one is the inadequate API that MS exposes.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Doug Mahugh : Miscellaneous links for 03/18/2009</title>
		<link>http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm/comment-page-1#comment-128</link>
		<dc:creator>Doug Mahugh : Miscellaneous links for 03/18/2009</dc:creator>
		<pubDate>Thu, 19 Mar 2009 04:00:49 +0000</pubDate>
		<guid isPermaLink="false">http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm#comment-128</guid>
		<description>[...] Peter Sefton has a thought-provoking post on custom XML and interoperability.&#160; It’s worth reading carefully, and you can follow the [...]</description>
		<content:encoded><![CDATA[<p>[...] Peter Sefton has a thought-provoking post on custom XML and interoperability.&#160; It’s worth reading carefully, and you can follow the [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Doug Mahugh</title>
		<link>http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm/comment-page-1#comment-127</link>
		<dc:creator>Doug Mahugh</dc:creator>
		<pubDate>Wed, 18 Mar 2009 04:12:13 +0000</pubDate>
		<guid isPermaLink="false">http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm#comment-127</guid>
		<description>Marbux, regarding RDF in ODF 1.2, I&#039;m just hoping for the best.  I hear you on the challenges there, and we&#039;ll continue to work with the ODF TC to try to improve interoperability.  We&#039;re in a position where we need to balance our perspective on various issues with the &quot;OMG, Microsoft is trying to dominate things&quot; reaction that we seem to get for just falling out of bed some days.

Peter, I&#039;m with Ian on the fact that IS29500&#039;s custom XML support is every bit as well-defined as OLE embedding is.  I agree with your point that it&#039;s messy for two different editing applications to round-trip documents with custom markup in them, especially if the document is edited in both apps, but that&#039;s not the typical use case I&#039;ve seen in custom XML scenarios.  The tagging of content is usually done immediately before some type of automated processing by a non-desktop app, so the issue of what an editor should do with that markup doesn&#039;t come up very often.

As for whether custom XML is a thinly veiled plot to lock people into Word, I think another way to say it is that Word is the richest implementation of custom XML concepts in a word processing application today.  We would love to have other implementations support these concepts as thoroughly as we do, exactly as they&#039;re specified in IS29500 and ECMA-376, and I think many organizations would get great value out of the resulting interoperability.  We&#039;re the proud owners of the first fax machine, but the protocols have been standardized, wires have been strung between towns, and we&#039;re looking forward to more fax machines coming online soon. :-)</description>
		<content:encoded><![CDATA[<p>Marbux, regarding RDF in ODF 1.2, I&#8217;m just hoping for the best.  I hear you on the challenges there, and we&#8217;ll continue to work with the ODF TC to try to improve interoperability.  We&#8217;re in a position where we need to balance our perspective on various issues with the &#8220;OMG, Microsoft is trying to dominate things&#8221; reaction that we seem to get for just falling out of bed some days.</p>
<p>Peter, I&#8217;m with Ian on the fact that IS29500&#8217;s custom XML support is every bit as well-defined as OLE embedding is.  I agree with your point that it&#8217;s messy for two different editing applications to round-trip documents with custom markup in them, especially if the document is edited in both apps, but that&#8217;s not the typical use case I&#8217;ve seen in custom XML scenarios.  The tagging of content is usually done immediately before some type of automated processing by a non-desktop app, so the issue of what an editor should do with that markup doesn&#8217;t come up very often.</p>
<p>As for whether custom XML is a thinly veiled plot to lock people into Word, I think another way to say it is that Word is the richest implementation of custom XML concepts in a word processing application today.  We would love to have other implementations support these concepts as thoroughly as we do, exactly as they&#8217;re specified in IS29500 and ECMA-376, and I think many organizations would get great value out of the resulting interoperability.  We&#8217;re the proud owners of the first fax machine, but the protocols have been standardized, wires have been strung between towns, and we&#8217;re looking forward to more fax machines coming online soon. <img src='http://ptsefton.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ian Easson</title>
		<link>http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm/comment-page-1#comment-126</link>
		<dc:creator>Ian Easson</dc:creator>
		<pubDate>Tue, 17 Mar 2009 17:04:19 +0000</pubDate>
		<guid isPermaLink="false">http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm#comment-126</guid>
		<description>Minor editing glitch:  Please replace the phrase &quot;If the former were true&quot; by &quot;If that were not the case&quot;.</description>
		<content:encoded><![CDATA[<p>Minor editing glitch:  Please replace the phrase &#8220;If the former were true&#8221; by &#8220;If that were not the case&#8221;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ian Easson</title>
		<link>http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm/comment-page-1#comment-131</link>
		<dc:creator>Ian Easson</dc:creator>
		<pubDate>Tue, 17 Mar 2009 16:59:52 +0000</pubDate>
		<guid isPermaLink="false">http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm#comment-131</guid>
		<description>One more clarification.

You said that:
&quot;That’s because OLE objects are embedded using standard interfaces not vague hand-wavy whatever-you-like Custom XML.&quot;

You seem to think that the following are either undefined or non-standardized:

1) The method of embedding a custom xml schema in the OPC container for an OOXML document
2) The method of tagging words or sections in the body of a document, for the case in which those tags are from the custom XML schema.

Neither of these are true.  They are both clearly documented as part of the IS29500 and ECMA-376 standard.

Those methods are INDEPENDENT (sorry for the shout) of the particular custom XML schema used.  It is not that custom schema that defines the embeddings; it is the IS29500 or ECMA-376 standards.  If the former were true, then you would have reason to worry, because there would indeed been no standardized way of handling the interoperability of such documents.

So, you can see why the use of custom XML in OOXML greatly *increases* the interoperability for applications that consume, produce, or modify files that make use of the hundreds of standardized XML vocabularies that have been defined for specific domains of knowledge.  When you combine this fact with what I showed in my earlier comment about no loss of information by conforming applications that do not understand the specific custom XML vocabulary, then you come to my main conclusion: a big win for interoperability overall.

I hope this helps.</description>
		<content:encoded><![CDATA[<p>One more clarification.</p>
<p>You said that:<br />
&#8220;That’s because OLE objects are embedded using standard interfaces not vague hand-wavy whatever-you-like Custom XML.&#8221;</p>
<p>You seem to think that the following are either undefined or non-standardized:</p>
<p>1) The method of embedding a custom xml schema in the OPC container for an OOXML document<br />
2) The method of tagging words or sections in the body of a document, for the case in which those tags are from the custom XML schema.</p>
<p>Neither of these are true.  They are both clearly documented as part of the IS29500 and ECMA-376 standard.</p>
<p>Those methods are INDEPENDENT (sorry for the shout) of the particular custom XML schema used.  It is not that custom schema that defines the embeddings; it is the IS29500 or ECMA-376 standards.  If the former were true, then you would have reason to worry, because there would indeed been no standardized way of handling the interoperability of such documents.</p>
<p>So, you can see why the use of custom XML in OOXML greatly *increases* the interoperability for applications that consume, produce, or modify files that make use of the hundreds of standardized XML vocabularies that have been defined for specific domains of knowledge.  When you combine this fact with what I showed in my earlier comment about no loss of information by conforming applications that do not understand the specific custom XML vocabulary, then you come to my main conclusion: a big win for interoperability overall.</p>
<p>I hope this helps.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ian Easson</title>
		<link>http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm/comment-page-1#comment-130</link>
		<dc:creator>Ian Easson</dc:creator>
		<pubDate>Tue, 17 Mar 2009 14:13:03 +0000</pubDate>
		<guid isPermaLink="false">http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm#comment-130</guid>
		<description>The essence of your misunderstanding is in the following sentence:

&quot;The information is written out into OOXML then when you open it in an earlier version of Word you can’t see it. I tried this with the NLM plugin and as far as I recall there was no structure apparent if you open the file in an earlier version of Word. The result is that if you try to edit the document you break the structure at which point THE INFORMATION IS LOST.&quot;

Just because no structure is *apparent* in another application doesn&#039;t mean the information is lost.  As long as you don&#039;t inadvertently (or deliberately) destroy the information, it won&#039;t be lost.  To prove that, all you have to do is re-open the edited document in Word 2007, and voila - the schema is intact and all tagged terms are still tagged.

Repeat the following experiment, which I just did a minute ago, and then you will understand:

1.  Install and configure the ontology add-in to Word 2007 on one PC.
2.  Create a Word 2007 document that uses a term in one of the ontologies you said to use during the configuration of the add-in.
    In my case, the entire document was the sentence: &quot;I have potter syndrome.&quot;  (&quot;Potter syndrome&quot; is a term in the Human Diseases ontology.)
3.  Save the document as a Word 2007 file.
4.  On another PC that has an earlier version of Word (in my case, Word 2003) and that of course has the &quot;Microsoft Office Compatibility Pack for Word, Excel, and PowerPoint 2007 File Formats&quot; installed, open the Word 2007 file.
5.  You will see that it doesn&#039;t recognize the term (just as you mentioned).
6.  Edit the document in some way.
In my case, I made 2 copies of the sentence, so I had three copies overall.  The first I left intact.  The second, I changed the words &quot;potter syndrome&quot; to &quot;blah blah&quot;.  The third, I added the word &quot;don&#039;t&quot; to the sentence, so it now read &quot;I don&#039;t have potter syndrome&quot;.  So, my document now read:
I have Potter syndrome.
I have blah blah.
I don&#039;t have Potter syndrome.
7.  In Word 2003, save the edited document in Word 2007 format.
8.  Open the edited document in Word 2007 on the original machine.

According to you, the information is destroyed.  Well, it isn&#039;t.

To be specific:

- The schema is still there, intact, which is easily verifiable if you have the Developer tab installed.
- The first, unedited sentence is still recognized by the ontology add-in as containing a recognized term (potter syndrome) with all its tags and meanings.
- The second sentence (which has &quot;blah blah&quot;) is not recognized as using the custom XML (because it doesn&#039;t).
- The third sentence, which says I don&#039;t have potter syndrome, is recognized as having the &quot;potter syndrome&quot; tag.  Everything is fully functional.

So, the information is NOT DESTROYED (sorry for the shouting!).

This fact shows that the custom XML feature in IS29500 is *not* a trick.

In my humble opinion, it finally realizes the original vision of SGML in the 80&#039;s, in that it provides a way for office documents to contain custom vocabularies (which can, of course, also be ones covered by international standards).  That GREATLY increases interoperability between applications that need to consume, produce, or transform information in those custom vocabularies.  There is no *inherent* information loss in OOXML applications like Word 2003 with the Compatability Pack (i.e., not counting deliberate user destruction of information) that are properly written not to destroy any custom schemas or custom schema tags.

As for OpenOffice, talk to whoever maintains it. From what you have said, it is not a properly conforming OOXML application in that it destroys the information.  But that does not take away anything I have said.</description>
		<content:encoded><![CDATA[<p>The essence of your misunderstanding is in the following sentence:</p>
<p>&#8220;The information is written out into OOXML then when you open it in an earlier version of Word you can’t see it. I tried this with the NLM plugin and as far as I recall there was no structure apparent if you open the file in an earlier version of Word. The result is that if you try to edit the document you break the structure at which point THE INFORMATION IS LOST.&#8221;</p>
<p>Just because no structure is *apparent* in another application doesn&#8217;t mean the information is lost.  As long as you don&#8217;t inadvertently (or deliberately) destroy the information, it won&#8217;t be lost.  To prove that, all you have to do is re-open the edited document in Word 2007, and voila &#8211; the schema is intact and all tagged terms are still tagged.</p>
<p>Repeat the following experiment, which I just did a minute ago, and then you will understand:</p>
<p>1.  Install and configure the ontology add-in to Word 2007 on one PC.<br />
2.  Create a Word 2007 document that uses a term in one of the ontologies you said to use during the configuration of the add-in.<br />
    In my case, the entire document was the sentence: &#8220;I have potter syndrome.&#8221;  (&#8220;Potter syndrome&#8221; is a term in the Human Diseases ontology.)<br />
3.  Save the document as a Word 2007 file.<br />
4.  On another PC that has an earlier version of Word (in my case, Word 2003) and that of course has the &#8220;Microsoft Office Compatibility Pack for Word, Excel, and PowerPoint 2007 File Formats&#8221; installed, open the Word 2007 file.<br />
5.  You will see that it doesn&#8217;t recognize the term (just as you mentioned).<br />
6.  Edit the document in some way.<br />
In my case, I made 2 copies of the sentence, so I had three copies overall.  The first I left intact.  The second, I changed the words &#8220;potter syndrome&#8221; to &#8220;blah blah&#8221;.  The third, I added the word &#8220;don&#8217;t&#8221; to the sentence, so it now read &#8220;I don&#8217;t have potter syndrome&#8221;.  So, my document now read:<br />
I have Potter syndrome.<br />
I have blah blah.<br />
I don&#8217;t have Potter syndrome.<br />
7.  In Word 2003, save the edited document in Word 2007 format.<br />
8.  Open the edited document in Word 2007 on the original machine.</p>
<p>According to you, the information is destroyed.  Well, it isn&#8217;t.</p>
<p>To be specific:</p>
<p>- The schema is still there, intact, which is easily verifiable if you have the Developer tab installed.<br />
- The first, unedited sentence is still recognized by the ontology add-in as containing a recognized term (potter syndrome) with all its tags and meanings.<br />
- The second sentence (which has &#8220;blah blah&#8221;) is not recognized as using the custom XML (because it doesn&#8217;t).<br />
- The third sentence, which says I don&#8217;t have potter syndrome, is recognized as having the &#8220;potter syndrome&#8221; tag.  Everything is fully functional.</p>
<p>So, the information is NOT DESTROYED (sorry for the shouting!).</p>
<p>This fact shows that the custom XML feature in IS29500 is *not* a trick.</p>
<p>In my humble opinion, it finally realizes the original vision of SGML in the 80&#8217;s, in that it provides a way for office documents to contain custom vocabularies (which can, of course, also be ones covered by international standards).  That GREATLY increases interoperability between applications that need to consume, produce, or transform information in those custom vocabularies.  There is no *inherent* information loss in OOXML applications like Word 2003 with the Compatability Pack (i.e., not counting deliberate user destruction of information) that are properly written not to destroy any custom schemas or custom schema tags.</p>
<p>As for OpenOffice, talk to whoever maintains it. From what you have said, it is not a properly conforming OOXML application in that it destroys the information.  But that does not take away anything I have said.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paul E. Merrell, J.D. (Marbux)</title>
		<link>http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm/comment-page-1#comment-129</link>
		<dc:creator>Paul E. Merrell, J.D. (Marbux)</dc:creator>
		<pubDate>Tue, 17 Mar 2009 08:11:56 +0000</pubDate>
		<guid isPermaLink="false">http://ptsefton.com/2009/03/17/more-on-microsoft-word-and-non-interoperable-standards-compliance.htm#comment-129</guid>
		<description>I can&#039;t share Doug Mahugh&#039;s optimism for the RDFa support coming in ODF 1.2. As originally drafted, the Metadata SC&#039;s work included a provision that mandated xml:id preservation. But Sun launched a successful last-minute proposal to change &quot;shall preserve&quot; to &quot;should preserve,&quot; which grants discretion to destroy xml:ids whilst still claiming conformance.

There was a fiery battle over it and Sun ignored all requests for a use case exposing the need for the change, but the Sun proposal was duly rubber-stamped by the big vendor-dominated ODF TC.

Gary Edwards and I resigned from the TC on that occasion, which was only the latest in a series of TC decisions on ODF 1.2 that broke interoperability with MS Office. Two notable prior examples are the switch from ordered list tuples to list triples, and Sun&#039;s refusal to even place on the TC agenda five Novell proposals aimed at compatibility with MS Office.

So I don&#039;t understand Doug Mahugh&#039;s optimism about RDFa support in ODF 1.2, particularly since the Sun &quot;should preserve&quot; language appeared to be directly aimed at blocking the use of RDFa to work around the destruction of (conformant) foreign elements and attributes by OpenOffice.org in order to create MS Office ODF plug-ins that can interoperate with OOo.

My sense is that these issues will get fixed at ISO/IEC JTC 1 after ODF 1.2 gets there. But between the time it takes to get ODF 1.2 through JTC 1 and the lag time before JTC 1&#039;s changes are implemented, we&#039;re probably talking about a multi-year delay in interoperability.

I think it vitally important that people realize Microsoft is not the only big vendor who misbehaves in regard to document format standards.</description>
		<content:encoded><![CDATA[<p>I can&#8217;t share Doug Mahugh&#8217;s optimism for the RDFa support coming in ODF 1.2. As originally drafted, the Metadata SC&#8217;s work included a provision that mandated xml:id preservation. But Sun launched a successful last-minute proposal to change &#8220;shall preserve&#8221; to &#8220;should preserve,&#8221; which grants discretion to destroy xml:ids whilst still claiming conformance.</p>
<p>There was a fiery battle over it and Sun ignored all requests for a use case exposing the need for the change, but the Sun proposal was duly rubber-stamped by the big vendor-dominated ODF TC.</p>
<p>Gary Edwards and I resigned from the TC on that occasion, which was only the latest in a series of TC decisions on ODF 1.2 that broke interoperability with MS Office. Two notable prior examples are the switch from ordered list tuples to list triples, and Sun&#8217;s refusal to even place on the TC agenda five Novell proposals aimed at compatibility with MS Office.</p>
<p>So I don&#8217;t understand Doug Mahugh&#8217;s optimism about RDFa support in ODF 1.2, particularly since the Sun &#8220;should preserve&#8221; language appeared to be directly aimed at blocking the use of RDFa to work around the destruction of (conformant) foreign elements and attributes by OpenOffice.org in order to create MS Office ODF plug-ins that can interoperate with OOo.</p>
<p>My sense is that these issues will get fixed at ISO/IEC JTC 1 after ODF 1.2 gets there. But between the time it takes to get ODF 1.2 through JTC 1 and the lag time before JTC 1&#8217;s changes are implemented, we&#8217;re probably talking about a multi-year delay in interoperability.</p>
<p>I think it vitally important that people realize Microsoft is not the only big vendor who misbehaves in regard to document format standards.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
