<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>ptsefton &#187; Uncategorized</title>
	<atom:link href="http://ptsefton.com/category/uncategorized/feed" rel="self" type="application/rss+xml" />
	<link>http://ptsefton.com</link>
	<description>This seems to be a workblog</description>
	<lastBuildDate>Thu, 15 Jul 2010 01:10:27 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Open Repositories 2010 &#8211; Learning and Culture</title>
		<link>http://ptsefton.com/2010/07/15/open-repositories-2010-learning-and-culture.htm</link>
		<comments>http://ptsefton.com/2010/07/15/open-repositories-2010-learning-and-culture.htm#comments</comments>
		<pubDate>Thu, 15 Jul 2010 01:10:27 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/2010/07/15/open-repositories-2010-learning-and-culture.htm</guid>
		<description><![CDATA[



Issue: packaging
An idea for USQ



The last couple of days of the Open Repositories conference were devoted to the user-group streams, splitting the community into the Duraspace crowd and the ePrints devotees. You can move between them, of course, but some of the stuff that&#8217;s in those sessions is not really software specific, and I think [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="http://ptsefton.com/2010/07/15/open-repositories-2010-learning-and-culture.htm"><!-- &nbsp; --></abbr>
<div>
<div class="page-toc">
<ul>
<li><a href="#id2"><span>Issue: packaging</span></a></li>
<li><a href="#id3"><span>An idea for USQ</span></a></li>
</ul>
</div>
<div>
<p>The last couple of days of the Open Repositories conference were devoted to the user-group streams, splitting the community into the Duraspace crowd and the ePrints devotees. You can move between them, of course, but some of the stuff that&#8217;s in those sessions is not really software specific, and I think that it would be good to have a bit more cross-fertilisation between the software platforms (assuming they don&#8217;t all end up under one big foundation <img src='http://ptsefton.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> . But I know from talking to a couple of people from the program committee over dinner that streaming this conference is a very difficult balancing act. Some people will only come for &#8216;their&#8217; software sessions, and others only want the general sessions.</p>
<p>I attended a few talks on the use of repositories in eLearning (in Dspace and in ePrints) and on using ePrints for creative-arts outputs. I was interested to see what others are doing, as these are both areas where our team is working very actively at USQ. What I saw really reinforced for me that the work we&#8217;re doing at ADFI is on the right track. </p>
<p>In this post I want to talk briefly about one of the main issues that repository designers are facing as they move away from hosting mainly articles, mainly in PDF, into learning and creative-arts resources; the need for packaging and its impact on repository architecture, and then look to the opportunities that we have at USQ to apply some of our repository expertise <span class="spCh spChx2013">&#8211;</span> which  has mostly been on the research-outputs side <span class="spCh spChx2013">&#8211;</span> to learning resources.</p>
<h1><a id="id2" name="id2"><span /></a>Issue: packaging</h1>
<p><b>Packaging is important</b>; learning resources and creative-arts outputs need to be organised into ordered , hierarchical sets or resources that people can click through in a comfortable way. A screen-full of PDF files to download is not optimal.. In the learning world that&#8217;s why there are standards like IMS content packaging and SCORM. And for the creative arts there are lots of use-cases around organising things into exhibitions and portfolios. Exhibitions might travel around, and have different makeup in different places. Not to mention complexities like a photo in your repository that depicts more than one painting, all of which Duncan Dickinson at ADFI has been looking at modelling using CCO: <span class="spCh spChx201c">&#8220;</span><a href="http://www.vraweb.org/ccoweb/cco/partone.html"><span>Cataloging Cultural Objects</span></a><span class="spCh spChx201d">&#8221;</span>. </p>
<p>The thing is, this business of packaging things is putting pressure on the data models inherent in IR software like Eprints and DSpace which grew up around discrete &#8216;items&#8217; that have a metadata page, with click-to-download files. When we start looking at an exhibition of photographs this model is put under a fair bit of strain. </p>
<p>In the session on <a href="http://kultur.eprints.org/"><span>Kultur </span></a><span class="spCh spChx2013">&#8211;</span> the Eprints extension for creative arts output; Stephanie Meece showed some of the extensions that improve the metadata page in Eprints for exhibitions. While it does provide a basic way to show-off a large number of images, I think her talk showed up some of the architectural issues really well <span class="spCh spChx2013">&#8211;</span> she noted that sometimes the repository manager has to add circa fifty images files to an &#8216;exhibition&#8217; item, using an interface that requires a lot of scrolling backwards and forwards. I talked to her afterwards, and found that there is no real way, yet, to deal with exhibitions that change over time, or to re-use items across portfolios (the Eprints team did say that there is a collections feature now that could be used for this). One thing that Kultur did achieve is to work out the basis for metadata for creative-arts metadata; work that has been re-used in Australia. This is informing our work at USQ, and we have helped disseminate it via CAIRSS.</p>
<p>Back in the day, on the RUBRIC project, I was a vocal critic of the notion of &#8216;hard&#8217; collections I don&#8217;t like the way DSpace was designed around &#8216;communities&#8217; and &#8216;collections&#8217; and I argued that for most of the use-cases  people were talking about then were better served by &#8217;soft&#8217; metadata-driven collections. If you want to look at all your theses, for example that should just be a &#8217;slice&#8217; of the repository based on a query not  some process of hand-curating a list, or having to deposit items into a particular collection.</p>
<p>But as I noted above there are cases where you do need to hand-curate sets of items. </p>
<p>In our current work on The Fascinator, Duncan Dickinson is leading the team in efforts to capture creative-arts outputs, partly for the ERA assessment where exhibitions are particularly important. Bron Chandler is managing the media repository project, where courseware media objects (video, audio that sort of thing)  that need to be grouped into course-materials packages along with the traditional long-form course books for which USQ is famous. This work has led us to create a flexible architecture where you can either:</p>
<ol class="lin" style="list-style: decimal;">
<li>
<p>Add a manifest/table of contents to a single compound repository item containing lots of stuff; for example we have pre-populated a repository with all of USQ&#8217;s current course offerings from ICE and each course can be considered one item.</p>
</li>
<li>
<p>Group several discrete items together into a package.</p>
</li>
</ol>
<p>Oh, and you can package packages too <span class="spCh spChx2013">&#8211;</span> something we&#8217;re going to have to think about very hard when it comes to user-interfaces.</p>
<p>One output of this work is some web-interface code for navigating and managing packages, called Paquete. Paquete can build a table of contents and forward/back links to a set of web resources using a simple JSON table of contents. It&#8217;s like having a pure web-based eBook or IMS package reader. We&#8217;re thinking it could be deployed in all sorts of places, such as on top of ePrints, or in WordPress, and, of course in a learning management system. One use would be in <a href="http://www.jorum.ac.uk/"><span>Jorum</span></a>Open, the DSpace-based OER repository which is in need of a way to view IMS packages. It would be easy to add an IMS organiser support to Paquete, or produce a standalone tool to add a Paquete manifest to an IMS content package.</p>
<p>You can see a simple demo of an early version Paquete at our demo site. Note that each part of the package, which behind the scenes is a separate HTML page,  has a proper URL, so I can send you direct to the<a href="http://demo.adfi.usq.edu.au/paquete/demo/#module01.htm"><span> intro</span></a>, or to the bit about <a href="http://demo.adfi.usq.edu.au/paquete/demo/#module02.htm"><span>JSON</span></a>. The idea is that on an HTML 5 device like an iPhone, you could use &#8217;save as App&#8217; to grab a copy of an entire resource, via the HTML 5 manifest, and the stable URLs mean we can combine it with our annotation software that allows tags and discussion to take place in-line.</p>
<p>Using Paquete, it won&#8217;t matter to users whether the resources on show are spread out across several repository items or all jammed into one big item. So, for an exhibition, you could put ALL the pictures into a single ePrints object, then make multiple new items which consisted of slightly different packages representing variants of the exhibition. Or, each picture could be a first-class item with a number of exhibition items that simply reference them.</p>
<p>There&#8217;s lots more we can do with this viewer with a bit more coding<span class="spCh spChx2013">&#8211;</span> to make it show pictures and videos gracefully, and to work in presentation mode like PowerPoint. Speaking of PPT, why not render each slide as a JPEG and then let people flip through them there in the repository without having to download? (We&#8217;re very close to this in our work on The Fascinator). It could also be made easier to use on mobile devices, mimicking the tap-to-page-turn behaviour of eBook apps like Stanza or Kindle <a href="http://www.russellbeattie.com/blog/making-an-ipad-two-column-magazine-layout-using-jquery"><span>along the lines of this thing that Russell Beattie did</span></a>. Contributions are welcome if any of this sounds like something you can do. I&#8217;d be really happy if someone with some Eprints skills was able to do a plugin that can recognize where there&#8217;s a Pacquet or IMS manifest present and show the HTML resource in-line.</p>
<p>There&#8217;s <b>a lot more to say about this packaging stuff,</b> and how we might be able to use the ePUB format to ship repository content around, I&#8217;ll come back to this in a future series of posts.</p>
<h1><a id="id3" name="id3"><span /></a>An idea for USQ</h1>
<p>One of the repositories on show at OR10 in a session led by Yvonne Howard was an effort at Southampton to provide an open platform for sharing: <a href="http://www.edshare.soton.ac.uk/"><span>EdShare</span></a>. This is an ePrints repository that&#8217;s being built-in behind the Learning management System / Virtual Learning Environment they use at Southampton. I think the idea is that eventually, there will be seamless integration with BlackBoard so that resources can show up as part of the course-experience there, but be sitting in an open content repository behind.</p>
<p>There&#8217;s an interesting design pattern here; a sharing oriented repository that&#8217;s also hooked in to an access-controlled learning environment. At Southampton the emphasis is on low-barrier to participation so that lecturers can get stuff up as quickly as possible, with the downside that some of the metadata provided can be a bit lacking and many of the resources end up being quite fragmented. For example in the <a href="http://humbox.ac.uk/"><span>HumBox</span></a> site for sharing humanities resources last Friday, the latest uploads were a bunch of individual PDF files that didn&#8217;t mean much on their own. Fom my point of view as someone discovering stuff I would have preferred them to be in one bundle so I could begin to make sense of them.  </p>
<p>At USQ, on the other hand, where a large number of our courses go through a production process in which many people collaborate with the course writer to bake-in pedagogy, get the referencing right, and deal with licensing for readings and other supplementary materials, we are well on the way to having v<b>ery well-described well-organised materials already in a repository</b>. It would be so easy to tag a course as &#8216;ok for Open Access&#8217; and have it flow through to an externally-facing site as a high quality Open Educational Resource (OER). </p>
<p>There are lots of reasons that we might want to go open, even just a little bit. I&#8217;ll rehearse them here, yet again:</p>
<ol class="lin" style="list-style: decimal;">
<li>
<p><b>For prospective students: </b>I bet high-quality courseware would bring lots of traffic from people searching for stuff . Some of them might enrol, particularly if we lower the barriers to participation.</p>
</li>
<li>
<p><b>For staff: </b>open resources can be good for a lecturer&#8217;s profile <span class="spCh spChx2013">&#8211;</span> even better if we can work out a way to build in a genuine peer review process that helps some kinds of resources also count as research outputs (which = $$$$ and prestige in a way that learning resources currently just don&#8217;t). OERs would mean that you get to take your work with you to the next institution. </p>
<p>There are some things to think about re copyright, too. At the moment USQ owns the courseware, but in new models there might be ways to let authors keep their copyright, with an agreement that they license it openly.</p>
</li>
<li>
<p><b>For USQ</b> there&#8217;s that thing about <a href="http://en.wikipedia.org/wiki/Linus&apos;_Law"><span>many eyeballs</span></a>. Back when we first looked at open courseware, some senior people wondered if we wanted to expose some of the content. A bit of sunlight should work wonders if there are substandard bits of courseware (I&#8217;m not saying there are). Feedback and re-use has the potential to help us improve our materials. </p>
<p>When we first launched our minimal set of OERs at USQ Prof. Jim Taylor was working on a project to get volunteer tutors (focusing on retired academics) to help learners in an LMS environment. That project didn&#8217;t come off, but I am wondering if there might not be some people willing to review courseware<span class="footnote" style="vertical-align: super;"><a class="footnote" href="#ftn1" name="ftn1-text" title="1I know one retired physics lecturer, for example who can&apos;t resist pointing out, er, issues with the way physics is presented to students. Why in Year 12 he once &apos;marked&apos; an entire textbook I&apos;d been set; the textbook didn&apos;t do very well."><span>1</span></a></span>. </p>
</li>
</ol>
<p>Those are just some of the reasons we might open our courseware, there are other stories about why we might use stuff created elsewhere.</p>
<p>It sounds silly, but at the moment <b>we can&#8217;t even share our own courseware with our own students</b>. As I said not long ago, <a href="http://ptsefton.com/2010/05/17/cheap-laugh-1-suggest-opening-up-courseware-just-a-little.htm"><span>we should at least open up all the resources to our existing students</span></a> so they can (a) go back to prerequisite stuff they missed or forgot and (b) discover new things to enrol in.</p>
<p>To bring this back to Open Repositories <span class="spCh spChx2013">&#8211;</span> there&#8217;s a research project around the idea of having two channels into our course materials, one via the existing institutional systems we use for students and staff; the Moodle LMS and the course production systems, but with a public-facing discovery system that lets us expose at least some of our resources under open licenses so we can see if the potential benefits are realised and work out what the various stakeholders can do with openly licensed quality-controlled learning materials.</p>
<p class="center">Copyright Peter Sefton, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. &lt;<a href="http://creativecommons.org/licenses/by-sa/2.5/au/" onclick="javascript:window.top.open(&quot;http://creativecommons.org/licenses/by-sa/2.5/au/&quot;);return false;"><span>http://creativecommons.org/licenses/by-sa/2.5/au/</span></a>&gt;</p>
<p class="center"><span class="Default_20_Paragraph_20_Font"><span style="country:US; language:en; "><span class="T1"><a name="HTTP:::DBPEDIA.ORG:SNORQL:?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%"><span /></a><img alt="HTTP://DBPEDIA.ORG/SNORQL/?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%" class="fr1" height="31" src="http://ptsefton.com/wp-content/uploads/2010/07/m40ca94ba1.png" style="border:0px; vertical-align: top" width="88" /></span></span></span></p>
<p class="center">This post was written in OpenOffice.org, using templates and tools provided by the <a href="http://ice.usq.edu.au/" onclick="javascript:window.top.open(&quot;http://ice.usq.edu.au/&quot;);return false;"><span>Integrated Content Environment</span></a> project and published to WordPress using <a href="http://fascinator.usq.edu.au/desktop/desktop.htm" onclick="javascript:window.top.open(&quot;http://fascinator.usq.edu.au/desktop/desktop.htm&quot;);return false;"><span>The Fascinator</span></a>.</p>
<hr />
<div style="font-size: .9em;"><span class="footnote-defined"><a href="#ftn1-text" name="ftn1"><span>1</span></a>I know one retired physics lecturer, for example who can&#8217;t resist pointing out, er, issues with the way physics is presented to students. Why in Year 12 he once &#8216;marked&#8217; an entire textbook I&#8217;d been set; the textbook didn&#8217;t do very well.</span></div>
</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2010/07/15/open-repositories-2010-learning-and-culture.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>AWE &#8211; Presentation for Open Repositories 2010</title>
		<link>http://ptsefton.com/2010/07/07/awe-presentation-for-open-repositories-2010.htm</link>
		<comments>http://ptsefton.com/2010/07/07/awe-presentation-for-open-repositories-2010.htm#comments</comments>
		<pubDate>Wed, 07 Jul 2010 10:38:40 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/2010/07/07/awe-presentation-for-open-repositories-2010.htm</guid>
		<description><![CDATA[












Peter Sefton
sefton@usq.edu.au
University of Southern Queensland


Duncan Dickinson
Duncan.Dickinson@usq.edu.au
University of Southern Queensland





[Update: added link to the paper]
The short paper Duncan Dickinson and I put together for this conference is organised around the conference themes, and what our Research and Development group at the Australian Digital Futures Institute is doing about each of them. In this presentation I will [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="http://ptsefton.com/2010/07/07/awe-presentation-for-open-repositories-2010.htm"><!-- &nbsp; --></abbr>
<div>
<div class="page-toc"></div>
<div>
<div class="Table1" style="width: 100%; margin: 0px; padding: 0px; text-align: left;">
<table class="Table1" style="border-spacing: 0pt; empty-cells: show; width: 17cm; border-collapse: collapse;">
<colgroup>
<col style="width: 8.498cm;"></col>
<col style="width: 8.502cm;"></col>
</colgroup>
<tbody>
<tr>
<td class="Table1_A1" style="vertical-align: top; border: none; padding: 0.097cm;">
<p class="meta-author-name" style="margin-bottom: 0cm; margin-top: 0.42cm;"><span class="meta-familyname">Peter Sefton</span></p>
<p class="meta-author-email"><a href="mailto:sefton@usq.edu.au"><span class="Internet_20_link">sefton@usq.edu.au</span></a></p>
<p class="meta-author-affiliation">University of Southern Queensland</p>
</td>
<td class="Table1_A1" style="vertical-align: top; border: none; padding: 0.097cm;">
<p class="meta-author-name" style="margin-bottom: 0cm; margin-top: 0.42cm;"><span class="meta-familyname">Duncan Dickinson</span></p>
<p class="meta-author-email"><a href="mailto:Duncan.Dickinson@usq.edu.au"><span class="Internet_20_link">Duncan.Dickinson@usq.edu.au</span></a></p>
<p class="meta-author-affiliation"><span class="meta-familyname">University of Southern Queensland</span></p>
</td>
</tr>
</tbody>
</table>
</div>
<p>[Update: added link to the paper]</p>
<p><a href="http://ptsefton.com/2010/04/25/repositories-post-2010-embracing-heterogeneity-in-awe-the-academic-working-environment.htm">The short paper Duncan Dickinson and I put together for this conference</a> is organised around the conference themes, and what our Research and Development group at the Australian Digital Futures Institute is doing about each of them. In this presentation I will pick out some of the the work we&#8217;re doing and some of the issues we&#8217;re thinking about, and try to relate this work back to the conference themes.</p>
<p>It&#8217;s gratifying to be able to make this presentation. Some of the core ideas we&#8217;re talking about here were the subject of a proposal I submitted for Open Repositories 2007, but it received mixed reviewer feedback and was relegated to a poster; my message that repositories were stuck in <span class="spCh spChx201c">“</span>Web 0.5<span class="spCh spChx201d">”</span> and needed to be made more webish was not timely.<span class="Footnote_20_anchor"><span class="footnote" style="vertical-align: super;"><a class="footnote" title="1 At dinner on Monday night I was told that the ideas we're pushing here at ADFI are “15 years ahead” of where the users are. Given that this paper was accepted, and doing some quick maths in my head I think that means that we have only 25 months before the things that we're talking about here are considered relevant, given that we're working in Internet time." name="ftn1-text" href="#ftn1"><span>1</span></a></span></span></p>
<div class="slide">
<h1>Conference themes</h1>
<p>Please tick these off as I go.</p>
<ol class="lin" style="list-style: decimal;">
<li><strong>The web and the repository &amp; The cloud and the desktop</strong>*</li>
<li>Knowledge and technology</li>
<li>Wild and curated content</li>
<li>Linked and isolated data &amp; Ad-hoc and long-term access</li>
<li>Disciplinary and institutional systems / Scholars and service providers</li>
<li>Ubiquitous and personalized environments</li>
</ol>
<p>* This is the big one!</p>
</div>
<p>So, what is this AWE thing?</p>
<div class="slide">
<h1><a href="http://adfi.usq.edu.au/blog/2010/03/what-should-we-call-awe/"><span class="Internet_20_link">AWE</span></a>: The Academic Working Environment</h1>
<p><strong>Duncan: </strong><span class="spCh spChx201c">“</span>A set of purposeful technologies brought together by standard interfaces for data exchange?<span class="spCh spChx201d">”</span></p>
<p><strong>Peter:</strong> <span class="spCh spChx201c">“</span>I needed a single cover-all name for all the different projects we were working on in our <a href="http://adfi.usq.edu.au/"><span class="Internet_20_link">institute</span></a>, so that I could try to report to the powers that be in a more efficient way.<span class="spCh spChx201d">”</span></p>
</div>
<div class="slide">
<h1>The Web and the Repositories</h1>
<p>We&#8217;re <strong>on</strong> the web, but are we <strong>of</strong> the web?</p>
</div>
<div class="slide">
<h1>Show and tell</h1>
<ul class="lib">
<li>Repositories need to understand <strong>renditions</strong> and <strong>rendering</strong>
<ul class="lib">
<li>Rendering: The <a href="../../api/convert"><span class="Internet_20_link">ICE service</span></a> <span class="spCh spChx2013">–</span> renders web versions of everything it can and extracts metadata.</li>
<li>Renditions: See our site for<a href="http://demo.adfi.usq.edu.au/library/default/search"><span class="Internet_20_link"> USQ policies</span></a>. HTML all the way.</li>
</ul>
</li>
<li>We&#8217;re working on small pieces of web-infrastructure that should apply across multiple repositories.
<ul class="lib">
<li>Anotar: annotations toolkit designed to be easy to plug in to any web system <span class="spCh spChx2013">–</span> even I can do it. See this <a href="http://demo.adfi.usq.edu.au/blog/2010/05/25/anotar-wordpress-plugin/"><span class="Internet_20_link">WordPress version</span></a>.
<p>(Not as good as <a href="http://digress.it/"><span class="Internet_20_link">digress.it</span></a> but much more portable).</li>
<li><a href="http://demo.adfi.usq.edu.au/paquete/demo/#module01.htm"><span class="Internet_20_link">Paquete</span></a>: An ePUBesque reader you can embed in any web page.
<p>Also a way for us to support compound objects in a repository.</li>
</ul>
</li>
<li>We&#8217;re working on bringing the web to the desktop.
<p>&lt;Insert demo of The Fascinator Desktop&gt;</li>
</ul>
</div>
<div class="slide">
<h1>Screenshot: HTML documents derived from Word documents</h1>
<p class="P8"><a name="graphics1"><span> </span></a><img class="fr1" style="border: 0px; vertical-align: top;" src="http://ptsefton.com/wp-content/uploads/2010/07/m3a411f99_610x406.jpeg" alt="graphics1" width="610" height="406" /></p>
</div>
<div class="slide">
<h1>Screenshot: ICE conversion service</h1>
<p><a name="graphics2"><span> </span></a><img class="fr2" style="border: 0px; vertical-align: top;" src="http://ptsefton.com/wp-content/uploads/2010/07/mfacbaf2.png" alt="graphics2" width="682" height="446" /></p>
</div>
<div class="slide">
<h1>Screenshot: ICE conversion options for word processing</h1>
<p class="P8"><a name="graphics3"><span> </span></a><img class="fr2" style="border: 0px; vertical-align: top;" src="http://ptsefton.com/wp-content/uploads/2010/07/4731ccff_426x526.jpeg" alt="graphics3" width="426" height="526" /></p>
</div>
<div class="slide">
<h1>Screenshot: Annotations on a (this) document</h1>
<p><a name="graphics4"><span> </span></a><img class="fr2" style="border: 0px; vertical-align: top;" src="http://ptsefton.com/wp-content/uploads/2010/07/m5208af63_643x209.jpeg" alt="graphics4" width="643" height="209" /></p>
</div>
<div class="slide">
<h1>Screenshot: Paquete stand-alone demo</h1>
<p class="P8"><a name="graphics5"><span> </span></a><img class="fr2" style="border: 0px; vertical-align: top;" src="http://ptsefton.com/wp-content/uploads/2010/07/m299e55db_611x209.jpeg" alt="graphics5" width="611" height="209" /></p>
</div>
<div class="slide">
<h1>Issue: What will happen when vertically controlled computing platforms are the norm?</h1>
<p class="center" style="font-weight: bold;">Does the desktop/lab/home PC become a server? Or will it live in the cloud?</p>
</div>
<p>I brought an iPad with me on this trip, but I have found that I can&#8217;t put music on it or look at the books I have bought across <strong>four</strong> <span class="spCh spChx2013">–</span> yes, four <span class="spCh spChx2013">–</span> different reader applications using standard file-operations. Imagine the problems if we have to deal with valuable research data which lives on these controlled devices. A whole new era of format lock-in that makes Microsoft look like a free-software hippy.</p>
<p>This is one reason why the web, and delivering stuff in web formats is important.</p>
<div class="slide">
<h1>Idea: How about ePUB as a repository packaging format?</h1>
<p class="P5">(In my reading of the specs) ePUB is engineered for overloading:</p>
<p>A  zip file containing:</p>
<ul class="lib">
<li>(At least) XHTML, with optional extra elements &amp; a flat table of contents.
<ul class="lib">
<li>Can include video, chemistry, whatever if you provide fallback image/text.</li>
</ul>
</li>
<li>Allows for alternate renditions such as PDF, docx or odt originals.</li>
<li>We could include an HTML 5 version (using something like Paquete) for modern browsers/devices.
<ul class="lib">
<li>Javascript is allowed <span class="spCh spChx2013">–</span> but must be ignored by ePUB readers but could be used by web apps.</li>
</ul>
</li>
</ul>
</div>
<p class="center">
<div class="slide">
<h1>Idea: <a href="http://www.openebook.org/"><span class="Internet_20_link">ePUB</span></a> use-cases</h1>
<p><strong>Repository:</strong><span style="font-weight: normal;"><span class="T2"> (SIP/AIP/DIP?)</span></span></p>
<ul class="lib">
<li>Serve the content using an in-page eReader (Paquete).</li>
<li>Let the package handle package semantics (pre-print, published version, presentation), repository can continue to handle streams.</li>
<li>Support viewers for data types for defined periods, such as JMOL for chemistry <span style="font-weight: normal;"><span class="T2">using something like ePUB&#8217;s fall-back mechanism and oEmbed.</span></span></li>
</ul>
<p class="P6">Users:</p>
<ul class="lib">
<li>If all else fails: unzip the package and click &#8216;index.html&#8217;</li>
<li>Use with eBook reader software/hardware.</li>
</ul>
<p class="P6">Developer / repository manager:</p>
<ul class="lib">
<li>Add more packaging info <span class="spCh spChx2013">–</span> ORE, METS etc.</li>
</ul>
<p class="P5">
</div>
<p>One of the other things were look at at ADFI is ways of bootstrapping the Linked-Data/Semantic Web. I (Peter) have proposed a way of embedding RDF statements in documents using simple interoperable URIs. Duncan has taken this work further with a system that can serve &#8216;proper&#8217; RDF. The demo here shows this technique for metadata, but it could also be applied for other kinds of semantics, when you are talking about someone, for example rather than asserting that they are an author or an editor.</p>
<div class="slide">
<h1>Idea: Making linked data, well, links</h1>
<ul class="lib">
<li>Step 1: Approach your repository or ID provider, search for self
<p><a href="http://nla.gov.au/nla.party-541658"><span class="Internet_20_link"><span style="font-size: 10.5pt; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; text-transform: none;"><span class="T6">http://nla.gov.au/nla.party-541658</span></span></span></a><span style="color: #333333; font-size: 10.5pt; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; text-transform: none;"><span class="T4"> </span></span></p>
<p>Note: wrapping this link around some text is essentially meaningless. It&#8217;s not the semantic web it&#8217;s the old-web.</li>
<li>Step 2: Copy the link labelled <span class="spCh spChx201c">“</span>Assert Authorship<span class="spCh spChx201d">”</span>
<p><code>http://ontologize.me/meta/?r=http://purl.org/dc/terms/creator&amp;o=http://nla.gov.au/nla.party-541658</code><code><span style="color: #333333; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; text-transform: none;"><span class="T5"> </span></span></code></li>
</ul>
</div>
<div class="slide">
<h1>Word processor-proof linked data <span class="spCh spChx2013">–</span> part 2</h1>
<ul class="lib">
<li>Step 3: Paste onto document text
<p>Author: <a title="This is the author, ID: [http://nla.gov.au/nla.party-541658]" href="http://ontologize.me/meta/?r=http://purl.org/dc/terms/creator&amp;o=http://nla.gov.au/nla.party-541658"><span class="Internet_20_link">Peter Sefton</span></a></li>
<li>Step 4: Deposit somewhere that understands
<p><a name="graphics6"><span> </span></a><img class="fr2" style="border: 0px; vertical-align: top;" src="http://ptsefton.com/wp-content/uploads/2010/07/11a98ed3.png" alt="graphics6" width="559" height="75" /></p>
<p><code>&lt;oai_dc:dc&gt;</code></p>
<p><code>&lt;dc:title&gt;AWE - Presentation for Open Repositories 2010&lt;/dc:title&gt;</code></p>
<p><code>...</code></p>
<p><code>&lt;dc:creator&gt;http://nla.gov.au/nla.party-541658&lt;/dc:creator&gt;</code></p>
<p><code>&lt;/oai_dc:dc&gt;</code></li>
</ul>
</div>
<div class="slide">
<h1>Conclusion</h1>
<p>We (like many others) are building a toolkit for web/repository construction. The key reasons we rolled our own are:</p>
<ul class="lib">
<li>We care about having web-resources not just PDF.</li>
<li>We wanted to be able to deploy the application to the desktop (hence a Java app that can be deployed with Apache Solr).</li>
</ul>
</div>
<p class="center">Copyright Peter Sefton &amp; Duncan Dickinson, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. &lt;<a onclick="javascript:window.open(&quot;http://creativecommons.org/licenses/by-sa/2.5/au/&quot;);return false;" href="http://creativecommons.org/licenses/by-sa/2.5/au/"><span class="Internet_20_link">http://creativecommons.org/licenses/by-sa/2.5/au/</span></a>&gt;</p>
<p class="center"><span class="Default_20_Paragraph_20_Font"><span style="country: US; language: en;"><span class="T7"><a name="graphics7"><span> </span></a><img class="fr2" style="border: 0px; vertical-align: top;" src="http://ptsefton.com/wp-content/uploads/2010/07/m40ca94ba.png" alt="graphics7" width="88" height="31" /></span></span></span></p>
<p class="center">This post was written in OpenOffice.org, using templates and tools provided by the <a onclick="javascript:window.open(&quot;http://ice.usq.edu.au/&quot;);return false;" href="http://ice.usq.edu.au/"><span class="Internet_20_link">Integrated Content Environment</span></a> project and published to WordPress using <a onclick="javascript:window.open(&quot;http://fascinator.usq.edu.au/desktop/desktop.htm&quot;);return false;" href="http://fascinator.usq.edu.au/desktop/desktop.htm"><span class="Internet_20_link">The Fascinator</span></a>.</p>
<hr />
<div style="font-size: .9em;"><span class="footnote-defined"><a name="ftn1" href="#ftn1-text"><span>1</span></a> At dinner on Monday night I was told that the ideas we&#8217;re pushing here at ADFI are <span class="spCh spChx201c">“</span>15 years ahead<span class="spCh spChx201d">”</span> of where the users are. Given that this paper was accepted, and doing some quick maths in my head I think that means that we have only 25 months before the things that we&#8217;re talking about here are considered relevant, given that we&#8217;re working in Internet time.</span></div>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2010/07/07/awe-presentation-for-open-repositories-2010.htm/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Names (but not IDs) for Name Authority Services</title>
		<link>http://ptsefton.com/2010/06/22/names-but-not-ids-for-name-authority-services.htm</link>
		<comments>http://ptsefton.com/2010/06/22/names-but-not-ids-for-name-authority-services.htm#comments</comments>
		<pubDate>Tue, 22 Jun 2010 06:16:49 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=557</guid>
		<description><![CDATA[


In my last post I asked “What should we call this name-authority, vocabulary-server, linked-data URI factory service we are building for ANDS?” I got a few answers, one of which I really like, but they&#8217;re really suitable as names for a particular service rather than the class of services.

MINT
“Mashing Identifiers and Names Together” – Debbie [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="http://ptsefton.com/?p=557"><!-- &nbsp; --></abbr>
<div>
<div>
<p>In my last post I asked <span class="spCh spChx201c">“</span><a href="http://ptsefton.com/2010/06/18/what-should-we-call-this-name-authority-vocabulary-server-linked-data-uri-factory-service-we-are-building-for-ands-2.htm">What should we call this name-authority, vocabulary-server, linked-data URI factory service we are building for ANDS?</a><span class="spCh spChx201d">”</span> I got a few answers, one of which I really like, but they&#8217;re really suitable as names for a particular service rather than the <strong>class</strong> of services.</p>
<dl>
<dt>MINT</dt>
<dd><span class="spCh spChx201c">“</span>Mashing Identifiers and Names Together<span class="spCh spChx201d">”</span> <span class="spCh spChx2013">–</span> Debbie Campbell of the NLA.</dd>
<dt>AWESOME</dt>
<dd>Dorothea Salo suggested this without a gloss. I&#8217;m not sure if she realised the tie-in with AWE <span class="spCh spChx2013">–</span> <a href="http://ptsefton.com/2010/04/25/repositories-post-2010-embracing-heterogeneity-in-awe-the-academic-working-environment.htm">The Academic Work(ing) Environment</a> which is a working title for the technology stack we use, and which is the theme of my talk for Open Repositories 2010. So with my, um, awesome, DIY backronym skillz it&#8217;s:</dd>
<dd><span class="spCh spChx201c">“</span>Academic Work Environment Semantics for Open Materials Exchange<span class="spCh spChx201d">”</span></dd>
<dt>CATNIP</dt>
<dd><span class="spCh spChx201c">“</span>Collating All Terms Name Identifier Project<span class="spCh spChx201d">”</span>. This is Andew Treloar&#8217;s [update: fixed spelling] attempt. Not really in the same class as ARROW, DART and ARCHER. </dd>
</dl>
<p>But still, the <strong>class</strong> of thing I was describing doesn&#8217;t seem to have a name, and nobody at the round table had a suggestion. I am now thinking that it should be called an <strong>Linked Authority Control Service</strong>. That is, it&#8217;s a thing to do <a href="http://en.wikipedia.org/wiki/Authority_control">authority control,</a> in a linked-data, semantic-web way.</p>
<p>But what call the instance of the service?</p>
<p>Well, the naming convention we&#8217;re developing for <a href="http://fascinator.usq.edu.au/">The Fascinator</a>, the software construction-set which will power the service is to have <span class="spCh spChx201c">“</span>editions<span class="spCh spChx201d">”</span>, so the thing you would download and install would be called something like <span class="spCh spChx201c">“</span>The Fascinator, Linked Authority Control Edition<span class="spCh spChx201d">”</span> which you might install alongside <span class="spCh spChx201c">“</span>The (forthcoming) Fascinator, Institutional Repository Edition<span class="spCh spChx201d">”</span> and set up to integrate with instances of <span class="spCh spChx201c">“</span>The Fascinator, Desktop Edition<span class="spCh spChx201d">”</span> sprinkled around the institution, picking up data and related publications at the source, from people&#8217;s local drives, labs, instruments etc.</p>
<p>Once you&#8217;ve installed it and you need a name to call it by, I don&#8217;t think you can go past <span class="spCh spChx201c">“</span><strong>The Mint</strong><span class="spCh spChx201d">”</span>, so thanks Debbie, I&#8217;ll get you something minty next time I see you.</p>
<p>Just think what we can do with our own linked-data mint. We can control the linked-data economy. If we wanted to cause inflation by minting new URI&#8217;s for John Smiths without having enough actual John Smiths to back them up then we, like, could. Not that we&#8217;d use our power for evil or nothing.</p>
<p>Come to think of it, if I could mint another <em>Alan</em> <em>James</em> Smith I would.</p>
<p class="center">Copyright Peter Sefton, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. &lt;<a onclick="javascript:window.top.open(&quot;http://creativecommons.org/licenses/by-sa/2.5/au/&quot;);return false;" href="http://creativecommons.org/licenses/by-sa/2.5/au/">http://creativecommons.org/licenses/by-sa/2.5/au/</a>&gt;</p>
<p class="center"><span class="Default_20_Paragraph_20_Font"><span style="country: US; language: en;"><span class="T1"><a name="HTTP:::DBPEDIA.ORG:SNORQL:?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%"></a><img class="fr1" style="border: 0px; vertical-align: top;" src="http://ptsefton.com/wp-content/uploads/2010/06/m40ca94ba.png" alt="HTTP://DBPEDIA.ORG/SNORQL/?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%" width="88" height="31" /></span></span></span></p>
<p class="center">This post was written in OpenOffice.org, using templates and tools provided by the <a onclick="javascript:window.top.open(&quot;http://ice.usq.edu.au/&quot;);return false;" href="http://ice.usq.edu.au/">Integrated Content Environment</a> project and published to WordPress using <a onclick="javascript:window.top.open(&quot;http://fascinator.usq.edu.au/desktop/desktop.htm&quot;);return false;" href="http://fascinator.usq.edu.au/desktop/desktop.htm">The Fascinator</a>.</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2010/06/22/names-but-not-ids-for-name-authority-services.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What should we call this name-authority, vocabulary-server, linked-data URI factory service we are building for ANDS?</title>
		<link>http://ptsefton.com/2010/06/18/what-should-we-call-this-name-authority-vocabulary-server-linked-data-uri-factory-service-we-are-building-for-ands-2.htm</link>
		<comments>http://ptsefton.com/2010/06/18/what-should-we-call-this-name-authority-vocabulary-server-linked-data-uri-factory-service-we-are-building-for-ands-2.htm#comments</comments>
		<pubDate>Fri, 18 Jun 2010 02:31:07 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/2010/06/18/what-should-we-call-this-name-authority-vocabulary-server-linked-data-uri-factory-service-we-are-building-for-ands-2.htm</guid>
		<description><![CDATA[

  

Today I am at the National Library, at the &#8220;Names Round Table&#8221;. I&#8217;m part of the ARDCPIPAG, which stands for &#8220;Australian Research Data Commons Party Identifier Advisory Group&#8221;. We&#8217;re advising a team at the library who are building party-identifier services for researchers and research institutions in Australia.
At my work, at ADFI we&#8217;re working [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="http://ptsefton.com/2010/06/18/what-should-we-call-this-name-authority-vocabulary-server-linked-data-uri-factory-service-we-are-building-for-ands-2.htm"><!-- &nbsp; --></abbr>
<div>
<div class="page-toc">  </div>
<div>
<p>Today I am at the National Library, at the <span class="spCh spChx201c">&#8220;</span>Names Round Table<span class="spCh spChx201d">&#8221;</span>. I&#8217;m part of the ARDCPIPAG, which stands for <span class="spCh spChx201c">&#8220;</span>Australian Research Data Commons Party Identifier Advisory Group<span class="spCh spChx201d">&#8221;</span>. We&#8217;re advising a team at the library who are <a href="https://wiki.nla.gov.au/pages/viewpage.action?pageId=17039683">building party-identifier services for researchers and research institutions in Australia</a>.</p>
<p>At my work, at <a href="http://usq.edu.au/adfi">ADFI</a> we&#8217;re working on developing specifications for Metadata stores for ANDS. The first cab off the rank is an application architecture developed with The University of Newcastle and Swinburne University. I posted about this earlier this year. ANDS agreed that it would be a good idea to take a linked data approach to the design of the application. Linked Data is a non-threatening way of talking about the The Semantic Web, which is definitely coming real soon now. No, really, 2010 will be the year, or at the very outside 2012.</p>
<p>Or 2013.</p>
<div class="slide">
<h1>The linked data rules (from Tim Berners-Lee)</h1>
<blockquote class="bq"><p>Like the web of hypertext, the web of data is constructed with documents on the web. However, &#160;unlike the web of hypertext, &#160;where&#160;links are relationships&#160;anchors in hypertext documents written in&#160;<span style="font-size:10pt; "><span class="T1">HTML</span></span>, for data they links&#160; between arbitrary things described by&#160;<span style="font-size:10pt; "><span class="T1">RDF</span></span>,. &#160;The&#160;<span style="font-size:10pt; "><span class="T1">URI</span></span>s identify any kind of object or&#160; concept. &#160; But for&#160;<span style="font-size:10pt; "><span class="T1">HTML</span></span>&#160;or&#160;<span style="font-size:10pt; "><span class="T1">RDF</span></span>, the same expectations apply to make the web grow:</p>
<p>1. <b>Use&#160;URIs</b> as names for things</p>
<p>2. <b>Use&#160;HTTP&#160;URIs</b> so that people can look up those names.</p>
<p>3. When someone looks up a&#160;<span style="font-size:10pt; "><span class="T1">URI</span></span>, <b>provide useful information, using the standards (RDF, SPARQL)</b></p>
<p>4. <b>Include links to other&#160;</b><b style="font-size:10pt; "><span>URIs</span></b>. so that they can discover more things.</p>
<p>Simple. &#160;In fact, though, a surprising amount of data isn&#8217;t linked in 2006, because of problems with one or more of the steps. &#160;This article discusses solutions to these problems, details of implementation, and factors affecting choices about how you publish your data.</p>
<p><a href="http://www.w3.org/DesignIssues/LinkedData.html">http://www.w3.org/DesignIssues/LinkedData.html</a></p>
</blockquote>
</div>
<p>Our metadata stores work is covered in <a href="http://delicious.com/ptsefton/andsmetadatastores">blog posts</a> here. </p>
<p>My presentation today focussed on the name-authority part of the architecture and looked at the process of establishing name-identities at an institutional level <b>before</b> joining in a broader federation. I asked the audience, what is the name we should use for this class of service? (We&#8217;re still looking for a name for the metadata-stores app as well, something better than <span class="spCh spChx201c">&#8220;</span>The Fascinator, Research Metadata Store Edition, Pro<span class="spCh spChx201d">&#8221;</span> or the current working title of <span class="spCh spChx201c">&#8220;</span>Ingect<span class="spCh spChx201d">&#8221;</span>.</p>
<div class="slide">
<h1>Architecture &#8211; <span class="spCh spChx201c">&#8220;</span>Ingect<span class="spCh spChx201d">&#8221;</span> (working title)</h1>
<p class="P1"><a name="Picture_8" /><img alt="Picture 8" class="fr1" height="392" src="http://ptsefton.com/wp-content/uploads/2010/06/m74901623_554x3921.jpeg" style="border:0px; vertical-align: top" width="554" /></p>
</div>
<p />
<div class="slide">
<h1>What does a Linked Data Approach mean for a metadata stores project?</h1>
<ol class="lin" style="list-style: decimal;">
<li>
<p><b>No more typing name-strings</b> into web forms.</p>
</li>
<li>
<p><b>Agreed names/URIs </b>for things like resource-types. </p>
</li>
<li>
<p><b>Sorting out URIs</b> for things so that (unlike with Institutional Repositories) we can:</p>
<ol class="li-lower-roman" style="list-style: lower-roman;">
<li>
<p><b>Agree on terms</b> before we start.</p>
</li>
<li>
<p><b>Match-up terms later</b> if we don&#8217;t reach agreement.</p>
</li>
</ol>
</li>
</ol>
</div>
<p />
<p>In our forthcoming collaboration with the UoX we wanted to make sure that when we named the things in the institution with the role of &#8216;research&#8217; or &#8216;owner&#8217; of data we used URIs. In ANDS speak, these things are &#8216;parties&#8217;. Some of the are people, some are institutions, organisations, or organisational units.</p>
<p />
<div class="slide">
<h1>Goal: Establish URIs for person-parties at UoX before we start</h1>
<p>So, a simple matter of:</p>
<ul class="lib">
<li>
<p><b>Extracting name data</b> from the existing institutional repository<br />(the UoX model is to build their metadata store so collections are described in the IR)</p>
</li>
<li>
<p><b>Disambiguating name-strings</b>, for the usual reasons; there is typically more than one string used to refer to the same person and often the same string used to refer to more than one person. Sorry, party.</p>
</li>
<li>
<p><b>Establishing new URIs</b> for each party that the UoX cares about (they don&#8217;t care about parties from UoY). </p>
</li>
<li>
<p><b>Injecting the URIs back in to the IR</b>.</p>
</li>
</ul>
<p class="P1" /></div>
<p />
<div class="slide">
<h1>Establishing IDs? How?</h1>
<p>We considered these options.</p>
<ol class="lin" style="list-style: decimal;">
<li>
<p><b>Use People Australia</b> / ARDC-PIP services. (It was a bit early)</p>
</li>
<li>
<p><b>Use Nicnames</b>  (plus).</p>
</li>
<li>
<p><b>Use a combination</b> of the above.</p>
</li>
</ol>
</div>
<p />
<p />
<div class="slide">
<h1>We chose option 4</h1>
<p>Build a name-authority server inspired by  and informed by the NicNames work.</p>
<p>Why?</p>
<ol class="lin" style="list-style: decimal;">
<li>
<p>Can ship as an integrated part of the broader application we are building, in the same language (Java).</p>
<ol class="li-lower-roman" style="list-style: lower-roman;">
<li>
<p><b>Easy-install</b> under the same web container as Fedora and Solr.</p>
</li>
<li>
<p><b>Same configuration files</b> as the <span class="spCh spChx201c">&#8220;</span>Ingect<span class="spCh spChx201d">&#8221;</span> application.</p>
</li>
</ol>
</li>
<li>
<p>Having a local service means we can use:</p>
<ol class="li-lower-roman" style="list-style: lower-roman;">
<li>
<p><b>Private data such as staff-IDs</b> internal to the application.</p>
</li>
<li>
<p><b>Local URI-schemes</b> for local things, such as internal projects.</p>
</li>
</ol>
</li>
<li>
<p>Our service will <b>deal with JSON formatted , not RDF</b> to make it easy for web-interface designers.</p>
</li>
</ol>
</div>
<p />
<div class="slide">
<h1>The process <span class="spCh spChx2013">&#8211;</span> disambiguating names</h1>
<ol class="lin" style="list-style: decimal;">
<li>
<p>Import all the name-string/publication pairs from the IR into a new repository (in EAC format) .</p>
<p>That&#8217;s one for each name on a paper. Records will be presented as citations keyed by a name.</p>
<p><code>name:&lt;name-string&gt; id:&lt;pID&gt; title:&lt;dc:title&gt; (subject:&lt;dc:subject&gt;)*</code></p>
</li>
<li>
<p>Import a set of canonical names we care about into the new repository.</p>
</li>
<li>
<p>Turn all the canonical names from a local directory into master-records/name-packages.</p>
</li>
<li>
<p>Allow a data librarian to drag all the name-strings into the name-packages.</p>
<blockquote class="bq"><p><code>[Jane Hunter CONTAINER RECORD</code></p>
<p><code>Hunter, J: </code><a href="../../../../../../list/author_id/10933/"><code><span style="color:#b712aa; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T18">Gerber, A.</span></span></code></a><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">&#160;and&#160;</span></span></code><a href="../../../../../../list/author_id/3813/"><code><span style="color:#1e88ce; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T23">Hunter, J.</span></span></code></a><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">&#160;(</span></span></code><a href="../../../../../../list/year/2008/"><code><span style="color:#1e88ce; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T23">2008</span></span></code></a><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">).&#160;</span></span></code><a href="../../../../../../view/UQ:187804"><code><span style="color:#1e88ce; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T23">A compound object authoring and publishing tool for literary scholars based on the IFLA-FRBR</span></span></code></a><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">. In: C. Rusbridge, A. Trefethen and D. Berry, Proceedings of: 4th International Digital Curation Conference &quot;Radical Sharing: Transforming Science?&quot;.&#160;</span></span></code><code><i style="color:#000000; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span>4th International Digital Curation Conference (IDCC 2009)</span></i></code><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">, Edinburgh, Scotland, (1-10). 1-3 December 2008.</span></span></code></p>
<p><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T7">Hunter, Jane: </span></span></code><a href="../../../../../../list/author_id/3813/"><code><span style="color:#b712aa; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T18">Hunter, Jane</span></span></code></a><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">&#160;(</span></span></code><a href="../../../../../../list/year/2001/"><code><span style="color:#1e88ce; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T23">2001</span></span></code></a><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">).&#160;</span></span></code><a href="../../../../../../view/UQ:7845"><code><span style="color:#1e88ce; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T23">Adding Multimedia to the Semantic Web: Building an MPEG-7 Ontology</span></span></code></a><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">. In:</span></span></code><code><i style="color:#000000; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span>International Semantic Web Working Symposium (SWWS)</span></i></code><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">, Stanford University, California, (). July, 2001.</span></span></code></p>
<p><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T7">Hunter, J.: </span></span></code><a href="../../../../../../list/author_id/3813/"><code><span style="color:#b712aa; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T18">Hunter J.</span></span></code></a><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">&#160;(</span></span></code><a href="../../../../../../list/year/1999/"><code><span style="color:#1e88ce; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span class="T23">1999</span></span></code></a><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">).&#160;</span></span></code><a href="../../../../../../view/UQ:151497"><code><i style="color:#1e88ce; font-variant:normal; font-weight:normal; letter-spacing:normal; text-decoration: line-through; text-decoration: underline; text-transform:none; "><span>An "Improved" Proposal for an MPEG-7 DDL</span></i></code></a><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">. ISO/IEC JTC1/SC29/WG11, 47th MPEG Meeting M4518, .</span></span></code></p>
<p><code><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T12">...</span></span></code></p>
<p><code>]</code></p>
</blockquote>
</li>
</ol>
<h1><code /></h1>
</div>
<div class="slide" />
<p />
<div class="slide">
<h1>Process</h1>
<p><span style="display: block"><a name="graphics1" /><img alt="graphics1" class="fr2" height="899" src="http://ptsefton.com/wp-content/uploads/2010/06/m13df4043_635x8991.jpeg" style="border:0px; vertical-align: top" width="635" /></span></p>
<p />
<p /></div>
<p />
<p />
<div class="slide">
<h1>But wait! There's more!</h1>
<p>This new module, which is called, um <span class="spCh spChx2026">&#8230;</span> will:</p>
<ul class="lib">
<li>
<p>'Mint' a new URI whenever someone types a string (in desperation).</p>
</li>
<li>
<p>Provide locally available, fast services for accessing any ontology. Eg:</p>
<ul class="lib">
<li>
<p>FOR codes.</p>
</li>
<li>
<p><a href="http://www.geonames.org/">Geonames</a>.</p>
</li>
</ul>
</li>
</ul>
</div>
<p />
<p />
<div class="slide">
<h1>One final thing</h1>
<p>(And this bit is not ANDS funded)</p>
<p>We want to make it possible to encode and decode web-semantics in a URI. Eg:</p>
<p><a href="http://ontologize.me/meta/?r=http://purl.org/dc/terms/creator&amp;o=http://nla.gov.au/nla.party-541658">http://ontologize.me/meta/?r=http://purl.org/dc/terms/creator&amp;o=http://nla.gov.au/nla.party-541658</a> </p>
<p class="P2"><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T4">I can use this to assert that <span class="spCh spChx201c">&#8220;</span></span></span><a href="http://ontologize.me/meta/?r=http://purl.org/dc/terms/creator&amp;o=http://nla.gov.au/nla.party-541658%20">ptsefton</a><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T4"><span class="spCh spChx201d">&#8221;</span> is the author of this resource.</span></span></p>
<p class="P2"><span style="color:#000000; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; text-transform:none; "><span class="T4">Would ANDS and/or the NLA consider supporting services like this?</span></span></p>
</div>
<p />
<p>Oh, and nobody at the ARDCPIP round table had an answer for me about what we call this class of name-authority linked-data-endpoint-factory application.</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2010/06/18/what-should-we-call-this-name-authority-vocabulary-server-linked-data-uri-factory-service-we-are-building-for-ands-2.htm/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Cheap laugh #1. Suggest opening up courseware just a little.</title>
		<link>http://ptsefton.com/2010/05/17/cheap-laugh-1-suggest-opening-up-courseware-just-a-little.htm</link>
		<comments>http://ptsefton.com/2010/05/17/cheap-laugh-1-suggest-opening-up-courseware-just-a-little.htm#comments</comments>
		<pubDate>Mon, 17 May 2010 04:33:22 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/2010/05/17/cheap-laugh-1-suggest-opening-up-courseware-just-a-little.htm</guid>
		<description><![CDATA[



The other day I attended a Moodle developers&#8217; co-op group at USQ.  Over lunch we were kicking around ideas for what USQ could do to improve our learning and teaching.
Two things I said sent one long-standing member of staff into gales of laughter.
First, I said, we should let all the students access and search [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="http://ptsefton.com/2010/05/17/cheap-laugh-1-suggest-opening-up-courseware-just-a-little.htm"><!-- &nbsp; --></abbr>
<div>
<div class="page-toc" />
<div>
<p>The other day I attended a Moodle developers&#8217; co-op group at USQ.  Over lunch we were kicking around ideas for what USQ could do to improve our learning and teaching.</p>
<p>Two things I said sent one long-standing member of staff into gales of laughter.</p>
<p>First, I said, we should let all the students <b>access and search all the courseware</b> (with the exception of course readings that are licensed for enrolled students only). That way they (a) might find answers to their questions (b) catch up and revise, even if they didn&#8217;t study the prerequisite subject with us and (c) find new courses they&#8217;d like to do. Hilarious, apparently. (This was in the context of people wanting to add access to pre-requisite materials into courses, for students who needed to revise or catch-up, which apparently involves lecturers bootlegging course materials instead of the rational approach, which would be to just link to the other course).</p>
<p>Second, encouraged by this comedic success, I said that we should also offer a bonus to any staff member who could <b>source and adapt open courseware</b> instead of writing and/or maintaining a USQ study book, thus saving USQ money. Same reaction.</p>
<p>Getting a few laughs is good, as I play to a very tough audience at home, and there&#8217;s very little levity in the serious business of writing software<span class="footnote" style="vertical-align: super;"><a class="footnote" href="#ftn1" name="ftn1-text" title="1Duncan Dickinson has recently introduced &apos;stand up&apos; meetings to the ADFI Software R &amp; D lab. I was hoping that would turn out to be something Seinfeldish, but sadly I was thinking of the wrong kind of standup, and the meetings aren&apos;t about nothing, either.">1</a></span>. </p>
<p>I&#8217;m interested if others think this is funny. Here it is once more:</p>
<ol class="lin" style="list-style: decimal;">
<li>
<p>I reckon if the <b>default search </b>in our Moodle learning management system <b>showed materials from other courses</b> than the ones you happen to be enrolled in then that should help drive return business (and it if helped a student to decide NOT to enrol then that&#8217;s probably better than having them show up and discontinue or fail). This would be dead simple to implement, and would have the added benefit that our own staff might be able to areas of overlap or synergies between courses. (My team is working on a proof-of-concept repository which would be able to support this).</p>
</li>
<li>
<p>And while we have been very slow as an organisation to publish our own materials under open licenses, surely the<b> benefits of cutting course development costs</b> would be worth looking at. After all, our main business is not selling courseware. USQ has (or should that be &#8216;is&#8217;?) a &#8216;relationship brand&#8217;, Yes we&#8217;re famous for our courseware, but more famous for <a href="http://www.usq.edu.au/aboutusq/strategy">Providing the highest quality educational experiences to students irrespective of their location or lifestyle</a>. It&#8217;s the <b>experiences that matter,</b> and the less we spend writing the same Basket Weaving 101 course as every other distance educator on the planet the more we could devote to the experiential side.</p>
</li>
</ol>
<p>(Depending on how this post goes I might see if  I can work up a one-man show on USQ and Open Courseware).</p>
<p />
<p />
<p class="center">Copyright Peter Sefton, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. &lt;<a href="http://creativecommons.org/licenses/by-sa/2.5/au/" onclick="javascript:window.top.open(&quot;http://creativecommons.org/licenses/by-sa/2.5/au/&quot;);return false;">http://creativecommons.org/licenses/by-sa/2.5/au/</a>&gt;</p>
<p class="center"><span class="Default_20_Paragraph_20_Font"><span style="country:US; language:en; "><span class="T1"><a name="HTTP:::DBPEDIA.ORG:SNORQL:?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%" /><img alt="HTTP://DBPEDIA.ORG/SNORQL/?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%" class="fr1" height="31" src="http://ptsefton.com/wp-content/uploads/2010/05/m40ca94ba2.png" style="border:0px; vertical-align: top" width="88" /></span></span></span></p>
<p class="center">This post was written in OpenOffice.org, using templates and tools provided by the <a href="http://ice.usq.edu.au/" onclick="javascript:window.top.open(&quot;http://ice.usq.edu.au/&quot;);return false;">Integrated Content Environment</a> project and published to WordPress using <a href="http://fascinator.usq.edu.au/desktop/desktop.htm" onclick="javascript:window.top.open(&quot;http://fascinator.usq.edu.au/desktop/desktop.htm&quot;);return false;">The Fascinator</a>.</p>
<hr />
<div style="font-size: .9em;"><span class="footnote-defined"><a href="#ftn1-text" name="ftn1">1</a>Duncan Dickinson has recently introduced &#8217;stand up&#8217; meetings to the ADFI Software R &amp; D lab. I was hoping that would turn out to be something Seinfeldish, but sadly I was thinking of the wrong kind of standup, and the meetings aren&#8217;t<a href="http://en.wikipedia.org/wiki/Seinfeld#Theme"> about nothing</a>, either.</span></div>
</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2010/05/17/cheap-laugh-1-suggest-opening-up-courseware-just-a-little.htm/feed</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>ICE and word processor / HTML interop, the ugly, uglier, ugliest</title>
		<link>http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest-2.htm</link>
		<comments>http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest-2.htm#comments</comments>
		<pubDate>Fri, 14 May 2010 03:52:45 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest-2.htm</guid>
		<description><![CDATA[We have started having post-Friday-morning-tea information sessions in the ADIF technical team. Ron Ward kicked the series off with a great overview of Javascript which is both the world’s most widely deployed and worst misunderstood language and Greg Pendlebury introduced us to some of the wonders of spelling suggestions in Apache Solr indexes (he works [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest-2.htm"><!-- &nbsp; --></abbr>
<div>
<div class="page-toc" />
<div>
<p>[Update: finished an unfinished sentence, added a link]</p>
<p>We have started having post-Friday-morning-tea information sessions in the ADIF technical team. Ron Ward kicked the series off with a great overview of Javascript which is both the world&#8217;s most widely deployed and worst misunderstood language and Greg Pendlebury introduced us to some of the wonders of spelling suggestions in Apache Solr indexes (he works nights on the VuFind project).</p>
<p>My contribution is this presentation on ICE, below, looking at how it converts word processing documents into HTML, and some of the complexities that the ICE code has to deal with. These complexities are technical, horribly so in some cases. Many of the complexities are present  not for good technical reasons but because of business and politics; competing vendors and competing standards have left us a really nasty mess to deal with. </p>
<p>My talk started with some history. When I first got my hands on a WYSIWYG word processor, around 1988 or so, the first thing I did was go through all the menus and work out what everything was <i>for</i>. For my honours  thesis in 1990 I naturally used styles for the headings, so I could compile a table of context and the automatic numbering feature for figures, EndNote for references and so on. I was far too lazy to do any of that by hand. </p>
<p>Twenty years later, those features are still in the word processors we all use (although Google Docs is a disappointment), and we now have<a href="http://zotero.org/"> Zotero</a> for reference management, to which we in ADFI made a modest contribution. But I still meet senior academics who have no idea how to use styles to make a table of contents, don&#8217;t use reference management systems and consequently are unable to help their students to work efficiently. </p>
<p>There a couple of interesting questions, which I&#8217;m not going to answer here, but which are important to the research planning process at ADFI:</p>
<ol class="lin" style="list-style: decimal;">
<li>
<p>What are the word processors of the future going to look like? It&#8217;s clear that we will still need to &#8216;process words&#8217; and some of them will still be strung together in long sequences, like books and theses, but other more dynamic genres are emerging. I&#8217;ve written here about <a href="http://ptsefton.com/2009/11/02/a-bit-more-on-wave-as-a-scholarly-html-editor.htm">Google Wave</a> (which seems to have receded leaving behind only plastic bottles and dead fish) and various other new approaches to editing. I don&#8217;t know what&#8217;s coming but I am sure it will be collaborative.</p>
</li>
<li>
<p>How can we help people to use these programs for processing words?</p>
</li>
<li>
<p>Shouldn&#8217;t our training programmes for academics be more like <span class="spCh spChx201c">&#8220;</span>Word processing techniques for academic writing, using Word 2007<span class="spCh spChx201d">&#8221;</span>  than the current <span class="spCh spChx201c">&#8220;</span>Word 2007 basics<span class="spCh spChx201d">&#8221;</span>? How would we go about making that change?</p>
</li>
<li>
<p>More generally, how can we encourage the kind of laziness that prompts some of us to discover the bleeding obvious things that are under out noses, by doing things like RTFM? And how do we create a culture where that know-how can spread? </p>
</li>
<li>
<p>And what do we do when what we need is not there right under our noses? For example we need the ability to produce Scholarly HTML from our documents but current tools just don&#8217;t do that. How can we get the academic community and software vendors, including us in the open source world, in-sync to fill in the gaps? </p>
<p>Next week I have a call with Microsoft Research, and I&#8217;ll be talking about getting Word to be able to produce web-ready, journal ready, semantically rich documents, with useful metadata and seamless links to data and interop with other programs like OpenOffice.org. Note, I&#8217;m not looking at this from the point of view of a &#8217;standards war&#8217;; I&#8217;m trying to get thing<b> done</b>.</p>
</li>
</ol>
<p>When I ran through the slides below today we came back to these questions. We didn&#8217;t get any answers in half an hour, but my summary of what we should be doing is:</p>
<ol class="lin" style="list-style: decimal;">
<li>
<p>Finally get around to setting up one or more cohorts of postgrad students with an end-to end system for managing their thesis starting from day one, and including a facilty for annotation/comment by supervisor, cohort and finally examiners.</p>
</li>
<li>
<p>Work out how to educate (not train) the trainers about some of the principles behind academic use of tools.</p>
</li>
<li>
<p>Work with our Learning and Teaching Support Unit to start changing the corporate training culture away from teaching specific tools with a  generic focus to empowering academic users to do academic things.</p>
</li>
</ol>
<p />
<div class="slide">
<h1>What and why?</h1>
<p><a href="http://ice.usq.edu.au/">ICE </a>is, at it&#8217;s core a way of turning word processing documents into <b>proper </b>web pages.  </p>
<p>Word processors are:     </p>
<ol class="lin" style="list-style: decimal;">
<li>
<p><b>Ubiquitous in academia</b>. Still. Even with the advent of Moodle&#8217;s built in HTML editor and Google Wave.</p>
</li>
<li>
<p><b>Much maligned</b>, particularly by computer scientists, engineers and expert tinkerers who have much better ways of making web pages and printed documents from the same source (until you ask them to show you).</p>
</li>
<li>
<p><b>Full of useful features</b> for long documents, including:</p>
<ol class="li-lower-roman" style="list-style: lower-roman;">
<li>
<p>Drawing tools  </p>
</li>
<li>
<p>Data visualisation tools (charting)</p>
</li>
<li>
<p>Reference and bibliography management (usually best with a plugin)</p>
</li>
<li>
<p>Change-tracking (and some have  synchronous collaboration)</p>
</li>
<li>
<p>HTML export&#8230;</p>
</li>
</ol>
</li>
</ol>
</div>
<p />
<p />
<div class="slide">
<h1>HTML export?</h1>
<p>OK, so the HTML export in word processors is universally crappy.</p>
<p>This is one of the great mysteries of the universe.</p>
<p>Actually it&#8217;s not <span class="spCh spChx2013">&#8211;</span> Microsoft made a concerted attempt in the late nineties to make their office suite part of the web, and to add their own extensions to HTML By 2000 they were the only major player left in the word processing world and I guess they thought they had a shot at owning the web. It has taken nearly 10 years for us to get back to something like a multiple-horse race. Ironically the widely ridiculed Word 2000 save as web format was actually very easy to process into XML and to use to produce good quality web pages; it took years to generate the current OOXML approach which is much less useful. (<a href="http://www.xml.com/pub/a/2004/12/08/word-to-xml.html">I wrote about his for XML.com</a>).</p>
<p>It is a mystery why Google Docs is so bad, though. I had corresponded with one of the people from Google Docs about <a href="http://ptsefton.com/blog/2007/10/03/10-01-45.649977/">making it better</a>, but he lost interest.</p>
</div>
<p />
<p />
<div class="slide">
<h1>In 1995&#8230; </h1>
<p>The technical writing team at TAFE NSW information systems division decided to put our technical bulletins on the web. (We started this work before HTML even had tables, BTW).</p>
<ul class="lib">
<li>
<p>We <a href="http://delicious.com/ptsefton/ptsefton+usestyles">used word processing styles</a>, because that was (and still is) what sensible, rational people do.</p>
</li>
<li>
<p>I rolled my own process based on: RTF via Rainbow maker to SGML, to ESIS via nsgmls to HTML via Perl + SGMLSpm.</p>
<p>The current ICE conversion system is quite similar <span class="spCh spChx2013">&#8211;</span> uses an event driven XML (SAX process and a scripting language, it&#8217;s just that word processors now produce their own XML.</p>
<p>In between I have used <a href="http://en.wikipedia.org/wiki/OmniMark">Omnimark</a> (which was just plain weird), <a href="http://www.jclark.com/dsssl/">DSSSL</a> (which was Scheme, which is a kind of Lisp), XSLT (dumbed-down DSSSL with XML syntax) which we dumped in ICE for performance/maintainability reasons, and the free-as-in-beer <a href="http://99-bottles-of-beer.net/language-ace-15.html">ACE</a> language that came with TeraText (from RMIT).</p>
</li>
</ul>
<p>In those days:</p>
<ul class="lib">
<li>
<p>Image handling was tricky <span class="spCh spChx2013">&#8211;</span> it was hard to convert embedded objects to web formats.</p>
</li>
<li>
<p>I thought that the next version of Word would ship with syle-to-web mapping and I could stop working on this distraction<span class="footnote" style="vertical-align: super;"><a class="footnote" href="#ftn1" name="ftn1-text" title="* I was just taking a short break from computational linguistics which turned out to be easier than making web pages, in retrospect.">*</a></span>.</p>
</li>
</ul>
</div>
<p />
<p />
<div class="slide">
<h1>Styles</h1>
<p>In 2010, we&#8217;re writing software at ADFI that uses very similar styles to the ones used in 1995 at TAFE, which I refined for use at Standards Australia then NextEd before starting the ICE project at USQ.  The ICE style-set is <a href="http://eprints.usq.edu.au/697/">defined in my 2006 paper</a>.</p>
</div>
<p />
<p />
<p />
<div class="slide">
<h1>I promised you ugliness</h1>
<p>Styles are sensible and useful. But:</p>
<ul class="lib">
<li>
<p><b>There are no standards</b>. The only thing close to a standard is the use of Title, Heading 1 <span class="spCh spChx2026">&#8230;</span> Heading n. How do you handle a mixture of numbered headings and non-numbered headings. What about chapter numbering?</p>
</li>
<li>
<p><b>There is no standard even for a single vendor for an ordinary paragraph</b>: Normal, Default Default Text etc.</p>
</li>
<li>
<p>Even individual <b>vendors don&#8217;t stick to or ship a set of styles</b> for things like lists or quotes in their sample templates.</p>
</li>
<li>
<p>Word and Open Office and Google Docs <b>all</b> produce <b>different</b> rubbish when asked to make web pages and make almost no use of styles in doing so.</p>
</li>
</ul>
</div>
<p class="center" />
<div class="slide">
<h1>And worse&#8230;</h1>
<p>Like here are some of the favourites from the team:</p>
<ul class="lib">
<li>
<p>There are some circumstances under which Word 2007 does not export graphics when saving as HTML (happened to Ron <span class="spCh spChx2013">&#8211;</span> not sure if it is reproducible).</p>
</li>
<li>
<p>Page breaks and section breaks probably cost society a lot more than terrorism and smoking combined.</p>
</li>
<li>
<p>We  have not forgotten or forgiven the paper-clip; have they apologised yet?</p>
</li>
<li>
<p>Word used to (and still does) get into states where things fly around randomly. </p>
</li>
</ul>
<p>Look, if O&#8217;Reilly publishes a book entitled <span class="spCh spChx201c">&#8220;</span>&lt;name-of-your-product&gt; Annoyances<span class="spCh spChx201d">&#8221;</span> you have a problem. If nobody bothers, as with OpenOffice,org then you probably have a bigger problem.</p>
<p /></div>
<p />
<p />
<div class="slide">
<h1>And worse&#8230;</h1>
<p>Here&#8217;s a list I created using the bullets and numbering toolbar in Writer:</p>
<p><a name="graphics1" /><img alt="graphics1" class="fr1" height="90" src="http://ptsefton.com/wp-content/uploads/2010/05/6f671eae1.png" style="border:0px; vertical-align: middle" width="200" /></p>
<p>ODF has hierarchical list structures (one of the worst decisions ever). Now, you&#8217;d think that if we were going to have hierarchical list structures this might be, you know hierarchical, like one list embedded in another. Instead we get this :</p>
<blockquote class="bq"><p>&lt;<code>text:list xml:id="list1916064342" text:style-name="L1"&gt;</code></p>
<p><code>&lt;text:list-item&gt;&lt;text:p text:style-name="P1"&gt;Bullet list &lt;/text:p&gt;&lt;/text:list-item&gt;</code></p>
<p><code>&lt;/text:list&gt;</code></p>
<p><code>&lt;text:list xml:id="list1908078027" text:style-name="L2"&gt;</code></p>
<p><code>&lt;text:list-item&gt;</code></p>
<p><code>&lt;text:list&gt;</code></p>
<p><code>&lt;text:list-item&gt;</code></p>
<p><code>&lt;text:p text:style-name="P2"&gt;Number list &lt;/text:p&gt;&lt;/text:list-item&gt;</code></p>
<p><code>&lt;text:list-item&gt;</code></p>
<p><code>&lt;text:p text:style-name="P2"&gt;Number list&lt;/text:p&gt;&lt;/text:list-item&gt;&lt;/text:list&gt;</code></p>
<p><code>&lt;/text:list-item&gt;&lt;/text:list&gt;</code></p>
<p><code>&lt;text:list xml:id="list322315948" text:style-name="L3"&gt;&lt;text:list-item&gt;&lt;text:p text:style-name="P3"&gt;Bullet list&lt;/text:p&gt;&lt;/text:list-item&gt;&lt;/text:list&gt;</code></p>
</blockquote>
<p>Bizarre. Truly. The &#8216;main&#8217; list is three lists. That  &#8216;middle&#8217; numbered list is a single element list with no text with a two element list embedded in it. </p>
<p>I promise all I did was push the buttons on the toolbar. Pushing them in a different order gets you different craziness.</p>
</div>
<div class="slide">
<h1>Word&#8217;s OOXML has a more rational approach</h1>
<p>OOXML is flat &#8211; the list formatting is implied. Each paragraph is a paragraph and the fact that they belong in a list structure is indicated with attributes. Given that word processors have a paragraph based interface this is rational.</p>
<p><a name="graphics3" /><img alt="graphics3" class="fr1" height="93" src="http://ptsefton.com/wp-content/uploads/2010/05/201d28ad1.png" style="border:0px; vertical-align: middle" width="200" /></p>
<p>This kind of markup scares some people, but it&#8217;s very efficient:</p>
<blockquote class="bqs"><p>&lt;w:p w:rsidR=&#8221;009B3D58&#8243; w:rsidRDefault=&#8221;002D3C0B&#8221; w:rsidP=&#8221;002D3C0B&#8221;&gt;&lt;w:pPr&gt;&lt;w:pStyle w:val=&#8221;ListParagraph&#8221;/&gt;&lt;w:numPr&gt;&lt;w:ilvl w:val=&#8221;0&#8243;/&gt;&lt;w:numId w:val=&#8221;1&#8243;/&gt;&lt;/w:numPr&gt;&lt;/w:pPr&gt;&lt;w:r&gt;&lt;w:t&gt;Bullet list&lt;/w:t&gt;&lt;/w:r&gt;&lt;/w:p&gt;</p>
<p>&lt;w:p w:rsidR=&#8221;002D3C0B&#8221; w:rsidRDefault=&#8221;002D3C0B&#8221; w:rsidP=&#8221;002D3C0B&#8221;&gt;&lt;w:pPr&gt;&lt;w:pStyle w:val=&#8221;ListParagraph&#8221;/&gt;&lt;w:numPr&gt;&lt;w:ilvl w:val=&#8221;1&#8243;/&gt;&lt;w:numId w:val=&#8221;2&#8243;/&gt;&lt;/w:numPr&gt;&lt;/w:pPr&gt;&lt;w:r&gt;&lt;w:t&gt;Numbered list&lt;/w:t&gt;&lt;/w:r&gt;&lt;/w:p&gt;</p>
<p>&lt;w:p w:rsidR=&#8221;002D3C0B&#8221; w:rsidRDefault=&#8221;002D3C0B&#8221; w:rsidP=&#8221;002D3C0B&#8221;&gt;&lt;w:pPr&gt;&lt;w:pStyle w:val=&#8221;ListParagraph&#8221;/&gt;&lt;w:numPr&gt;&lt;w:ilvl w:val=&#8221;1&#8243;/&gt;&lt;w:numId w:val=&#8221;2&#8243;/&gt;&lt;/w:numPr&gt;&lt;/w:pPr&gt;&lt;w:r&gt;&lt;w:t&gt;Numbered list&lt;/w:t&gt;&lt;/w:r&gt;&lt;/w:p&gt;</p>
<p>&lt;w:p w:rsidR=&#8221;002D3C0B&#8221; w:rsidRDefault=&#8221;002D3C0B&#8221; w:rsidP=&#8221;002D3C0B&#8221;&gt;&lt;w:pPr&gt;&lt;w:pStyle w:val=&#8221;ListParagraph&#8221;/&gt;&lt;w:numPr&gt;&lt;w:ilvl w:val=&#8221;0&#8243;/&gt;&lt;w:numId w:val=&#8221;1&#8243;/&gt;&lt;/w:numPr&gt;&lt;/w:pPr&gt;&lt;w:r&gt;&lt;w:t&gt;Bullet list&lt;/w:t&gt;&lt;/w:r&gt;&lt;/w:p</p>
</blockquote>
</div>
<p />
<p class="center" />
<div class="slide">
<h1>But not that rational</h1>
<p>So <span class="spCh spChx2013">&#8211;</span> Word&#8217;s better, right? Not really. There are several ways to make lists in Word:</p>
<ul class="lib">
<li>
<p><b>Ad hoc lists</b> using the toolbar buttons which is likely the only thing casual users will discover in Word 2007 (there are no list-type styles showing in any of the ribbon galleries).</p>
</li>
<li>
<p>Using <b>anonymous multi-level list structures</b> <span class="spCh spChx2013">&#8211;</span> you can define these via the styles part of the ribbon, but not name them.</p>
</li>
<li>
<p>Using <b>named list outlines</b> which you get to from a DIFFERENT place on the ribbon (not the styles part) but which point back to styles.</p>
</li>
<li>
<p>Using <b>list styles </b><span class="spCh spChx2013">&#8211;</span> which were introduced in Word 2003 <span class="spCh spChx2013">&#8211;</span> but you know, I can&#8217;t find them in Word 2007 although apparently they&#8217;re there somewhere.</p>
</li>
</ul>
<p>Here&#8217;s the <a href="http://blogs.msdn.com/microsoft_office_word/archive/2010/01/28/the-why-behind-our-styles-and-lists-designs.aspx">attempted explanation on the Word blog</a>. Read that and I&#8217;m sure you&#8217;ll feel much better about the ribbon in Word 2007. It was<b> for your own good</b>. </p>
<p /></div>
<p />
<p />
<p />
<div class="slide">
<h1>So in ICE we try to do the right thing</h1>
<p>We try to:</p>
<ul class="lib">
<li>
<p>Produce sensible HTML using whatever cues the document contains:</p>
<ul class="lib">
<li>
<p>Styles, obviously make that very clear, but even with styles if someone has a third level list style following an ordinary paragraph ICE will not try to create some kind nested list structure or add a big margin it will produce sensible structured HTML.</p>
</li>
<li>
<p>Lately, we have been adding code to try to interpret direct formatting but this approach is never going to be as robust as using styles and it is very hard when using, say Word and Writer on the same document.</p>
</li>
</ul>
</li>
<li>
<p>Provide a toolbar which tries to generate sensible structure <span class="spCh spChx2013">&#8211;</span> eg if you hit the block-quote button in a paragraph under bullet list it will indent the quote under the list.</p>
</li>
</ul>
</div>
<p />
<p class="center" />
<p class="center">Copyright Peter Sefton, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. &lt;<a href="http://creativecommons.org/licenses/by-sa/2.5/au/" onclick="javascript:window.top.open(&quot;http://creativecommons.org/licenses/by-sa/2.5/au/&quot;);return false;">http://creativecommons.org/licenses/by-sa/2.5/au/</a>&gt;</p>
<p class="center"><span class="Default_20_Paragraph_20_Font"><span style="country:US; language:en; "><span class="T1"><a name="HTTP:::DBPEDIA.ORG:SNORQL:?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%" /><img alt="HTTP://DBPEDIA.ORG/SNORQL/?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%" class="fr2" height="31" src="http://ptsefton.com/wp-content/uploads/2010/05/m40ca94ba1.png" style="border:0px; vertical-align: top" width="88" /></span></span></span></p>
<p class="center">This post was written in OpenOffice.org, using templates and tools provided by the <a href="http://ice.usq.edu.au/" onclick="javascript:window.top.open(&quot;http://ice.usq.edu.au/&quot;);return false;">Integrated Content Environment</a> project and published to WordPress using <a href="http://fascinator.usq.edu.au/desktop/desktop.htm" onclick="javascript:window.top.open(&quot;http://fascinator.usq.edu.au/desktop/desktop.htm&quot;);return false;">The Fascinator</a>.</p>
<hr />
<div style="font-size: .9em;"><span class="footnote-defined"><a href="#ftn1-text" name="ftn1">*</a> I was just taking a short break from computational linguistics which turned out to be easier than making web pages, in retrospect.</span></div>
</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest-2.htm/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>ICE and word processor / HTML interop, the ugly, uglier, ugliest</title>
		<link>http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest.htm</link>
		<comments>http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest.htm#comments</comments>
		<pubDate>Fri, 14 May 2010 03:46:12 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest.htm</guid>
		<description><![CDATA[
Something went wrong and I got a duplicate post when I tried to make a correction here. See the new version.
]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest.htm"><!-- &nbsp; --></abbr>
<p>Something went wrong and I got a duplicate post when I tried to make a correction here.<a href="http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest-2.htm"> See the new version</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2010/05/14/ice-and-word-processor-html-interop-the-ugly-uglier-ugliest.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Repositories post 2010: embracing heterogeneity in AWE, the Academic Working Environment</title>
		<link>http://ptsefton.com/2010/04/25/repositories-post-2010-embracing-heterogeneity-in-awe-the-academic-working-environment.htm</link>
		<comments>http://ptsefton.com/2010/04/25/repositories-post-2010-embracing-heterogeneity-in-awe-the-academic-working-environment.htm#comments</comments>
		<pubDate>Sun, 25 Apr 2010 01:58:38 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/2010/04/25/repositories-post-2010-embracing-heterogeneity-in-awe-the-academic-working-environment.htm</guid>
		<description><![CDATA[

[update remove reference to a project which is not yet approved]


1   Introduction
2   AWE: Meeting the grand challenges for repositories

2.1   The web and the repository &#38; The cloud and the desktop
2.2   Knowledge and technology
2.3   Wild and curated content
2.4   Linked and isolated data &#38; Ad-hoc [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="http://ptsefton.com/2010/04/25/repositories-post-2010-embracing-heterogeneity-in-awe-the-academic-working-environment.htm"><!-- &nbsp; --></abbr>
<div>
[update remove reference to a project which is not yet approved]</p>
<div class="page-toc">
<ul>
<li><a href="#id2" mce_href="#id2">1   Introduction</a></li>
<li><a href="#id3" mce_href="#id3">2   AWE: Meeting the grand challenges for repositories</a>
<ul>
<li><a href="#id4" mce_href="#id4">2.1   The web and the repository &amp; The cloud and the desktop</a></li>
<li><a href="#id5" mce_href="#id5">2.2   Knowledge and technology</a></li>
<li><a href="#id6" mce_href="#id6">2.3   Wild and curated content</a></li>
<li><a href="#id8" mce_href="#id8">2.4   Linked and isolated data &amp; Ad-hoc and long-term access</a></li>
<li><a href="#id9" mce_href="#id9">2.5   Disciplinary and institutional systems / Scholars and service providers</a></li>
<li><a href="#id10" mce_href="#id10">2.6   Ubiquitous and personalized environments</a></li>
</ul>
</li>
<li><a href="#id11" mce_href="#id11">3   Conclusion</a></li>
<li><a href="#id12" mce_href="#id12">4   References</a></li>
</ul>
</div>
<div>
<p class="P4">[This paper has been accepted for the main track of the Open Repositories Conference with very strong reviewer feedback. I'll be there.]</p>
<div class="Table16" style="width: 100%; margin: 0px; padding: 0px; text-align: left;">
<table class="Table16 mceItemTable" style="border-spacing: 0; empty-cells: show; width: 18.611cm; border-collapse: collapse;">
<colgroup>
<col style="width: 9.305cm;">
<col style="width: 9.306cm;">
</colgroup>
<tbody>
<tr>
<td class="Table16_A1" style="vertical-align: top; background-color: transparent; border: none; padding: 0.097cm;" mce_style="vertical-align: top; background-color: transparent; border: none; padding: 0.097cm;">
<p class="meta-author-name"><span class="meta-familyname">Peter Sefton</span></p>
<p class="meta-author-email"><a href="mailto:sefton@usq.edu.au" mce_href="mailto:sefton@usq.edu.au">sefton@usq.edu.au</a></p>
<p class="meta-author-affiliation">University of Southern Queensland</p>
</td>
<td class="Table16_A1" style="vertical-align: top; background-color: transparent; border: none; padding: 0.097cm;" mce_style="vertical-align: top; background-color: transparent; border: none; padding: 0.097cm;">
<p class="meta-author-name"><span class="meta-familyname">Duncan Dickinson</span></p>
<p class="meta-author-email"><span class="meta-familyname">Duncan.Dickinson@usq.edu,au</span></p>
<p class="meta-author-affiliation"><span class="meta-familyname">University of Southern Queensland</span></p>
</td>
</tr>
</tbody>
</table>
</div>
<p class="meta-conference"><span class="meta-familyname">Open Repositories July 2010, Madrid, Spain</span></p>
<p><span mce_name="strong" mce_style="font-weight: bold;" style="font-weight: bold;" class="Apple-style-span">Abstract:</span> The organizers of the fifth international conference on Open Repositories list nine polar dichotomies that represent <span class="spCh spChx201c">“</span>The Grand Integration Challenge<span class="spCh spChx201d">”</span> for the repository community/movement. In this paper we take up the challenge. We do so in the context of a program of work being undertaken at our institution to build infrastructure for the academy in general, working towards a modular &#8216;Academic Working Environment&#8217; (AWE) which encompasses both teaching and learning on one hand and research on the other. Repositories and the ecosystem of services and workflows that surround them play a key role in this emerging system.</p>
<h1><img mce_name="a" name="id2" class="mceItemAnchor">1   Introduction</h1>
<p>This presentation uses the conference organizers&#8217; description of current issues in repositories as a way to structure discussion about a program being undertaken at the Australian Digital Futures Institute; The Academic Work Environment (AWE). The presentation will look at each of the integration challenges listed on the call for papers: the web and the repository, knowledge and technology,wild and curated content, linked and isolated data, disciplinary and institutional systems, scholars and service providers, ad-hoc and long-term access, ubiquitous and personalized environments, the cloud and the desktop.</p>
<h1><img mce_name="a" name="id3" class="mceItemAnchor">2   AWE: Meeting the grand challenges for repositories</h1>
<p>The Academic Work Environment (AWE) is a label for a set of services and computational systems  that support the academic enterprise. The acronym was coined to capture under a single identifier a range of research and development work going on in a group tasked with pragmatic, practical work on workflows and computer systems for eLearning and eResearch.</p>
<h2><img mce_name="a" name="id4" class="mceItemAnchor">2.1   The web and the repository &amp; The cloud and the desktop</h2>
<p>It is telling that <span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span"><span>the web vs the repository</span></span> is posed as a dichotomy or a challenge. While repositories are, by default, web-based systems the vast majority of document content housed in repositories is in a non-web format, PDF. Furthermore, vast amounts of potential repository content such as data-files are excluded from the repository, and from the web/cloud because of the lack of services that assist academic users in making their content available on the web. The Academic Work Environment addresses both of the issues identified above with systems to:</p>
<ol class="lin" style="list-style: decimal;" mce_style="list-style: decimal;">
<li>Allow academic documents to be made available in HTML as well as PDF. This work, based on The Integrated Content Environment (<a href="http://ice.usq.edu.au/" mce_href="http://ice.usq.edu.au/">ICE</a>) was presented at Open Repostiories 2009 (Sefton, Downing, &amp; Day, 2009). Uptake on this has been very slow, but repositories need to start using web formats if they are to be part of the web and fulfill the promise of integrated documents and data as envisaged by Murray Rust and Rzepa (Murray-Rust &amp; Rzepa, 2004).</li>
<li>Begin to close the gap between the desktop work environment and the repository via an application which bring a web-based repository view to all the files that a researcher/educator is using; allowing them to describe them, back them up and have them routed to appropriate repository for works-in-progress and completed research outputs as appropriate. (Dickinson &amp; Sefton, 2009; Sefton, 2009)</li>
</ol>
<p>Other work in this area includes Microsoft Research&#8217;s work on embedding SWORD <span class="spCh spChx2013">–</span> named OfficeSWORD<span class="footnote" style="vertical-align: super;" mce_style="vertical-align: super;"><img mce_name="a" name="ftn1-text" class="mceItemAnchor"></span> <span class="spCh spChx2013">–</span> repository deposit into their word processing product as well as some attempts to allow structured authoring and semantic web-authoring in the browser (Fernicola, 2009; Fink et al., 2010).</p>
<h2><img mce_name="a" name="id5" class="mceItemAnchor">2.2   Knowledge and technology</h2>
<p>Whilst individual web-pages may contain information that people can use to construct knowledge, the technology of the web itself is largely ignorant to the concept of information and the notion of a body of information being spread across the network. Authors may hyperlink to provide readers with further context but the web browser is a dumb-terminal that understands nothing about the actual information embedded in the content. Search engines provide a basic entry point to the information network but their reliance is on key phrases and not the context and meaning understood by a community of practice (Neumann &amp; Prusak, 2007). The challenge now is to open up the information existing in online forms <span class="spCh spChx2013">–</span> including documents and data <span class="spCh spChx2013">–</span> so that it is findable<span class="footnote" style="vertical-align: super;" mce_style="vertical-align: super;"><img mce_name="a" name="ftn2-text" class="mceItemAnchor"></span>, a concept that spans more than just matching keywords:</p>
<blockquote class="bq"><p>Any system aiming to integrate heterogeneous data on an ad hoc basis and present this to users will need to adopt sophisticated models of relevance, quality, and trust that are sensitive to the user<span class="spCh spChx2019">’</span>s current task and its context. (Heath Heath, 2008 p. 91)</p>
</blockquote>
<p>Heath&#8217;s discussion here is focused on the Semantic Web. Based on technologies from groups such as the W3C<span class="footnote" style="vertical-align: super;" mce_style="vertical-align: super;"><img mce_name="a" name="ftn3-text" class="mceItemAnchor"></span>, the Semantic Web provides a framework within the Web that gives web browsers and search engines the ability to interact with information. Navigation in this model is based on assembling meaning rather than merely providing presentation services (Heath, 2008). The challenge, then, is to get the semantics into the Semantic Web. Within the AWE we are looking at methods that allow researchers to easily provide semantic information in their data and documents. The Anotar<span class="footnote" style="vertical-align: super;" mce_style="vertical-align: super;"><img mce_name="a" name="ftn4-text" class="mceItemAnchor"></span> project presented a framework for adding semantics through the well understood concept of tagging and extending this by allowing taxonomies to be utilised. Created as an easy to adopt toolkit, Anotar is being adapted to systems such as The Fascinator, Moodle and WordPress. Initial pilot work has also been undertaken to provide facilities for adding semantic information to word processed documents so that semantic <span class="spCh spChx201c">“</span>mark up<span class="spCh spChx201d">”</span> can be created with the document rather as a deferred technical effort (Sefton, 2009).</p>
<h2><img mce_name="a" name="id6" class="mceItemAnchor">2.3   Wild and curated content</h2>
<p>The AWE approach to wild and curated content is to gradually tame and domesticate the content, by allowing it to be husbanded by a series of &#8216;curation events&#8217;. Using the Fascinator Desktop, The initial creator may label data items or sets in simple ad-hoc terms using tags such as <span class="spCh spChx201c">“</span>My Thesis<span class="spCh spChx201d">”</span> or <span class="spCh spChx201c">“</span>Anthropology 101 Course Notes<span class="spCh spChx201d">”</span>. The incentive to do this is that when they do so, the items will be (a) backed up appropriately and (b) routed to collaborators automatically. This represents what we might call an emergent workflow; where object state changes result in items making progress through required stages.</p>
<p>We also provide a more intentional kind of curation via &#8216;acts of publishing&#8217; where a data owner can push content across curation boundaries (the term coined by ARROW project members (Treloar, Groenewegen, &amp; Harboe-Ree, 2007)) for various reasons;</p>
<ul class="lib">
<li>a draft can be pushed to a blog (remember that, using ICE content services, all word processing documents in the Academic Working Environment are web-ready and can be delivered as HTML)</li>
<li>a finished paper or approved draft can be pushed to the institutional repository. Work is under way to enable this to happen in a way that (a) data is also deposited and (b) both document and data are web-ready and can be viewed in-browser as native web content, as well as being available for print .</li>
</ul>
<h2><img mce_name="a" name="id8" class="mceItemAnchor">2.4   Linked and isolated data &amp; Ad-hoc and long-term access</h2>
<p>Bootstrapping the linked-data web remains a grand challenge, but we are attempting to address it in work on The Fascinator Desktop by providing URIs (the formal name for links) for data while it is still in isolation in a lab or on a laptop computer for example. Managing identifiers through the lifecycle of a digital object is not easy when you consider that a researcher may have their digital files spread across multiple desktop, portable and mobile devices and that, in the messy landscape of desktop filesystems, filenames are changed at a whim and multiple versions may exist throughout a system.</p>
<p>Within The Fascinator, every item will have a URI from the moment it is discovered on the user&#8217;s desktop.</p>
<p>Creating a URI for desktop files is a similar approach to the SemDeskURI Scheme suggested by members of the Nepomuk<span class="footnote" style="vertical-align: super;" mce_style="vertical-align: super;"><img mce_name="a" name="ftn5-text" class="mceItemAnchor"></span> project (Sauermann, 2008). However, instead of relying on a new protocol (in the case of SemDeskUri this is <span class="spCh spChx201c">“</span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">desktop://<span class="spCh spChx201d">”</span></span>), URIs in The Fascinator will utilise <span class="spCh spChx201c">“</span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">http://</span><span class="spCh spChx201d">”</span>. For example, whilst <span class="spCh spChx201c">“</span>http://localhost/dickinso@usq.edu.au/research/data<span class="spCh spChx201d">”</span> will not allow remote users to access the resource, it does provide for contextual identification.</p>
<p>In terms of ad-hoc vs long-term access, under this scheme we would expect resources to move from an isolated desktop-web view of digital objects (remembering that our systems give people a web view from the very creation of the object) to ad-hoc team views where the provenance of items is preserved by keeping both desktop and team-URIs, through to more formally created views. As content is routed from the desktop to shared repositories the plan is to keep the URIs, so that the metadata for an item contains all of the known identifiers that we have for it.</p>
<h2><img mce_name="a" name="id9" class="mceItemAnchor">2.5   Disciplinary and institutional systems / Scholars and service providers</h2>
<p>Whilst much of this presentation has focused on the technical aspects of <span class="spCh spChx201c">“</span>The Grand Integration Challenge<span class="spCh spChx201d">”</span>, the complexity of institutional and individual engagement  must be considered. This is a challenge that threads through numerous areas of academia, including postgraduate researcher skills, central ICT provision, Government reporting requirements, Library systems, research project management etc. Indeed, it presents a <span class="spCh spChx201c">“</span>wicked problem<span class="spCh spChx201d">”</span><span class="footnote" style="vertical-align: super;" mce_style="vertical-align: super;"><img mce_name="a" name="ftn6-text" class="mceItemAnchor"></span> for the academic research sector and is one that various stakeholders approach with a carrot and/or stick approach.</p>
<p>From the carrot side, we are working with motivated pilot users and faculties to build the AWE to meet their needs. Another carrot, of course, is grants &#8211; such as those offered by the Australian National Data Service <span class="spCh spChx2013">–</span> to teams developing solutions. The stick often presents itself as institutionalised mandates  (<span class="spCh spChx201c">“</span>Data-sharing culture has changed,<span class="spCh spChx201d">”</span> 2009; Australian Government, 2007) but for many researchers, the technical complexities around data management are an intrusion into their time (Henty, Weaver, Bradbury, &amp; Porter, 2008). As mentioned earlier, a central goal of the AWE is to provide services that meet the various stakeholder demands but don&#8217;t interfere with the researcher&#8217;s core tasks.</p>
<h2><img mce_name="a" name="id10" class="mceItemAnchor">2.6   Ubiquitous and personalized environments</h2>
<p>Two central goals of the AWE are to hide the technicalities around data management and the semantic web and to provide services that meet various stakeholder demands. These goals are both aimed at allowing the researcher to focus on their research. We envisage a mesh of repository services, building on the existing standards for linking repository content; the Fascinator Desktop work (and before it The Integrated Content Environment) introduces the idea of a personal desktop web; with services such as tagging and note-taking acting not only in their traditional role of assisting in the research process, but as triggers in emergent workflows.</p>
<p>A good example to illustrate how the personal and ubiquitous can meet and <span class="spCh spChx201c">“</span>intermesh<span class="spCh spChx201d">”</span> is our work on a general framework for annotations, Anotar. In work extending from that described in Dickinson and Sefton (2009), we are adding Anotar annotation services to The Fascinator. In the case of our work with USQ&#8217;s Public Memory Research Centre, this facility will allow the researcher to tag photos in their desktop repository for release to an online repository for viewing. Research participants can log onto this repository and provide the researcher with essential information regarding the people and locations in various photos and movies through the use of taxonomy-based tagging. Furthermore, the open-ended nature of the annotations will provide an online forum for the participants to share their memories and even debate points of view <span class="spCh spChx2013">–</span> providing a rich data-set for the researcher as well as a first-person public memory archive.</p>
<p>Whilst tagging and annotation services are provided (to some extent) by various online Web 2.0 systems,  the AWE solution meets research-specific data management requirements by keeping research data in a way that adheres to the University&#8217;s ethical clearance requirements. For the researcher and their community of participants, it provides a personalized environment that lets them focus on their own goals rather than the technical infrastructure.</p>
<h1><img mce_name="a" name="id11" class="mceItemAnchor">3   Conclusion</h1>
<p>The Academic Working Environment is not a single monolithic application with a <span class="spCh spChx201c">“</span>one size fits all<span class="spCh spChx201d">”</span> approach. Its name defines its structure: an environment of interoperable services that works with the researcher and doesn&#8217;t hamper their efforts.</p>
<h1><img mce_name="a" name="id12" class="mceItemAnchor">4   References</h1>
<p class="P2">Australian Government. (2007). <span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">Australian code for the responsible conduct of research</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">. Canberra, Australia: Australian Government. Retrieved from http://www.nhmrc.gov.au/_files_nhmrc/file/publications/synopses/r39.pdf</span></span></p>
<p class="P2"><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">Data-sharing culture has changed. (2009, December 11). </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">Research Information</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">. Retrieved November 19, 2009, from http://www.researchinformation.info/news/news_story.php?news_id=553</span></span></p>
<p class="P3">Dickinson, D., &amp; Sefton, P. (2009). Creating an eResearch desktop for the Humanities. Presented at the eResearch Australasia 2009, Manly, Australia. Retrieved from http://eprints.usq.edu.au/6090/</p>
<p class="P3">Fernicola, P. F. (2009). Incorporating Semantics and Metadata as Part of the Article Authoring Process. Retrieved March 1, 2010, from http://elpub.scix.net/cgi-bin/works/Show?152_elpub2009</p>
<p class="P2"><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">Fink, J. L., Fernicola, P., Chandran, R., Parastitidas, S., Wade, A., Naim, O., Quinn, G., et al. (2010). Word add-in for ontology recognition: semantic enrichment of scientific literature. </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">BMC Bioinformatics</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">, </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">11</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">(1), 103. doi:10.1186/1471-2105-11-103</span></span></p>
<p class="P2"><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">Heath, T. (2008). How Will We Interact with the Web of Data? </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">Internet Computing, IEEE</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">, </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">12</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">(5), 88-91.</span></span></p>
<p class="P2"><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">Henty, M., Weaver, B., Bradbury, S. J., &amp; Porter, S. (2008). </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">Investigating Data Management Practices in Australian Universities</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">. Retrieved from http://eprints.qut.edu.au/14549/</span></span></p>
<p class="P2"><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">Murray-Rust, P., &amp; Rzepa, H. S. (2004). The Next Big Thing: From Hypermedia to Datuments. </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">Journal of Digital Information</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">, </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">5</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">(1), 248.</span></span></p>
<p class="P2"><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">Neumann, E., &amp; Prusak, L. (2007). Knowledge networks in the age of the Semantic Web. </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">Briefings in Bioinformatics</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">, </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">8</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">(3), 141-149. doi:10.1093/bib/bbm013</span></span></p>
<p class="P3">Sauermann, L. (2008, October 27). RFC-draft: SemDesk URI Scheme. Retrieved February 25, 2010, from http://dev.nepomuk.semanticdesktop.org/repos/trunk/doc/2008_09_semdeskurischeme/index.html</p>
<p class="P2"><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">Sefton, P. (2009, April 8). Journal 2.0: Embedding semantics in documents. </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">ptsefton</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">. Retrieved February 26, 2010, from http://ptsefton.com/2009/04/08/journal-20-embedding-semantics-in-documents.htm</span></span></p>
<p class="P2"><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">Sefton, P., Downing, J., &amp; Day, N. (2009). ICE-theorem &#8211; end to end semantically aware eResearch infrastructure for theses. </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">University of Southern Queensland</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">. Retrieved August 24, 2009, from http://eprints.usq.edu.au/5248/1/ice-theorem-paper-OR09.htm</span></span></p>
<p class="P3">Sefton, P. M. (2009). The Fascinator &#8211; Desktop eResearch and Flexible Portals. Presented at the 4th International Conference on Open Repositories, Georgia Institute of Technology. Retrieved from http://smartech.gatech.edu/handle/1853/28483</p>
<p class="P2"><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">Treloar, A., Groenewegen, D., &amp; Harboe-Ree, C. (2007). The Data Curation Continuum. </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">D-Lib Magazine</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">, </span></span><span mce_name="em" mce_style="font-style: italic;" style="font-style: italic;" class="Apple-style-span">13</span><span style="font-style: normal;" mce_style="font-style: normal;"><span class="T5">(9/10). doi:10.1045/september2007-treloar</span></span></p>
<p class="center">Copyright Peter Sefton and Duncan Dickinson, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. &lt;<a onclick="javascript:window.top.open(&quot;http://creativecommons.org/licenses/by-sa/2.5/au/&quot;);return false;" href="http://creativecommons.org/licenses/by-sa/2.5/au/" mce_href="http://creativecommons.org/licenses/by-sa/2.5/au/">http://creativecommons.org/licenses/by-sa/2.5/au/</a>&gt;</p>
<p class="center" style="font-style: normal;" mce_style="font-style: normal;"><span class="Default_20_Paragraph_20_Font"><span style="country: US; language: en;" mce_style="country: US; language: en;"><span class="T6"><img mce_name="a" name="HTTP:::DBPEDIA.ORG:SNORQL:?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%" class="mceItemAnchor"><img class="fr1" style="border: 0px; vertical-align: top;" mce_style="border: 0px; vertical-align: top;" src="http://ptsefton.com/wp-content/uploads/2010/04/m40ca94ba.png" mce_src="http://ptsefton.com/wp-content/uploads/2010/04/m40ca94ba.png" alt="HTTP://DBPEDIA.ORG/SNORQL/?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%" width="88" height="31"></span></span></span></p>
</p>
<hr />
<div style="font-size: .9em;" mce_style="font-size: .9em;"><span class="footnote-defined"><img mce_name="a" name="ftn1" class="mceItemAnchor"><a href="http://officesword.codeplex.com/" mce_href="http://officesword.codeplex.com/">http://officesword.codeplex.com/</a> </span></div>
<div style="font-size: .9em;" mce_style="font-size: .9em;"><span class="footnote-defined"><img mce_name="a" name="ftn2" class="mceItemAnchor"><a href="http://en.wikipedia.org/wiki/Findable" mce_href="http://en.wikipedia.org/wiki/Findable">http://en.wikipedia.org/wiki/Findable</a> </span></div>
<div style="font-size: .9em;" mce_style="font-size: .9em;"><span class="footnote-defined"><img mce_name="a" name="ftn3" class="mceItemAnchor"><a href="http://www.w3.org/2001/sw/" mce_href="http://www.w3.org/2001/sw/">http://www.w3.org/2001/sw/</a> </span></div>
<div style="font-size: .9em;" mce_style="font-size: .9em;"><span class="footnote-defined"><img mce_name="a" name="ftn4" class="mceItemAnchor"><a href="http://www.purl.org/anotar/" mce_href="http://www.purl.org/anotar/">http://www.purl.org/anotar/</a> </span></div>
<div style="font-size: .9em;" mce_style="font-size: .9em;"><span class="footnote-defined"><img mce_name="a" name="ftn5" class="mceItemAnchor"><a href="http://nepomuk.semanticdesktop.org/xwiki/bin/view/Main1/" mce_href="http://nepomuk.semanticdesktop.org/xwiki/bin/view/Main1/">http://nepomuk.semanticdesktop.org/xwiki/bin/view/Main1/</a> </span></div>
<div style="font-size: .9em;" mce_style="font-size: .9em;"><span class="footnote-defined"><img mce_name="a" name="ftn6" class="mceItemAnchor"><a href="http://en.wikipedia.org/wiki/Wicked_problem" mce_href="http://en.wikipedia.org/wiki/Wicked_problem">http://en.wikipedia.org/wiki/Wicked_problem</a> </span></div>
</p>
</div>
<p><span mce_name="strong" mce_style="font-weight: bold;" style="font-weight: bold;" class="Apple-style-span"> </span></p>
</div>
<p><span mce_name="strong" mce_style="font-weight: bold;" style="font-weight: bold;" class="Apple-style-span"> </span></p>
]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2010/04/25/repositories-post-2010-embracing-heterogeneity-in-awe-the-academic-working-environment.htm/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Making things happen to the book</title>
		<link>http://ptsefton.com/2010/04/23/making-things-happen-to-the-book.htm</link>
		<comments>http://ptsefton.com/2010/04/23/making-things-happen-to-the-book.htm#comments</comments>
		<pubDate>Fri, 23 Apr 2010 05:39:59 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/2010/04/23/making-things-happen-to-the-book.htm</guid>
		<description><![CDATA[



A week ago I was at the National Library&#8217;s Innovative Ideas Forum in Canberra &#8211; that&#8217;s #iif2010. I posted something about Kent Fitch&#8217;s take on innovation in libraries over on the CAIRSS blog.
One of the other speakers was Mark Pesce, asking What Ever Happened to the Book. This looks at how books are evolving, and [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="http://ptsefton.com/2010/04/23/making-things-happen-to-the-book.htm"><!-- &nbsp; --></abbr>
<div>
<div class="page-toc" />
<div>
<p>A week ago I was at the <a href="http://www.nla.gov.au/initiatives/meetings/innovative-ideas-forum/2010/program.html">National Library&#8217;s Innovative Ideas Forum in Canberra</a> <span class="spCh spChx2013">&#8211;</span> that&#8217;s <a href="http://twitter.com/">#iif2010</a>. I posted <a href="http://cairss.caul.edu.au/blog/2010/04/21/innovative-ideas-for-cairss/">something about Kent Fitch&#8217;s take on innovation in libraries over on the CAIRSS blog</a>.</p>
<p>One of the other speakers was Mark Pesce, asking <a href="http://blog.futurestreetconsulting.com/?p=282">What Ever Happened to the Book</a>. This looks at how books are evolving, and how they aren&#8217;t, as they go electronic. Tom Worthington had a rather harsh summary: <a href="http://www.tomw.net.au/blog/2010/04/whatever-happened-to-plain-english.html">Whatever happened to plain English?</a> Well, OK, so it is a bit long winded. I think the main thing it offered was a reminder that we don&#8217;t yet know what &#8216;the book&#8217; will become and whether we will call it &#8216;the book&#8217;. That&#8217;s an important reminder and it was an interesting tour of some of the issues.</p>
<p>One of Mark&#8217;s gripes was that a lot of electronic publishing is merely <span class="spCh spChx201c">&#8220;</span>Publishing in light<span class="spCh spChx201d">&#8221;</span> and publishers are not embracing the possibilities of a new medium; they don&#8217;t even put links in their material in a lot of cases resisting the <span class="spCh spChx201c">&#8220;</span>centrifugal force<span class="spCh spChx201d">&#8221;</span> that pulls readers away. I tweeted:</p>
<blockquote class="bq"><p><a href="../search%3Fq=%2523iif2010"><span class="T1">#iif2010</span></a> I think electronic books (portable, cheap) ARE compelling, sure hypertext is nice to</p>
</blockquote>
<p>To expand on that <span class="spCh spChx2013">&#8211;</span> there are substantial benefits to having books published in light; the main one is that, well, they&#8217;re light. If I can read a book on my Android phone I will, publishing in light makes the book weightless, lit for reading at night and always with me <span class="spCh spChx2013">&#8211;</span> even without reading glasses, &#8216;cos I can make the words big.</p>
<p>The other point I want to make is about the nature of the links that Mark Pesce puts so much emphasis on. As I said, he talked about a tendency for some commercial publishers to resist linking. Later on he talks about the way links distract and how this means that fewer long texts are being consumed. Coincidentally, the same morning there was a post on the O&#8217;Reilly radar blog: <a href="http://oreil.ly/9u1pox">Ebook annotations, links and notes: Must-haves or distractions? &#8211; O&#8217;Reilly Radar</a> (it&#8217;s short so you can afford to go and read it). This looked at the same issue <span class="spCh spChx2013">&#8211;</span> the author&#8217;s conclusion is that our reading tools should allow us to turn links, annotations, footnotes, marginalia etc on and off so we can be distracted, and drift with the links or not as we choose. I agree <span class="spCh spChx2013">&#8211;</span> sounds like it must become part of our reading experience.</p>
<p>Now, to the main point of this post. The bit about turning on and off links and annotations reminded me of the work we&#8217;ve been doing on our web annotation toolkit at ADFI. And I got to thinking that it would be good to go beyond the author/publisher stuffing a text with links and let third parties to add their own distractions. There are lots of uses cases here; allowing users to make their own notes, in-text discussions, letting a lecturer annotate a text book with notes for their students, a toolbox for the Joycean Scholars to fiddle yet more with Finnegans Wake.</p>
<p>So, as a kind of experiment in Hypertext and in the spirit of WriteToReply.org who do this with all kinds of documents, I have <a href="http://anotar.ptsefton.com/?p=15">reproduced Mark Persce&#8217;s script on my Anotar server</a> for you all (and me, and him, too if he wants) to interact with, using the first tentative release of the WordPress version of Anotar.. To comment, hover over a paragraph and click the [Comment] link.</p>
<p>I&#8217;m sure there are lots of suggestions you can make about the interface. We have not yet got the ability to highlight terms and add your own links. Other obvious things would be a floating button to show/hide all annotations, and the option to have stuff appear in the margin rather than in-line. It&#8217;s open source. Send us a patch.</p>
<p>And let me know if there are any bugs via ptsefton, at you know, the URL for google&#8217;s mail system.</p>
<p class="center">Copyright Peter Sefton, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. &lt;http://creativecommons.org/licenses/by-sa/2.5/au/&gt;</p>
<p class="center">This post was written Microsoft Word (which turns out to be much better than OpenOffice at doing sensible things when you paste HTML into it), using templates and tools provided by the Integrated Content Environment .</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2010/04/23/making-things-happen-to-the-book.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>My obsession with WordPress</title>
		<link>http://ptsefton.com/2010/03/30/my-obsession-with-wordpress.htm</link>
		<comments>http://ptsefton.com/2010/03/30/my-obsession-with-wordpress.htm#comments</comments>
		<pubDate>Tue, 30 Mar 2010 06:34:57 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/2010/03/30/my-obsession-with-wordpress.htm</guid>
		<description><![CDATA[



Why WordPress in particular
What&#8217;s not to like?



My team received some interesting, thoughtful and detailed feedback this week from one of the computer programmers in one of our sister-teams at USQ. One thread in this missive was to do with the superiority of general purpose content management applications,and Joomla! In particular over mere blogs. Oh, and [...]]]></description>
			<content:encoded><![CDATA[<abbr class="unapi-id" title="http://ptsefton.com/2010/03/30/my-obsession-with-wordpress.htm"><!-- &nbsp; --></abbr>
<div>
<div class="page-toc">
<ul>
<li><a href="#id2">Why WordPress in particular</a></li>
<li><a href="#id3">What&#8217;s not to like?</a></li>
</ul>
</div>
<div>
<p>My team received some interesting, thoughtful and detailed feedback this week from one of the computer programmers in one of our sister-teams at USQ. One thread in this missive was to do with the superiority of general purpose content management applications,and Joomla! In particular over mere blogs. Oh, and the email contained the phrase <span class="spCh spChx201c">&#8220;</span>I don<span class="spCh spChx2019">&#8217;</span>t understand your obsession with WordPress<span class="spCh spChx201d">&#8221;</span>. I wasn&#8217;t the only recipient, but I thought that now might be a good time to explain <b>my own</b> supposed obsession with WordPress, and why, as the head of the team that produces the <a href="http://ice.usq.edu.au/">Integrated Content Environment</a> and is building our repository toolkit <a href="http://fascinator.usq.edu.au/">The Fascinator</a> I have made sure that our software are able to interoperate with WordPress in particular and blogging software in general.</p>
<p>Actually, I&#8217;m not obsessed. I do use WordPress  to run this blog<a href="#ftn2" name="ftn2-text"><span style="vertical-align: super;"><span class="endnote">i</span></span></a> and the reason for that is pretty much the same reason we support it and use it at ADFI (for the <a href="http://adfi.usq.edu.au/blog/">main blog</a> and for a more technical <a href="http://intranet.adfi.usq.edu.au/dev_blog/">developers blog</a>). It&#8217;s one of the most commonly used web publishing applications. And it&#8217;s big in academia. Our job is to build tools for academics and we use it for the same reason they do. It&#8217;s like a Commodore or a Falcon or a Camry, well understood by mechanics, cheap, and easy to find spare parts, even if it&#8217;s not the best bit of engineering or it uses more fuel than we&#8217;d like. As with such utilitarian transport WordPress is not the interesting bit, it&#8217;s where it can take you that counts, and academic users are taking themselves all sorts of places with it.</p>
<p>There has been a lot written on this topic, and I&#8217;m not an expert, but my impression is that one reason  that blogging has worked well in the academy is that the basic model of blog posts that don&#8217;t change much is very similar to journal articles. You publish, others comment on or reference what you published. This used to happen at glacier-speed via journal processes and letters and reviews and so on and now it an happens fast, but it&#8217;s essentially the same kind of discourse as we&#8217;ve had for some centuries. </p>
<h1><a id="id2" name="id2" />Why WordPress in particular</h1>
<p>One of the main reasons we chose to use and extend WordPress a few years ago is that it has AtomPub support that works. This means that you can post a document, like, say, this one, along with images and maybe other stuff such as alternative renditions, using a standard protocol. Other blogging software we tried a couple of years ago Just Didn&#8217;t Work. We have a few bugs in our AtomPub support, which we&#8217;ll look at. </p>
<p>And it&#8217;s obvious that WordPress is really widely used in academia. Off the top of my head, here are just a few of the many, mostly recent things I know are going on in the crowd that I follow in Google Reader <span class="spCh spChx2013">&#8211;</span> I didn&#8217;t subscribe to any of these  because of some predilection for WP, I follow this stuff because it&#8217;s interesting:</p>
<ul class="lib">
<li>
<p>JISC in the UK have funded <a href="http://jiscpress.blogs.lincoln.ac.uk/2009/11/18/digress-it-version-2-3-was-released-last/">JISCPress which produced the Digress.it</a> system for turning WordPress into a paragraph-level discussion system. Very important work for public review of all kinds of documents. </p>
</li>
<li>
<p>Joss Winn <a href="http://joss.blogs.lincoln.ac.uk/2010/02/22/wordpress-beyond-blogging/">squeezes all sorts of stuff out of WordPress<a href="#ftn0" name="ftn0-text"><span style="vertical-align: super;"><span class="endnote">ii</span></span></a></a>.</p>
</li>
<li>
<p>The <a href="http://journal.code4lib.org/process-and-structure">Code4Lib journal</a> is powered by WordPress.</p>
</li>
<li>
<p>Tony Hirst is <a href="http://ouseful.wordpress.com/2010/03/25/viewing-wordpress-posts-in-chronological-order/">full of useful tips for using WP</a>. Tony pointed out to me that you can extract the content from a WP page using a single-page feed, making WP blogs ideal for integration with other services.</p>
</li>
<li>
<p>Peter Murray-Rust <a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=896">has written about the shortcomings of WordPress</a> as he tries to force it to do data-rich publishing and was <a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1720">trying our ICE templates with it last year</a>.</p>
</li>
</ul>
<p>And:</p>
<ul class="lib">
<li>
<p>When I was looking at referencing systems I was able to <a href="http://www.zotero.org/support/dev/wordpress">make my blog Zotero compatible in three minutes flat</a>.</p>
</li>
<li>
<p><a href="http://lackoftalent.org/michael/blog/ore-wordpress-plug-in/">Ditto</a> for when <a href="http://ptsefton.com/2008/10/14/what-the-oai-ore-protocol-can-do-for-you.htm">I was exploring ORE</a>.</p>
</li>
</ul>
<p>Point is, there are so many plugins that when you need one, someone has usually done it for you.</p>
<p>Look, I don&#8217;t have stats on this, but it is clear that this is the system to support first. I don&#8217;t see all this activity around other systems in the stuff I follow, which is biased towards the Open Access Scholarly Publishing and  repository communities. </p>
<p>So, we&#8217;re making a WordPress plugin for our Anotar discussion/annotation system; there&#8217;s a huge audience who might (a) like to use it to allow paragraph-anchored discussion, and eventually  image-region anchored discussion etc and (b) if enough people adopt it, then they might help us build Anotar by extending the Anotar client, which we will be using in other systems as well, such as Moodle.</p>
<h1><a id="id3" name="id3" />What&#8217;s not to like?</h1>
<p>To finish up, there <b>are</b> some aspects of WordPress that I really, really hate<a href="#ftn1" name="ftn1-text"><span style="vertical-align: super;"><span class="endnote">iii</span></span></a></p>
<ul class="lib">
<li>
<p><b>The key one is this &#8216;river of news&#8217; format</b> which is the default. Go to a default WordPress home page and you get the latest article, and you have to scroll and scroll to see the next article. I reckon that for all but very specialised link blogs or twitter-like blogs this is incredibly poor usability <span class="spCh spChx2013">&#8211;</span> how am I supposed to know what&#8217;s actually there?</p>
</li>
<li>
<p><b>The default search behaviour is the same.</b> Look at <a href="http://ouseful.wordpress.com/?s=word+press+single+post+rss">this search on Tony&#8217;s blog</a>. There&#8217;s a reason Google doesn&#8217;t do this. I was trying to find something that I know he wrote about getting the content of a post via an RSS feed but I get an incredibly long page containing several posts.</p>
</li>
<li>
<p>The <b>default theme mangles basic HTML</b> like bullet points for no apparent reason.</p>
</li>
<li>
<p>The <b>default behaviour of making the title of a post link to the &#8216;permalink&#8217; is silly</b> <span class="spCh spChx2013">&#8211;</span> I particularly loathe pages that link to themselves (not that I have bothered to change this on my site).</p>
</li>
<li>
<p>There&#8217;s this fundamental <b>hard-wired distinction between &#8216;posts&#8217; and &#8216;pages&#8217;</b>. Ugh. It would be so much nicer if a document was a document, and it had metadata, such as &#8216;document-type&#8217; and then you could configure how you wanted things displayed <span class="spCh spChx201c">&#8220;</span>on the home page give me abstracts of the last 6 posts, most recent first<span class="spCh spChx201d">&#8221;</span>. But then, you know it would be Drupal, not good ole WordPress.</p>
</li>
</ul>
<p />
<p class="center">Copyright Peter Sefton, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. &lt;<a href="http://creativecommons.org/licenses/by-sa/2.5/au/" onclick="javascript:window.top.open(&quot;http://creativecommons.org/licenses/by-sa/2.5/au/&quot;);return false;">http://creativecommons.org/licenses/by-sa/2.5/au/</a>&gt;</p>
<p class="center"><span class="Default_20_Paragraph_20_Font"><span style="country:US; language:en; "><span class="T1"><a name="HTTP:::DBPEDIA.ORG:SNORQL:?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%" /><img alt="HTTP://DBPEDIA.ORG/SNORQL/?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%" class="fr1" height="31" src="http://ptsefton.com/wp-content/uploads/2010/03/m40ca94ba4.png" style="border:0px; vertical-align: top" width="88" /></span></span></span></p>
<p class="center">This post was written in OpenOffice.org, using templates and tools provided by the <a href="http://ice.usq.edu.au/" onclick="javascript:window.top.open(&quot;http://ice.usq.edu.au/&quot;);return false;">Integrated Content Environment</a> project and published to WordPress using <a href="http://fascinator.usq.edu.au/desktop/desktop.htm" onclick="javascript:window.top.open(&quot;http://fascinator.usq.edu.au/desktop/desktop.htm&quot;);return false;">The Fascinator</a>.</p>
<hr />
<div style="font-size: .9em;"><span class="endnote"><a href="#ftn2-text" name="ftn2">i</a> When PHP was born I hated it, the way it mixed code and output formatting, but you know, modern PHP apps have grown up a lot, and WordPress has a reasonably respectable, if extremely poorly documented, way to write extensions. I avoided using it for years, but it&#8217;s actually OK, if you don&#8217;t try to edit using the stupid web-based editor.</span></div>
</p>
<p>
<div style="font-size: .9em;"><span class="endnote"><a href="#ftn0-text" name="ftn0">ii</a>I am still trying to get over the way he deposited a thesis in ePrints via an RSS feed.</span></div>
</p>
<p>
<div style="font-size: .9em;"><span class="endnote"><a href="#ftn1-text" name="ftn1">iii</a> I&#8217;m not sounding obsessive am I?</span></div>
</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2010/03/30/my-obsession-with-wordpress.htm/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
