<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>ptsefton</title>
	<atom:link href="http://ptsefton.com/feed" rel="self" type="application/rss+xml" />
	<link>http://ptsefton.com</link>
	<description></description>
	<lastBuildDate>Tue, 21 May 2013 23:34:11 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4.1</generator>
		<item>
		<title>Research Data @ the University of Western Sydney (Introducing a data deposit management plan to the research community at UWS)</title>
		<link>http://ptsefton.com/2013/05/22/research-data-the-university-of-western-sydney-introducing-a-data-deposit-management-plan-to-the-research-community-at-uws.htm</link>
		<comments>http://ptsefton.com/2013/05/22/research-data-the-university-of-western-sydney-introducing-a-data-deposit-management-plan-to-the-research-community-at-uws.htm#comments</comments>
		<pubDate>Tue, 21 May 2013 23:34:11 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=1705</guid>
		<description><![CDATA[I was invited to speak at the National Higher Education Faculty Research Summit in Sydney on May 22 about our Research Data Repository project. The conference promises to provide a forum for exploration. Explore Sourcing extra grant funding and increasing revenue streams Fostering collaboration and building successful relationships Emerging tools and efficient practices for maintaining [...]]]></description>
			<content:encoded><![CDATA[	<div>
	<span property="http://purl.org/dc/terms/title" content="Research Data @ the University of Western Sydney (Introducing a data deposit management plan to the research community at UWS)"><!--  --></span>
			<span property='http://purl.org/dc/terms/creator' content='ptsefton'><!-- --></span>
	
	
	
	
	
	
	
	
	
	
<article><section><h1 class="Title">
<b><span lang="en-US"></b></h1><p class="Standard">
<span lang="en-US">I was invited to speak at the </span><a href="http://apo.org.au/content/national-higher-education-faculty-research-summit-2013" onclick="pageTracker._trackPageview('/outgoing/apo.org.au/content/national-higher-education-faculty-research-summit-2013?referer=');">National
Higher Education Faculty Research Summit in Sydney on May 22</a><span lang="en-US">
about our Research Data Repository project. The conference promises
to provide a forum for exploration. </span>
</p><blockquote><p class="Standard">
<b>Explore</b></p><ul><li><p class="List Paragraph">
	Sourcing extra grant funding and increasing revenue streams</p></li><li><p class="List Paragraph">
	Fostering collaboration and building successful relationships</p></li><li><p class="List Paragraph">
	<b>Emerging tools and efficient practices for maintaining research
	efficacy and integrity</b></p></li><li><p class="List Paragraph">
	Improving your University’s research performance, skills and
	culture to enable academic excellence</p></li></ul></blockquote><p class="Standard">
<span lang="en-US">My topic </span>is “Introducing a data deposit
management plan to the research community at UWS”. This relates
directly to the conference theme I have highlighted, on emerging
tools and practice. My strategy for this presentation, given that
we’re at a summit, is to stay above 8000m, use a few metaphors, and
discuss the strategy we’re taking at UWS rather than dive too
deeply into the sordid details of projects. As usual, these are my
notes; I hope these few paragraphs will be more useful than just a
slide deck, but this is not a fully developed essay.</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			There are two kinds of data: Working and Archival/Published</h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_1e9e85bc.png" name="Picture 3" align="BOTTOM" width="692" height="361" border="0"></p></section><p class="Standard">
In very general terms, we have divided our data storage into two
parts: the working Research Data Storage service where people get
things done, collect data and work with it and the archival Research
Data Repository part where stable, citable published data sets are
looked after (by the library) for the long term. 
</p><p class="Standard">
This talk is not going to be all about architecture diagrams but
here’s one more, from a <a href="http://eresearch.uws.edu.au/blog/2013/05/09/research-data-repository-rdr-progress-report-may-2013/" onclick="pageTracker._trackPageview('/outgoing/eresearch.uws.edu.au/blog/2013/05/09/research-data-repository-rdr-progress-report-may-2013/?referer=');">recent
project update</a> showing two examples of applications that will
assist researchers in  working with data. One very important
application is <a href="http://eresearch.uws.edu.au/blog/projects/data-capture-for-climate-change-and-energy-research/" onclick="pageTracker._trackPageview('/outgoing/eresearch.uws.edu.au/blog/projects/data-capture-for-climate-change-and-energy-research/?referer=');">HIEv,
the central data capture/management platform for the Hawkesbury
Institute for the Environment</a>. This is where research teams
capture sensor data, research support staff work to clean and package
the data, researchers develop models and produce derived data and
visualisations. We’re still working out exactly how this will work
as publications using the data start to flow, but right now data
moves from the working space to the archival space, and thence to the
national data discovery service, see this example of <a href="http://researchdata.ands.org.au/automatic-weather-station-data-collected-at-the-university-of-western-sydneys-hawkesbury-institute-for-the-environment-field-facility-based-in-richmond-nsw-during-the-period-june-2011-to-march-2013" onclick="pageTracker._trackPageview('/outgoing/researchdata.ands.org.au/automatic-weather-station-data-collected-at-the-university-of-western-sydneys-hawkesbury-institute-for-the-environment-field-facility-based-in-richmond-nsw-during-the-period-june-2011-to-march-2013?referer=');">weather
data</a> – (unfortunately the data set is not yet openly available
for this one, I think it should be, and I’ll be doing what I can to
make it so).</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			Data wrangling services</h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_6478c395.png" name="Picture 5" align="BOTTOM" width="553" height="497" border="0"></p></section><p class="Standard">
The other service shown on this diagram is Dropbox.com. We’d be
hard pressed to stop researchers from using this service – it comes
up in just about every consultation meeting. Researchers themselves
must take responsibility for making sure that services like this are
appropriate given their data management obligations under funder
agreements and codes of practice. For those projects where
Dropbox.com <i>is</i> appropriate we plan to let researchers invite
the Research Data Store to share their stuff, thus creating a
managed, backed-up copy at the university, and opening the way for us
to provide useful services over the data (coming soon).</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			Data <a href="http://www.uws.edu.au/research/researchers/data_management" onclick="pageTracker._trackPageview('/outgoing/www.uws.edu.au/research/researchers/data_management?referer=');">management</a></h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_1ebe2c00.png" name="Picture 1" align="BOTTOM" width="553" height="299" border="0"></p></section><p class="Standard">
Yes, we have a web page about research data management, with some
basic advice and links to more resources, but putting up web pages
does not effect the kind of culture change needed to establish
research data management, data re-use and data citation. As our
Research Office head, Gar Jones, says this will be a change similar
to the introduction of Human and Animal ethics management which will
take several years to roll out. 
</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			Some key points for this presentation</h2><p class="Standard">
			I want to talk about:</p><ul><li><p lang="en-US" class="List Paragraph">
				Governance, open access, metadata, identifiers</p></li><li><p lang="en-US" class="List Paragraph">
				The importance of the (administrative) research lifecycle</p></li><li><p lang="en-US" class="List Paragraph">
				Policy supported by services rather than aspirations</p></li></ul></section><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			eResearch = goat tracks</h2><h2 class="Slide">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_30963b55.jpg" name="Picture 2" align="BOTTOM" width="617" height="617" border="0"></h2></section><p class="Standard">
This is a concrete path on the Werrington South (Penrith) campus of
the University of Western Sydney. The path is there because people
kept walking through the garden bed, which was in between where the
shuttle bus stops and where they wanted to be, at the library.  As <a href="http://ptsefton.com/2012/09/12/virtual-infrastructure-and-research-support-fostering-collaboration-across-institutions.htm">I
said at a similar conference for IT-types last year</a>:</p><blockquote><p class="Standard">
Groups
like mine work in the gap between the concrete and the goat track, my
job is to encourage the goats.</p></blockquote><p class="Standard">
And once we’ve encouraged the goats to make new paths, we need to
get the university infrastructure people to come and pave the paths.</p><p class="Standard">
 
</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			What’s over the horizon?</h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_66d08499.jpg" name="graphics1" align="BOTTOM" width="624" height="468" border="0"></p></section><p class="Standard">
What do research administrators and IT directors need to be thinking
about? 
</p><ul><li><p class="List Paragraph">
	Changes in the research landscape – more emphasis on data reuse
	and citation, increasing emphasis on defensible research mean data
	will become as important as citations</p></li><li><p class="List Paragraph">
	Providing access to publications and data so it can be reused.</p></li><li><p class="List Paragraph">
	(e)Research infrastructure in general, where collaboration must not
	be constrained by the boundaries of individual institutional
	networks and firewalls.</p></li></ul><p class="Standard">
Any others?</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			<a href="http://researchdata.ands.org.au/" onclick="pageTracker._trackPageview('/outgoing/researchdata.ands.org.au/?referer=');">Research data</a>, Next
			Big Thing?</h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_m4335d93f.png" name="graphics2" align="BOTTOM" width="553" height="308" border="0"></p></section><p class="Standard">
The Australian National Data Service runs a data-discovery service
designed to advertise data for reuse. 
</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			Governments are <a href="http://data.gov.au/" onclick="pageTracker._trackPageview('/outgoing/data.gov.au/?referer=');">joining in</a></h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_5861e9e3.png" name="graphics3" align="BOTTOM" width="553" height="355" border="0"></p></section><p class="Standard">
As research organisations, we want to have infrastructure for data
management, and a culture of data management that involves forward
planning, and data re-use. So the next section of the talk is about
how we need to:</p><ul><li><p class="List Paragraph">
	Stop the fat multinational-publisher tail from wagging the starving
	research dog. Ensure research funded by us is accessible and usable
	by us.</p></li><li><p class="List Paragraph">
	Understand our researchers and their habits, so we can help them
	take on this new data management responsibility (actually it’s not
	a new responsibility, but many have simply been paying no attention
	to it, in the absence of any obvious reason to do so).</p></li><li><p class="List Paragraph">
	Sort out the metadata mess most universities are swimming in.</p></li></ul><blockquote><p class="Standard">
Now for the big picture stuff.</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			<strike>Open</strike> Free scholarship is coming? (Just beyond
			that ridge)</h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_287aa0ee.jpg" name="Picture 4" align="BOTTOM" width="624" height="468" border="0"></p></section><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			OA is a Good Thing, 
			</h2><p class="Standard">
			Which will:</p><ul><li><p class="List Paragraph">
				Reduce extortionate journal pricing.</p></li><li><p class="List Paragraph">
				Provide equitable access to research outputs to the whole world.</p></li><li><p class="List Paragraph">
				Open Access to publications and Coming Soon: Open Access to data.</p></li><li><p class="List Paragraph">
				Promote Open Science and Open Research.</p></li><li><p class="List Paragraph">
				Drive huge demand for data management, cataloguing, archiving,
				publishing services</p></li></ul></section><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			http://aoasg.org.au/</h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_m17e13dab.png" name="graphics4" align="BOTTOM" width="553" height="339" border="0"></p></section></blockquote><p class="Standard">
There are competing models for open access. Bizarrely the discussion
is often framed as a contest between  ‘Green’ and ‘Gold’.
It’s a lot like the State of Origin Rugby League, a contrived but
popular-in-obscure-corners of the world contest where the ‘Blues’
and ‘Maroons’ run repeatedly into each other. In both State of
Origin and Open Access, the current winners are large media
companies. Add least being an Open Access advocate doesn’t give you
head injuries.</p><p class="Standard">
<i>Green</i> OA refers to author-deposited pre-publication versions
of research articles. <i>Gold</i> means that the published version
itself is ‘Open’ for some ill-defined definition of open, often
at a cost of thousands of dollars, out of the researcher’s budget.
Green or Gold, a lot of so-called Open Access publishing operates
with no formal legal underpinnings, that is, without copyright-based
licenses. For example when I deposited a Green version of a paper I
had written here, and wrote to the publisher <a href="http://ptsefton.com/2009/08/19/towards-scholarly-html.htm">asking
them to clarify copyright and licensing issues I got no reply</a>.</p><p class="Standard">
<br />We have a brief window now to try to build services for research
data management that do have a solid legal basis and avoid following
some of the OA movements missteps but this is not trivial (1). 
</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			Identity management is crucial</h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_20cb366a.png" name="Picture 10" align="BOTTOM" width="648" height="697" border="0"></p></section><p class="Standard">
I have used a variant of the above dog picture before to talk <a href="http://www.slideshare.net/ptsefton1/a-hint-ofmint/8" onclick="pageTracker._trackPageview('/outgoing/www.slideshare.net/ptsefton1/a-hint-ofmint/8?referer=');">about
identity management</a>. This dog has a name but it’s a terrible
way to find out about him as he has a <a href="http://en.wikipedia.org/wiki/Bootsy_Collins" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Bootsy_Collins?referer=');">much
more famous namesake</a>. 
</p><p class="Standard">
Like the rest of us, this dog has all sorts of identifying names and
numbers – a microchip number linked to a database, an ID assigned
by the RSPCA, patient numbers at veterinary practices, which may be
linked to more than one human, phone numbers on his tag etc. Point
is, it’s much worse for researchers than for dogs – identities
are maintained all over the place. Foley and <span lang="en-US">Kochalko
put it like this:</span></p><blockquote><p lang="en-US" class="Standard">
While much has changed since the days of David Livingstone, we
continue to struggle with associating individuals with their works
accurately and unambiguously. Author name ambiguity plagues science
and scholarship: when researchers are not properly identified and
credited for their work, dead-ends and information gaps emerge. The
impact ripples throughout the ecosystem, compromising collaboration
networks, impact metrics, “smarter” research allocations, and the
overall discovery process. Name ambiguity also weighs on the system
by creating significant hidden costs for all stakeholders. (2)</p></blockquote><p class="Standard">
<br />To do metadata management well we need to make sure that we sort
out all sorts of naming and identifying issues, dealing correctly
with potential causes of confusion, multiple people with the same
name, people with multiple names over time, and simultaneously, name
variants. Even where there are agreed subject codes like the Field of
Research codes that are heavily used in research measurement
exercises they can get mixed us as different databases use different
variants.</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			We try to work out how to fit new processes into existing
			workflows</h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_7e60e83b.png" name="Picture 9" align="BOTTOM" width="789" height="462" border="0"></p></section><p class="Standard">
At Rochester university, when they installed an institutional
repository the team conducted ethnographic research on their research
community (3). We have not gone that far, but our Research Data
Repository project does try to pay attention to what researchers <i>do</i>
as part of their current work, and to fit new processes into existing
ones.</p><p class="Standard">
<br />For example, the above scenario tries to capture the interactions
that would happen when a researcher is required by a journal to
deposit data before publication. We spend a lot of time talking to
the Office of Research Services (ORS) and research librarian team
about how we can fit in with their existing processes, and how to
minimise negative impacts on research groups. Research Offices are
used to responding to changing regulatory environments so adding new
fields to forms etc is straightforward. Changing IT services is much
harder; the ITS is much bigger than ORS, new services need to be
acquired, provisioned and documented, and the service desk team has
to be taught new processes.   
</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			Challenge: how to stop the corporate publishing tail from wagging
			the scholarly dog</h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_m3505a486.jpg" name="Picture 6" align="BOTTOM" width="636" height="477" border="0"></p></section><p class="Standard">
This is a rather a substantial issue to try to talk about in a
discussion about research data management and repositories, but it’s
essential to keep an eye on the big picture. We know that scholarship
has to change, publishing has to change, but we don’t know how. We
need to develop strategies for how we want it to change. Some
examples of where this is important:</p><ul><li><p class="List Paragraph">
	Policy on ‘ownership’ of intellectual property rights over data
	needs to be established. This is not as simple as it is for
	publications, as data are not always subject to copyright (1).</p></li><li><p class="List Paragraph">
	Data citation is going to be an important metric.</p></li></ul><p class="Standard">
New models are needed. People like Alex Holcombe from Sydney uni are
developing them:</p><blockquote><p class="Normal (Web)">
Science
is broken; let’s fix it.&nbsp;This
has been my mantra for some years now, and today we are launching an
initiative aimed squarely at one of science’s biggest problems. The
problem is called publication bias or the file-drawer problem and
it’s resulted in what some&nbsp;<a href="http://pps.sagepub.com/content/7/6/528.short" onclick="pageTracker._trackPageview('/outgoing/pps.sagepub.com/content/7/6/528.short?referer=');">have
called a replicability crisis</a>.</p><p class="Normal (Web)">
When
researchers do a study and get negative or inconclusive results,
those results usually end up in file drawers rather than published.
When this is true for studies attempting to replicate
already-published findings, we end up with a replicability crisis
where people don’t know which published findings can be trusted.</p><p class="Normal (Web)">
To
address the problem,&nbsp;<a href="http://www.dansimons.com/" onclick="pageTracker._trackPageview('/outgoing/www.dansimons.com/?referer=');">Dan
Simons</a>&nbsp;and
I are introducing a new article format at the journal&nbsp;<i>Perspectives
on Psychological Science (PoPS)</i>.
The new article format is called&nbsp;<a href="http://www.psychologicalscience.org/index.php/replication" onclick="pageTracker._trackPageview('/outgoing/www.psychologicalscience.org/index.php/replication?referer=');">Registered
Replication Reports</a>&nbsp;(RRR).
&nbsp;The process will begin with a psychological scientist
interested in replicating an already-published finding. They will
explain to we editors why they think replicating the study would be
worthwhile (perhaps it has been widely influential but had few or no
published replications). If we agree with them, they will be invited
to submit a methods section and analysis plan and submit it to we
editors.&nbsp;The submission will be sent to reviewers, preferably
the authors of the original article that was proposed to be
replicated. These reviewers will be asked to help the replicating
authors ensure their method is nearly identical to the original
study. &nbsp;The submission will at that point be accepted or
rejected, and the authors will be told to report back when the data
comes in. &nbsp;The methods will also be made public and other
laboratories will be invited to join the replication attempt. &nbsp;All
the results will be posted in the end, with a meta-analytic estimate
of the effect size combining all the data sets (including the
original study’s data if it is available). The Open Science
Framework&nbsp;<a href="http://openscienceframework.org/" onclick="pageTracker._trackPageview('/outgoing/openscienceframework.org/?referer=');">website</a>&nbsp;will
be used to post some of this. The press release is&nbsp;<a href="http://www.psychologicalscience.org/index.php/news/releases/initiative-on-research-replication.html" onclick="pageTracker._trackPageview('/outgoing/www.psychologicalscience.org/index.php/news/releases/initiative-on-research-replication.html?referer=');">here</a>,
and the details can be found&nbsp;<a href="http://www.psychologicalscience.org/index.php/replication" onclick="pageTracker._trackPageview('/outgoing/www.psychologicalscience.org/index.php/replication?referer=');">at
the PoPS website</a>.</p><p class="Standard">
<a href="http://alexholcombe.wordpress.com/2013/03/03/registered-replication-reports-are-open-for-submissions/" onclick="pageTracker._trackPageview('/outgoing/alexholcombe.wordpress.com/2013/03/03/registered-replication-reports-are-open-for-submissions/?referer=');">http://alexholcombe.wordpress.com/2013/03/03/registered-replication-reports-are-open-for-submissions/</a>
</p></blockquote><p class="Standard"><a name="_GoBack"></a>
This seems like a positive note on which to end. Hundreds of
researchers are trying to fix scholarship, they’re the ones we need
to talk to about what a data repository or a data management plan
should be.</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2 class="Slide">
			Science is broken let’s fix it</h2><p class="Standard">
			<img src="http://ptsefton.com/wp-content/uploads/2013/05/wpid-Research-Data.html_Research-Data_html_25f6648f.png" name="Picture 7" align="BOTTOM" width="552" height="311" border="0"></p></section><blockquote><p lang="en-US" class="Bibliography">
1. 	Stodden V. The Legal Framework for
Reproducible Scientific Research: Licensing and Copyright. Computing
in Science Engineering. 2009;11(1):35–40. 
</p><p lang="en-US" class="Bibliography">
2. 	Foley MJ, Kochalko DL. Open
Researcher and Contributor Identification (ORCID). 2012 [cited 2013
May 21]; Available from:
http://docs.lib.purdue.edu/cgi/viewcontent.cgi?article=1133&amp;context=charleston</p><p lang="en-US" class="Bibliography">
3. 	Lindahl D, Bell S, Gibbons S, Foster
NF. Institutional Repositories, Policies, and Disruption. 2007 [cited
2013 May 21]; Available from:

http://open.bu.edu/xmlui/handle/2144/919</p></blockquote></section>

<p><a rel="license" href="http://creativecommons.org/licenses/by/3.0/deed.en_US" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by/3.0/deed.en_US?referer=');"><img alt="Creative Commons License" style="border-width:0" src="http://i.creativecommons.org/l/by/3.0/88x31.png" /></a><br /><span xmlns:dct="http://purl.org/dc/terms/" property="dct:title">Research Data @ the University of Western Sydney (Introducing a data deposit management plan to the research community at UWS)</span> by <span xmlns:cc="http://creativecommons.org/ns#" property="cc:attributionName">Peter Sefton</span> is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by/3.0/deed.en_US" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by/3.0/deed.en_US?referer=');">Creative Commons Attribution 3.0 Unported License</a>.</p></article>
	</div>]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2013/05/22/research-data-the-university-of-western-sydney-introducing-a-data-deposit-management-plan-to-the-research-community-at-uws.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Running an Open Source project from a university dev team</title>
		<link>http://ptsefton.com/2013/03/25/running-an-open-source-project-from-a-university-dev-team.htm</link>
		<comments>http://ptsefton.com/2013/03/25/running-an-open-source-project-from-a-university-dev-team.htm#comments</comments>
		<pubDate>Mon, 25 Mar 2013 03:59:11 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=1682</guid>
		<description><![CDATA[Steven Hayes from Arts eResearch at the University of Sydney invited me to visit their group and talk about running open source software projects, as they are making their Heurist (semantic database-of-everything) software open source. This was more of a conversation than a presentation, but I prepared a few ‘slides’ to remind me of which [...]]]></description>
			<content:encoded><![CDATA[	<div>
	<span property="http://purl.org/dc/terms/title" content="Running an Open Source project from a university dev team"><!--  --></span>
			<span property='http://purl.org/dc/terms/creator' content='ptsefton'><!-- --></span>
	<article>  <section><p>Steven Hayes from Arts eResearch at the University of Sydney
invited me to visit their group and talk about running open source software
projects, as they are making their <a href="http://heuristscholar.org/heurist/help/tour.html" onclick="pageTracker._trackPageview('/outgoing/heuristscholar.org/heurist/help/tour.html?referer=');">Heurist</a> (semantic
database-of-everything) software open source. This was more of a conversation
than a presentation, but I prepared a few ‘slides’ to remind me of which points
to hit. Here are my notes. The focus here was not on why go open source, or
open source in general, it was about doing it in a
small university-based team. Comments about how various uni
open source projects run would be appreciated.</p><p>I have been involved in creating two sizeable code-bases
both released by the University of Southern Queensland as open source. They had
very different histories. I’ll talk about both and how they run, although
actually one of them doesn’t run any more in any meaningful way. </p><section typeof="http://purl.org/ontology/bibo/Slide"><h2>Two projects I started…</h2><p>… on which other people* did most
  of the work</p><ul><li><p><a href="https://code.google.com/p/integrated-content-environment/" onclick="pageTracker._trackPageview('/outgoing/code.google.com/p/integrated-content-environment/?referer=');">ICE</a>
  – the Integrated Content Environment. Used at USQ for creating course
  materials for delivery online and in print. Almost no activity on this outside
  of USQ these days. Inside USQ? I don’t know for certain, but I think it is
  still in use, and finding a replacement has proven difficult, which doesn’t
  surprise me as that was the reason we built it in the first place).</p></li><li><p><a href="http://www.redboxresearchdata.com.au/" onclick="pageTracker._trackPageview('/outgoing/www.redboxresearchdata.com.au/?referer=');">ReDBOX</a> – the Research Data Box (and <a href="https://sites.google.com/site/fascinatorhome/" onclick="pageTracker._trackPageview('/outgoing/sites.google.com/site/fascinatorhome/?referer=');">The Fascinator</a>, the
  underlying toolkit).</p></li></ul></section><p>*Thanks to Ron Ward, Oliver Lucido,
Linda Octalina, Duncan Dickinson, Greg Pendlebury, Daniel de Byl, Bron Chandler, Tim McCallum, Cynthia Wong, Jason Zejfert, Sally MacFarlane, Caroline Drury, Pamela Glossop,
Warwick Milne, Sue Craig, Vicki Picasso, Dave Huthnance,
Shirley Reushle and the late Alan Smith who made,
tested, championed and supported these projects. Thanks also to funding from
the Australian government via ANDS, ARROW and other streams. Sorry if I forgot
anyone.</p><p>(At this point I wanted to check that everyone knows what
Open Source means, making sure that we all understand how Richard Stallman made
software free using copyright law. Whoever holds the copyright in a bit of
software, which is likely to be whoever wrote it, or their employer can control
distribution by using a licence, a legal instrument. Stallman’s insight was
that a licence could be used to enforce sharing, openness and freedom: you can
use this stuff I created provided you promise to share it with other people
(that’s not a quote). Oh, and people working in this space should also
understand the difference between <a href="http://producingoss.com/en/producingoss.html#free-vs-open-source" onclick="pageTracker._trackPageview('/outgoing/producingoss.com/en/producingoss.html_free-vs-open-source?referer=');">Free
and Open Source</a> <span property="cites" typeof="http://schema.org/ScholarlyArticle"><link itemprop="url" href="data:application/json,%A0%20%7B%22citationID%22%3A%22TmTEoEWN%22%2C%22properties%22%3A%7B%22formattedCitation%22%3A%22%5B1%5D%22%2C%22plainCitation%22%3A%22%5B1%5D%22%7D%2C%22citationItems%22%3A%5B%7B%22id%22%3A1804%2C%22uris%22%3A%5B%22http%3A//zotero.org/users/568/items/P3UUVZGT%22%5D%2C%22uri%22%3A%5B%22http%3A//zotero.org/users/568/items/P3UUVZGT%22%5D%2C%22itemData%22%3A%7B%22id%22%3A1804%2C%22type%22%3A%22book%22%2C%22title%22%3A%22Producing%20open%20source%20software%3A%20How%20to%20run%20a%20successful%20free%20software%20project%22%2C%22publisher%22%3A%22O%27Reilly%20Media%2C%20Inc.%22%2C%22URL%22%3A%22http%3A//dl.acm.org/citation.cfm%3Fid%3D1121560%22%2C%22shortTitle%22%3A%22Producing%20open%20source%20software%22%2C%22author%22%3A%5B%7B%22family%22%3A%22Fogel%22%2C%22given%22%3A%22Karl%22%7D%5D%2C%22issued%22%3A%7B%22year%22%3A2005%7D%2C%22accessed%22%3A%7B%22year%22%3A2013%2C%22month%22%3A3%2C%22day%22%3A17%7D%7D%7D%5D%2C%22schema%22%3A%22https%3A//github.com/citation-style-language/schema/raw/master/csl-citation.json%22%7D%20"><span itemprop="label">[1]</span><span property="cites" typeof="http://schema.org/ScholarlyArticle"><meta property="id" content="1804"><meta property="type" content="book"><meta property="title" content="Producing open source software: How to run a successful free software project"><meta property="publisher" content="O" reilly="" media,="" inc.'=""><meta property="URL" content="http://dl.acm.org/citation.cfm?id=1121560"><meta property="shortTitle" content="Producing open source software"><span property="author"><meta property="family" content="Fogel"><meta property="given" content="Karl"></span><span property="issued"><meta property="year" content="2005"></span><span property="accessed"><meta property="year" content="2013"><meta property="month" content="3"><meta property="day" content="17"></span><link itemprop="uri" href="http://zotero.org/users/568/items/P3UUVZGT"></span></span>. </p><p>But I forgot.)</p><section typeof="http://purl.org/ontology/bibo/Slide"><h2>RTFM</h2><p>Above, I linked to a <a href="http://producingoss.com/en/producingoss.html" onclick="pageTracker._trackPageview('/outgoing/producingoss.com/en/producingoss.html?referer=');">free book on producing Open Source software</a> <span property="cites" typeof="http://schema.org/ScholarlyArticle"><link itemprop="url" href="data:application/json,%A0%20%20%20%7B%22citationID%22%3A%221ajbns4ast%22%2C%22properties%22%3A%7B%22formattedCitation%22%3A%22%5B1%5D%22%2C%22plainCitation%22%3A%22%5B1%5D%22%7D%2C%22citationItems%22%3A%5B%7B%22id%22%3A1804%2C%22uris%22%3A%5B%22http%3A//zotero.org/users/568/items/P3UUVZGT%22%5D%2C%22uri%22%3A%5B%22http%3A//zotero.org/users/568/items/P3UUVZGT%22%5D%2C%22itemData%22%3A%7B%22id%22%3A1804%2C%22type%22%3A%22book%22%2C%22title%22%3A%22Producing%20%20%20open%20source%20software%3A%20How%20to%20run%20a%20successful%20free%20software%20%20%20project%22%2C%22publisher%22%3A%22O%27Reilly%20Media%2C%20%20%20Inc.%22%2C%22URL%22%3A%22http%3A//dl.acm.org/citation.cfm%3Fid%3D1121560%22%2C%22shortTitle%22%3A%22Producing%20%20%20open%20source%20%20%20software%22%2C%22author%22%3A%5B%7B%22family%22%3A%22Fogel%22%2C%22given%22%3A%22Karl%22%7D%5D%2C%22issued%22%3A%7B%22year%22%3A2005%7D%2C%22accessed%22%3A%7B%22year%22%3A2013%2C%22month%22%3A3%2C%22day%22%3A17%7D%7D%7D%5D%2C%22schema%22%3A%22https%3A//github.com/citation-style-language/schema/raw/master/csl-citation.json%22%7D%20%20%20"><span itemprop="label">[1]</span><span property="cites" typeof="http://schema.org/ScholarlyArticle"><meta property="id" content="1804"><meta property="type" content="book"><meta property="title" content="Producing   open source software: How to run a successful free software   project"><meta property="publisher" content="O" reilly="" media,="" inc.'=""><meta property="URL" content="http://dl.acm.org/citation.cfm?id=1121560"><meta property="shortTitle" content="Producing   open source   software"><span property="author"><meta property="family" content="Fogel"><meta property="given" content="Karl"></span><span property="issued"><meta property="year" content="2005"></span><span property="accessed"><meta property="year" content="2013"><meta property="month" content="3"><meta property="day" content="17"></span><link itemprop="uri" href="http://zotero.org/users/568/items/P3UUVZGT"></span></span> by <i>Karl&nbsp;Fogel<b> </b></i>which<b><i>
  </i></b>seems to cover most of what you’d need to know. I haven’t read
  it all, looks useful.</p><p><img border="0" width="417" height="546" src="http://ptsefton.com/wp-content/uploads/2013/03/wpid-OSS.docx.html_OSS.docx_filesimage002.png" v:shapes="_x0000_i1026"></p></section><p></p><p></p><section typeof="http://purl.org/ontology/bibo/Slide"><h2>But I don’t like this</h2><p>The book begins:</p><blockquote><p>Most
  free software projects <b>fail</b>.</p></blockquote><p>I think that’s silly,
  talking about failure without first defining success.</p></section><p>Me, I’m not sure that all
these scenarios Fogel lists are failures at all,
there are lots of reasons to release code and they are not all necessarily
about building a substantial community:</p><blockquote><p>We tend
not to hear very much about the failures. Only successful projects attract
attention, and there are so many free software projects in total<sup>[</sup><a name="idp5223728"></a><a href="http://producingoss.com/en/producingoss.html#ftn.idp5223728" onclick="pageTracker._trackPageview('/outgoing/producingoss.com/en/producingoss.html_ftn.idp5223728?referer=');"><sup>2</sup></a><sup>]</sup><span class="apple-converted-space">&nbsp;</span>that even though only a small
percentage succeed, the result is still a lot of visible projects. We also
don&#8217;t hear about the failures because failure is not an event. There is no single
moment when a project ceases to be viable; people just sort of drift away and
stop working on it. There may be a moment when a final change is made to the
project, but those who made it usually didn&#8217;t know at the time that it was the
last one. There is not even a clear definition of when a project is expired. Is
it when it hasn&#8217;t been actively worked on for six months? When its user base
stops growing, without having exceeded the developer base? What if the
developers of one project abandon it because they realized they were
duplicating the work of another—and what if they join that other project,
then expand it to include much of their earlier
effort? Did the first project end, or just change homes?</p>

</blockquote>
<section typeof="http://purl.org/ontology/bibo/Slide"><h2>What’s the first thing that comes to mind when you think of
  Open Source?</h2><p>Linux? Apache? WordPress?&nbsp; Firefox?</p><p>The <b>hits</b>. The
  stadium-filling rock-star projects?</p></section><section typeof="http://purl.org/ontology/bibo/Slide"><h2>Your band has 99.9% probability of staying in the garage</h2><h2><img border="0" width="417" height="556" src="http://ptsefton.com/wp-content/uploads/2013/03/wpid-OSS.docx.html_OSS.docx_filesimage004.png" v:shapes="Picture_x0020_1"></h2><p>Figure 1 Me (the
  good looking one) and cousin Tim at the Springwood Sports club, about to
  perform with a community uke-group. No plans
  for world-domination, playing for family, who are obliged to attend and even
  some people who , for some reason, <i>choose</i> to come. #Notfailure.</p></section><p></p><p></p><p>It’s important to
work out why you are going to release software as Open Source – think
about the audience. One very important audience is you, yourself. If you work
on code as part of your job, then your employment contract may well mean that
your employer owns the copyright. Do you want to be able to continue using it
in your next job? Show potential employers? Making it open source helps your
future self.</p><p></p><p>I know this first
hand.</p><p></p><p>Universities are not as stable as they seem, or you may hope.
At the Australian Digital Futures Institute at USQ we began by hosting code
repositories and websites internally. I reasoned that the university would be a
good bet for maintaining persistence of these resources. </p><p>But then one <a href="http://www.swinburne.edu.au/hed/pvc_learn_transform/gillysalmon.html" onclick="pageTracker._trackPageview('/outgoing/www.swinburne.edu.au/hed/pvc_learn_transform/gillysalmon.html?referer=');">Gilly Salmon</a> came to our institute to be the new
professor, decided, along with the rest of the senior leadership team that
there was altogether too much <i>making</i> the
digital future going on in the Australian Digital Futures Institute, too much <i>technology</i>. They let just about all the
technical staff go, no matter how useful they were to the organisation, or how
pregnant they happened to be (we’re a relationship brand, the director of
marketing told me, so we shouldn’t be continuing to develop software to deliver
award-winning distance-ed services). </p><p>Web sites that would still have value are just gone from
public view, including, ironically the PILIN
project site, which was about persistent identifiers. Even the ICE website
which is full of useful stuff for USQ itself now appears to be <a href="http://web.archive.org/web/20120829154125/http://ice.usq.edu.au/" onclick="pageTracker._trackPageview('/outgoing/web.archive.org/web/20120829154125/http_//ice.usq.edu.au/?referer=');">only
accessible via the Wayback machine</a>. They’re still <i>using</i> it but they turned off the website anyway, the code, however,
is sitting on Google code so we all still have access to it.</p><p>This sort of thing happens all the time. For a couple of us,
the <a href="http://www.virtualcampuses.eu/index.php/NextEd" onclick="pageTracker._trackPageview('/outgoing/www.virtualcampuses.eu/index.php/NextEd?referer=');">NextEd</a> refugees, this was the second redundancy
associated with USQ. Kids, it is prudent to make sure that any code you might
want to re-use later in your career is released under an open licence, and
documentation, web sites etc likewise under creative
commons. Think of it as a professional escape pod. </p><p>The ReDBOX project survived this
ADFI shut down, because it had been open source from the beginning but further
funding had to be redirected to another university which
was willing to host the building of a digital future.</p><p></p><section typeof="http://purl.org/ontology/bibo/Slide"><h2>Lessons</h2><ul><li><p>Open Source can be worth doing even if the
  audience is your future self</p></li><li><p>Don’t trust someone else to keep your website
  up</p></li><li><p>If you want a community you’ll (likely) have
  to build it</p></li><li><p>Every project is different, so you need to structure
  yours around your users</p></li></ul></section><p>Oh, and the answer to most questions is on Stack Exchange. I
decided that this list was worth using as a starting point for discussion.</p><p><a href="http://programmers.stackexchange.com/questions/51553/checklist-for-starting-an-open-source-project" onclick="pageTracker._trackPageview('/outgoing/programmers.stackexchange.com/questions/51553/checklist-for-starting-an-open-source-project?referer=');">http://programmers.stackexchange.com/questions/51553/checklist-for-starting-an-open-source-project</a></p><section typeof="http://purl.org/ontology/bibo/Slide"><h2><a href="http://programmers.stackexchange.com/questions/51553/checklist-for-starting-an-open-source-project" onclick="pageTracker._trackPageview('/outgoing/programmers.stackexchange.com/questions/51553/checklist-for-starting-an-open-source-project?referer=');">Havoc
  P said</a>: [with additions by me post the discussion at USYD]</h2><blockquote><p>Things I&#8217;d put in the early priorities are:</p><ul><li><p>have a simple &#8220;what is
  it?&#8221; web site with links to some discussion forum (whether email or
  chat) and to the source code repository</p><p><b>[Mailing lists are usually best IMO – forums can be
  empty, echoing and make you project look unloved. A tech list is a must,
  always, but other communications should be built around the reality of your
  project. No user community yet? Build one. Others over at Stack Exchange
  added that once you have a tech-list is best to hold or log all your
  discussions there so architectural decisions are transparent and the
  community can engage.</b></p><p><b>On the ReDBOX project there are two
  main mailing lists, one for the techies and one for the users (mostly library
  staff), and lots of virtual and face-to-face get togethers. There is a
  committers group who are in charge of what gets into the trunk and various
  ad-hoc arrangements to sponsor sub-projects at the dozen or so sites using
  the software. The groups and how they interact were all created to serve that
  community, not from some manual of best practice, although it is all informed
  by collective experience of open source projects.]</b></p></li><li><p>be sure the code compiles and usually works,
  don&#8217;t commit work-in-progress or half-ass patches on the main branch that
  break things, because then other people&#8217;s work would be disrupted</p><p><b>[Well, OK, but if you’re releasing an existing code base then
  don’t get too hung up on making things perfect (a) it will be a huge waste if
  there is no demand for your code and (b) don’t be unnecessarily shy, most
  open source projects are like busking, not stadium rock, nobody is watching
  you waiting to pounce on your errors.]</b></p></li><li><p>put a license file in the
  code repository with a well-known license, and mark the copyright owner
  (probably you, or your company). don&#8217;t omit the
  license, make up a license, or use an obscure license.</p></li><li><p>have instructions for how
  to contribute, say in a HACKING file or include in your README. This should
  include where to send patches, how to format patches, code indentation rules,
  any other important conventions of the project</p></li><li><p>have instructions on how to
  report a bug</p></li><li><p>be helpful on the mailing list or whatever
  your forums are</p></li></ul></blockquote></section><blockquote><p></p><section typeof="http://purl.org/ontology/bibo/Slide"><h2>More from Havoc P</h2><blockquote><p>After those priorities I&#8217;d say:</p><ul><li><p>documentation (this saves you work
  on the mailing list&#8230; make a FAQ from your list posts is a simple start)</p></li><li><p>try to do things in a
  &#8220;normal&#8221; way (don&#8217;t invent your own build system or use some weird
  one, don&#8217;t use 1-space indentation, don&#8217;t be annoyingly quirky in general
  because it adds learning curve)</p></li><li><p>promote your project. marketing marketing marketing. You need some blogs and news sites and stuff
  like that to cover you, and then when people show up interested, you need to
  talk to them and be sure they get it working and look at their patches. Maybe
  mention your project in the forums for related projects.</p><p><b>[Yes, this is a huge one. One of the big differences between ReDBOX, which is no hit, but has a solid user base and ICE
  which never made it out of USQ is that Vicki Picasso from Newcastle Uni and I marketed the hell out of ReDBOX
  early to a very specific community of user-organisations. We needed a community
  so the software would have a sustainable base, so we designed the software
  for the community and sought input on the design as broadly as we could. </b></p><p><b>With ICE, I talked about it to lots of the wrong people and
  didn’t sell it to the right ones, other distance ed
  unis, but that was partly because it conferred a competitive advantage on USQ.
  This comes back to the point above about success vs
  failure – there’s more than one way to succeed.]</b></p></li><li><p>always review and accept
  patches as quickly as humanly possible. Immediately is perfect. More than a
  couple days and you are losing lots of people.</p></li><li><p>always reply to email about
  the project as quickly as humanly possible.</p></li><li><p>create a
  welcoming/positive/fun atmosphere. don&#8217;t be a jerk. say please and thank you and hand out praise. chase off any jackasses that turn up and start to poison
  the community. try to meet people in person when you
  can and form bonds.</p></li></ul></blockquote></section></blockquote><p></p><section typeof="http://purl.org/orb/References"><p>[1]  K. Fogel, <i>Producing open source software:
How to run a successful free software project</i>. O’Reilly Media, Inc., 2005.</p></section>

<p><a rel="license" href="http://creativecommons.org/licenses/by/3.0/deed.en_US" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by/3.0/deed.en_US?referer=');"><img alt="Creative Commons License" style="border-width:0" src="http://i.creativecommons.org/l/by/3.0/88x31.png" /></a><br /><span xmlns:dct="http://purl.org/dc/terms/" property="dct:title">Running an Open Source project from a university dev team</span> by <span xmlns:cc="http://creativecommons.org/ns#" property="cc:attributionName">Peter (pt) Sefton</span> is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by/3.0/deed.en_US" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by/3.0/deed.en_US?referer=');">Creative Commons Attribution 3.0 Unported License</a>.

</p></section></article>
	</div>]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2013/03/25/running-an-open-source-project-from-a-university-dev-team.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Repositories! (What are they good for?)</title>
		<link>http://ptsefton.com/2013/03/01/repositories-what-are-they-good-for.htm</link>
		<comments>http://ptsefton.com/2013/03/01/repositories-what-are-they-good-for.htm#comments</comments>
		<pubDate>Fri, 01 Mar 2013 00:39:08 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=1672</guid>
		<description><![CDATA[Repositories! (What are they good for?) by Peter Sefton is licensed under a Creative Commons Attribution 3.0 Unported License. Repositories! (What are they good for?)Georgina Edwards has invited me to Intersect NSW to give a talk to the software engineering team about repositories in eResearch. There were also quite a few eResearch analysts in attendance, [...]]]></description>
			<content:encoded><![CDATA[	<div>
	<span property="http://purl.org/dc/terms/title" content="Repositories! (What are they good for?)"><!--  --></span>
			<span property='http://purl.org/dc/terms/creator' content='ptsefton'><!-- --></span>
	
<article> 
<p><a rel="license" href="http://creativecommons.org/licenses/by/3.0/deed.en_US" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by/3.0/deed.en_US?referer=');"><img alt="Creative Commons License" style="border-width:0" src="http://i.creativecommons.org/l/by/3.0/88x31.png" /></a><br /><span xmlns:dct="http://purl.org/dc/terms/" property="dct:title">Repositories! (What are they good for?)</span> by <span xmlns:cc="http://creativecommons.org/ns#" property="cc:attributionName">Peter Sefton</span> is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by/3.0/deed.en_US" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by/3.0/deed.en_US?referer=');">Creative Commons Attribution 3.0 Unported License</a>.</p>
 <section><h1>Repositories! (What are they good for?)</h1><p>Georgina Edwards has invited me to <a href="http://intersect.org.au" onclick="pageTracker._trackPageview('/outgoing/intersect.org.au?referer=');">Intersect NSW</a> to give a talk to the software
engineering team about repositories in eResearch. There were also quite a few
eResearch analysts in attendance, not to mention a couple of members of the
senior management team. (Just in case you’re wondering, the answer to the
question in the title is <i>not</i>
“absolutely nothing”).</p><p><br />
Here are my notes, with embedded slides, which I put together on the train to
and from the CBD (ie quick and dirty). </p><p><b>The summary:</b> repository
means a lot of different things, but the main sense I talked about with the Intersect
team was ‘data-store component’. I tried to cover why using a repository in an
eResearch project might be important because repositories can provide a lot of
ready-made functionality, particularly in the area of digital preservation, but
also access to indexing services and content-transcoding to generate new
formats from things ingested. I talked about one aid for thinking about
repository services which I think is useful – the Repository
Micro-services framework from the California Digital Libraries, and ran through
some of the repository frameworks that people in the eResearch.au world might encounter.</p><p>The liveliest discussion was around RDF, the <a href="What%25E2%2580%2599s%2520a%2520repository%2520to%2520me%3F%2520%2520The%2520first%2520time%2520many%2520of%2520us%2520heard%2520the%2520term%2520repository%2520in%2520Higher%2520Education%2520was%2520in%2520connection%2520with%2520the%2520Open%2520Access%2520movement,%2520when%2520a%2520few%2520forward%2520thinking%2520universities%2520in%2520Australia%2520QUT,%2520UQ,%2520USQ%2520and%2520even%2520some%2520others%2520outside%2520of%2520Queensland%2520began%2520to%2520set%2520up%2520Institutional%2520Repositories,%2520using%2520software%2520like%2520Eprints%2520or%2520Dspace.%2520These%2520were%2520essentially%2520online%2520databases%2520of%2520PDF%2520files,%2520with%2520bibliographic%2520metadata">Resource
Description Framework</a>, and what <i>it’s</i>
good for. I made the assertion that RDF was the best practice approach to storing
metadata, allowing for built-in extensibility. RDF uses URIs as names for both
things and relations, which reduces ambiguity and aids interoperability. But I
think it’s important to draw the line between RDF as a good way to do metadata,
and annotation and the assumption that an RDF query language (via an RDF
triple-store) is always going to be needed or even work. I’m sceptical about
the promise of RDF as some kind of super semantic world-wide
web of knowledge you can query for the answer to anything, but it’s clearly a
good way to do metadata – there’s no excuse for inventing a new metadata
schema that is not RDF based these days.&nbsp;
(Use the comments if you want to discuss).</p><section><h2>The talk</h2><p>I thought I’d start from something that the developers would
be familiar with. Source-code repositories.</p><section typeof="http://purl.org/ontology/bibo/Slide">
  <h2>To a bunch of software developers…</h2>
  <p>… a repository is a place to put
  code</p>
 </section><p>But it’s not just a place to <i>put</i> things. On a development project, the repository offers a
number of services, like integration with task management systems, versioning, search
and collaboration. I’m sure everyone in the professional eResearch world would
be horrified to find a development project that wasn’t using source-code
management via a code-repository: Git, Mercurial, or at the very least
something ancient like Subversion.</p></section><section><h2>What’s a repository to me?</h2><p>The first time many of us heard the term repository in
Higher Education was in connection with the Open Access movement, when a few
forward thinking universities in Australia <a href="http://eprints.qut.edu.au/" onclick="pageTracker._trackPageview('/outgoing/eprints.qut.edu.au/?referer=');">QUT</a>,
<a href="http://espace.library.uq.edu.au/" onclick="pageTracker._trackPageview('/outgoing/espace.library.uq.edu.au/?referer=');">UQ</a>, <a href="http://eprints.usq.edu.au/" onclick="pageTracker._trackPageview('/outgoing/eprints.usq.edu.au/?referer=');">USQ</a> and even some others outside of
Queensland began to set up Institutional Repositories, using software like Eprints or Dspace. These were
essentially online databases of PDF files for academic works, with
bibliographic metadata. They were also seen as sites for preservation of
materials, and had services to advertise their contents to the world, via the <a href="http://www.openarchives.org/pmh/" onclick="pageTracker._trackPageview('/outgoing/www.openarchives.org/pmh/?referer=');">OAI-PMH</a> metadata
harvesting standard, and via metadata embedded in the web pages that
described the academic works.</p><p>A group of us put together <a href="http://ptsefton.com/2012/07/24/think-local-act-global-institutional-data-repositories-being-built-in-australia-with-lessons-learned-from-institutional-publications-repositories.htm">a
presentation for Open Repositories last year on the growth of Institutional
Research Data Repositories, alongside the ‘traditional’ Institutional
Publications repository</a>.</p><section typeof="http://purl.org/ontology/bibo/Slide">
  <h2>There are a few senses of the word:</h2>
  <ul><li><p>Repository-as-database</p></li><li><p>Repository-as-application</p><p>Institutional Repository or
  Data Repository</p></li><li><p>Repository-as-lifestyle (ie
  analogous to a ‘library’)</p></li></ul>
  
  
  
  </section><p>People tend not to be very careful about these senses of the
word repository and indeed the boundaries are actually quite blurry. If you
have chosen to call your application a repository, then that term brings a certain
gravitas, you’d expect the repository-as-application to be something that’s not
just for Christmas, but something you’ve made a commitment to feeding and
walking at least for <i>some</i> time.</p><p>With that &nbsp;in mind, the point of this
discussion: is what might a repository-as-data store be good for in an
eResearch project?</p><section typeof="http://purl.org/ontology/bibo/Slide">
  <h2>Services in a typical repository-as-datastore
  underneath an application:</h2>
  <ul><li><p>If the app goes away the data is/are safe
  independently of the application services,</p><ul><li><p>with all digital
  objects stored in standard formats</p></li><li><p>with standardised
  metadata</p></li><li><p>so they can be
  preserved*.</p></li></ul></li><li><p>You get OAI-PMH (pull/out) and SWORD (push/in)
  built in</p></li><li><p>Built in security/access control </p><p>(but
  beware of actual real-world performance)</p></li><li><p>Content transcoding </p><p>(thumbnails
  / image viewers / video versions)</p></li></ul>
  
  
  
  
  
  
  
  
 </section><p>Nobody put up their hand and said “Hey that’s just a CMS”
(Content Management System), but the answer would have been, yes, of course. A
repository-as-application is just a serious CMS, one designed for maintaining
important stuff in a well-managed way. Indeed, the University of Queensland is
moving its Institutional Repository to a <a href="http://drupal.org/" onclick="pageTracker._trackPageview('/outgoing/drupal.org/?referer=');">Drupal</a>-based
system, and leaving behind the repository-as-data-store that used to sit
underneath it. </p><p>The <a href="http://journals.tdl.org/jodi/index.php/jodi/article/view/1605/1766" onclick="pageTracker._trackPageview('/outgoing/journals.tdl.org/jodi/index.php/jodi/article/view/1605/1766?referer=');">Repository
Micro-services framework</a> from the University of California captures all
these services really nicely.</p><section typeof="http://purl.org/ontology/bibo/Slide">
  <h2>Repository Micro-services</h2>
  <p><a href="http://journals.tdl.org/jodi/index.php/jodi/article/view/1605/1766" onclick="pageTracker._trackPageview('/outgoing/journals.tdl.org/jodi/index.php/jodi/article/view/1605/1766?referer=');">http://journals.tdl.org/jodi/index.php/jodi/article/view/1605/1766</a></p>
  <p><img border="0" width="477" height="298" src="http://ptsefton.com/wp-content/uploads/2013/03/wpid-repositories.docx.html_repositories.docx_filesimage002.png" v:shapes="Picture_x0020_1"></p>
  <p>This is implemented in <a href="http://merritt.cdlib.org/" onclick="pageTracker._trackPageview('/outgoing/merritt.cdlib.org/?referer=');">http://merritt.cdlib.org/</a>,
  which does not seem to have an obvious application to download.</p>
 </section><section typeof="http://purl.org/ontology/bibo/Slide">
  <h2>Repository micro-services</h2>
  <h2><img border="0" width="802" height="362" src="http://ptsefton.com/wp-content/uploads/2013/03/wpid-repositories.docx.html_repositories.docx_filesimage004.png" v:shapes="Picture_x0020_2"></h2>
  </section><section typeof="http://purl.org/ontology/bibo/Slide">
  <h2>Some repository software you may hear about</h2>
  <ul><li><p><a href="http://www.eprints.org/" onclick="pageTracker._trackPageview('/outgoing/www.eprints.org/?referer=');">Eprints</a> (Perl) </p><p>Good for publications
  repositories, has been used for cultural collections, learning – has every
  imaginable interface to repository content</p></li><li><p><a href="http://dspace.org" onclick="pageTracker._trackPageview('/outgoing/dspace.org?referer=');">DSpace</a> good for a range of digital object collections</p><p>eg Andrea Schweer’s talk
  on a data capture app <a href="http://www.caul.edu.au/content/upload/files/cairss/cairss2012schweer.pdf" onclick="pageTracker._trackPageview('/outgoing/www.caul.edu.au/content/upload/files/cairss/cairss2012schweer.pdf?referer=');">Building a repository for freshwater quality
  data</a></p></li><li><p>Fedora Commons (back end) </p><ul><li><p><a href="http://www.islandora.ca/" onclick="pageTracker._trackPageview('/outgoing/www.islandora.ca/?referer=');">Islandora</a> application/platform (Drupal PHP)</p></li><li><p><a href="https://github.com/the-fascinator/the-fascinator" onclick="pageTracker._trackPageview('/outgoing/github.com/the-fascinator/the-fascinator?referer=');">The Fascinator</a>
  platform / ReDBOX Application (Java + Jython)</p></li><li><p><a href="https://wiki.duraspace.org/display/hydra/The+Hydra+Project" onclick="pageTracker._trackPageview('/outgoing/wiki.duraspace.org/display/hydra/The+Hydra+Project?referer=');">Hydra</a>
  platform (Rails)</p></li></ul></li><li><p><a href="http://ckan.org/" onclick="pageTracker._trackPageview('/outgoing/ckan.org/?referer=');">CKAN</a> – a
  Research data Hub app (Python)</p></li><li><p>Micro-service components like <a href="http://en.wikipedia.org/wiki/BagIt" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/BagIt?referer=');">BagIt</a>
  for packaging and <a href="https://wiki.ucop.edu/display/Curation/PairTree" onclick="pageTracker._trackPageview('/outgoing/wiki.ucop.edu/display/Curation/PairTree?referer=');">PairTree</a> for efficient file-storage. </p></li></ul>
  
  
  
  
  
  
  
  
  
  <p>NOTE: All of the above apart form Eprints
  include built-in search using <a href="http://lucene.apache.org/solr/" onclick="pageTracker._trackPageview('/outgoing/lucene.apache.org/solr/?referer=');">Apache
  Solr</a>.</p>
  </section><p>In conclusion, I asked: why use one of the above,
particularly when on first acquaintance, something like Fedora can look like an
anchor, impeding forward progress? </p><p>The basic answer is that if in the long run your project is
going to require some large percentage of the repository micro-services discussed
above, then you’re going to end up writing your <i>own</i> Fedora-like thing. Also, I think it’s better to be part of a
community looking at these things together. For example Fedora is not a magic
solution to being able to re-use repository content between applications, but
it is reassuring to know that the Hydra and Islandora
communities are talking about interop via their <a href="https://www.conftool.net/or2012/index.php?page=browseSessions&amp;form_session=62" onclick="pageTracker._trackPageview('/outgoing/www.conftool.net/or2012/index.php?page=browseSessions_amp_form_session=62&amp;referer=');">Hylandora</a> project and there is a significant amount of
preservation-work happening in the Fedora world.</p><p>To some of us, the idea of doing <i>certain kinds</i> of eResearch project without a back-end repository
(as in something that has managed services around preservation under some kind
of serious governance) would be like doing software development without a code
repository. The question, of course is which kinds of project? And of course,
if you do need one, where do you put the repository part in the architecture.</p></section></section></article>
	</div>]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2013/03/01/repositories-what-are-they-good-for.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Putting data on the web</title>
		<link>http://ptsefton.com/2013/02/05/putting-data-on-the-web.htm</link>
		<comments>http://ptsefton.com/2013/02/05/putting-data-on-the-web.htm#comments</comments>
		<pubDate>Tue, 05 Feb 2013 03:25:49 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[ScholarlyHTML]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=1660</guid>
		<description><![CDATA[I attended this data newsroom (#datanews) event in Melbourne Monday Feb 3rd [Correction - it was the 4th] 2013. David Flanders asked me to come prepared to give a talk on tools and techniques for embedding data into web pages, particularly using Schema.org, the corporate sponsored ontology of everything that matters for commerce.So here are [...]]]></description>
			<content:encoded><![CDATA[	<div>
	<span property="http://purl.org/dc/terms/title" content="Putting data on the web"><!--  --></span>
			<span property='http://purl.org/dc/terms/creator' content='ptsefton'><!-- --></span>
	<article>  <section><p>I attended this <a href="http://www.eventbrite.com/event/4900494511" onclick="pageTracker._trackPageview('/outgoing/www.eventbrite.com/event/4900494511?referer=');">data newsroom</a> (#datanews)
event in Melbourne Monday Feb 3<sup>rd</sup> [Correction - it was the 4th] 2013. David Flanders asked me to
come prepared to give a talk on tools and techniques for embedding data into
web pages, particularly using Schema.org, the corporate sponsored ontology of everything
that matters for commerce.</p><p>So here are my semantically rich<a href="#_edn1" name="_ednref1" title=""><sup>[i]</sup></a>
notes for the presentation. This is neither a tutorial nor a coherent story, so
you may want to leave now, but there is a picture of Tim Berners-Lee about half
way through.</p><section typeof="http://purl.org/ontology/bibo/Slide">
  <h4>Why embed data in web pages?</h4>
  <p>You can make new things happen. Let other people or
  machines do things with the data. <a href="http://wraggelabs.com/shed/presentations/ndf2012/storydata/" onclick="pageTracker._trackPageview('/outgoing/wraggelabs.com/shed/presentations/ndf2012/storydata/?referer=');">Here’s an
  example by Tim Sherratt</a> showing how data embedded in the page (left) can
  drive new behaviour (the stuff on the right).</p>
  <p><img border="0" width="417" height="252" src="http://ptsefton.com/wp-content/uploads/2013/02/wpid-putting-data-copy.html.html_putting-data-copy_filesimage0021.png" v:shapes="Picture_x0020_4"></p>
 </section><section typeof="http://schema.org/" vocab="http://schema.org/">
  <h4>What is this Schema.org? </h4>
  <p>(I have added a couple of tags to discuss later)</p>
  <blockquote><p>Many sites are generated from
  structured data, which is often stored in databases. When this data is
  formatted into HTML, it becomes very difficult to recover the original
  structured data. Many applications, especially search engines, can benefit
  greatly from direct access to this structured data. On-page markup <b>[#inlinedata]</b> enables search engines
  to understand the information on web pages and provide richer search results
  in order to make it easier for users to find relevant information on the web.
  Markup <b>[#semanticsyntax]</b> can also
  enable new tools and applications that make use of the structure.</p><p>A shared markup vocabulary <b>[</b><b>#sharedvocab]</b> makes it easier for webmasters to decide on a markup schema
  and get the maximum benefit for their efforts. So, in the spirit of
  sitemaps.org, search engines have come together to provide a shared
  collection of schemas that webmasters can use.</p><p>http://schema.org/ </p></blockquote>
  
  
  
 </section><section typeof="http://purl.org/ontology/bibo/Slide">
  <h4>Use Schema.org – get snippets</h4>
  <p><img border="0" width="416" height="85" src="http://ptsefton.com/wp-content/uploads/2013/02/wpid-putting-data-copy.html.html_putting-data-copy_filesimage0041.png" v:shapes="Picture_x0020_2"></p>
 </section><p>The point of this, for the vast majority of web practitioners
is to get into the world of ‘rich snippets’. If you use the schema.org way then
you get ‘better’ search results – but you’re also allowing the search
engines, and anyone else who views your page to use the data. Right now if I
search for movie times at my local cinema I enter a Google-trap which shows me
films and when they’re on with no link to the cinema site itself. It’s also
hard to tell what role Schema.org plays in all these search engine things
– some of the data you see is harvested using older conventions for data
services and who knows, maybe the cinemas just give Google a spreadsheet with
the movie times in it.</p><p>For data journalism and research, we presumably want to get
the data out in a form that it can be reused so the concerns are different
– you want the data to be used, and your part in its collection or
creation to be cited.</p><section typeof="http://purl.org/ontology/bibo/Slide">
  <h4>The other thing you need to know about: RDF</h4>
  <p>RDF is the Resource Description Framework.</p>
  <blockquote><p>The<span class="apple-converted-space">&nbsp;</span><b>Resource
  Description Framework</b><span class="apple-converted-space">&nbsp;</span>(<b>RDF</b>)
  is a family of<span class="apple-converted-space">&nbsp;</span><a href="http://en.wikipedia.org/wiki/World_Wide_Web_Consortium" title="World Wide Web Consortium" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/World_Wide_Web_Consortium?referer=');">World Wide Web Consortium</a><span class="apple-converted-space">&nbsp;</span>(W3C)<a href="http://en.wikipedia.org/wiki/Specification" title="Specification" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Specification?referer=');">specifications</a><span class="apple-converted-space">&nbsp;</span><a href="http://en.wikipedia.org/wiki/Resource_Description_Framework#cite_note-1" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Resource_Description_Framework_cite_note-1?referer=');"><sup>[1]</sup></a><span class="apple-converted-space">&nbsp;</span>originally designed as a<span class="apple-converted-space">&nbsp;</span><a href="http://en.wikipedia.org/wiki/Metadata" title="Metadata" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Metadata?referer=');">metadata</a><span class="apple-converted-space">&nbsp;</span><a href="http://en.wikipedia.org/wiki/Data_model" title="Data model" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Data_model?referer=');">data
  model</a>. It has come to be used as a general method for conceptual
  description or modeling of information <b>[#sharedvocabularies]</b>
  that is implemented in web resources, using a variety of syntax formats <b>[#semanitcsyntax]</b>.</p><p><a href="http://en.wikipedia.org/wiki/Resource_Description_Framework" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Resource_Description_Framework?referer=');">http://en.wikipedia.org/wiki/Resource_Description_Framework</a></p><p></p></blockquote>
  
  
  
 </section><p>In the work I do in eResearch systems and repositories, RDF
is clearly a very good framework for extensible metadata, and the associated
“Linked data” approach of using URIs to describe things and concepts is a good
way to implement shared vocabularies, but RDF is very hard to get to grips with
as a general modelling framework.</p><p>Now it’s time to over-simplify the process of getting data
into web pages via schema.org et al.</p><section typeof="http://purl.org/ontology/bibo/Slide">
  <h4>Putting data on the web?</h4>
  <ul><li><p>Is it in some kind of web ready format?*</p><ul><li><p><b>Yes</b>:
  Put it on the web as-is <b>#justpublish</b></p></li><li><p><b>No</b>:
  &nbsp;Make it into a web ready format.
  Options:</p><ol type="i"><li><p>Reformat
  to a spreadsheet or something <b>#justpublish</b></p></li><li><p>Embed the data in human readable HTML </p><p><b>#inlinedata</b> and <b>#semanticmarkup</b>
  &nbsp;</p></li><li><p>Publish as a stand-alone RDF resource**</p></li></ol></li></ul></li><li><p>In any case publish a web page <i>about</i> it***</p></li><li><p>Include metadata in the web page. <b>#pagelevelmetadata</b></p></li><li><p>Make the metadata standards-based and
  proper****. <b>#sharedvocab</b></p></li><li><p>Choose a syntax for the embedding <b>#semanticsyntax</b></p></li></ul>
  
  
  
  
  
  
  
  
  
  
  
  
 </section><section><h4>The fine print</h4><p>*What is a web-ready format depends on how much of a pedant
you are – for some only gold-plated RDF is good enough </p><p>**And, you know, keep the web page UP.</p><p>*** At <a href="http://en.wikipedia.org/wiki/Tim_Berners-Lee" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Tim_Berners-Lee?referer=');">Tim
Berners-Lee</a>’s talk in Melbourne that night David Flanders asked him what
advice he had for researchers re data – should they put it on the web? </p><p>Tim’s response was that researcher should work with their
data in the format that suits them but they should get a ‘shim’ or adaptor
built to provide an RDF interface to the data so others could use it as part of
the semantic web.</p><p>I think that’s easy for Sir Tim to say and he’s right that
it would be a Good Thing, but experience has shown that projects like that run
to about $200K in Australia and don’t always get results, so I’d add “and while
you’re working on the RDF adaptor, publish what you have in the format in which
you have it with as much metadata as you can manage”.</p><p>****Good luck. If anybody comments at all it will be to ask
“why didn’t you use the European/ISO/W3C Standard” (which will turn out to be a
document that has been in development for 5 years but expected to be released
in six months for the last four of those years)</p><p><img border="0" width="417" height="555" src="http://ptsefton.com/wp-content/uploads/2013/02/wpid-putting-data-copy.html.html_putting-data-copy_filesimage0061.png" v:shapes="Picture_x0020_3"></p><p>Figure 1 Tim
Berners-Lee (right) dwarfed by the happy head on a sponsor&#8217;s banner, which in
turn is dwarfed by Art &#8211; at the University of Melbourne</p><section typeof="http://purl.org/ontology/bibo/Slide">
  <h4>Google Scholar</h4>
  <p><img border="0" width="417" height="214" src="http://ptsefton.com/wp-content/uploads/2013/02/wpid-putting-data-copy.html.html_putting-data-copy_filesimage008.png" v:shapes="Picture_x0020_1"></p>
 </section><section typeof="http://purl.org/ontology/bibo/Slide">
  <h4>Case study Getting a scholarly work into Google Scholar</h4>
  <ul><li><p>A repository somewhere advertised the
  existence of the work via extensive use of the venerable meta-tag.
  #page-level-metadata</p></li><li><p>Google found the data, entered it in its
  database. </p></li><li><p>When you search it puts the metadata back in
  the page so other software can scrape it out #microformats* </p></li></ul>
  
  
  
</section><section typeof="http://purl.org/ontology/bibo/Slide">
  <h4>*Microformats mean </h4>
  <p>Worst-case: maintaining a web-load of converters –
  see this from <a href="https://github.com/zotero/translators/issues/432" onclick="pageTracker._trackPageview('/outgoing/github.com/zotero/translators/issues/432?referer=');">a
  patch</a> to keep the Zotero reference manager working with Google Scholar.
  Google changes their page? You change your code and redeploy to millions of
  people. </p>
  <blockquote><p>&#8216;//div[@class="gs_r"]/div[@class="gs_fl"]/a[contains(@href,"q=related:")]&#8216;) +&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
  &#8216;//div[@class="gs_r"]//div[@class="gs_fl"]/a[contains(@href,"q=related:")]&#8216;)&nbsp;&nbsp; </p></blockquote>
  <p>These are XPath expressions looking in the webpage for
  stuff that Google coded for their own reasons, probably to make it <i>look</i> right, not primarily for data
  interchange.</p>
</section><p>You see what’s happening there? Google indexes pages that conform
to a standard they defined (not the one the repository community uses for its
own interchange). Then to get the data back out the scholarly community has to
keep track of a non-standard convention, again invented by Google. </p><section typeof="http://purl.org/ontology/bibo/Slide">
  <h4>Sounds like a case for Schma.org? </h4>
  
  <p>You’d certainly think so.</p>
  
  <p>But don’t underestimate the power of commercial interests
  to distort the shape of the semantic web.</p>
  
</section><section typeof="http://purl.org/ontology/bibo/Slide">
  <h4>There are (at least) two things to be standardised in web
  semantics</h4>
  <ul><li><p>The (hopefully) shared vocabulary / world view
  &#8211; “ontology” <b>#sharedvocab</b></p></li><li><p>The encoding method; how the meaning is
  embroidered on to the web <b>#semanticsyntax</b></p></li></ul>
  
  <p>And of course we have multiple overlapping but incomplete
  standards, best practices, worst practices and flame-wars for both.</p>
</section><p>There are four basic ways to embed data into web pages.</p>
<section typeof="http://purl.org/ontology/bibo/Slide">
  <h4>Four ways to #inlinedata</h4>
  <ul><li><p>Metadata about a <i>whole page</i> via meta tags in the head #pagelevelmetadata&nbsp; #traditional</p></li><li><p>Metadata/data about parts of a page:
  #semanticsyntax</p><ul><li><p><a href="http://en.wikipedia.org/wiki/Microformat" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Microformat?referer=');">Microformats</a> (obsolete
  but persisting) using conventions #byconvention</p></li><li><p><a href="http://en.wikipedia.org/wiki/Microdata_(HTML)" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Microdata_HTML?referer=');">Microdata</a> –
  part of the (non W3c) HTML5 spec&nbsp;
  simple, flawed, controversial #worksbutpissedpeopleoff</p></li><li><p><a href="http://en.wikipedia.org/wiki/Rdfa" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Rdfa?referer=');">RDFa</a>
  &#8211; obscenely complicated unless you use RDFa 1.1 lite #theonetrueway</p></li></ul></li></ul>
  
  
  </section></section>
  
 <p>I have been working with researchers at the Hawkesbury
Institute for the Environment at UWS and the technical folks at Intersect NSW
to implement an HTML readme file to accompany environmental researcher data
sets – we’re working on a case-study that goes into how we made the
choice of RDFa (#semanticsyntax) and how we chose which vocabularies and terms
to use (#sharedvocab) which we’ll publish as soon as possible.</p><br clear="all">

<hr align="left" size="1" width="33%">

<p><a href="#_ednref1" name="_edn1" title=""><sup>[i]</sup></a>
Semantically rich? Look at the source – I’ve used a web-police-approved
mechanism for embedding slides in my prose. That is, I have used a standard
vocabulary (the bibliographic ontology #sharedvocab) and a syntactic
specification (RDFa 1.1 lite #semanticsyntax) for saying that some parts of the
page are special.</p><p></p></section></section></article>
	</div>]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2013/02/05/putting-data-on-the-web.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CAIRSS – CAUL Australasian Institutional Repository Support Service</title>
		<link>http://ptsefton.com/2013/02/01/cairss-caul-australasian-institutional-repository-support-service.htm</link>
		<comments>http://ptsefton.com/2013/02/01/cairss-caul-australasian-institutional-repository-support-service.htm#comments</comments>
		<pubDate>Thu, 31 Jan 2013 22:38:55 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=1629</guid>
		<description><![CDATA[By Dr Peter Sefton (University of Western Sydney) with Ms Caroline Drury (University of Southern Queensland).On Wednesday 5th Dec I (Peter) visited the Japanese Digital Repository Federation at their invitation and expense, to talk about how our respective repository communities are organised. &#160;I’d like to thank the DRF for this opportunity to make the brief [...]]]></description>
			<content:encoded><![CDATA[	<div>
	<span property="http://purl.org/dc/terms/title" content="CAIRSS – CAUL Australasian Institutional Repository Support Service"><!--  --></span>
			<span property='http://purl.org/dc/terms/creator' content='ptsefton'><!-- --></span>
	
<article>  <section><p align="center">By Dr Peter Sefton
(University of Western Sydney) with Ms Caroline Drury (University of Southern
Queensland).</p><p>On Wednesday 5<sup>th</sup> Dec I (Peter) visited the Japanese
Digital Repository Federation at their invitation and expense, to talk about
how our respective repository communities are organised. &nbsp;I’d like to thank the DRF for this
opportunity to make the brief trip to Tokyo. Caroline was invited but was
unable to make it. The DRF folks have put up a <a href="http://drf.lib.hokudai.ac.jp/drf/index.php?DRF%20CAIRSS%20meeting" onclick="pageTracker._trackPageview('/outgoing/drf.lib.hokudai.ac.jp/drf/index.php?DRF_20CAIRSS_20meeting&amp;referer=');">summary
of the meeting</a>, in Japanese. Note that while my comments on that page are
listed as “CAIRSS” I was not representing CAIRSS (the CAUL (Council of Australian
University Librarians) Institutional Repository Support Service), I attended as
a member of the Australian/Australasian repository community. I also attended
the DRF international conference in 2009 on a similar basis, when I did happen
to be associated with CAIRSS, so the organisers knew me. I did talk a fair bit
about CAIRSS, in the context of other projects in Australia.</p><p>Before I went I polled the CAIRSS-list to find out if there
were any questions people would like answered – more on that below.</p><p>First, a bit about me and repositories:</p><ul><li><p>I was the technical lead for the Regional
Universities Building Research Infrastructure Collaboratively (<a href="http://rubric.edu.au" onclick="pageTracker._trackPageview('/outgoing/rubric.edu.au?referer=');">RUBRIC</a>) project which was hosted by the
University of Southern Queensland (and the de-facto project manager for several
months during the project establishment phase).</p></li><li><p>I led a small team at USQ subcontracting to the
ARROW project during 2008, providing technical support to ARROW, and repository
services to small Higher Education institutions in Australia.</p></li><li><p>I worked on USQ’s successful bid to host the
first CAIRSS repository support service in Australia and acted as a senior
strategist for the service, for example working on guides such as the one on <a href="http://cairss.caul.edu.au/cairss/repository-manager-tools/getting-into-google-google-scholar-and-other-search-engines/" onclick="pageTracker._trackPageview('/outgoing/cairss.caul.edu.au/cairss/repository-manager-tools/getting-into-google-google-scholar-and-other-search-engines/?referer=');">how
to get into Google Scholar et al</a>, and negotiating major changes to
repository infrastructure such as the closure of the Australian Digital Theses
search service and its subsumption into the National Library of Australia’s
Trove service. </p></li><li><p>I was <b>not</b>
involved in running the second version of CAIRSS from 2011-2012 but I have
remained part of the repository community in Australia and attended the 2012
community day where I spoke about <a href="http://ptsefton.com/2012/11/19/receding-repository-software.htm">trends
in repository software in the context of organisational governance</a>. </p></li><li><p>I am on the conference committee for the Open
Repositories series of conferences (from 2011) – the <a href="http://or2013.net/content/call-proposals" onclick="pageTracker._trackPageview('/outgoing/or2013.net/content/call-proposals?referer=');">call for papers for the 2013
conference is just out</a>.</p></li></ul><section><h2>The DRF</h2><p>Shigeki SUGITA started off proceedings with a <a href="http://drf.lib.hokudai.ac.jp/drf/index.php?plugin=attach&amp;pcmd=open&amp;file=ActivitiesOfDigitalRepositoryFederation.pdf&amp;refer=DRF%20CAIRSS%20meeting" onclick="pageTracker._trackPageview('/outgoing/drf.lib.hokudai.ac.jp/drf/index.php?plugin=attach_amp_pcmd=open_amp_file=ActivitiesOfDigitalRepositoryFederation.pdf_amp_refer=DRF_20CAIRSS_20meeting&amp;referer=');">presentation
about the activities of the Digital Repository Federation</a>. </p><p>Perhaps the most striking thing from an Australasian point
of view is a staffing issue; talented repository managers are required by
management to rotate through a variety of library jobs meaning that there is
constant turnover and a lack of opportunity to specialise. There are similar
pressures at play in our libraries I guess, with a need to train new repository
staff, and significant turnover but not to the same extent.</p><p>Japanese repositories are very much driven by an Open Access
agenda, which is quite different from the situation in Australia where two
different government measurement schemes collecting information about
publications and push repositories in another direction, more on that below.</p><p>Another interesting dimension to the Japanese scene is that
they have a number of consortial-repositories where a
number of institutions share a repository. This is an idea that came up in
Australia in the mid-to-late 00’s several times, but never got off the ground.
It might be worth revisiting some time both for institutional publications repositories
</p></section><section><h2>The presentation</h2><p>I presented from an earlier version of the ‘slides’ below
– I have added some notes from the discussion and clarifying material.</p><section><h3>CAIRSS background</h3><section typeof="http://purl.org/ontology/bibo/Slide"><h4>Parent projects</h4><p>The Australian government made significant investments in
  institutional repositories via programs such as:</p><ul><li><p><a href="http://apsr.anu.edu.au/" onclick="pageTracker._trackPageview('/outgoing/apsr.anu.edu.au/?referer=');">APSR</a>
  Australian Partnership for Sustainable Repositories (ended 2008)</p></li><li><p><a href="http://arrow.edu.au/" onclick="pageTracker._trackPageview('/outgoing/arrow.edu.au/?referer=');">ARROW</a>
  Australian Research Repositories Online to the World</p></li><li><p><a href="http://rubric.edu.au/" onclick="pageTracker._trackPageview('/outgoing/rubric.edu.au/?referer=');">RUBRIC</a>,
  Regional Universities Building Repository Infrastructure Collaboratively.</p></li></ul></section><p>These projects and other investments in the repository world
were via these funding streams (for which the websites have disappeared):</p><ul><li><p>ASHER2 – Australian Scheme for Higher
Education Repositories [Sponsored the development of repositories in all Higher
Education Institutions in Australia]</p></li><li><p><a href="http://en.wikipedia.org/wiki/Systemic_Infrastructure_Initiative" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Systemic_Infrastructure_Initiative?referer=');">SII3</a> – Systemic Infrastructure
Initiative [APSR, ARROW and RUBRIC]</p></li><li><p><a href="http://en.wikipedia.org/wiki/Backing_Australia%27s_Ability" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Backing_Australia_27s_Ability?referer=');">BAA4</a> – Backing Australia&#8217;s Ability</p></li></ul><p>We talked in
some detail about how these funding schemes have influenced the establishment
of repositories; while the initial driver for Australian repositories was open
access, the Excellence in Research for Australia (<a href="http://en.wikipedia.org/wiki/Excellence_in_Research_for_Australia" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Excellence_in_Research_for_Australia?referer=');">ERA</a>) measurement exercise and its <a href="http://en.wikipedia.org/wiki/Research_Quality_Framework" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Research_Quality_Framework?referer=');">failed predecessor</a> stalled the Open Access movement to
some extent, by requiring universities to collect non-open access materials in
complicated ways. </p><section typeof="http://purl.org/ontology/bibo/Slide"><h4>CAIRSS v1 2009-2011</h4><p>Coming out of the investments outline above, CAIRSS was
  established on &nbsp;March 16, 2009:</p><blockquote><p>The first CAUL service was funded for two years, with the
  approval of Department of Innovation (DIISR), with monies remaining from the
  successful ARROW project, supplemented by CAUL member subscriptions.</p></blockquote></section><p></p><section typeof="http://purl.org/ontology/bibo/Slide"><h4>CAIRSS Structure</h4><p><img border="0" width="417" height="250" src="http://ptsefton.com/wp-content/uploads/2013/02/wpid-CAIRSS-copy.html.html_CAIRSS-copy_filesimage002.png" v:shapes="Diagram_x0020_1"></p></section><section typeof="http://purl.org/ontology/bibo/Slide"><h4>CAIRSS v1 staffing</h4><p>This version of the CAIRSS
  service covered Australian universities, and was staffed with:</p><ul><li><p>A full time repository manager. (USQ)</p></li><li><p>A full time technical staff member. (USQ)</p></li><li><p>A full time copyright officer. (Swinburne)</p></li><li><p>A part-time strategic advisor and other senior
  support.</p></li></ul></section><section typeof="http://purl.org/ontology/bibo/Slide"><h4>CAIRSS v1 approach</h4><p>The initial CAIRSS service included:</p><ul><li><p>Annual meetings with both a general and
  technical strand.</p></li><li><p>Copyright workshops for private discussions of
  copyright issues.</p></li><li><p>Maintained ‘sandbox’ instances of repository
  software.</p></li><li><p>Creation and maintenance of web pages and
  guides on repository issues such as statistics, indexing and an extensive copyright
  guide.</p></li><li><p>Provided direct support for government
  reporting processes – chiefly the establishment of the Excellence in
  Research Australia (ERA) exercise.</p></li></ul></section></section></section><section><h2>CAIRSS v2 2011-2012</h2><section typeof="http://purl.org/ontology/bibo/Slide"><h4>With added New Zealanders</h4><p>The second version of CAIRSS was funded from member
  subscriptions and expanded to include New Zealand:</p><blockquote><p>The second CAUL service is also funded for almost two years
  and incorporates many of New Zealand’s higher education institutions. With
  this expansion, CAIRSS now stands for the CAUL Australasian Institutional
  Repository Support Service.</p></blockquote></section><section typeof="http://purl.org/ontology/bibo/Slide"><h4>CAIRSS v2 Staffing</h4><p>This version of CAIRSS had a reduced team in the central
  office at USQ.</p><ul><li><p>One full time repository manager.</p></li><li><p>One half-time technical officer.</p></li><li><p>Part time senior manager.</p></li><li><p>Part time copyright person at Swinburne.</p></li></ul></section><section typeof="http://purl.org/ontology/bibo/Slide"><h4>CAIRSS v2 approach</h4><p>The second CAIRSS service included:</p><ul><li><p>Annual meetings with a general strand.</p></li><li><p>Discussion list for members only.</p></li><li><p>Copyright workshops for private discussions of
  copyright issues.</p></li><li><p>Maintenance of web pages on repository issues
  such as statistics etc.</p></li><li><p>Provided support for government reporting
  processes (ERA)</p></li></ul></section></section><section><h2></h2></section><section><h2>Post CAIRSS: CRAC 2013-?</h2><p>From 2013 CAIRSS will no longer exist – it is being
replaced with a new service know as CRAC.&nbsp;
I gather that the feeling of CAUL was that the
community is now mature enough to be self-sustaining.</p><section typeof="http://purl.org/ontology/bibo/Slide"><h4>CRAC (CAUL Research Advisory Committee) NEW! from 2013</h4><blockquote><p>CAUL
  Research Advisory Committee</p><p>(will undertake some of the work carried out by CAIRSSAC
  and COSIAC, from 2013)</p><p>Program Research<span class="apple-converted-space">&nbsp;</span><br />
  Chair Heather Gordon (2013-2014)<br />
  Members TBC<br />
  CONZUL Janet Copsey (2013-&nbsp;&nbsp;&nbsp; )<br />
  Practitioners TBC</p></blockquote><p>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <a href="http://www.caul.edu.au/about-caul/caul-committees" onclick="pageTracker._trackPageview('/outgoing/www.caul.edu.au/about-caul/caul-committees?referer=');">http://www.caul.edu.au/about-caul/caul-committees</a>
  </p></section><section typeof="http://purl.org/ontology/bibo/Slide"><h4>CRAC anticipated activities</h4><ul><li><p>Running the annual event</p></li><li><p>Annual copyright workshop</p></li><li><p>Maintaining the CAIRSS discussion list</p></li></ul></section><section typeof="http://purl.org/ontology/bibo/Slide"><h4>New Open Access group: AOASG</h4><p>There is a new Open
  Access group in Australia which is not part of the CAUL/CAIRSS family.</p><p>From Danny Kingsley:</p><blockquote><p>The Australian Open Access Support Group (AOASG) was
  launched during Open Access Week in 2012. It is&nbsp;a consortium of six
  universities with open access policies&nbsp; &#8211; QUT, ANU, Macquarie
  University, Newcastle University, Charles Sturt University and Victoria
  University. The group aims to provide support, lobbying and advocacy for open
  access in Australia. Membership will be extended to other research
  institutions and affiliates during 2013.</p><p><span class="MsoHyperlink">http://www.aoasg.org.au</span><span class="apple-converted-space">&nbsp;</span>[NOTE the website is
  currently being built – may not be live yet]</p></blockquote></section><section><h3>General comments about the CAIRSS/CRAC community</h3><section><h4>Small task-force groups now self-organize</h4><p>The repository community is well established and members of
the community run their own investigations into repository matters. These range
from asking questions on the list about repository practices, to running formal
surveys. An example from the broader CAUL community of which CAIRSS is a part is
the IR / Open Access Funding Survey by Danny Kingsley and Vicki Picasso.</p></section></section><section><h3>Opportunities</h3><section typeof="http://purl.org/ontology/bibo/Slide"><h4>DRF collaboration?</h4><p>From Caroline Drury:</p><blockquote><p>It would be
  interesting for CRAC to consider something similar to the DRF model &nbsp;- eg at the beginning of each two year period, to meet and
  consider what projects could be done in the space, within Australia / NZ.
  Then perhaps a call could be put to institutions who could then (according to
  their strengths) be assigned to do that project in a collaborative model,
  using their own funds. I’m not sure if it would work here, given the big
  physical distances, but I think it&#8217;s a good model in a scenario where there&#8217;s
  limited funding.&nbsp;</p></blockquote></section><p>I’m sure CRAC will consider this.</p><section typeof="http://purl.org/ontology/bibo/Slide"><h4>April event in Tasmania</h4><p>An event is being organised in Tasmania in April around
  the following themes. Regional participating would be most welcome.</p><p>From David Flanders at ANDS:</p><ol type="a"><li><p>linking
  research data and research publications</p></li><li><p>re-architecting
  the repository (if we started now based on what we know).</p></li><li><p>business
  metrics/analytics from scholarly systems</p></li><li><p>research
  profiles and author identifiers</p></li><li><p>emerging
  scholarly vocabularies, linkeddata &amp;&nbsp;</p></li><li><p>scholarly
  search engines (beyond Google Scholar)</p></li><li><p>APIs
  and bringing all these systems together via shared resources.</p></li></ol></section></section></section><section><h2>Questions (with my notes)</h2><p>Natasha Simons at Griffith University had three questions
that provided a great structure for the discussion part of the meeting. I tried
to take notes (included below) as well as talk.</p><blockquote><p><br />
1. How&#8217;s the Memorandum of Understanding between<span class="apple-converted-space">&nbsp;</span>Digital Repository Federation (Japan), UKCoRR<span class="apple-converted-space">&nbsp;</span>and the UK RSP going?
What sorts of things are of the most importance to all parties to share and
experience in this space? What sort of involvement has there been between the
signers to the MoU? How do they envisage this MoU benefiting all parties (particularly long-term)?<br style="mso-special-character:line-break">
</p><blockquote><p>In
January 2012 – DRF heard there were repo managers in the UK, they invited
a rep from <a href="http://www.rsp.ac.uk/" onclick="pageTracker._trackPageview('/outgoing/www.rsp.ac.uk/?referer=');">Repositories
Support Project</a> (RSP) in the UK. &nbsp;Jackie Whickam came
to snowy Hokkaido, where they found out that RDF and RSP carried out similar
activities – eg the re-enactment of online
discussions wearing masks. &nbsp;After
the meeting found out that there were many more things to share. Eg in the UK they carry out residential workshops. Meeting
with JW was about operational things between repo manager communities in UK and
JP. Wanted to do more on activities to do with individual repositories. </p><p>Most
important objective of <a href="http://rspproject.wordpress.com/2012/03/13/japan-and-uk-in-agreement/" onclick="pageTracker._trackPageview('/outgoing/rspproject.wordpress.com/2012/03/13/japan-and-uk-in-agreement/?referer=');">MoU</a> is to send representatives to counterpart meeting to
share more specifics. Since signed the MoU they have
not done so much. First thing was to invite a rep from UK to national workshop.
UK rep was asked to give a presentation about how they promote activities
inside universities. </p></blockquote><p></p><blockquote><p>The
MoU says they will sponsor trips for each other (but
the Brits have not done their bit … yet :). RSP has just come to the end of its
funding cycle. DRF hopes that even though there is uncertainly about funding
the collaboration can continue. Funding is restricted to long term planning is
difficult.</p></blockquote><p><br />
2. Find out what you can about the NII Repositories Program -<span class="apple-converted-space">&nbsp;</span><span class="MsoHyperlink">http://www.nii.ac.jp/irp/en/rfp/</span><span class="apple-converted-space">&nbsp;</span><br />
There are some interesting projects listed. How do they decide on the project
areas? Where does the funding come from? How do they decide on the actual
projects? Are they all 12 month projects? Are they all collaborative? How do
they share the results?<br style="mso-special-character:line-break">
</p><blockquote><p>Cyber
Science Infrastrcuture CSI hosted by NII – has
a selection board, informatics scholars &amp; heads / top management of uni libraries about 10 members. 200 – 300 M
Yen</p><ul><li><p>Launch – circa 50 univerisites
circa 1M Yen</p></li><li><p>R&amp; D (5-6 page proposal docs by multiple
unis) &#8211; examples</p><ul><li><p>DRF (proposal by several unis)</p></li><li><p>Sherpa Romeo – Japan</p></li><li><p>Statistics</p></li></ul></li></ul><p>About 30 submitted – 20
accepted.</p><p>Proposals ask for money never
given more, usually slightly less than proposed. Money goes to the unis as
project operators. Budgets split between the participants (training, workshops,
system development, etc). Budget allocated on fiscal
year basis, CSI checks. Proposal made around March – decision around June
– activities take place from June to March. Following June there is a
results project meeting in Tokyo – 2 day meeting (decreasing because the
number of people launching has dropped. Initial 50 now 10). </p><p>Sharing of results at June
meeting, not much more than that.&nbsp;
Some projects with strong outreach will be well known. Out of 20 –
some projects faded out without much impact.</p><p>Some projects that have done
well:</p><ul><li><p>DRF – The Digital Repositories Federation,
my hosts.</p></li><li><p>Sherpa-Romeo Japan.</p></li><li><p>SCPJ – Society Copyright Policy in Japan
– 600 societies almost all grey lit – a few are ‘green’ <a href="http://en.wikipedia.org/wiki/Open_access" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Open_access?referer=');">OA</a>.</p></li><li><p>ROAT – project to standardise repo stats
same as PIRUS/IRUS</p></li><li><p>Author – ID – (ORCID) participants
on this are involved in ORCID. NII working on a trial basis to have a database
of Japanese researchers.</p></li></ul><ul><li><p>ShareRe – consortial repositories came out of Hiroshima . DSpace commonly used in Japan and EARMAS (original Japanese
repository software) </p><p>There are 14 on a regional basis.
Each one has a lead university acts as a host to provide system and support. &nbsp;</p><ul><li><p>Hiroshima .&nbsp;
Collect funds for future 14 members. 14 * 13K – 420K per kept in
reserve, and used for security maintenance. Operated by regional council of
libraries, Hiroshima uni serves as sectretariat and hosts, additional funding come from
regional 30K Yen per year. Initial launch funds came from CSI – Hiroshima
was the first. </p></li><li><p>7 Member universities Kagoshima as lead. Initial
investment about 2M yen do not collect funds for future rennocation
collectively 250 K pro rata per year contributed by member according to FTE</p></li></ul></li><li><p>UsrCom – Trial
repository system sandbox&nbsp; -
2008-2009</p></li></ul></blockquote><p><br />
3. Are they ways we can communicate better with them? Do they hold any webinars
on topics that would be of interest to us? If so, how do they tell people about
them? Could they tell us? I see they post to the JISC list every now and then
(usually well-deserved achievement boasts). Should we have a &#8216;guest&#8217; from DRF
join our CAIRSS e-list or could they email you and then you post to our e-list?</p><blockquote><p>Don’t do webinars and they do
everything in Japanese so that’s a challenge. Does anyone on the CAIRSS list
have good Japanese? The DRF would like to have a member on the CAIRSS/CRAC list.</p><p>Future of DRF not clear.</p><p>(In Japan moves afoot to
subsidise societies as a way of driving OA) </p><p>NO OA mandates from JP govt – rules being revised now so that theses can go
through IR or must? Policy reads like must but we don’t know. May open the door
for theses to publicised thru network. If this is realised there will be more
possibilities – need to think about metadata standards and talking to
national library. )</p></blockquote></blockquote></section><section><h2>Next steps</h2><p>Once again, thanks to the DRF for having me – I am
following up with CAUL on how we might be able to collaborate further. Now that
we are in the CRAC era, Caroline’s suggestion of having a ‘call for projects’
that then get implemented at the member institutions sounds like it might be a
way forward, and an ongoing relationship with the DRF (and the UK RSP) would be
helpful, as they’ve been down this road before.</p></section></section><a rel="license" href="http://creativecommons.org/licenses/by/3.0/deed.en_US" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by/3.0/deed.en_US?referer=');"><img alt="Creative Commons License" style="border-width:0" src="http://i.creativecommons.org/l/by/3.0/88x31.png" /></a><br /><span xmlns:dct="http://purl.org/dc/terms/" property="dct:title">CAIRSS – CAUL Australasian Institutional Repository Support Service</span> by <span xmlns:cc="http://creativecommons.org/ns#" property="cc:attributionName">Peter Sefton &#038; Caroline Drury</span> is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by/3.0/deed.en_US" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by/3.0/deed.en_US?referer=');">Creative Commons Attribution 3.0 Unported License</a>.</article>
	</div>]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2013/02/01/cairss-caul-australasian-institutional-repository-support-service.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Avatars of the Book in Digital Culture</title>
		<link>http://ptsefton.com/2012/12/20/new-avatars-of-the-book-in-digital-culture-2.htm</link>
		<comments>http://ptsefton.com/2012/12/20/new-avatars-of-the-book-in-digital-culture-2.htm#comments</comments>
		<pubDate>Thu, 20 Dec 2012 03:10:51 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[jiscPUB]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=1617</guid>
		<description><![CDATA[New Avatars of the Book in Digital CultureWhy are we here?This week the University of Western Sydney held a symposium, New Avatars of the Book in Digital Culture. I was invited to contribute to the event, in my capacity as eResearch manager. One member of my team thought it meant I was going to get [...]]]></description>
			<content:encoded><![CDATA[	<div>
	<span property="http://purl.org/dc/terms/title" content="New Avatars of the Book in Digital Culture"><!--  --></span>
			<span property='http://purl.org/dc/terms/creator' content='ptsefton'><!-- --></span>
	
<article>  <section><h1>New Avatars of the Book in Digital Culture</h1><section><h2>Why are we here?</h2><p>This week the University of Western Sydney held a symposium, New
Avatars of the Book in Digital Culture. I was invited to contribute to the
event, in my capacity as eResearch manager. One member of my team thought it
meant I was going to get to slap on the blue body paint, but sadly it wasn’t
that kind of Avatar.</p><blockquote><p>This
symposium focuses on the changing nature and status of those peculiarly useful
interactive objects we call ‘books’ in online contexts. In contrast to
web-pages, files, ‘sites’ and
‘contexts’
of reading, the book still presents a useful model of rich ‘containment’ and
productive constraint.</p><p>But are
books possible in digital form? What elements of the book survive online? Which
ones are transformed and in  what ways? The end of the book has long been
prophesied, but how do we replicate the particular functionality (such as
searching without knowing what you’re looking for), and forms  of
knowledge particular to books in online environments?  What forms of
interface can be envisioned bearing in mind  the characteristic feedback
loops between (especially) literary reading and writing? Distinguishing between
the form of the book and its functionality, this symposium will explore
possibilities for replicating the book’s useful functions in online
environments.</p><p>(Anna
Gibbs and Maria Angel, from the invitation to the event)</p></blockquote><p>In this context I didn’t want to present too formally but prepared
some notes for the discussion, after the keynote talks. This post contains my
pre-event note/slides and a few observations from the event. </p></section><section><h2>The keynotes</h2><p>The keynotes were both really engaging discussions of digital
culture (not so much the book).</p><p>Mitchell Whitelaw, from Canberra University <a href="https://docs.google.com/presentation/d/1lY4V0yXN2yqpubXVJsxsZWj4SMIPkkvaLzyEPkvkJ9Q/edit" onclick="pageTracker._trackPageview('/outgoing/docs.google.com/presentation/d/1lY4V0yXN2yqpubXVJsxsZWj4SMIPkkvaLzyEPkvkJ9Q/edit?referer=');">talked about</a> building “<a href="http://visiblearchive.blogspot.com.au/2012/10/manly-images-generous-prototype.html" onclick="pageTracker._trackPageview('/outgoing/visiblearchive.blogspot.com.au/2012/10/manly-images-generous-prototype.html?referer=');">generous
interfaces</a>” to online databases. The idea is to enable a browse-based
interface that attempts to show as many entry points to a collection as
possible. Mitchell’s visual tools allow you to see the shape of large archival
collections and explore them using both visual cues and metadata facets such as
dates or creators. Dan Cohen recently <a href="http://www.dancohen.org/2012/12/05/generous-interfaces-for-scholarly-sites/" onclick="pageTracker._trackPageview('/outgoing/www.dancohen.org/2012/12/05/generous-interfaces-for-scholarly-sites/?referer=');">asked
how this might work for university web sites</a>, I can’t see how, but it’s an
interesting design challenge (hint for uni web teams – the big images
that take up a third or more of the screen and change all the time don’t
improve the utility of the site).</p><p>Mitchell talked a bit in the discussion about how a ‘generous’ book
interface might look, showing the ‘weight’ of items in the Table of Contents,
giving a sense of the shape of a work,&nbsp; a
potential site for further research. </p><p><a href="http://en.wikipedia.org/wiki/Jason_Nelson" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Jason_Nelson?referer=');">Jason Nelson</a>,
<a href="http://www.griffith.edu.au/humanities-languages/school-humanities/staff/mr-jason-nelson" onclick="pageTracker._trackPageview('/outgoing/www.griffith.edu.au/humanities-languages/school-humanities/staff/mr-jason-nelson?referer=');">from
Griffith</a> uni talked about his practice as a digital poet, which didn’t
involve any pretentious reading out loud, but did involve an array of amusing,
rude and occasionally moving online stuff, which I won’t attempt to describe or
critique it.</p><p>As Jason demoed various Flash and HTML stuff he’d made I was
thinking about all the problems he’d create for archivists, and indeed he does;
he told us about how he’d had to deposit screen-capture walk-throughs into the Griffith
repository as proxies for the actual games and interactive poem-things he
creates<a style="mso-footnote-id:ftn1" href="#_ftn1" name="_ftnref1" title=""><span class="MsoFootnoteReference"><span class="MsoFootnoteReference">[1]</span></span></a>.
He’s a textbook example of the difficult-to-archive new media artist. See <a href="http://www.secrettechnology.com/gamegame/agame.html" onclick="pageTracker._trackPageview('/outgoing/www.secrettechnology.com/gamegame/agame.html?referer=');">game game game and
again game</a>, which apparently may ruin your life.</p><p>I was glad I didn’t have to follow Jason directly with my rather
more sedate stuff about techno-socio-political considerations, standards and
such. But at least my cut-price embedded slide-shows don’t look too shabby
compared to his low-tech, low-culture hand-drawn stuff.</p></section><section><h2>The readings</h2><p>There were two readings for this symposium. One is a long essay <i>Graphesis: Visual Knowledge Production and
Representation</i> by Joanna Drucker <span property="cites" typeof="http://schema.org/ScholarlyArticle"><link itemprop="url" href="data:application/json,%A0%20%7B%22citationID%22%3A%220Pihnv3z%22%2C%22properties%22%3A%7B%22formattedCitation%22%3A%22%282010%29%22%2C%22plainCitation%22%3A%22%282010%29%22%7D%2C%22citationItems%22%3A%5B%7B%22id%22%3A1334%2C%22uris%22%3A%5B%22http%3A//zotero.org/users/568/items/6F8FM8UW%22%5D%2C%22uri%22%3A%5B%22http%3A//zotero.org/users/568/items/6F8FM8UW%22%5D%2C%22itemData%22%3A%7B%22id%22%3A1334%2C%22type%22%3A%22article-journal%22%2C%22title%22%3A%22Graphesis%3A%20Visual%20knowledge%20production%20and%20representation%22%2C%22container-title%22%3A%22Poetess%20Archive%20Journal%22%2C%22page%22%3A%221%u201350%22%2C%22volume%22%3A%222%22%2C%22issue%22%3A%221%22%2C%22shortTitle%22%3A%22Graphesis%22%2C%22author%22%3A%5B%7B%22family%22%3A%22Drucker%22%2C%22given%22%3A%22J.%22%7D%5D%2C%22issued%22%3A%7B%22year%22%3A2010%7D%2C%22accessed%22%3A%7B%22year%22%3A2012%2C%22month%22%3A12%2C%22day%22%3A10%7D%2C%22page-first%22%3A%221%u201350%22%7D%2C%22suppress-author%22%3Atrue%7D%5D%2C%22schema%22%3A%22https%3A//github.com/citation-style-language/schema/raw/master/csl-citation.json%22%7D%20"><span itemprop="label">(2010)</span><span property="cites" typeof="http://schema.org/ScholarlyArticle"><meta itemprop="id" content="1334"><meta itemprop="type" content="article-journal"><meta itemprop="title" content="Graphesis: Visual knowledge production and representation"><meta itemprop="container-title" content="Poetess Archive Journal"><meta itemprop="page" content="1–50"><meta itemprop="volume" content="2"><meta itemprop="issue" content="1"><meta itemprop="shortTitle" content="Graphesis"><span itemprop="author"><meta itemprop="family" content="Drucker"><meta itemprop="given" content="J."></span><span itemprop="issued"><meta itemprop="year" content="2010"></span><span itemprop="accessed"><meta itemprop="year" content="2012"><meta itemprop="month" content="12"><meta itemprop="day" content="10"></span><meta itemprop="page-first" content="1–50"><link itemprop="uri" href="http://zotero.org/users/568/items/6F8FM8UW"></span></span>. It’s a well-referenced
survey of different approaches to “critical understanding of visual knowledge
production” with only passing reference to books. I thought this was a really
useful way of mapping the jagged edges of a huge theoretical hole, but it
doesn’t provide any answers about how we might explore bookishness, or redefine
the book.</p><p>The second reading is a piece from Alan Liu, <span property="cites" typeof="http://schema.org/ScholarlyArticle"><link itemprop="url" href="data:application/json,%A0%20%7B%22citationID%22%3A%22Tehgv0qt%22%2C%22properties%22%3A%7B%22formattedCitation%22%3A%22%282009%29%22%2C%22plainCitation%22%3A%22%282009%29%22%7D%2C%22citationItems%22%3A%5B%7B%22id%22%3A1335%2C%22uris%22%3A%5B%22http%3A//zotero.org/users/568/items/M79TAQSX%22%5D%2C%22uri%22%3A%5B%22http%3A//zotero.org/users/568/items/M79TAQSX%22%5D%2C%22itemData%22%3A%7B%22id%22%3A1335%2C%22type%22%3A%22article-journal%22%2C%22title%22%3A%22The%20End%20of%20the%20End%20of%20the%20Book%3A%20Dead%20Books%2C%20Lively%20Margins%2C%20and%20Social%20Computing%22%2C%22URL%22%3A%22http%3A//hdl.handle.net/2027/spo.act2080.0048.404%22%2C%22shortTitle%22%3A%22The%20End%20of%20the%20End%20of%20the%20Book%22%2C%22author%22%3A%5B%7B%22family%22%3A%22Liu%22%2C%22given%22%3A%22Alan%22%7D%5D%2C%22issued%22%3A%7B%22year%22%3A2009%2C%22season%22%3A%22Fall%22%7D%7D%2C%22suppress-author%22%3Atrue%7D%5D%2C%22schema%22%3A%22https%3A//github.com/citation-style-language/schema/raw/master/csl-citation.json%22%7D%20"><span itemprop="label">(2009)</span><span property="cites" typeof="http://schema.org/ScholarlyArticle"><meta itemprop="id" content="1335"><meta itemprop="type" content="article-journal"><meta itemprop="title" content="The End of the End of the Book: Dead Books, Lively Margins, and Social Computing"><meta itemprop="URL" content="http://hdl.handle.net/2027/spo.act2080.0048.404"><meta itemprop="shortTitle" content="The End of the End of the Book"><span itemprop="author"><meta itemprop="family" content="Liu"><meta itemprop="given" content="Alan"></span><span itemprop="issued"><meta itemprop="year" content="2009"><meta itemprop="season" content="Fall"></span><link itemprop="uri" href="http://zotero.org/users/568/items/M79TAQSX"></span></span> which, I think, assumes that
the end of the book is obvious, apparently because we can put books online and
search and tag them and dismember them in an online environment. He cites three
scholarly web applications to demonstrate that books are dead, but I don’t really
buy it. One of them is the Open Journal Systems software, which is not about
books anyway, it’s a workflow system for managing journal production. Last time
I checked it was mostly used by editors to produce journals consisting of PDF
files, on the web but not <i>of</i> the web.
</p><section typeof='http://purl.org/ontology/bibo/Slide'>
  <h3>The end of the sign?</h3>
  <p><img border="0" width="425" height="319" src="http://ptsefton.com/wp-content/uploads/2012/12/wpid-Avatars-of-the-book-copy.html.html_Avatars-of-the-book-copy_filesimage0022.gif" v:shapes="Picture_x0020_7"></p>
  <p>We don’t herald the “end of the sign” just because people have
  annotated a sign. And remember, books have <i>always</i> been ‘hypertextual’ via referencing, quoting and wholesale
  stealing.</p>
  </section></section><section><h2>The end of the book?</h2><p>Liu says, in reference to some other symposium:</p><blockquote><p>As suggested by the title of this symposium (Bookishness: The New Fate
of Reading in the Digital Age), the best way to think about the book in the
digital age may well be to focus on bookishness. From the
point of view of the digital, the book has already gone away. So the remaining
question is “what happens to bookishness?” Or, again, “where does bookishness
go?”</p></blockquote><p>I don’t know where bookishness goes, or what happens to it.</p><p>From the point of view of this eResearch person books have not gone
away. And from the point of view of the digital (who knew the digital had a
point of view?) books have most emphatically not gone away and do have digital
analogues:</p><section typeof="http://purl.org/ontology/bibo/Slide"><h3>Rumours of demise of book greatly exaggerated</h3><p><img border="0" width="253" height="216" src="http://ptsefton.com/wp-content/uploads/2012/12/wpid-Avatars-of-the-book-copy.html.html_Avatars-of-the-book-copy_filesimage0042.gif" v:shapes="Picture_x0020_1"></p></section><p>That’s actually quite a lot of books, with no apparent existential
crisis. Not so many magazines; they’re not making the transition to eReaders
and tablets as happily as books. I think Craig Mod’s post on ‘<a href="http://craigmod.com/journal/subcompact_publishing/" onclick="pageTracker._trackPageview('/outgoing/craigmod.com/journal/subcompact_publishing/?referer=');">subcompact
publishing</a>’ is worth reading on this issue, he looks at new
models for magazine-like publications which might work better than the current
approach of putting entire magazines into tablet apps complete with page-layout
and ads. </p><p>It goes without saying that an <i>e</i>
book is different in many ways from a <i>paper</i>
book but nobody in the mainstream is having any trouble with the concept of
delivering or buying books digitally (though they may be less relaxed when they
finally realize that they’ve been renting the books, not building the family library).
Scholars can have all the fun they like debating the demise of the form, and
speculating about function, but the book is obviously very much alive. </p><section typeof="http://purl.org/ontology/bibo/Slide"><h3>Craig Mod’s subcompact recipe</h3><blockquote><p>I propose&nbsp;<i>Subcompact Publishing</i>&nbsp;tools and editorial
  ethos begin&nbsp;<i>(but not end)</i>&nbsp;with the following qualities:</p></blockquote><ul><li><p>Small issue sizes (3-7 articles / issue)</p></li><li><p>Small file sizes</p></li><li><p>Digital-aware subscription prices</p></li><li><p>Fluid publishing schedule</p></li><li><p>Scroll (don’t paginate)</p></li><li><p>Clear navigation</p></li><li><p>HTML(ish) based</p></li><li><p>Touching
  the open web</p></li></ul></section><p></p><p>I think that Mod’s list of qualities looks like at least a good check-list
for any bookish project to consider. </p><p>The book is not the only bounded object to make the transition from the
physical to the virtual. </p><section typeof="http://purl.org/ontology/bibo/Slide"><h3>Take today’s Wikipedia entry on <i>album</i>:</h3><blockquote><p>In
  musical usage the word was used for collections of short pieces of <a href="http://en.wikipedia.org/wiki/Printed_music" title="Printed music" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Printed_music?referer=');">printed
  music</a>&nbsp;from
  the early nineteenth century.<a href="http://en.wikipedia.org/wiki/Album#cite_note-2" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Album_cite_note-2?referer=');"><sup>[2]</sup></a>&nbsp;Later, collections
  of related&nbsp;<a href="http://en.wikipedia.org/wiki/78rpm_record" title="78rpm record" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/78rpm_record?referer=');">78rpm records</a>&nbsp;were bundled in book-like albums.<a href="http://en.wikipedia.org/wiki/Album#cite_note-3" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Album_cite_note-3?referer=');"><sup>[3]</sup></a>&nbsp;When&nbsp;<a href="http://en.wikipedia.org/wiki/Long-playing_record" title="Long-playing record" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Long-playing_record?referer=');">long-playing records</a>&nbsp;were introduced, a
  collection of pieces on a single record was called an album; the word was
  extended to other recording media such as&nbsp;<a href="http://en.wikipedia.org/wiki/Compact_disc" title="Compact disc" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Compact_disc?referer=');">compact
  disc</a>,&nbsp;<a href="http://en.wikipedia.org/wiki/MiniDisc" title="MiniDisc" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/MiniDisc?referer=');">MiniDisc</a>,&nbsp;<a href="http://en.wikipedia.org/wiki/Compact_audio_cassette" title="Compact audio cassette" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Compact_audio_cassette?referer=');">Compact audio cassette</a>, and digital or&nbsp;<a href="http://en.wikipedia.org/wiki/MP3" title="MP3" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/MP3?referer=');">MP3</a>&nbsp;albums, as they
  were introduced.<a href="http://en.wikipedia.org/wiki/Album#cite_note-About_Vinyl_Records-4" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/Album_cite_note-About_Vinyl_Records-4?referer=');"><sup>[4]</sup></a><u><sup></sup></u></p><p><span property="cites" typeof="http://schema.org/ScholarlyArticle"><link itemprop="url" href="data:application/json,%A0%20%20%20%7B%22citationID%22%3A%22szdpnOJ5%22%2C%22properties%22%3A%7B%22formattedCitation%22%3A%22%28Anon.%20%20%202012%29%22%2C%22plainCitation%22%3A%22%28Anon.%202012%29%22%7D%2C%22citationItems%22%3A%5B%7B%22id%22%3A1338%2C%22uris%22%3A%5B%22http%3A//zotero.org/users/568/items/SGQM5SUA%22%5D%2C%22uri%22%3A%5B%22http%3A//zotero.org/users/568/items/SGQM5SUA%22%5D%2C%22itemData%22%3A%7B%22id%22%3A1338%2C%22type%22%3A%22entry-encyclopedia%22%2C%22title%22%3A%22Album%22%2C%22container-title%22%3A%22Wikipedia%2C%20%20%20the%20free%20encyclopedia%22%2C%22abstract%22%3A%22An%20album%20is%20a%20book%20%20%20used%20for%20the%20collection%20and%20preservation%20of%20miscellaneous%20items%20such%20as%20%20%20photographs%2C%20postage%20stamps%2C%20newspaper%20clippings%2C%20visitors%27%20comments%2C%20etc.%5B1%5D%20%20%20The%20word%20later%20became%20widely%20used%20to%20describe%20a%20collection%20of%20audio%20%20%20recordings%20of%20pieces%20of%20music%20on%20a%20single%20gramophone%20record%2C%20cassette%2C%20%20%20compact%20disc%2C%20or%20via%20digital%20distribution.%5B1%5D%22%2C%22URL%22%3A%22http%3A//en.wikipedia.org/w/index.php%3Ftitle%3DAlbum%26oldid%3D526533373%22%2C%22note%22%3A%22Page%20%20%20Version%20ID%3A%20%20%20526533373%22%2C%22language%22%3A%22en%22%2C%22issued%22%3A%7B%22year%22%3A2012%2C%22month%22%3A12%2C%22day%22%3A5%7D%2C%22accessed%22%3A%7B%22year%22%3A2012%2C%22month%22%3A12%2C%22day%22%3A11%7D%7D%7D%5D%2C%22schema%22%3A%22https%3A//github.com/citation-style-language/schema/raw/master/csl-citation.json%22%7D%20%20%20"><span itemprop="label">(Anon. 2012)</span><span property="cites" typeof="http://schema.org/ScholarlyArticle"><meta itemprop="id" content="1338"><meta itemprop="type" content="entry-encyclopedia"><meta itemprop="title" content="Album"><meta itemprop="container-title" content="Wikipedia,   the free encyclopedia"><meta itemprop="abstract" content="An album is a book   used for the collection and preservation of miscellaneous items such as   photographs, postage stamps, newspaper clippings, visitors" comments,="" etc.[1]="" the="" word="" later="" became="" widely="" used="" to="" describe="" a="" collection="" of="" audio="" recordings="" pieces="" music="" on="" single="" gramophone="" record,="" cassette,="" compact="" disc,="" or="" via="" digital="" distribution.[1]'=""><meta itemprop="URL" content="http://en.wikipedia.org/w/index.php?title=Album&amp;oldid=526533373"><meta itemprop="note" content="Page   Version ID:   526533373"><meta itemprop="language" content="en"><span itemprop="issued"><meta itemprop="year" content="2012"><meta itemprop="month" content="12"><meta itemprop="day" content="5"></span><span itemprop="accessed"><meta itemprop="year" content="2012"><meta itemprop="month" content="12"><meta itemprop="day" content="11"></span><link itemprop="uri" href="http://zotero.org/users/568/items/SGQM5SUA"></span></span></p></blockquote></section><p>And the ‘album’ is still alive even in all-digital online distribution. If
the recorded-music-album can survive, and it obviously has, then the much older
book is no danger. In a nice bit of back-formation, the scholars at Amazon have
made the link between the book-omnibus or magazine to the record-album and come
up with a way of describing long essays that they presumably feel a bit coy
about marketing as eBooks. </p><p>Singles:<img border="0" width="131" height="21" src="http://ptsefton.com/wp-content/uploads/2012/12/wpid-Avatars-of-the-book-copy.html.html_Avatars-of-the-book-copy_filesimage0062.gif" v:shapes="Picture_x0020_2">
</p><p>I should point out that this workshop was not about the end
of the book. In fact, Maria Angel asked, in the introduction; how might the
constraints, and bounds of the book create an interesting space for innovation?</p></section><section><h2>Interesting things about C21 books</h2><section typeof="http://purl.org/ontology/bibo/Slide"><h3>Mapping the enlightenment book trade</h3><p>The book trade no longer looks like <a href="http://chop.leeds.ac.uk/stn/interface/query_places.php?t=author&amp;e=rawsales&amp;id=au0000023&amp;pa=on&amp;sa=on&amp;ea=on&amp;ta=on&amp;d1=01&amp;m1=01&amp;y1=1769&amp;d2=31&amp;m2=12&amp;y2=1794&amp;g=town&amp;d=map" onclick="pageTracker._trackPageview('/outgoing/chop.leeds.ac.uk/stn/interface/query_places.php?t=author_amp_e=rawsales_amp_id=au0000023_amp_pa=on_amp_sa=on_amp_ea=on_amp_ta=on_amp_d1=01_amp_m1=01_amp_y1=1769_amp_d2=31_amp_m2=12_amp_y2=1794_amp_g=town_amp_d=map&amp;referer=');">this</a>:</p><p><img border="0" width="417" height="124" src="http://ptsefton.com/wp-content/uploads/2012/12/wpid-Avatars-of-the-book-copy.html.html_Avatars-of-the-book-copy_filesimage0082.gif" v:shapes="Picture_x0020_4"></p><p>Figure 1
  Searching for a sales destinations for an author</p><p>(From a project led by Professor Simon Burrows of Leeds, who is <a href="http://frenchbooktrade.wordpress.com/2012/12/08/future-arrangements-for-fbtee/" onclick="pageTracker._trackPageview('/outgoing/frenchbooktrade.wordpress.com/2012/12/08/future-arrangements-for-fbtee/?referer=');">joining
  UWS soon</a>)</p></section><section typeof="http://purl.org/ontology/bibo/Slide"><h3>A map</h3><h2><img border="0" width="417" height="295" src="http://ptsefton.com/wp-content/uploads/2012/12/wpid-Avatars-of-the-book-copy.html.html_Avatars-of-the-book-copy_filesimage0102.gif" v:shapes="Picture_x0020_5"></h2></section><p>Figure 2 The search
returns a map</p><section typeof="http://purl.org/ontology/bibo/Slide"><h3>Potential research project: What does a map of the modern
  book/eBook trade look like?</h3><p>How to visualise this?</p><ul><li><p>Vertical sales channels*</p></li><li><p>Software distribution across channels and
  device types.**</p></li><li><p>Geographic distribution of channels </p></li><li><p>DRM; Digital Rights (Restrictions) Management.</p></li><li><p>Copyright territories</p></li><li><p>Geographic distribution of DMCA-type laws</p></li><li><p>The naughty-net DRM-free distribution system.</p></li></ul></section></section><li><p>* There is some ‘leakage’ between channels, eg Kindle apps on Apple
iOS, but there complicated dynamics at play, such as no in-app purchasing on
Kindle.</p></li><li><p>**Globally, the PC is still the most popular eBook device, but eReaders
win in the US and UK. (According to this <a href="http://www.the-digital-reader.com/2012/03/20/bowker-amazon-dominates-the-world-ebook-market/#.UMeRBpOUpus" onclick="pageTracker._trackPageview('/outgoing/www.the-digital-reader.com/2012/03/20/bowker-amazon-dominates-the-world-ebook-market/_.UMeRBpOUpus?referer=');">summary</a>
of a <a href="http://www.bowker.com/en-US/aboutus/press_room/2012/pr_04302012.shtml" onclick="pageTracker._trackPageview('/outgoing/www.bowker.com/en-US/aboutus/press_room/2012/pr_04302012.shtml?referer=');">2012
report</a>).</p></li><section><h2>Useful functions of the book?</h2><section typeof="http://purl.org/ontology/bibo/Slide"><h3>&nbsp;Persistence,
  archive-ability</h3><h3><img border="0" width="418" height="166" src="http://ptsefton.com/wp-content/uploads/2012/12/wpid-Avatars-of-the-book-copy.html.html_Avatars-of-the-book-copy_filesimage0122.gif" v:shapes="Picture_x0020_6"></h3><p>Figure 3
  Don&#8217;t let your scholarly works/tools end up all, like, 404</p></section><p>The above screenshot is one of the sights I saw when looking for
PreE – the post book research environment mentioned by Liu. All the
researchers at our symposium should be worrying about this. How are you going
to preserve all your works, tools and experiments? </p><section typeof="http://purl.org/ontology/bibo/Slide"><h3>Preservation?</h3><p>When I raised preservation as an issue, Jason Nelson said
  (more or less):</p><blockquote><p>Don’t obsess about preserving
  everything. Build things that people love – and they’ll work to
  preserve. </p></blockquote><p>Good point – but don’t lose sight of the value of a
  professional portfolio (and not everyone’s a rock-star internet poet like
  Jason :).</p></section><section typeof="http://purl.org/ontology/bibo/Slide"><h3>eBooks <i>are</i> little
  web sites</h3><ul><li><p>HTML has more or less ‘won’ as the basis for
  most eBook formats.</p></li><li><p>EPUB is the standard. A book is essentially a
  (complicated) zipped-up website.</p></li><li><p>Beware of licensing/platform traps like the
  iBooks authoring application.</p></li><li><p>See the <a href="http://jiscpub.blogs.edina.ac.uk/final-report/" onclick="pageTracker._trackPageview('/outgoing/jiscpub.blogs.edina.ac.uk/final-report/?referer=');">JISC EPUB project</a>,
  on which I worked in 2011.</p></li><li><p>Think about how to design things and
  interfaces to things that are driven by declarative markup (ie – be explict
  first, delightful second)</p></li></ul></section><section typeof="http://purl.org/ontology/bibo/Slide"><h3>What about apps? Games?</h3><p>&#8211; &nbsp;see Jason Nelson &#8211;</p></section></section><section><h2>Potential projects?</h2><p>I figured there might be postgrads and others at the symposium
interested in where to direct their bookish work here’s a list from the
technical perspective.</p><section typeof="http://purl.org/ontology/bibo/Slide"><h3>Arising from the JISC EPUB project</h3><p><a href="http://jiscpub.blogs.edina.ac.uk/final-report/#43" onclick="pageTracker._trackPageview('/outgoing/jiscpub.blogs.edina.ac.uk/final-report/_43?referer=');">Recommendations
  for further work</a>:</p><ol type="1"><li><p><a name="44"><strong>Provide rich search tools for individual
  collections of ebooks</strong><b></b></a></p></li><li><p><a name="45"><strong>Tools for
  generating or traversing ebook citations</strong></a><b></b></p></li><li><p><a name="46"><strong>Development
  of a pilot to produce ebooks with linked-data content*</strong></a><b></b></p></li><li><p><a name="47"><strong>Native
  EPUB output for Microsoft Word or Open Office</strong></a><b></b></p></li><li><p><a name="48"><strong>LaTeX to
  EPUB 3/MathML</strong></a><b></b></p></li><li><p><a name="49"><strong>Ereading
  systems with scholarly annotation systems</strong></a><strong></strong></p></li><li><p><strong>Community
  resources for individual scholars wishing to epublish</strong><b></b></p></li><li><p><a name="51"><strong>Aggregate
  resources for digital conversion for small scholarly presses</strong></a><b></b></p></li><li><p><a name="52"><strong>Maximize
  use of orphan works</strong></a><b></b></p></li><li><p><a name="53"><strong>Community
  resources for institutions with digital collections</strong></a></p></li></ol></section><section typeof="http://purl.org/ontology/bibo/Slide"><h3>Think about separating data from presentation</h3><p>To finish, I tried to show a little of what is possible
  with declarative semantically-marked up HTML5, as one potential means to
  create new bookish things that might last.</p><ul><li><p>Think about how to separate content from
  presentation and ‘engine’. (Might not be possible in some avant-garde
  experiments, but there are usually ways).</p></li><li><p>Consider standards-based experiments like <a href="http://wraggelabs.com/shed/presentations/ndf2012/storydata/" onclick="pageTracker._trackPageview('/outgoing/wraggelabs.com/shed/presentations/ndf2012/storydata/?referer=');">this one
  from Tim Sheratt</a>.</p></li></ul></section></section><section><h2>Conclusion</h2><p>Well, there isn’t a conclusion really as these are just
notes but I’m sure that continued work led by Anna and Maria and the Writing
and Society Research Centre will be critically important to UWS, seeing as
we’re the first Australian university to give <a href="http://www.uws.edu.au/learning_teaching/learning_and_teaching/ipad_initiative" onclick="pageTracker._trackPageview('/outgoing/www.uws.edu.au/learning_teaching/learning_and_teaching/ipad_initiative?referer=');">Every
New Student a Free iPad</a>. </p><section typeof="http://purl.org/orb/References"><p>Anon. 2012. “Album.” <i>Wikipedia, the Free Encyclopedia</i>.
http://en.wikipedia.org/w/index.php?title=Album&amp;oldid=526533373.</p><p>Drucker, J. 2010. “Graphesis: Visual Knowledge
Production and Representation.” <i>Poetess Archive Journal</i> 2 (1):
1–50.</p><p>Liu, Alan. 2009. “The End of the End of the
Book: Dead Books, Lively Margins, and Social Computing.” http://hdl.handle.net/2027/spo.act2080.0048.404.</p><br clear="all">

<hr align="left" size="1" width="33%">

</section><p><a style="mso-footnote-id:ftn1" href="#_ftnref1" name="_ftn1" title=""><span class="MsoFootnoteReference"><span class="MsoFootnoteReference">[1]</span></span></a>
There was one aspect of Jason’s work which I can’t talk about here which I
think will make a fantastic online case study in research data archiving. I’m
going to suggest it to the eResearch and repository staff at Griffith.</p><p></p></section></section></article>
	</div>]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2012/12/20/new-avatars-of-the-book-in-digital-culture-2.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Receding Repository Software?</title>
		<link>http://ptsefton.com/2012/11/19/receding-repository-software.htm</link>
		<comments>http://ptsefton.com/2012/11/19/receding-repository-software.htm#comments</comments>
		<pubDate>Sun, 18 Nov 2012 23:21:09 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=1558</guid>
		<description><![CDATA[&#8220;Receding Repository Software?&#8221; by Peter (pt) Sefton is licensed under a Creative Commons Attribution 3.0 Unported License.I’m leading a brief session at the CAIRSS community days today (CAIRSS is the national repository support service for Australasia). The title is “Emerging Repository Software”, but I thought I’d turn that around and propose that the future of [...]]]></description>
			<content:encoded><![CDATA[	<div>
	<span property="http://purl.org/dc/terms/title" content="Receding Repository Software?"><!--  --></span>
			<span property='http://purl.org/dc/terms/creator' content='ptsefton'><!-- --></span>
	
<article itemscope="itemscope" itemtype="http://schema.org/ScholarlyArticle"><section>
<a rel="license" href="http://creativecommons.org/licenses/by/3.0/deed.en_US" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by/3.0/deed.en_US?referer=');"><img alt="Creative Commons License" style="border-width:0" src="http://i.creativecommons.org/l/by/3.0/88x31.png" /></a><br /><span xmlns:dct="http://purl.org/dc/terms/" property="dct:title">&#8220;Receding Repository Software?&#8221;</span> by <a xmlns:cc="http://creativecommons.org/ns#" href="http://nla.gov.au/nla.party-541658" property="cc:attributionName" rel="cc:attributionURL" onclick="pageTracker._trackPageview('/outgoing/nla.gov.au/nla.party-541658?referer=');">Peter (pt) Sefton</a> is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by/3.0/deed.en_US" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by/3.0/deed.en_US?referer=');">Creative Commons Attribution 3.0 Unported License</a>.<p>I’m leading a brief session at the CAIRSS community days today
(CAIRSS is the national repository support service for Australasia). The title
is “Emerging Repository Software”, but I thought I’d turn that around and
propose that the future of institutional repositories is to fade into the
background.</p><p>Here are my notes for the session.</p><p>Take this screenshot of the Griffith University repository.
See here’s the default browse-screen for publications.</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>The Griffith Publications repository</h2><p><img width="417" height="251" src="http://ptsefton.com/wp-content/uploads/2012/11/wpid-Receding_repo-copy.html.html_Receding_repo-copy_filesimage002.png" v:shapes="_x0000_i1029"></p></section><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Hold on! That’s not the repository</h2><p>This is the Research Hub which ties together data from a
  number of different sources to provide a joined-up view of publications in
  the context of other research information.</p><p><a href="http://research-hub.griffith.edu.au/publications" onclick="pageTracker._trackPageview('/outgoing/research-hub.griffith.edu.au/publications?referer=');">http://research-hub.griffith.edu.au/publications</a></p></section><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>The repository has faded into the background</h2><p>Much like this invisible dog.</p><p><img border="0" width="461" height="346" src="http://ptsefton.com/wp-content/uploads/2012/11/wpid-Receding_repo-copy.html.html_Receding_repo-copy_filesimage004.png" v:shapes="Picture_x0020_2"></p></section><p>Just in case these Griffith people get big heads, I do have
to point out that while I think this service points to the future of the
Institutional Repository as an embedded part of the research information
systems of the university, the work’s not all done yet.</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>But, this is not perfect</h2><p>The ‘find it yourself’ button is sub-optimal: <img border="0" width="202" height="155" src="http://ptsefton.com/wp-content/uploads/2012/11/wpid-Receding_repo-copy.html.html_Receding_repo-copy_filesimage006.png" v:shapes="Picture_x0020_1"></p><p>And check out this URL:</p><p><a href="http://research-hub.griffith.edu.au/collections#fq={!tag=classgroup}classgroup%3A%22http%3A%2F%2Fresearch-hub.griffith.edu.au%2Findividual%2FvitroClassGroupcollections%22" onclick="pageTracker._trackPageview('/outgoing/research-hub.griffith.edu.au/collections_fq=_tag=classgroup_classgroup_3A_22http_3A_2F_2Fresearch-hub.griffith.edu.au_2Findividual_2FvitroClassGroupcollections_22?referer=');">http://research-hub.griffith.edu.au/collections#fq={!tag=classgroup}classgroup%3A%22http%3A%2F%2Fresearch-hub.griffith.edu.au%2Findividual%2FvitroClassGroupcollections%22</a>
  </p><p>That really should be something like: <a href="http://research-hub.griffith.edu.au/collections" onclick="pageTracker._trackPageview('/outgoing/research-hub.griffith.edu.au/collections?referer=');">http://research-hub.griffith.edu.au/collections</a></p></section><p>This stuff is hard to get right. The hardest bit is getting
good quality metadata so things <i>do</i>
join up.</p><p>There are two new kinds of repository we’re seeing in
Australia, thanks to investment from the Australian National Data Service. </p><ol type="1"><li><p>There are many “Data Capture” systems for
researchers to manage data early in its lifecycle. </p></li><li><p>These feed into Data Catalogues or Data
Repositories – there’s a lot of terminological confusion here because of
the way the funding streams have been structured.</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Data capture Systems</h2><p>See the ANDS <a href="https://projects.ands.org.au/getAllProjects.php?start=dc" onclick="pageTracker._trackPageview('/outgoing/projects.ands.org.au/getAllProjects.php?start=dc&amp;referer=');">list of DC
  projects</a>. Here’s one I selected at random:</p><p><img border="0" width="416" height="157" src="http://ptsefton.com/wp-content/uploads/2012/11/wpid-Receding_repo-copy.html.html_Receding_repo-copy_filesimage008.png" v:shapes="Picture_x0020_3"></p><p>It’s difficult to get useful information about many of
  these projects.</p></section></li></ol><p>There are many data capture projects, and all of them will
presumably need to be hooked up to systems like the Griffith Research Hub at
some stage.</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>It’s a jungle out there, mount an expedition!</h2><p><img border="0" width="417" height="210" src="http://ptsefton.com/wp-content/uploads/2012/11/wpid-Receding_repo-copy.html.html_Receding_repo-copy_filesimage010.png" v:shapes="Picture_x0020_5"></p></section><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Data <s>is</s> are the new black</h2><p>A few notable projects:</p><ul><li><p>Lincoln’s ‘API first’ research data management
  app, <a href="http://orbital.blogs.lincoln.ac.uk/" onclick="pageTracker._trackPageview('/outgoing/orbital.blogs.lincoln.ac.uk/?referer=');">Orbital</a>. </p></li><li><p><a href="http://imageweb.zoo.ox.ac.uk/wiki/index.php/ADMIRAL" onclick="pageTracker._trackPageview('/outgoing/imageweb.zoo.ox.ac.uk/wiki/index.php/ADMIRAL?referer=');">Admiral</a> at
  Oxford; a two-tier system (capture / archive).</p></li><li><p>Islandora <a href="http://www.slideshare.net/d_wilcox/stewarding-research-data-with-islandora" onclick="pageTracker._trackPageview('/outgoing/www.slideshare.net/d_wilcox/stewarding-research-data-with-islandora?referer=');">in
  a few research data contexts</a>.</p></li></ul><p>See JISC’s <a href="http://www.jisc.ac.uk/whatwedo/programmes/~/link.aspx?_id=CA545CC36C54494CA1D5EFAC8C79320A&amp;_z=z" onclick="pageTracker._trackPageview('/outgoing/www.jisc.ac.uk/whatwedo/programmes/_/link.aspx?_id=CA545CC36C54494CA1D5EFAC8C79320A_amp_z=z&amp;referer=');">Managing
  Research Data</a> programme for more.</p><p>(I wanted to mention the Hydra Fedora-commons toolkit as
  well lots of work on <a href="https://wiki.duraspace.org/display/hydra/Partners+and+Implementations" onclick="pageTracker._trackPageview('/outgoing/wiki.duraspace.org/display/hydra/Partners+and+Implementations?referer=');">archives
  and digital libraries</a>).</p></section><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>The current opportunity for libraries</h2><p>Use your metadata skills to help with “the great joining
  up”.</p><ul><li><p>Get the governance right (<a href="http://metadata-stores.blogspot.com.au/2012/08/project-governance-roles-and.html" onclick="pageTracker._trackPageview('/outgoing/metadata-stores.blogspot.com.au/2012/08/project-governance-roles-and.html?referer=');">see
  the ANDS view of this</a>).</p><p>Research systems are <i>for</i> the researchers.</p><p>&nbsp;(So projects should report to the
  Deputy Vice Chancellor Research).</p></li><li><p>Start working with Research Data – not
  just on repositories, but on <i>useful</i>
  applications.</p></li><li><p>Get involved in tag-and-release programs for
  the feral data capture projects roaming Australia’s universities.</p></li><li><p>Do more work on ‘Digital library’ projects
  beyond the institutional publications repository.</p></li></ul></section></section></article>
	</div>]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2012/11/19/receding-repository-software.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Culture and climate</title>
		<link>http://ptsefton.com/2012/11/12/culture-and-climate.htm</link>
		<comments>http://ptsefton.com/2012/11/12/culture-and-climate.htm#comments</comments>
		<pubDate>Mon, 12 Nov 2012 12:17:48 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=1544</guid>
		<description><![CDATA[Culture and climateI was invited to attend the planning day for the Institute for Culture and Society (ICS) at the University of Western Sydney, to talk about the eResearch team at UWS, discuss collaboration tools, and show a few useful, relevant examples of eResearch in the humanities. Here are some rough notes for the discussion. [...]]]></description>
			<content:encoded><![CDATA[	<div>
	<span property="http://purl.org/dc/terms/title" content="Culture and climate"><!--  --></span>
			<span property='http://purl.org/dc/terms/creator' content='ptsefton'><!-- --></span>
	<article itemscope="itemscope" itemtype="http://schema.org/ScholarlyArticle"><section><h1>Culture and climate</h1><p>I was invited to attend the planning day
for the Institute for Culture and Society (<a href="http://www.uws.edu.au/ics" onclick="pageTracker._trackPageview('/outgoing/www.uws.edu.au/ics?referer=');">ICS</a>)
at the University of Western Sydney, to talk about the <a href="http://eresearch.uws.edu.au" onclick="pageTracker._trackPageview('/outgoing/eresearch.uws.edu.au?referer=');">eResearch team</a> at UWS, discuss
collaboration tools, and show a few useful, relevant examples of eResearch in
the humanities. </p><p>Here are some rough notes for the
discussion. </p><p>For eResearch I will talk about our small
eResearch website, and on the subject of collaboration tools I’ll be evasive. </p><section><h2>The problem with surveys of collaboration tools</h2><p>While lots of people are interested in
finding out how to collaborate using modern techniques we really need to talk
this through on a project by project basis. &nbsp;I tried to write about collaboration tools at the
Australian Digital Futures Institute after complaints from an education
researcher in the institute about the bewildering array of <i>stuff </i>we used to get things done. I gather it was like turning up
for work as a carpenter’s apprentice and being introduced to <i>all</i> the tools in the ute at once. </p><p>(That piece is still online, but it is of
historical interest only, as the tools have all changed. Not to mention it is
very long winded and mentions some USQ tools that aren’t relevant to you, still
if you’d like to see <a href="http://ptsefton.com/2009/07/08/adfi-collaboration-and-documentation-tools.htm">how
I explained Twitter and hashtags, and predicted the demise of Google Docs ‘cos
Google Wave had arrived</a> then you might enjoy it. Otherwise, file as too
long, don’t read.)</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Dr Sefton’s quick cure for a lack of
  online collaboration</h2><p>If in doubt, start a Google Group. If
  symptoms persist, see me in the morning; I may put you into one of my group
  therapy sessions.</p></section><p>Ok, so maybe that advice about
collaboration tools is a bit too short. But rather than list tools, I’ll put up
this list of collaborative tasks (not tools) as potential discussion topics to
come back to, either in this session or in a future dedicated workshop or
consultation.</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Some collaboration modes/tasks</h2><ul><li><p><b>Talking to each other:</b> email, video/audio
  conferencing, discussions </p></li><li><p><b>Writing together:</b> word processing,
  wikis, Content Management Systems</p></li><li><p><b>Publishing:</b> blogs, wikis, Content
  Management Systems, pod/video-casting, CVs, microblogging</p></li><li><p><b>Remembering and sharing:</b> links,
  reference materials, bibliographic references</p></li><li><p><b>Storing:</b> stuff</p></li></ul><p><b>Which
  tools do you favour and why?</b></p></section></section><section><h2>eResearch for Culture and Society</h2><p>Back to the more interesting topic –
eResearch as it relates to culture and society. </p><p>On the way to work on Monday I rode through
local instances of some lovely spring weather (cold enough for me to want a
jacket descending the mountains, warm by the time I got to the river), which
got me thinking about the climate and in turn the Hawkesbury Institute for the
Environment (<a href="http://www.uws.edu.au/hie" onclick="pageTracker._trackPageview('/outgoing/www.uws.edu.au/hie?referer=');">HIE</a>), which is just downstream
from Penrith.</p><p>The eResearch team does a lot of work with
HIE and the connection is easy to see. We obviously need large amounts of data
to document, let alone model, climate, and we need to run climate simulations at
atmospheric and oceanic scale as well as at smaller scale, like models of
leaves or trees – all of which involves data management, computational tools
and global collaboration.</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Weather, climate, and the ICS
  planning day reminded me of an <a href="http://books.google.com.au/books?hl=en&amp;lr=&amp;id=R26GCAZHw5MC&amp;oi=fnd&amp;pg=PA1&amp;ots=Nz766k4s4q&amp;sig=WaHhKvSWan_xYqrA3RJuQLq4QYU#v=onepage&amp;q&amp;f=false" onclick="pageTracker._trackPageview('/outgoing/books.google.com.au/books?hl=en_amp_lr=_amp_id=R26GCAZHw5MC_amp_oi=fnd_amp_pg=PA1_amp_ots=Nz766k4s4q_amp_sig=WaHhKvSWan_xYqrA3RJuQLq4QYU_v=onepage_amp_q_amp_f=false&amp;referer=');">analogy
  of Michael Halliday’s</a>:</h2><blockquote><p>We&nbsp;can&nbsp;perhaps&nbsp;use&nbsp;an&nbsp;analogy&nbsp;from
  the&nbsp;physical&nbsp;world: the&nbsp;difference&nbsp;between&nbsp;“culture” and
  “situation” is&nbsp;rather&nbsp;like&nbsp;that&nbsp;between&nbsp;the “climate” and the
  “weather”.&nbsp;</p></blockquote></section><p>I used to think about this analogy a lot,
particularly when some lecturer was getting us undergrads to formulate grammar
rules from half an A4 page of dodgy examples. Those ‘models’ (including
Halliday’s) were severely limited by the number of data points that supported them.
</p><p>Then I was introduced to corpus linguistics
in the early 1990s in a workshop by <a href="http://en.wikipedia.org/wiki/John_McHardy_Sinclair" title="John McHardy Sinclair" onclick="pageTracker._trackPageview('/outgoing/en.wikipedia.org/wiki/John_McHardy_Sinclair?referer=');">John Sinclair</a>. In the workshop multiple
instances of words in context were used as data to help decide what they
actually <i>mean</i>.&nbsp; The Collins COBUILD dictionaries that
Sinclair was involved in producing gave quite a different picture of the
‘climate’ of English that the traditional dictionary approach of
forward-copying definitions by using, you know, evidence to decide what words
mean. </p><p>Fast forward to 2012 and the Macquarie
dictionary decided to re-look at its definition of misogyny, after the word got
a bit of an airing in the Australian Parliament, as noted in <a href="http://www.macquariedictionary.com.au/pdf/letter_from_ed.pdf" onclick="pageTracker._trackPageview('/outgoing/www.macquariedictionary.com.au/pdf/letter_from_ed.pdf?referer=');">this letter</a>
from the Macquarie’s editor. I knew that they would have been able to get
plenty of data on the term’s use, and I thought of John Sinclair again. But the
letter didn’t talk about data, curiously, it talked about house-work.</p><blockquote><p>As Editor of the Macquarie Dictionary, I picture myself as the woman
with the broom and mop and bucket cleaning the language off the floor after the
party is over.</p></blockquote><p>The dictionary is one sort of ‘cultural
climate’ record, so of course we have to have sceptics, like <a href="http://www.heraldsun.com.au/opinion/macquarie-dictionary-widens-definition-of-misogyny-after-julia-gillards-putdown-of-tony-abbott/story-fncw91kq-1226498914785" onclick="pageTracker._trackPageview('/outgoing/www.heraldsun.com.au/opinion/macquarie-dictionary-widens-definition-of-misogyny-after-julia-gillards-putdown-of-tony-abbott/story-fncw91kq-1226498914785?referer=');">this
example</a> form the Herald Sun’s Patrick Carlyon, who
like a good climate change sceptic brings his own data to the table.</p><blockquote><p>Given the
ever-changing flow of words and their meaning, Macquarie has announced a raft
of further definition shadings to reflect recent political events and current
affairs:</p><p>Dog: To be known
also as &#8220;cat&#8221;, after a two-year-old boy at an East Brighton childcare
centre pointed at a chihuahua and meowed.</p></blockquote><p>These days, dictionary editors don’t need no
fancy ‘corpus’ like they used on those revolutionary Collins Dictionaries, as
we find out from <a href="http://www.macquariedictionary.com.au/pdf/editors_response.pdf" onclick="pageTracker._trackPageview('/outgoing/www.macquariedictionary.com.au/pdf/editors_response.pdf?referer=');">another
letter</a> they have the Internets, and not only that, they can still copy from
others, just like they used to.</p><blockquote><p>When it is brought to our attention, we are lucky these days to be able
to draw on the immense resources of the internet such as newsfeeds, blogs,
videos, etc., to research the use of the word over time, in different areas of
the world, and in different kinds of texts. Of course, we can also check other
dictionaries, to see if the same conclusions have been reached by our fellow
lexicographers. </p><p>http://www.macquariedictionary.com.au/pdf/editors_response.pdf</p></blockquote><p>I’m telling you this because I wanted to
show a simple eResearch example from the cultural sphere. Halliday’s climate
analogy seemed apt. Just as we know climate science is done with lots of data
points, recording the weather at the highest possible scale that add up to a
climate record, we can study cultural phenomena such as language by looking at
data-points of various kinds. Text is an easy example, because it’s easy to
search and there’s now a lot of it to search.</p><p>Anyway, with all that in mind, I wanted
to ask the researchers from the institute:</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>What infrastructure do you need to
  do ‘culture science’?</h2><p>Or is that a stupid/naïve/offensive question?</p></section><p></p><p>While people think about that I thought I’d
continue with a few examples, and come back to the discussion of collaboration
and eResearch tools at the end.</p><p>The Feds don’t seem to think this idea of
‘culture science’ it is entirely stupid, as they have funded a couple of
million-dollar plus projects to build virtual laboratories, not just in
sciences but in the humanities.</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>NeCTAR (Aus govt funding) Round 1
  Virtual lab projects</h2><ul><li><p><a href="http://www.nectar.org.au/virtual-geophysics-laboratory" onclick="pageTracker._trackPageview('/outgoing/www.nectar.org.au/virtual-geophysics-laboratory?referer=');"><b>CSIRO &#8211; Virtual Geophysics Laboratory</b></a></p></li><li><p><a href="https://www.nectar.org.au/genomics-virtual-laboratory-0" onclick="pageTracker._trackPageview('/outgoing/www.nectar.org.au/genomics-virtual-laboratory-0?referer=');"><b>University of Queensland &#8211; Virtual Genomics
  Laboratory</b></a></p></li><li><p><a href="http://www.nectar.org.au/marine-virtual-laboratory" onclick="pageTracker._trackPageview('/outgoing/www.nectar.org.au/marine-virtual-laboratory?referer=');"><b>University of Tasmania &#8211; Marine Virtual
  Laboratory</b></a></p></li><li><p><a href="https://www.nectar.org.au/all-sky-virtual-observatory#overlay-context=virtual-laboratories-contract-negotiation-phase" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.nectar.org.au/all-sky-virtual-observatory_overlay-context=virtual-laboratories-contract-negotiation-phase?referer=');"><b>The All Sky Virtual
  Observatory</b></a></p></li><li><p><a href="https://www.nectar.org.au/climate-and-weather-science-laboratory" onclick="pageTracker._trackPageview('/outgoing/www.nectar.org.au/climate-and-weather-science-laboratory?referer=');"><b>Climate and Weather Science Laboratory</b></a></p></li><li><p><b>&gt;&gt;&gt; <a href="https://www.nectar.org.au/humanities-networked-infrastructure-huni-virtual-laboratory" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.nectar.org.au/humanities-networked-infrastructure-huni-virtual-laboratory?referer=');">Humanities
  Networked Infrastructure (HuNI) unlocking and uniting Australia&#8217;s cultural
  data</a> &lt;&lt;&lt; This one relates to culture</b></p></li><li><p><a href="https://www.nectar.org.au/genomics-virtual-laboratory#overlay-context=virtual-laboratories-contract-negotiation-phase" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.nectar.org.au/genomics-virtual-laboratory_overlay-context=virtual-laboratories-contract-negotiation-phase?referer=');"><b>The Genomics Virtual
  Laboratory</b></a></p></li><li><p><a href="https://www.nectar.org.au/characterisation-virtual-laboratory" target="_blank" onclick="pageTracker._trackPageview('/outgoing/www.nectar.org.au/characterisation-virtual-laboratory?referer=');"><b>The Characterisation Virtual
  Laboratory: research environments for exploring inner space</b></a></p></li></ul><p>A question for ICS researchers – what
  kind of cultural data is important to you?</p></section><p>And again in round 2, there is a UWS-led
bid to build a virtual laboratory which is partly in the cultural domain;
contract negotiations proceeding on that one. This lab is very much like
‘climate science’ for linguistics, musicology et al, bringing together data
sets and letting researchers run tools on them, generate new analyses and
annotations and feed back, to be built by the UWS MARCS institute. </p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Round 2 Virtual Laboratories
  recommended for funding</h2>ul><li><p>The Industrial Ecology Lab&nbsp; </p></li><li><p>Marine Virtual Laboratory
(MARVL)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </p></li><li><p>Biodiversity and Climate Change&nbsp;&nbsp;&nbsp; </p></li><li><p>Endocrine Genomics (EndoVL)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </p></li><li><p><b>&gt;&gt;&gt; Human Communication Science&lt;&lt;&lt; This one is
also in the humanities (and it’s UWS-led)</b></p></li></ul></section><<p></p><p>One of the data sources that HuNI is linking
into their Virtual Lab is the National Library’s Trove. To demonstrate this
I’ll want to try out Tim Sherratt’s QueryPic tool, searching across the Trove
Newspaper database for occurrances of terms to do with the workshop topic,
which broadly speaking means stuff about Asian studies, and the Asian century.
Tim’s tool is an example of an eResearch tool that’s completely data driven.</p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>QueryPic showing searches for Asia
  and Asian in Trove Newspapers 1803–1954</h2><p><a href="http://dhistory.org/querypic/4t/" onclick="pageTracker._trackPageview('/outgoing/dhistory.org/querypic/4t/?referer=');">http://dhistory.org/querypic/4t/</a></p><p><img border="0" width="553" height="287" src="http://ptsefton.com/wp-content/uploads/2012/11/wpid-Culture_climate.docx.htm_Culture_climate.docx_filesimage002.jpg" v:shapes="Picture_x0020_1"></p></section><p></p><p>You can click on a data point to see a list
of articles. </p><p><img border="0" width="554" height="283" src="http://ptsefton.com/wp-content/uploads/2012/11/wpid-Culture_climate.docx.htm_Culture_climate.docx_filesimage004.jpg" v:shapes="Picture_x0020_2"></p><p>But be careful with these results! </p><p>Q. Why were the Aussie papers talking about
Asia so much in 1820? </p><section><h4>A. They were
talking about&nbsp; <span class="apple-converted-space"><u>a
ship.</u></span></h4><p>If only the Macquarie had something like
this.</p><p>(There are some issues with this tool, not
least of which is that this is not a stable, fixed data set, people are
actively improving it via crowdsourced editing, and the data set is expanding
so it would be impossible to reproduce results. I’ve suggested that a solution
would be to place snapshots of the data into the Research Data Storage
Infrastructure starting to roll out now via lead agency, The University of
Queensland so that researchers could work on known-stable corpora, and perform
tricks like reindexing to improve performance on this class of query.)</p><p>Contrast this approach of re-using existing
data in a fairly generic database to ask new questions with a very different
kind of eResearch application, the Dictionary of Sydney, a project of the Arts
Computing Lab at the University of Sydney; we can search for the <a href="http://dictionaryofsydney.org/entry/art_gallery_of_new_south_wales" onclick="pageTracker._trackPageview('/outgoing/dictionaryofsydney.org/entry/art_gallery_of_new_south_wales?referer=');">Art
Gallery of NSW</a> where we’ll be meeting, and from there browse a rich curated
web of relationships between entries about buildings, people, institutions etc.</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Another way of recording culture:
  The Dictionary of Sydney</h2><p><a href="http://dictionaryofsydney.org/organisation/art_gallery_of_new_south_wales" onclick="pageTracker._trackPageview('/outgoing/dictionaryofsydney.org/organisation/art_gallery_of_new_south_wales?referer=');"><b>http://dictionaryofsydney.org/organisation/art_gallery_of_new_south_wales</b></a></p><p><img border="0" width="553" height="339" src="http://ptsefton.com/wp-content/uploads/2012/11/wpid-Culture_climate.docx.htm_Culture_climate.docx_filesimage006.jpg" v:shapes="Picture_x0020_3"></p></section><p></p><p>So, back to the question.</p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>What infrastructure do you need to
  do ‘culture science’?</h2></section><p></p></section></section></section>
<p>Copyright  Peter Sefton 2012. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia.</p>
<p>[Updated 2012-11-13, removed Andrew Leahy as co author]</article>
	</div>]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2012/11/12/culture-and-climate.htm/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Tip: Arrange dock icons by shape, colour to reduce seek-time</title>
		<link>http://ptsefton.com/2012/10/16/tip-arrange-dock-icons-by-shape-colour-to-reduce-seek-time.htm</link>
		<comments>http://ptsefton.com/2012/10/16/tip-arrange-dock-icons-by-shape-colour-to-reduce-seek-time.htm#comments</comments>
		<pubDate>Tue, 16 Oct 2012 12:38:22 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=1534</guid>
		<description><![CDATA[Like the guy in this video I used to think it would be a good idea to arrange icons in the OS X dock by how often I used them, or maybe by type. But I found that whatever ordering I used I would have trouble finding things. I know that iTunes is a blue [...]]]></description>
			<content:encoded><![CDATA[	<div>
	<span property="http://purl.org/dc/terms/title" content="Tip: Arrange dock icons by shape, colour to reduce seek-time"><!--  --></span>
			<span property='http://purl.org/dc/terms/creator' content='ptsefton'><!-- --></span>
	<article itemscope="itemscope" itemtype="http://schema.org/ScholarlyArticle"><section><p>Like the guy in <a href="http://www.youtube.com/watch?v=dVT16BPr2DU" onclick="pageTracker._trackPageview('/outgoing/www.youtube.com/watch?v=dVT16BPr2DU&amp;referer=');">this video</a> I used to
think it would be a good idea to arrange icons in the OS X dock by how often I
used them, or maybe by type. But I found that whatever ordering I used I would
have trouble finding things. I know that iTunes
is a blue circle, but so is Skype, and Safari (I rarely use it but sometimes I
want to test something) – so the task of finding the app I want meant scanning
for blue circles and often zooming my attention to the wrong one – I hate to
say it, but they all look the same to me those pictures.</p><p>So, I now arrange them by shape, then colour. To seek, zoom in to the right shape-group. It’s easy to see the differences because they’re side by side rather than
spread out.</p><p>Circles and circle-like things:</p><p><img border="0" width="437" height="91" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-dock-tip.docx.htm_dock-tip.docx_filesimage002.jpg" v:shapes="Picture_x0020_5">’</p><p>Squarish things:</p><p><img border="0" width="553" height="72" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-dock-tip.docx.htm_dock-tip.docx_filesimage004.jpg" v:shapes="Picture_x0020_6"></p><p>Love those Apple apps that are so easy to
tell apart:</p><p><img border="0" width="156" height="56" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-dock-tip.docx.htm_dock-tip.docx_filesimage006.jpg" v:shapes="Picture_x0020_8"></p><p>Oh, and might as well arrange things with
letter-icons in alphabetical order.</p><p><img border="0" width="276" height="60" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-dock-tip.docx.htm_dock-tip.docx_filesimage007.png" v:shapes="Picture_x0020_3"></p><p>Et volia!</p><p><img border="0" width="553" height="26" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-dock-tip.docx.htm_dock-tip.docx_filesimage009.jpg" v:shapes="Picture_x0020_7"></p></section></article>
	</div>]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2012/10/16/tip-arrange-dock-icons-by-shape-colour-to-reduce-seek-time.htm/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>NeCTAR Über Dojo, Reproducible Research (UWS eResearch team in Cloud Land II)</title>
		<link>http://ptsefton.com/2012/10/08/nectar-uber-dojo-reproducible-research-uws-eresearch-team-in-cloud-land-ii.htm</link>
		<comments>http://ptsefton.com/2012/10/08/nectar-uber-dojo-reproducible-research-uws-eresearch-team-in-cloud-land-ii.htm#comments</comments>
		<pubDate>Mon, 08 Oct 2012 05:39:02 +0000</pubDate>
		<dc:creator>ptsefton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://ptsefton.com/?p=1521</guid>
		<description><![CDATA[Alf (Andrew) Leahy and I were recently in Melbourne for the NeCTAR Über Dojo event. By coming to this two day event you will be able to go back to your institution with a signed certificate showing that you&#8217;ve been &#8216;black belted&#8217; as a Cloud expert, specifically we&#8217;ll train and qualify you in using the [...]]]></description>
			<content:encoded><![CDATA[	<div>
	<span property="http://purl.org/dc/terms/title" content="NeCTAR Über Dojo, Reproducible Research (UWS eResearch team in Cloud Land II)"><!--  --></span>
			<span property='http://purl.org/dc/terms/creator' content='ptsefton'><!-- --></span>
	

<article itemscope="itemscope" itemtype="http://schema.org/ScholarlyArticle"><section><h1></h1><p>Alf (Andrew) Leahy and I were recently in
Melbourne for the <a href="http://melbourne-uberdojo.eventbrite.com/" onclick="pageTracker._trackPageview('/outgoing/melbourne-uberdojo.eventbrite.com/?referer=');">NeCTAR Über
Dojo event</a>. </p><blockquote><p>By coming to this
two day event you will be able to go back to your institution with a signed
certificate showing that you&#8217;ve been &#8216;black belted&#8217; as a Cloud expert,
specifically we&#8217;ll train and qualify you in using the following tools and data
in the Cloud:</p><ul><li><p>How to use image management tools for production level VM
management on the Cloud, i.e. Puppet or Chef</p></li><li><p>How to use libraries to access storage data APIs, e.g.
SWIFT/S3, NFS, Object Store, etc.</p></li><li><p>We&#8217;ll also use this event to get you the experts to tell us
what new features we need for all these new toys (I mean infrastructure ;-)</p></li></ul></blockquote><p></p><p></p><p>The climax of this thing was an Iron
Chef-style challenge. This is an initial rough post about our two hour hack
(which we cleaned up over a few more hours post-event). Andrew will write up
the event for the UWS eResearch blog. Thanks to Remko Duursma and Craig Barton
for letting use their code and data for the demo.</p><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Background: HIE* researchers use <a href="http://www.r-project.org/" onclick="pageTracker._trackPageview('/outgoing/www.r-project.org/?referer=');">R</a> </h2><p></p><p><img border="0" width="553" height="399" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-NeCTAR_uber_Dojo.docx.htm_NeCTAR_uber_Dojo.docx_filesimage002.jpg" v:shapes="Picture_x0020_1"></p><p>*Hawkesbury Institute for the Environment</p></section><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Meet Remko</h2><p><img border="0" width="554" height="282" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-NeCTAR_uber_Dojo.docx.htm_NeCTAR_uber_Dojo.docx_filesimage004.jpg" v:shapes="Picture_x0020_10"></p><p><a href="http://www.uws.edu.au/hie/people/researchers/doctor_remko_duursma" onclick="pageTracker._trackPageview('/outgoing/www.uws.edu.au/hie/people/researchers/doctor_remko_duursma?referer=');">http://www.uws.edu.au/hie/people/researchers/doctor_remko_duursma</a>
  </p></section><p></p><p></p><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>And Craig</h2><p></p><p><img border="0" width="554" height="281" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-NeCTAR_uber_Dojo.docx.htm_NeCTAR_uber_Dojo.docx_filesimage006.jpg" v:shapes="Picture_x0020_11"></p><p><a href="http://www.uws.edu.au/hie/people/admin_and_technical_staff/dr_craig_barton" onclick="pageTracker._trackPageview('/outgoing/www.uws.edu.au/hie/people/admin_and_technical_staff/dr_craig_barton?referer=');">http://www.uws.edu.au/hie/people/admin_and_technical_staff/dr_craig_barton</a> </p></section><p></p><p></p><p></p><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>They run R to clean data &amp; model
  stuff…</h2><p><img border="0" width="553" height="273" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-NeCTAR_uber_Dojo.docx.htm_NeCTAR_uber_Dojo.docx_filesimage008.jpg" v:shapes="Picture_x0020_4"></p></section><p></p><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>This R Notebook shows code and
  output together in HTML</h2><p><img border="0" width="553" height="589" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-NeCTAR_uber_Dojo.docx.htm_NeCTAR_uber_Dojo.docx_filesimage010.jpg" v:shapes="Picture_x0020_6"></p></section><p>The night before the challenge I asked
Remko and Craig if we could use some sample data and R code:</p><blockquote><p>Andrew (Alf) and I are at a workshop in Melbourne &#8211; I was wondering if I
can use this as a demo tomorrow &#8211; ie it will appear on screen but not be made
available over the net. The short notice is because the idea has only emerged
today.</p><p></p><p>The thing we&#8217;d be demoing would be to _simulate_ the following:</p><ul><li><p>Data set +
scripts like the attached is sitting in a web based repository.</p></li><li><p>Repository
offers user the opportunity to download the package&nbsp;OR &#8211; Get an interactive
R-Studio shell where they can re-run the data</p><ul><li><p>Use clicks
a &#8220;see with interactive shell&#8221; link.</p></li><li><p>Our
server:</p><ul><li><p>Fires up a virtual server in the cloud</p></li><li><p>Installs the server version of R-Studio</p></li><li><p>Creates a user account&nbsp;</p></li><li><p>Pushes the data package onto the new server</p></li><li><p>Unpacks the package</p></li><li><p>Sets up R-studio to run Knitr on the main
script to create HTML</p></li><li><p>sends a link to the R-Studio to the user (maybe
by email cos this might take a few minutes)</p></li></ul></li></ul></li><li><p>The user
can see the plots etc, and will have access to the R environment to tweak
things</p></li></ul></blockquote><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Our idea…</h2><p><img border="0" width="553" height="545" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-NeCTAR_uber_Dojo.docx.htm_NeCTAR_uber_Dojo.docx_filesimage012.png" v:shapes="Picture_x0020_8"></p><p></p></section><p></p><p></p><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Remko’s reponse</h2><blockquote><p>That sounds fantastic. Please go ahead! Wait actually those are
  Craig&#8217;s data :)**</p><p></p><p>&gt;*&nbsp;Get an interactive R-Studio shell where they can re-run the
  data</p><p></p><p>YES that is perfect!</p><p></p><p>greetings</p><p>remko</p><p></p></blockquote><p>**Craig gave us the go-ahead as well</p></section><p></p><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Initial quick and dirty demo approach</h2><p>We have written a simple CGI web script
  in Python which simulates a repository, with the capacity to orchestrate the
  below:</p><ul><li><p>Create a new NeCTAR machine
  using Python with the Boto library</p></li><li><p>Connect via SSH and install
  R-Studio and dependencies, and make a user account</p><p>TODO: Use Chef
  for the initial server build – we learned about Chef on Monday but not enough
  to be able to create a new ‘recipe’ in an hour or so.</p><p>TODO: Use a
  snapshot image so users don’t have to wait ten minutes for a machine to start
  up and install all the prerequisites</p></li><li><p>Copy the sample data set onto
  the VM</p><p>TODO:&nbsp; Use the cloud object storage so the data
  set is near to the VM (We know how, just didn’t have time)</p><p>TODO: See if we
  can get R-Studio to launch with an R Notebook by default and log-in
  automatically</p></li></ul><p></p></section><p></p><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Why is this cool?</h2><ul><li><p><b>Our
  researcher-colleagues like the idea</b></p></li><li><p>It uses the cloud to solve a set of real problems*:</p><ol type="1"><li><p>Makes it easy for ‘data
  shoppers’ to evaluate data sets</p></li><li><p>Potentially enables research
  outputs to be re-run in an exact environment for real reproducible workflows (Remko
  notes that you really need the exact R library versions sometimes, this could
  be an option.)</p></li></ol></li></ul><p>*Assuming a real implementation</p><p></p></section><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Screenshot : Script creating a
  virtual server</h2><p><img border="0" width="553" height="376" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-NeCTAR_uber_Dojo.docx.htm_NeCTAR_uber_Dojo.docx_filesimage014.jpg" v:shapes="Picture_x0020_2"></p><p></p><p></p></section><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Screenshot: our script Installs
  R-Studio*</h2><p><img border="0" width="554" height="392" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-NeCTAR_uber_Dojo.docx.htm_NeCTAR_uber_Dojo.docx_filesimage016.jpg" v:shapes="Picture_x0020_3"></p><p>*This process is really too slow to do
  on-demand, we should launch from a pre-built snapshot. And the automation
  here is really hacky and crude – it would be better to do it using something
  like Chef but that’s not a one-day project if you have never used it before.</p></section><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Result – an RStudio Server…</h2><p><img border="0" width="553" height="408" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-NeCTAR_uber_Dojo.docx.htm_NeCTAR_uber_Dojo.docx_filesimage018.jpg" v:shapes="Picture_x0020_5"></p><p>TODO: Automate the login by passing some
  kind of token?</p></section><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>With a data-set pre-loaded*</h2><p></p><p><img border="0" width="553" height="398" src="http://ptsefton.com/wp-content/uploads/2012/10/wpid-NeCTAR_uber_Dojo.docx.htm_NeCTAR_uber_Dojo.docx_filesimage020.jpg" v:shapes="Picture_x0020_7"></p><p></p><p>*We have not automated the installation
  of Knitr – so no R Notebook yet, but the code is runnable, and will spit out
  plot-images</p></section><p></p><p></p><p></p><section itemtype="http://purl.org/ontology/bibo/Slide" itemscope=""><h2>Show me the code</h2><p>The code we produced is demo quality
  hack-day code. This is not something you’d use in real life but we’re
  releasing it on <a href="http://code.google.com/p/uws-eresearch/source/browse/#svn%2Ftrunk%2Fnectar-cloud%2Frstudio" onclick="pageTracker._trackPageview('/outgoing/code.google.com/p/uws-eresearch/source/browse/_svn_2Ftrunk_2Fnectar-cloud_2Frstudio?referer=');">Google
  Code</a> as an example and so we can remember where to start when we come
  back to this and try to do it properly, maybe as part of the data capture
  project at the <a href="http://eresearch.uws.edu.au/blog/projects/data-capture-for-climate-change-and-energy-research/" onclick="pageTracker._trackPageview('/outgoing/eresearch.uws.edu.au/blog/projects/data-capture-for-climate-change-and-energy-research/?referer=');">Hawkesbury
  Institute for the Environment</a>.</p></section><p></p><p></p><p></p><p>Copyright<span class="apple-converted-space">&nbsp; </span>Peter Sefton and Andrew Leahy, 2012. Licensed under<span class="apple-converted-space">&nbsp;</span>Creative Commons Attribution-Share Alike 2.5<span class="apple-converted-space">&nbsp;</span>Australia. &lt;<a href="http://creativecommons.org/licenses/by-sa/2.5/au/" onclick="pageTracker._trackPageview('/outgoing/creativecommons.org/licenses/by-sa/2.5/au/?referer=');">http://creativecommons.org/licenses/by-sa/2.5/au/</a>&gt;</p><p></p></section></article>
	</div>]]></content:encoded>
			<wfw:commentRss>http://ptsefton.com/2012/10/08/nectar-uber-dojo-reproducible-research-uws-eresearch-team-in-cloud-land-ii.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
