To bring more clarity to such discussions, the PILIN Project has devised an abstract model of identifiers and identifier services, which is presented here in summary. Given such an abstract model, it is possible to compare different identifier schemes, despite variations in terminology; and policies and strategies can be formulated for persistence without committing to particular systems. The abstract model is formal and layered; in this article, we give an overview of the distinctions made in the model. This presentation is not exhaustive, but it presents some of the key concepts represented, and some of the insights that result.</p> <p>The main goal of the Persistent Identifier Linking Infrastructure (PILIN) project [<a href="#1">1</a>] has been to scope the infrastructure necessary for a national persistent identifier service. There are a variety of approaches and technologies already on offer for persistent digital identification of objects. But true identity persistence cannot be bound to particular technologies, domain policies, or information models: any formulation of a persistent identifier strategy needs to outlast current technologies, if the identifiers are to remain persistent in the long term.</p> <p>For that reason, PILIN has modelled the digital identifier space in the abstract. It has arrived at an ontology [<a href="#2">2</a>] and a service model [<a href="#3">3</a>] for digital identifiers, and for how they are used and managed, building on previous work in the identifier field [<a href="#4">4</a>] (including the thinking behind URI [<a href="#5">5</a>], DOI [<a href="#6">6</a>], XRI [<a href="#7">7</a>] and ARK [<a href="#8">8</a>]), as well as semiotic theory [<a href="#9">9</a>]. The ontology, as an abstract model, addresses the question 'what is (and isn't) an identifier?' and 'what does an identifier management system do?'. This more abstract view also brings clarity to the ongoing conversation of whether URIs can be (and should be) universal persistent identifiers.</p> <h2 id="Identifier_Model">Identifier Model</h2> <p>For the identifier model to be abstract, it cannot commit to a particular information model. The notion of an identifier depends crucially on the understanding that an identifier only identifies one distinct thing. But different domains will have different understandings of what things are distinct from each other, and what can legitimately count as a single thing. (This includes aggregations of objects, and different versions or snapshots of objects.) In order for the abstract identifier model to be applicable to all those domains, it cannot impose its own definitions of what things are distinct: it must rely on the distinctions specific to the domain.</p> <p>This means that information modelling is a critical prerequisite to introducing identifiers to a domain, as we discuss elsewhere [<a href="#10">10</a>]: identifier users should be able to tell whether any changes in a thing's content, presentation, or location mean it is no longer identified by the same identifier (i.e. whether the identifier is restricted to a particular version, format, or copy).</p> <p>The abstract identifier model also cannot commit to any particular protocols or service models. In fact, the abstract identifier model should not even presume the Internet as a medium. A sufficiently abstract model of identifiers should apply just as much to URLs as it does to ISBNs, or names of sheep; the model should not be inherently digital, in order to avoid restricting our understanding of identifiers to the current state of digital technologies. This means that our model of identifiers comes close to the understanding in semiotics of signs, as our definitions below make clear.</p> <p>There are two important distinctions between digital identifiers and other signs which we needed to capture. First, identifiers are managed through some system, in order to guarantee the stability of certain properties of the identifier. This is different to other signs, whose meaning is constantly renegotiated in a community. Those identifier properties requiring guarantees include the accountability and persistence of various facets of the identifier—most crucially, what is being identified. For digital identifiers, the <strong>identifier management system</strong> involves registries, accessed through defined services. An HTTP server, a PURL [<a href="#11">11</a>] registry, and an XRI registry are all instances of identifier management systems.</p> <p>Second, digital identifiers are straightforwardly <strong>actionable</strong>: actions can be made to happen in connection with the identifier. Those actions involve interacting with computers, rather than other people: the computer consistently does what the system specifies is to be done with the identifier, and has no latitude for subjective interpretation. This is in contrast with human language, which can involve complex processes of interpretation, and where there can be considerable disconnect between what a speaker intends and how a listener reacts. Because the interactions involved are much simpler, the model can concentrate on two actions which are core to digital identifiers, but which are only part of the picture in human communication: working out what is being identified (<em>resolution</em>), and accessing a representation of what is identified (<em>retrieval</em>).</p> <p>So to model managing and acting on digital identifiers, we need a concept of things that can be identified, names for things, and the relations between them. (Semiotics already gives us such concepts.) We also need a model of the systems through which identifiers are managed and acted on; what those systems do, and who requests them to do so; and what aspects of identifiers the systems manage.</p> <p>Our identifier model (as an ontology) thus encompasses:</p> <ul> <li><strong>Entities</strong> - including actors and identifier systems;</li> <li><strong>Relations</strong> between entities;</li> <li><strong>Qualities</strong>, as desirable properties of entities. Actions are typically undertaken in order to make qualities apply to entities.</li> <li><strong>Actions</strong>, as the processes carried out on entities (and corresponding to <strong>services</strong> in implementations);</li> </ul> <p>An individual identifier system can be modelled using concepts from the ontology, with an identifier system model.</p> <p>In the remainder of this article, we go through the various concepts introduced in the model under these classes. We present the concept definitions under each section, before discussing issues that arise out of them. <em>Resolution</em> and <em>Retrieval</em> are crucial actions for identifiers, whose definition involves distinct issues; they are discussed separately from other Actions. <p>We briefly discuss the standing of HTTP URIs in the model at the end.</p> For many this could also be the sign on the home page of their organisation's intranet as, with business-critical decisions to make, they begin the daily hunt for information that they are sure should be somewhere in the application. It could just as easily be the sign on the door of the intranet manager of the organisation, though this door usually also carries a number of other job descriptions, all of which seem to be given more priority by the organisation than the care and development of the intranet. Most organisations of any size will have a full-time web manager, often with a support team, but this is rarely the case with the intranet.</p> <p>There are a substantial number of intranets in the UK. Statistics from the Office for National Statistics indicate that 22% of all businesses have an intranet [<a href="#1">1</a>]. As the size of the business increases so does the level of penetration, and most businesses of more than 500 people will now have some form of intranet. Given the number of businesses in the UK the author estimates that there are probably around 300,000 intranets in the commercial sector, and at a guess a further 100,000 in the public sector, charities, Higher Education institutions (HEIs) and other organisations. Only over the last few years has any reliable statistical information become available on intranet use and development, and this is a in-depth global survey of only around 300 intranets [<a href="#2">2</a>]. In the UK HEI sector a major opportunity was lost in a survey commissioned in 2009 by Eduserv into the management of web content in the HEI sector as no account of intranet use of CMS applications was included in the scope of the survey [<a href="#3">3</a>]. A survey of SharePoint use in HEIs undertaken for Eduserv in late 2009 [<a href="#4">4</a>] did indicate that a number of institutions were using SharePoint for intranet applications but the survey did not look in detail at intranet implementation.</p> <p>It is also only over the last few years have forums been set up in which intranet managers are able to share experiences and challenges with others. The work of the Intranet Benchmark Forum [<a href="#5">5</a>] is focused on providing services to large organisations, but there are also other virtual and physical discussion forums, such as the Intranet Forum [<a href="#6">6</a>] run by UKeiG for its members. It is probably reasonable to suggest that the majority of intranet managers have seen very few intranets from which to gain a sense of good practice, whereas web managers have an almost unlimited supply of sites from which to gain ideas for their own use. This is as true in the HEI sector as in other sectors. Given the installed base of intranets in the UK it is also surprising that there is no 'intranet conference' event even though intranet management does feature in events such as Online Information [<a href="#7">7</a>]. Most countries in northern Europe have an intranet conference [<a href="#8">8</a>], often with several hundred delegates, so why there is no equivalent in the UK is a mystery.</p> <h2 id="Intranets_Are_Different">Intranets Are Different</h2> <p>All too often an intranet is regarded as an internal web site. The reality is that about the only commonality between an intranet and a web site is the use of web browser technology. Many very successful intranets do not even use a web content management application but instead are based on Notes technology or portal applications. Intranet content contribution is usually highly distributed, with individual members of staff publishing content direct to the intranet perhaps only a few times a year. This means that the web content management system has to be highly intuitive, and enable Word documents to be rendered into clean HTML code to create web pages. The teams supporting public web sites are using the systems every working day, working often in HTML and having a much more limited range of content to cope with. Many of the problems that arise in keeping content current on an intranet are a result of staff having to use a complex Web publishing system that was specified for Web site management and not intranet management.</p> <p>Another factor to be considered is that increasingly intranets are federated applications [<a href="#9">9</a>]. This is often the situation in HEIs where each department wants to have its own intranet, and on top of all these individual intranets there is some form of top-level 'corporate' home page and navigation. Often there is no central coordination of these intranets, and so each adopts some or none of the visual design standards of the HEI.</p> <p>As far as enterprise applications are concerned, intranets are different because they are not based on business processes or work-flow. Finance, registry, personnel and most other applications support well-defined processes, usually within a specific department, and where the content requirements are usually specified in database terms. Anything approaching text content is usually relegated to a single field in the database. Intranets exist because there is a substantial amount of information in any organisation that is not based on business processes and cannot be managed within a formal database structure, such as policies, procedures, campus maps, events, staff notices and hundreds of other information formats produced by every department and location within the organisation.</p> <p>As a result the intranet becomes an information dumping ground. Under-resourced intranet managers do not have the resources to maintain content quality, and so multiple versions of documents with no visible ownership or provenance proliferate. Employees leave or change responsibility but the intranet is based on a 'file-and-forget' principle and no effort is taken to ensure that document ownership is transferred to another member of staff. Very quickly the information architecture of the intranet, based usually on the structure of the organisation at the time of the last WCMS (Web content management system) deployment, is not fit for purpose. The decision is taken to implement a search engine, and only then does the scale of the problem of information decay become apparent. <p>It can also be an interesting exercise to search for 'Confidential' and see just how many documents are returned!</p> This implies new ways of collaboration, dissemination and reuse of research results, specifically via the Web. <p>Developing countries are also able to exploit the opportunity to make their knowledge output more widely known and accessible and to co-operate in research partnerships.</p> <p>Every now and then, a new idea comes along and sparks a wave of interest, the first stage in the Internet hype cycle.</p> <p>The idea of a VRE, which in this context includes cyberinfrastructure and e-infrastructure, arises from and remains intrinsically linked with, the development of e-science.</p> <p>The 2nd edition of Peter Griffiths' <em>Managing Your Internet &amp; Intranet Services</em> not only recognizes this, but argues that perhaps it is LIS professionals who are best suited for managing Web sites and intranets.</p> ETD-db: Choosing Software to Manage Electronic Theses and Dissertations http://www.ariadne.ac.uk/issue38/jones <div class="field field-type-text field-field-teaser-article"> <div class="field-items"> <div class="field-item odd"> <p><a href="/issue38/jones#author1">Richard Jones</a> examines the similarities and differences between DSpace and ETD-db to determine their applicability in a modern E-theses service.</p> </div> </div> </div> <p>The <a href="http://www.thesesalive.ac.uk/">Theses Alive!</a> [<a href="#1">1</a>] Project, based at Edinburgh University Library and funded under the JISC Fair Programme [<a href="#2">2</a>], is aiming to produce, among other things, a software solution for institutions in the UK to implement their own E-theses or Electronic Theses and Dissertations (ETD) online submission system and repository. <p>In order to achieve this it has been necessary to examine existing packages that may provide all or part of the solution we desire before considering what extra development we may need to do.</p> <p>Content management? That's what librarians do, right? But we've already got a <i>library management system</i> (LMS) – why should we consider a <i>content management system</i> (CMS)?</p> <p>The second initial is perhaps misleading – "manipulation" rather than "management" might better summarise the goals of a CMS. <p>Content creation and content re-purposing are fundamental aspects which tend to lie outside the current LMS domain.</p> <p>This document describes some common concerns of libraries, archival institutions and museums as they work together to address the issues the Programme raises. This accounts for three major emphases in the document. <p>First, discussion is very much about what brings these organisations together, rather than about what separates them.</p> <p>There has been a phenomenal growth in interest and activity, as seen in many new publications, conferences, IT products, and job advertisements (including a post advertised by HEFCE). <p>Various professional groups, notably HR professionals, IT specialists, and librarians, are staking their claims, seeing KM as an opportunity to move centre stage.</p> <p>We would like to thank all of the readers who participated in the Ariadne Web Survey, part of an evaluation of the Ariadne project being carried out by Dr. Anne L. Barker, Department of Information and Library Studies at the University of Wales Aberystwyth.</p> <p>These web protocols, all of which are concerned with the way in which information can be represented and displayed, were initially <strong>Working Drafts</strong> which were developed by the appropriate W3C Working Group.</p> This involves the use of an innovative approach to handling the hyperlinks between Web-based resources, which could have significant implications for on-line journals and publishing.</p> </div> </div> </div> <p>Everyone involved with scholarly journals has a problem with the Web. Readers wonder whether they will find what they want, and librarians want to bring some order to the Web's unregulated chaos. Authors are concerned with recognition, and publishers with how they can make money from the Web. The Web presents many new opportunities, especially for academic works.</p>