Overview of content related to 'dns'

Abstract Modelling of Digital Identifiers

Nick Nicholas, Nigel Ward and Kerry Blinco present an information model of digital identifiers, to help bring clarity to the vocabulary debates from which this field has suffered.

Discussion of digital identifiers, and persistent identifiers in particular, has often been confused by differences in underlying assumptions and approaches. To bring more clarity to such discussions, the PILIN Project has devised an abstract model of identifiers and identifier services, which is presented here in summary. Given such an abstract model, it is possible to compare different identifier schemes, despite variations in terminology; and policies and strategies can be formulated for persistence without committing to particular systems. The abstract model is formal and layered; in this article, we give an overview of the distinctions made in the model. This presentation is not exhaustive, but it presents some of the key concepts represented, and some of the insights that result.</p> <p>The main goal of the Persistent Identifier Linking Infrastructure (PILIN) project [<a href="#1">1</a>] has been to scope the infrastructure necessary for a national persistent identifier service. There are a variety of approaches and technologies already on offer for persistent digital identification of objects. But true identity persistence cannot be bound to particular technologies, domain policies, or information models: any formulation of a persistent identifier strategy needs to outlast current technologies, if the identifiers are to remain persistent in the long term.</p> <p>For that reason, PILIN has modelled the digital identifier space in the abstract. It has arrived at an ontology [<a href="#2">2</a>] and a service model [<a href="#3">3</a>] for digital identifiers, and for how they are used and managed, building on previous work in the identifier field [<a href="#4">4</a>] (including the thinking behind URI [<a href="#5">5</a>], DOI [<a href="#6">6</a>], XRI [<a href="#7">7</a>] and ARK [<a href="#8">8</a>]), as well as semiotic theory [<a href="#9">9</a>]. The ontology, as an abstract model, addresses the question 'what is (and isn't) an identifier?' and 'what does an identifier management system do?'. This more abstract view also brings clarity to the ongoing conversation of whether URIs can be (and should be) universal persistent identifiers.</p> <h2 id="Identifier_Model">Identifier Model</h2> <p>For the identifier model to be abstract, it cannot commit to a particular information model. The notion of an identifier depends crucially on the understanding that an identifier only identifies one distinct thing. But different domains will have different understandings of what things are distinct from each other, and what can legitimately count as a single thing. (This includes aggregations of objects, and different versions or snapshots of objects.) In order for the abstract identifier model to be applicable to all those domains, it cannot impose its own definitions of what things are distinct: it must rely on the distinctions specific to the domain.</p> <p>This means that information modelling is a critical prerequisite to introducing identifiers to a domain, as we discuss elsewhere [<a href="#10">10</a>]: identifier users should be able to tell whether any changes in a thing's content, presentation, or location mean it is no longer identified by the same identifier (i.e. whether the identifier is restricted to a particular version, format, or copy).</p> <p>The abstract identifier model also cannot commit to any particular protocols or service models. In fact, the abstract identifier model should not even presume the Internet as a medium. A sufficiently abstract model of identifiers should apply just as much to URLs as it does to ISBNs, or names of sheep; the model should not be inherently digital, in order to avoid restricting our understanding of identifiers to the current state of digital technologies. This means that our model of identifiers comes close to the understanding in semiotics of signs, as our definitions below make clear.</p> <p>There are two important distinctions between digital identifiers and other signs which we needed to capture. First, identifiers are managed through some system, in order to guarantee the stability of certain properties of the identifier. This is different to other signs, whose meaning is constantly renegotiated in a community. Those identifier properties requiring guarantees include the accountability and persistence of various facets of the identifier—most crucially, what is being identified. For digital identifiers, the <strong>identifier management system</strong> involves registries, accessed through defined services. An HTTP server, a PURL [<a href="#11">11</a>] registry, and an XRI registry are all instances of identifier management systems.</p> <p>Second, digital identifiers are straightforwardly <strong>actionable</strong>: actions can be made to happen in connection with the identifier. Those actions involve interacting with computers, rather than other people: the computer consistently does what the system specifies is to be done with the identifier, and has no latitude for subjective interpretation. This is in contrast with human language, which can involve complex processes of interpretation, and where there can be considerable disconnect between what a speaker intends and how a listener reacts. Because the interactions involved are much simpler, the model can concentrate on two actions which are core to digital identifiers, but which are only part of the picture in human communication: working out what is being identified (<em>resolution</em>), and accessing a representation of what is identified (<em>retrieval</em>).</p> <p>So to model managing and acting on digital identifiers, we need a concept of things that can be identified, names for things, and the relations between them. (Semiotics already gives us such concepts.) We also need a model of the systems through which identifiers are managed and acted on; what those systems do, and who requests them to do so; and what aspects of identifiers the systems manage.</p> <p>Our identifier model (as an ontology) thus encompasses:</p> <ul> <li><strong>Entities</strong> - including actors and identifier systems;</li> <li><strong>Relations</strong> between entities;</li> <li><strong>Qualities</strong>, as desirable properties of entities. Actions are typically undertaken in order to make qualities apply to entities.</li> <li><strong>Actions</strong>, as the processes carried out on entities (and corresponding to <strong>services</strong> in implementations);</li> </ul> <p>An individual identifier system can be modelled using concepts from the ontology, with an identifier system model.</p> <p>In the remainder of this article, we go through the various concepts introduced in the model under these classes. We present the concept definitions under each section, before discussing issues that arise out of them. <em>Resolution</em> and <em>Retrieval</em> are crucial actions for identifiers, whose definition involves distinct issues; they are discussed separately from other Actions. We briefly discuss the standing of HTTP URIs in the model at the end.

issue62 feature article
Sat, 30 Jan 2010 00:00:00 +0000

Embedding Web Preservation Strategies Within Your Institution

Christopher Eddie reports on the third one-day workshop of the JISC-PoWR (Preservation of Web Resources) Project held at the University of Manchester on 12 September 2008.

issue57 event report
Thu, 30 Oct 2008 00:00:00 +0000

Persistent Identifiers: Considering the Options

Emma Tonkin looks at the current landscape of persistent identifiers, describes several current services, and examines the theoretical background behind their structure and use.

What Is a Persistent Identifier, and Why?

Persistent identifiers (PIs) are simply maintainable identifiers that allow us to refer to a digital object – a file or set of files, such as an e-print (article, paper or report), an image or an installation file for a piece of software. The only interesting persistent identifiers are also persistently actionable (that is, you can "click" them); however, unlike a simple hyperlink, persistent identifiers are supposed to continue to provide access to the resource, even when it moves to other servers or even to other organisations.

issue56 feature article
Tue, 29 Jul 2008 23:00:00 +0000

Distributed Services Registry Workshop

John Gilby reports on the UKOLN/IESR two-day workshop at Scarman House, University of Warwick on 14-15 July 2005.

The number of available online digital collections is growing all the time and with this comes the need to discover these collections, both by machine (m2m) and by end-users. There is also a trend towards service-orientated architectures and a likely critical part of this will be service registries to assist with discovering services andtheir associated collections. UKOLN and the JISC Information Environment Services Registry Project (IESR) organised a two-day workshop to look at some of the issues that are likely to be present in building a distributed approach.

issue45 event report
Sat, 29 Oct 2005 23:00:00 +0000

World Wide Web Conference 2004

Dave Beckett reports on the international WWW2004 conference held in New York, 19-21 May 2004.

WWW2004 was the 13th conference in the series of international World Wide Web conferences organised by the IW3C2 (International World Wide Web Conference Committee). This was the annual gathering of Web researchers and technologists to present the latest work on the Web and Web standardisation at the World Wide Web Consortium (W3C).

issue40 event report
Thu, 29 Jul 2004 23:00:00 +0000

ERPANET Seminar on Persistent Identifiers

Monica Duke reports on a two-day training seminar on persistent identifiers held by ERPANET in Cork, Ireland over 17-18 June 2004.

Day One

Introduction
Welcome and Keynote
Overview of Persistent Identifier initiatives
URN
OpenURL - The Rough Guide
Info URIs
The DCMI Persistent Identifier Working Group
The CENDI Report
ARK
PURLs
Overview of the Handle System
DOI

issue40 event report
Thu, 29 Jul 2004 23:00:00 +0000

Web Watch: A Survey Of Numbers of UK University Web Servers

How many web servers are there in the UK Higher Education community? Brian Kelly provides some answers.

How many web servers are there in use within the UK higher education community? What is the profile of server usage within the community - do most institutions take a distributed approach, running many servers, or is a centralised approach more popular? A WebWatch survey has been carried out recently in an attempt to answer these questions.

issue24 tooled up
Thu, 22 Jun 2000 23:00:00 +0000

Performance and Security: Notes for System Administrators

Performance and Security - Notes for System Administrators: Andy Powell offers some hints and tips on the performance and security aspects of running electronic library services on UNIX based machines.

The eLib Technical Concertation day last November brought together techies from many of the eLib projects. (See Clare McClean's report in Ariadne issue 6 for more details [1]). A wide range of the technical issues associated with running electronic library services were discussed at the meeting but inevitably, given the time constraints, some of these were not covered in any great detail.

issue8 tooled up
Wed, 19 Mar 1997 00:00:00 +0000

Putting the UK on the Map

Peter Burden of the University of Wolverhampton's School of Computing and Information Technology describes the history behind his clickable maps of the UK, an essential and well established (though unfunded) resource for quickly locating academic and research Web sites.

Many of you are probably familiar with the WWW active maps of UK academic resources operated by the University of Wolverhampton's School of Computing and Information Technology. If you're not point your WWW browser at http://www.scit.wlv.ac.uk/ukinfo/uk.map.html before going any further. I thought it might be of interest to Ariadne readers to hear how and why these maps were created.

issue5 project update
Wed, 18 Sep 1996 23:00:00 +0000

AC/DC

Dave Beckett and Neil Smith explain a search engine that only indexes sites in the .ac.uk domain.

WWW Crawlers - Why A New One? 

All the major WWW crawling programs such as Alta Vista (Digital), InfoSeek, Lycos, Webcrawler, Excite etc. are based in the USA and collect their pages across the transatlantic link. There are two problems with the USA based services:

issue3 feature article
Sat, 18 May 1996 23:00:00 +0000