Web Magazine for Information Professionals

A Bug's Life?: How Metaphors from Ecology Can Articulate the Messy Details of Repository Interactions

R. John Robertson, Mahendra Mahey and Phil Barker introduce work investigating an alternative model of repository and service interaction.

Visions

In ‘Lost in the IE’, published in the last issue of Ariadne and in subsequent discussion on various blogs [1], [2] there has some thoughtful reflection on the vision of the JISC Information Environment (IE), its architecture and standards, the role of the IE and the role of ‘that diagram’ [3]. It is clear that the development of work on repositories and services in the UK has benefitted from the IE Architecture diagram but it is also clear that such a model does not (and was not intended to) reflect the reality of the ‘messiness’ that inevitably surrounds connecting actual repositories and services [4].

The development, implementation, and support of real services challenges how we have traditionally articulated, represented, and tried to communicate the context of those services. We need abstract visions of an information environment, recommended standards, and models of software architectures (or component software functions) that can inform how we begin to develop local repositories and services. However, we, as a community of managers, librarians, researchers, and developers of technology, also need approaches that help us engage with the complex details of local contexts that shape how and why particular repository implementations succeed or fail. This article, based on work carried out by the Repositories Research Team (RRT) [5], proposes an ecologically inspired approach to tackle that challenge.

This article will discuss the type of approach we think is missing, and, based on our investigation of ecology as a metaphor, present an overview of an ecologically influenced approach, and discuss an example of its use.

A Complex Context

Setting up a repository to manage appropriate copies of, or metadata about, journal articles is perhaps the most understood and established purpose for a repository. Repository software (for example: ePrints, DSpace, Fedora, Intralibrary, Digitool, and Bepress) is designed to accommodate this functional requirement. Standards, such as OAI-PMH and SWORD, have developed with the dissemination of scholarly communication as a crucial use case. Business cases for Open Access have been thoughtfully put forward and numerous suggestions about advocacy to encourage faculty deposit have been explored. Numerous institutions have deployed repositories to support this particular purpose. There is however, no simple, off-the-shelf, recipe to establish a repository for scholarly communications. Selecting and installing repository software is a relatively minor part of the challenge. The growth of institutional repositories for scholarly communications depends on finding the right combination of people, tools, standards, support, networking, and advocacy. Even a cursory review of the stories of successful and unsuccessful repositories highlight many different contextual issues such as: the key role of a particular departmental champion, the importance of good relations with university system managers [6], the role of funding body mandates to deposit [7], the benefits of automatically generated publication lists for faculty Web pages [8], the impact of an Open Access copy on paper citation [9] and the advocacy value of usage statistics. No one of these factors, or any other for that matter, will ensure the creation of a successful repository for every institution, and to discover which intricate combination might work requires not only trial and error, but also a thorough understanding of the particular complex context of the repository or service.

An essential part of gaining such an understanding of a repository or service in its context is finding a way to articulate and consequently discuss the key dependencies and interactions that shape the repository. This articulation of a complex situation has to be done in a way that can cross skill sets and quickly convey the most important systems, relationships, and other influences that need to be considered for all the stakeholders. However, when we approach such an articulation, we often pick an analytical modelling tool, such as UML (Unified Modelling Language) [10] or BPMN (Business Process Modelling Notation) [11], to represent the situation with that tool, and use that as the basis of our presentation and analysis. As useful as this can be, it is –if used on its own – also an inadequate and potentially flawed approach. This is not as a result of an inherent difficulty with the tools but rather because we are not adequately addressing the true nature of a complex situation.

Defining Complexity

Ritchey (2002), drawing on Ackoff (1974) and Pidd (1996), summarises a view that there are three types of complex situations in what he terms a socio-technical system; these are: messes, problems, and puzzles.

Ritchey captures implications of the above distinction between ‘mess’, ‘problem’, and ‘puzzle’ in the following quotation from Pidd,

One of the greatest mistakes that can be made when dealing with a mess is to carve off part of the mess, treat it as a problem and then solve it as a puzzle – ignoring its links with other aspects of the mess.[13]

Repositories and services often exist in this sort of mess. Not as a result of any failing or sloppiness on the part of the managers or developers, but because as organisational, technical and cultural entities - like many other services and systems - repositories exist in the midst of an extremely complex set of interactions and influences (only a small percentage of which are technical). As a community we run the risk of reducing this complexity to whichever form of transaction that fits with our chosen problem-solving tool. If we do this, we sidestep some of the complexity of the mess, view only one dimension of the problem, and then treat it as a puzzle. As a result, the proposed solutions may neglect the context they are intended for and consequently struggle to succeed because the scope of the tools was too narrow, and critical factors were overlooked. We need approaches to support discussing the ‘mess’, and that help us to draw out the problems that need to be considered. To this end, the Repositories Research Team (RRT) has examined the possible benefits of using ecology as a metaphor to shape how we, as a community, articulate complex situations.

Ecology as a Metaphor for the Complex

Ecology – ‘the branch of biology dealing with the relations and interactions between organisms and their environment, including other organisms’ – examines highly complex processes and interactions as they affect a particular organism being studied [14]. Although a branch of biology, the interactions it considers are not restricted to the biological; other types of interaction, such as chemical, physical, meteorological, geological, may play a part. For example, the survival of a herd of deer in a particular area will depend on multitude of factors including the availability of food, the presence of predators, the climate, hunting restrictions, and building developments. Investigating the details of such an ecosystem and assessing its sustainability could be a detailed and lengthy process, but a basic understanding of it is also possible without specialist knowledge.

If we consider the allotments pictured below as an example of an ecosystem, we intuitively know that the success or failure of one of the plots, such as one on the middle right, will depend on a complex set of interactions. The interactions between the gardener, the location of the plot, the chosen crops, weeds, the neighbouring gardeners and any wildlife, form a small ecosystem. The whole ecosystem is affected by differing environmental factors ranged from the weather and the location of the plot relative to the shed, to how often the gardener can visit the plot, local byelaws and regulations.

photo (66KB) : Figure 1 : Figure 1: Allotments & estuary

Figure 1: Allotments & estuary [15]

Perhaps because, at least on some levels, people seem to have an intuitive understanding of ecology, it is often used as a metaphor for complex situations. It has been used before in the field of library and information science, by Nardi and O’Day [16], and by Davenport [17], and in connection to digital libraries and repositories by Blinco and Maclean [18]. Our investigation explored the relevance of the metaphor for repositories, services, and their interactions, to see what concepts and approaches might be of use in helping articulate specifics of complex dependencies and supplement existing analytical tools and skills. An ecological approach to repository and service interactions [19], the introductory report, we produced on this topic, goes into more detail about how these related concepts might be applied, the following section briefly presents some ideas that may be of use.

What to Model: Key Ideas

We suggest that the metaphor of an ecosystem provides the following useful concepts that can help articulate complex situations:

Thinking about the Model: Key Ideas

An ecological metaphor also provides us with some useful concepts that can inform how we approach modelling and how to assess the interactions being represented. These are concepts that provide an easily grasped way to present and analyse a set of complex interactions:

Implications of an Ecological Metaphor for Modelling

Models using ecology as a metaphor are not as restricted as traditional problem-solving tools and allow the key parts of a complex situation to be expressed. The intuitive nature of the ecological metaphor may be more accessible to a larger group of people. The development and availability of such a model facilitates discussion and allows non-specialists to grasp the key features of a setting. This may lead to shaping a complex mess into discrete problems that can be tackled with other tools.

Presentation Example

We recently presented a poster at DC2008 about an academic’s dissemination of a presentation, specifically looking at what happened to his presentations and associated metadata [20]. A brief examination of a couple of its key features can illustrate one outcome of this type of ecological modelling.

The poster presented the ecosystem of a researcher ‘Phil Robertson’ trying to disseminate a presentation. Phil created a workshop presentation based on a recently published paper. After the workshop he sought to make the presentation more widely available. The repository at his institution had accepted a preprint of the paper, but it was not interested in the presentation. He uploaded a copy of the presentation onto SlideShare and blogged about both the paper and presentation. The presentation was then discovered by a number of researchers. The entities included: Phil, the institutional repository and its manager, SlideShare, flickr, the CETIS blog, two researchers, the journal publishing process, and the abstracting and indexing process. Some of the interactions were: manage account or profile, cite, deposit, and discover. Key environmental factors were: the collection policies and rights policies associated with the different tools, the network of the academic discipline, and social networks around Web2.0 tools. The poster primarily took a resource-tracking approach to study and asked questions about the metadata associated with the various forms of the work and their different points of access.

diagram (89KB) : Figure 2 : An ecosystem of presentation dissemination

Figure 2: An ecosystem of presentation dissemination

Two key issues emerged from being able to have an overview of this ecosystem that could articulate technical, cultural, and human context of this researcher’s efforts to disseminate his work. These were the differences between the author identities and the challenge of preservation.

Differences between the Author’s Identities

The different tools that academics use to disseminate their work all have different ways of creating and maintaining that author’s identity. In this instance, the identity associated with the paper carries a different organisational affiliation to that of the identity associated with the dissemination of the presentation and the blog. The paper carries an institutional affiliation requested from the author as part of the paper; while the repository carries a departmental affiliation created as part of a profile (with the institution implied). On the other hand, the blog, hosted by an organisation Phil belongs to, carries organisational branding and affiliation; whereas the academic’s identity on SlideShare carries no affiliation or other identification beyond what he added to his profile when it was last updated.

The Challenge of Preservation

If we consider questions about the access to the presentation in the longer term, it is clear that, outside any personal archiving, long-term access to Phil Robertson’s presentation is dependent on the existence of SlideShare, and its continuation of a business model supporting free access to the content it stores. Discovery of the presentation in the longer term may also be a challenge. If the link between the blog and SlideShare were broken, SlideShare only supports very basic metadata (a title, tags, and an association with an optional author profile). Practitioners seeking to preserve presentations as part of their work will encounter considerable difficulty unless they actually know the author’s SlideShare identity.

We found that this ecological model provided an easy way to identify, highlight, and discuss key issues about managing presentations. It could also be used to assess quickly the impact of a desktop deposit tool on the current dissemination workflows, and to identify the technical, practical and administrative issues such a change would raise.

Conclusion

In this article we have highlighted the need for approaches to supplement the current use of tools and models within the digital library, repository and service community. We then introduced our exploration of ecology as a possible alternative metaphor for work that has been carried out in connection with it. We introduced an example of a developed ecology and illustrated its usefulness through a few of the issues it had raised. The next stage of development of our investigation of repository ecology is the release of the final version of the introductory report and the completion of three more case studies of using this approach.

It will, we hope, be clear that the ecologically influenced perspective we are suggesting is intended to support discussion and reflection on how we implement and use systems. In providing a way to articulate the context of a service more fully, we hope that this will lead to richer discussion about why repositories succeed or fail. We hope it offers repository managers a way to model the context of the repository that does not rely on knowledge of a particular modelling tool and which can engage a variety of stakeholders (whether librarians, developers, academics, or administrators).

References

  1. Walk, P. (2008) All models are wrong, but some are useful http://blog.paulwalk.net/2008/08/20/all-models-are-wrong-but-some-are-useful/
  2. Powell, A. (2008) Lost in the JISC Information Environment http://efoundations.typepad.com/efoundations/2008/08/lost.html
  3. Powell, A. (2006) Information Environment Architecture Diagram http://www.ukoln.ac.uk/distributed-systems/jisc-ie/arch/jisc-ie-arch-big.gif
  4. Powell, A. (2007). You sharin’? You askin’? http://efoundations.typepad.com/efoundations/2007/07/you-sharin-you-.html
  5. Repositories Research Team http://www.ukoln.ac.uk/repositories/digirep/index/Repositories_Research
  6. Salo, D. ‘Innkeeper at the Roach Motel’ Library Trends 57:2 (Fall 2008), pre-print available: http://minds.wisconsin.edu/handle/1793/22088
  7. Sherpa Juliet http://www.sherpa.ac.uk/juliet/ lists the growing number of funding bodies mandating Open Access to research outputs.
  8. Fried Foster, N. and Gibbons, S. (2005) ‘Understanding Faculty to Improve Content Recruitment for Institutional Repositories’ D-Lib Magazine Vol 11, No. 1 http://www.dlib.org/dlib/january05/foster/01foster.html
  9. OpCit Project (2008) The effect of open access and downloads (‘hits’) on citation impact: a bibliography of studies
    http://opcit.eprints.org/oacitation-biblio.html
  10. Object Management Group (2008) UML Resource Page http://www.uml.org/
  11. Object Management Group (2008) Business Process Modelling Notation (BPMN) Information http://www.bpmn.org/
  12. Ritchey, T. “Modelling Complex Socio-Technical Systems using Morphological Analysis” : Adapted from an address to the Swedish Parliamentary IT Commission, Stockholm, December 2002; available: http://www.swemorph.com/it-art.html . The quote is Ritchey’s summary of R. Ackoff (1974) Redesigning the Future and M. Pidd (1996) Tools for thinking
  13. Pidd, M. (1996) Tools for thinking quoted in Ritchey.
  14. ecology. (n.d.). Dictionary.com Unabridged (v 1.1). Retrieved 8 June, 2007, from Dictionary.com Web site: http://dictionary.reference.com/browse/ecology
  15. MGSpiller (2006), Allotments & Estuary http://www.flickr.com/photos/mgspiller/318023291/ work is licensed Creative Commons: Attribution-Share Alike 2.0 Generic
    http://creativecommons.org/licenses/by-sa/2.0/deed.en_GB
  16. Nardi, B.A. and V. L O’Day (1999), ‘Information ecologies: using technology with heart. Chapter Four: Information ecologies.’ First Monday Vol. 4, No 5 http://www.firstmonday.org/issues/issue4_5/nardi_chapter4.html
  17. Davenport, T. H (1997). Information ecology, OUP
  18. Blinco K. and & N. McLean, “A ‘Cosmic’ View of the Repositories Space (Wheel of Fortune)”, 2004
    http://www.rubric.edu.au/extrafiles/wheel/main.swf
  19. Robertson, R.J., M. Mahey, and Allinson, J (2008) An ecological approach to repository and service interactions
    http://purl.oclc.org/NET/RepositoryEcology/IntroductoryReport
  20. Robertson, R.J., P. Barker, and M. Mahey (2008) Metadata in an Ecosystem of Presentation Dissemination, Poster presented at International Conference on Dublin Core and Metadata Applications 22-26 September, Berlin.
    http://dc2008.de/wp-content/uploads/2008/10/05_robertson__poster.pdf

Author Details

R. John Robertson
Research Fellow
University of Strathclyde

Email: robert.robertson@strath.ac.uk
Web site: http://blogs.cetis.ac.uk/johnr/

Mahendra Mahey
Repositories Research Officer
UKOLN
University of Bath

Email: m.mahey@ukoln.ac.uk
Web site: http://www.ukoln.ac.uk/

Phil Barker
Learning Technology Adviser
Institute for Computer Based Learning,
Heriot Watt University

Email: philb@icbl.hw.ac.uk

Return to top