Skip to Content

Overview of all keyword tags in articles

Inspecting content image: copyright, used under license from shutterstock.com

This page provides an overview of 42 tags, ordered by trending factor. Column headings allow re-sorting by other criteria. In the expanding tab below you can adjust filters to display sub-sets of tags and narrow the focus to specific items of interest (see FAQs on filtering for usage tips). Select this link to remove all filters.

Term Brief description Total articles Total usage Trending factor Charts

open data

Open data is a philosophy and practice requiring that certain data be freely available to everyone, without restrictions from copyright, patents or other mechanisms of control. It has a similar ethos to a number of other "Open" movements and communities such as open source and open access. However these are not logically linked and many combinations of practice are found. The practice and ideology itself is well established (for example in the Mertonian tradition of science) but the term "open data" itself is recent. Much of the emphasis in this entry is on data from scientific research and from the data-driven web. In some cases open data may be considered as more properly Open Metadata and there is not yet a consistent formalisation. (Excerpt from Wikipedia article: Open data)

Percentage of Ariadne articles tagged with this term: 1.9%.
33 83 1012.1

linked data

Linked Data describes a method of publishing structured data, so that it can be interlinked and become more useful. It builds upon standard Web technologies, such as HTTP and URIs - but rather than using them to serve web pages for human readers, it extends them to share information in a way that can be read automatically by computers. This enables data from different sources to be connected and queried. Tim Berners-Lee, director of the World Wide Web Consortium, coined the term in a design note discussing issues around the Semantic Web project. (Excerpt from Wikipedia article: Linked Data)

Percentage of Ariadne articles tagged with this term: 1.8%.
32 120 701.79

data

The term data refers to qualitative or quantitative attributes of a variable or set of variables. Data (plural of "datum") are typically the results of measurements and can be the basis of graphs, images, or observations of a set of variables. Data are often viewed as the lowest level of abstraction from which information and then knowledge are derived. Raw data, i.e. unprocessed data, refers to a collection of numbers, characters, images or other outputs from devices that collect information to convert physical quantities into symbols. (Excerpt from Wikipedia article: Data)

Percentage of Ariadne articles tagged with this term: 56.9%.
992 9403 384

data set

A data set (or dataset) is a collection of data, usually presented in tabular form. Each column represents a particular variable. Each row corresponds to a given member of the data set in question. Its values for each of the variables, such as height and weight of an object or values of random numbers. Each value is known as a datum. The data set may comprise data for one or more members, corresponding to the number of rows. (Excerpt from Wikipedia article: Data set)

Percentage of Ariadne articles tagged with this term: 13.1%.
229 937 108.8

data management

Data management comprises all the disciplines related to managing data as a valuable resource. The official definition provided by DAMA International, the professional organization for those in the data management profession, is: "Data Resource Management is the development and execution of architectures, policies, practices and procedures that properly manage the full data lifecycle needs of an enterprise." This definition is fairly broad and encompasses a number of professions which may not have direct technical contact with lower-level aspects of data management, such as relational database management. (Excerpt from Wikipedia article: Data management)

Percentage of Ariadne articles tagged with this term: 5.7%.
100 525 5

metadata

Metadata can be defined literally as "data about data," but the term is normally understood to mean structured data about digital (and non-digital) resources that can be used to help support a wide range of operations. These might include, for example, resource description and discovery, the management of information resources (including rights management) and their long-term preservation. In the context of digital resources, there exists a wide variety of metadata formats. Viewed on a continuum of increasing complexity, these range from the basic records used by robot-based Internet search services, through relatively simple formats like the Dublin Core Metadata Element Set (DCMES) and the more detailed Text Encoding Initiative (TEI) header and MARC formats, to highly specific formats like the FGDC Content Standard for Digital Geospatial Metadata, the Encoded Archival Description (EAD) and the Data Documentation Initiative (DDI) Codebook. (Excerpt from this source)

Percentage of Ariadne articles tagged with this term: 37%.
646 4953 4.8

data visualisation

Data visualization is the study of the visual representation of data, meaning "information which has been abstracted in some schematic form, including attributes or variables for the units of information". Data visualization is closely related to Information graphics, Information visualization, Scientific visualization and Statistical graphics. In the new millennium data visualization has become active area of research, teaching and development. (Excerpt from Wikipedia article: Data visualization)

Percentage of Ariadne articles tagged with this term: 0.8%.
14 23 4.3

database

A database is a system intended to organize, store, and retrieve large amounts of data easily. It consists of an organized collection of data for one or more uses, typically in digital form. One way of classifying databases involves the type of their contents, for example: bibliographic, document-text, statistical. Digital databases are managed using database management systems, which store database contents, allowing data creation and maintenance, and search and other access. (Excerpt from Wikipedia article: Database)

Percentage of Ariadne articles tagged with this term: 45.3%.
790 3295 2.7

data citation

Data citation refers to the practice of providing a reference to data in the same way as researchers routinely provide a bibliographic reference to printed resources. The need to cite data is starting to be recognised as one of the key practices underpinning the recognition of data as a primary research output rather than as a by-product of research. While data has often been shared in the past, it is rarely, if ever, cited in the same way as a journal article or other publication might be. If datasets were cited, they would achieve a validity and significance within the cycle of activities associated with scholarly communications and recognition of scholarly effort. (Excerpt from this source)

Percentage of Ariadne articles tagged with this term: 0.8%.
14 47 2.1

data mining

Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use. Aside from the raw analysis step, it involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating. (Excerpt from Wikipedia article: Data Mining)

Percentage of Ariadne articles tagged with this term: 2.8%.
48 116 0.9

geospatial data

A geographic information system (GIS), geographical information system, or geospatial information system is a system that captures, stores, analyzes, manages and presents data with reference to geographic location data. In the simplest terms, GIS is the merging of cartography, statistical analysis and database technology. GIS may be used in archaeology, geography, cartography, remote sensing, land surveying, public utility management, natural resource management, precision agriculture, photogrammetry, urban planning, emergency management, landscape architecture, navigation, aerial video and localized search engines. (Excerpt from Wikipedia article: Geographic information system)

Percentage of Ariadne articles tagged with this term: 3.6%.
62 138 0.7

bibliographic data

A bibliographic database is a database of bibliographic records, an organized digital collection of references to published literature, including journal and newspaper articles, conference proceedings, reports, government and legal publications, patents, books, etc. In contrast to library catalogue entries, a large proportion of the bibliographic records in bibliographic databases describe analytics (articles, conference papers, etc.) rather than complete monographs, and they generally contain very rich subject descriptions in the form of keywords, subject classification terms, or abstracts. (Excerpt from Wikipedia article: Bibliographic data)

Percentage of Ariadne articles tagged with this term: 20%.
348 728 0.1

automatic metadata generation

Use cases for the automatic generation and use of metadata will be developed using an approach that maximises the use of current community knowledge. Through knowledge-gathering workshops, commissioned expert reports, and surveys of the wider community it will be possible to identify a number of key use cases defining how metadata can be automatically gathered and how that metadata will be used. This approach builds on existing expertise and allows the focus to be on gathering use cases, analysing them, identifying tools and services, prioritising them, and assessing the main costs and benefits. The project will deliver general guidance for service providers in HE, a synthesis of previous work on automated metadata generation and a set of recommendations on tools and services required in the future. Project start date: 2009-03-01. Project end date: 2009-08-31. (Excerpt from this source)

Percentage of Ariadne articles tagged with this term: 0.2%.
3 4

bath information and data services

Bath Information and Data Services (BIDS) provided bibliographic database services to the academic community in the UK from 1991 to 2005. BIDS academic and scholarly journals services are now incorporated into IngentaConnect www.ingentaconnect.com (Excerpt from this source)

Percentage of Ariadne articles tagged with this term: 0.2%.
3 2

data curation for e-science

The DTI and the Research Councils are committing £118M to a government-industry programme on e-Science. The reason for this investment is that GRID technology is seen as the natural successor to the world wide web and the UK wants to take a leading role in order to develop solutions for its scientists and developing opportunities for its industry. The world wide web has revolutionised the way companies do business and fundamentally altered people's personal lives but it can no longer cope with the demands being placed on it by science. The world wide web allows very easy access to information, Grid allows that same easy access to computing power, data processing and communication of the results. The opportunities are immense, it will allow the efficient manipulation of vast amounts of information such as that contained in the human genome or the results from experiments in CERN's new Large Hadron Collider. It will also allow the ability to mine data again and again by comparing existing data sets collected for one purpose with new and previously unrelated information, so generating new knowledge. This consultancy will establish the current provision and future requirements for curation of primary research data being generated within e-science in the UK. This will include the e-science core programme but is anticipated to extend beyond this to other e-science research and primary research data. A consultancy report will provide a synthesis of findings and make recommendations for future action. The consultancy will support aims to manage JISC involvement in e-Science and the Research Grid, and to work in partnership to support the research community through activities such as its digital preservation programme. Project start date: 2003-02-01. Project end date: 2004-02-02. (Excerpt from this source)

Percentage of Ariadne articles tagged with this term: 0.1%.
1 1

data train project

The DataTrain project aims to build on findings and tools developed in the Incremental project (JISC 07/09 funding strand) by developing disciplinary focussed data management training modules for post-graduate courses in Archaeology and Social Anthropology at the University of Cambridge. To this end, the project will develop training modules for each of the two departments, and pilot these as part of the departments' postgraduate training provision in Spring of 2011. Beyond this, the modules would be embedded within research methods courses in each department. To extend its impact, the project would also make the training resources available through the University of Cambridge's institutional repository's support provision and via the Archaeology Data Service (ADS) and Digital Curation Centre (DCC). Project start date: 2010-08-01. Project end date: 2011-07-31. (Excerpt from this source)

Percentage of Ariadne articles tagged with this term: 0.1%.
1 1

data without boundaries

The Data without Boundaries û DwB û project exists to support equal and easy access to official microdata for the European Research Area, within a structured framework where responsibilities and liability are equally shared. Europe needs a comprehensive and easy-to-access research data infrastructure to be able to continuously produce cutting-edge research and reliable policy evaluations. (Excerpt from this source)

Percentage of Ariadne articles tagged with this term: 0.1%.
1 4

datagovuk

data.gov.uk is a UK Government project to open up almost all non-personal data acquired for official purposes for free re-use. Sir Tim Berners-Lee and Professor Nigel Shadbolt are the two key figures behind the project. The beta version of data.gov.uk has been online since the 30 September 2009 and by January 2010 more than 2,400 developers had registered to test the site, provide feedback and start experimenting with the data. When the project was officially launched in January 2010 it contained 2,500 data sets and developers had already built a site that showed the location of schools according to the rating assigned to them by education watchdog Ofsted. (Excerpt from Wikipedia article: Data.gov.uk)

Percentage of Ariadne articles tagged with this term: 0.1%.
2 4

datashare

DataShare, led by Edina, arises from an existing UK consortium of data support professionals working in departments and academic libraries in universities (Data Information Specialists Committee-UK), and builds on an international network with a tradition of data sharing and data archiving dating back to the 1960s in the social sciences. By working together across four universities and internally with colleagues already engaged in managing open access repositories for e-prints, this partnership will introduce and test a new model of data sharing and archiving to UK research institutions. By supporting academics within the four partner institutions who wish to share datasets on which written research outputs are based, this network of institution-based data repositories develops a niche model for deposit of 'orphaned datasets' currently filled neither by centralised subject-domain data archives/centres/grids nor by e-print based institutional repositories (IRs). The project's overall aim is to contribute to new models, workflows and tools for academic data sharing within a complex and dynamic information environment which includes increased emphasis on stewardship of institutional knowledge assets of all types; new technologies for doing e-Research; new research council policies and mandates; and the growth of the Open Access / Open Data movement. Project start date: 2007-03-01. Project end date: 2009-03-31. (Excerpt from this source)

Percentage of Ariadne articles tagged with this term: 0.5%.
8 19

dealing with data

UKOLN was asked to undertake a small-scale consultancy for JISC to investigate the relationships between data centres and institutions which may develop data repositories. The resulting direction-setting report will be used to advance the digital repository development agenda within the JISC Capital programme (2006 - 2009), to assist in the co-ordination of research data repositories and to inform an emerging Vision and Roadmap. The study includes a synthesis of some of the lessons learned from the projects within the Digital Repositories programme that were concerned with research data. Project start date: 2006-11-01. Project end date: 2007-05-31. (Excerpt from this source)

Percentage of Ariadne articles tagged with this term: 0.3%.
5 6
CSVXML


by Dr. Radut