FACTOID # 120: Nepal’s flag isn’t square or rectangular. It’s a double triangle.
 
 Home   Encyclopedia   Statistics   Countries A-Z   Flags   Maps   Education   Forum   FAQ   About 
 
WHAT'S NEW
RELATED ARTICLES
People who viewed "Metadata" also viewed:
RECENT ARTICLES
More Recent Articles »
 

SEARCH ALL

FACTS & STATISTICS    Advanced view

Search encyclopedia, statistics and forums:

 

 

(* = Graphable)

 

 


Encyclopedia > Metadata

Metadata is data about data. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items.


Metadata (sometimes written 'meta data') is used to facilitate the understanding, use and management of data. The metadata required for effective data management varies with the type of data and context of use. In a library, where the data is the content of the titles stocked, metadata about a title would typically include a description of the content, the author, the publication date and the physical location. In the context of a camera, where the data is the photographic image, metadata would typically include the date the photograph was taken and details of the camera settings. On a portable music player such as an Apple iPod, the album names, song titles and album art embedded in the music files are used to generate the artist and song listings, and are metadata. In the context of an information system, where the data is the content of the computer files, metadata about an individual data item would typically include the name of the field and its length. Metadata about a collection of data items, a computer file, might typically include the name of the file, the type of file and the name of the data administrator. Julio Pérez Ferrero Library - Cúcuta, Colombia A modern-style library in Chambéry A library is a collection of information, sources, resources, and services: it is organized for use and maintained by a public body, an institution, or a private individual. ... For other uses, see Author (disambiguation). ... This article is about the photographing device. ... For other uses, see Photograph (disambiguation). ... A grayscale fourth-generation iPod with earphones. ... It has been suggested that this article or section be merged with Information systems. ... This article is about the machine. ...

Contents

What is metadata?

Metadata gives any kind of data context. Any item of data is a description of something. Metadata is a type of data where the something being described is data. Or, as it is often put, metadata is data about data. If we consider a particular place in the real world, this may be described by many items of data, for example:

  • 1 “E83BJ” .
  • 2 “17”
  • 3 “Sunny”

To make sense of and use this data, it is necessary to have access to some form of description of the sort of data it is, or, in other words, have access to its metadata. So, for example, the metadata for the above three items of data might include:

  • 1.1 “Post Code” – This is a description of the data item “E83BJ”
  • 1.2 “The unique identifier of a postal district" - This is another description of “E83BJ”
  • 1.3 “27th June 2006” – This is another description of “E83BJ”
  • 2 “Average temperature in degrees Celsius” – This is a description of “17”
  • 3 "Yesterday’s weather”. – This is a description of “sunny”

An item of metadata is itself data and therefore may have its own metadata. This might (not particularly usefully) be referred to as meta-metadata. So, for example, “Post Code” might have the following metadata:

  • 1.1.1 “data item name”
  • 1.1.2 “8 characters, starting with A – Z”

“27th June 2006” might have the following metadata:

  • 1.3.1 “Date last changed”

The hierarchy of data, metadata, meta-metadata etc. can go on forever. Fortunately we have sufficient background knowledge so that we can usually make sense of and use an item of data with access to very little, if any, formally defined metadata. So, for example, with the “Post Code” metadata “8 characters, starting with A – Z” , it would be possible using background knowledge to know that this is a description of the format of a post code, without having access to any defined metadata for “8 characters, starting with A – Z”. A postal code is a series of letters and/or digits appended to a postal address for the purpose of sorting mail. ...


Levels

As indicated, there are hierarchies of data and metadata. However, any particular item of data may be on different levels of a hierarchy depending on the context. For example, when considering the geography of London, “E83BJ” would be data and “Post Code” would be metadata. But, when considering the data management of an automated system that manages geographical data, “Post Code” might be data and then “data item name” and “8 characters, starting with A – Z” would be metadata.


In any particular context, metadata must be at a higher level of abstraction than the data it is describing. So, in relation to “E83BJ”, the item of data “is in London” is a further description of the place in the real world which has the post code “E83BJ” and is at the same level of abstraction. Therefore, although it is providing information about “E83BJ” (It is telling us that this is the post code of a place in London) this would not normally be considered metadata, as it is describing “E83BJ” qua place in the real world and not qua data.


Definitions

The term was introduced intuitively, without a formal definition. Because of that, today there are various definitions. The most common one is the literal translation:

  • Metadata is data about data.

Example: "12345" is data, and with no additional context is meaningless. When "12345" is given a meaningful name (metadata) of "ZIP code", one can understand (at least in the United States, and further placing "ZIP code" within the context of a postal address) that "12345" refers to the General Electric plant in Schenectady, New York. Mr. ... An address is a code and abstract concept expressing the fixed location of a home, business or other building on the earths surface. ... “GE” redirects here. ... Schenectady (IPA ) is a city in Schenectady County, New York, United States, of which it is the county seat. ...


As for most people the difference between data and information is merely a philosophical one of no relevance in practical use, other definitions are: The ASCII codes for the word Wikipedia represented in binary, the numeral system most commonly used for encoding computer information. ... Philosophy (from the Greek words philos and sophia meaning love of wisdom) is understood in different ways historically and by different philosophers. ...

  • Metadata is information about data.
  • Metadata is information about information.

There are more sophisticated definitions, such as:

  • "Metadata is structured, encoded data that describe characteristics of information-bearing entities to aid in the identification, discovery, assessment, and management of the described entities."[1]
  • "[Metadata is a set of] optional structured descriptions that are publicly available to explicitly assist in locating objects."[2]

These are used more rarely because they tend to concentrate on one purpose of metadata — to find "objects", "entities" or "resources" — and ignore others, such as using metadata to optimize compression algorithms, or to perform additional computations using the data. Source coding redirects here. ...


The metadata concept has been extended into the world of systems to include any "data about data": the names of tables, columns, programs, and the like. Different views of this "system metadata" are detailed below, but beyond that is the recognition that metadata can describe all aspects of systems: data, activities, people and organizations involved, locations of data and processes, access methods, limitations, timing and events, as well as motivation and rules.


Fundamentally, then, metadata is "the data that describe the structure and workings of an organization's use of information, and which describe the systems it uses to manage that information". To do a model of metadata is to do an "Enterprise model" of the information technology industry itself.[3] Enterprise modeling is the process of improving the enterprise performance through the creation of enterprise models. ...


Hierarchies of metadata

When structured into a hierarchical arrangement, metadata is more properly called an ontology or schema. Both terms describe "what exists" for some purpose or to enable some action. For instance, the arrangement of subject headings in a library catalog serves not only as a guide to finding books on a particular subject in the stacks, but also as a guide to what subjects "exist" in the library's own ontology and how more specialized topics are related to or derived from the more general subject headings. In both computer science and information science, an ontology is a data model that represents a set of concepts within a domain and the relationships between those concepts. ... The word schema comes from the Greek word σχήμα (skhēma) that means shape or more generally plan. ...


Metadata is frequently stored in a central location and used to help organizations standardize their data. This information is typically stored in a metadata registry. A Metadata Registry is a central place where metadata definitions are stored and maintained. ...


Difference between data and metadata

Usually it is not possible to distinguish between (raw) data and metadata because:

  • Something can be data and metadata at the same time. The headline of an article is both its title (metadata) and part of its text (data).
  • Data and metadata can change their roles. A poem, as such, would be regarded as data, but if there were a song that used it as lyrics, the whole poem could be attached to an audio file of the song as metadata. Thus, the labeling depends on the point of view.

These considerations apply no matter which of the above definitions is considered.


Use

Metadata has many different applications; this section lists some of the most common.


Metadata is used to speed up and enrich searching for resources. In general, search queries using metadata can save users from performing more complex filter operations manually. It is now common for web browsers (with the notable exception of Mozilla Firefox), P2P applications and media management software to automatically download and locally cache metadata, to improve the speed at which files can be accessed and searched [citation needed].


Metadata may also be associated to files manually. This is often the case with documents which are scanned into a document storage repository such as FileNet or Documentum. Once the documents have been converted into an electronic format a user brings the image up in a viewer application, manually reads the document and keys values into an online application to be stored in a metadata repository.


Metadata provide additional information to users of the data it describes. This information may be descriptive ("These pictures were taken by children in the school's third grade class.") or algorithmic ("Checksum=139F").


Metadata helps to bridge the semantic gap. By telling a computer how data items are related and how these relations can be evaluated automatically, it becomes possible to process even more complex filter and search operations. For example, if a search engine understands that "Van Gogh" was a "Dutch painter", it can answer a search query on "Dutch painters" with a link to a web page about Vincent Van Gogh, although the exact words "Dutch painters" never occur on that page. This approach, called knowledge representation, is of special interest to the semantic web and artificial intelligence. In many layered systems, some conflicts when concepts at a high level of abstraction need to be translated into lower, more concrete artifacts. ... W3Cs Semantic Web logo The Semantic Web is an evolving extension of the World Wide Web in which web content can be expressed not only in natural language, but also in a format that can be read and used by software agents, thus permitting them to find, share and... AI redirects here. ...


Certain metadata is designed to optimize lossy compression algorithms. For example, if a video has metadata that allows a computer to tell foreground from background, the latter can be compressed more aggressively to achieve a higher compression rate. A lossy data compression method is one where compressing data and then decompressing it retrieves data that may well be different from the original, but is close enough to be useful in some way. ...


Some metadata is intended to enable variable content presentation. For example, if a picture has metadata that indicates the most important region — the one where there is a person — an image viewer on a small screen, such as on a mobile phone's, can narrow the picture to that region and thus show the user the most interesting details. A similar kind of metadata is intended to allow blind people to access diagrams and pictures, by converting them for special output devices or reading their description using text-to-speech software. Speech synthesis is the artificial production of human speech. ...


Other descriptive metadata can be used to automate workflows. For example, if a "smart" software tool knows content and structure of data, it can convert it automatically and pass it to another "smart" tool as input. As a result, users save the many copy-and-paste operations required when analyzing data with "dumb" tools. Windows, GNOME and KDE keys for cut and pasting: Control + x (cut), Control + c (copy), Control + v (paste) In human-computer interaction, cut and paste or copy and paste is a user interface paradigm for transferring text, data, files or objects from a source to a destination. ...


Metadata is becoming an increasingly important part of electronic discovery. [1] Application and file system metadata derived from electronic documents and files can be important evidence. Recent changes to the Federal Rules of Civil Procedure make metadata routinely discoverable as part of civil litigation. Parties to litigation are required to maintain and produce metadata as part of discovery, and spoliation of metadata can lead to sanctions. Electronic discovery, or e-discovery, refers to discovery in civil litigation which deals with information in electronic form. ... Electronic document means any computer data (other than programs or system files) that are intended to be used in their computerized form, without being printed (although printing is usually possible). ... The Federal Rules of Civil Procedure (FRCP) govern civil procedure in the United States district courts, or more simply, court procedures for civil suits. ... This article is about civil law within the common law legal system. ... In law, discovery is the pre-trial phase in a lawsuit in which each party through the law of civil procedure can request documents and other evidence from other parties or can compel the production of evidence by using a subpoena or through other discovery devices, such as requests for... Lawyers and courts use the term spoliation to refer to destruction of evidence relevant to a legal proceeding. ...


Metadata has become important on the World Wide Web because of the need to find useful information from the mass of information available. Manually-created metadata adds value because it ensures consistency. If a web page about a certain topic contains a word or phrase, then all web pages about that topic should contain that same word or phrase. Metadata also ensures variety, so that if a topic goes by two names each will be used. For example, an article about "sport utility vehicles" would also be tagged "4 wheel drives", "4WDs" and "four wheel drives", as this is how SUVs are known in some countries. WWWs historical logo designed by Robert Cailliau The World Wide Web (commonly shortened to the Web) is a system of interlinked, hypertext documents accessed via the Internet. ... A fourth-generation (2006-) Ford Explorer, the best-selling mid-size SUV in the United States. ... For a proposal for tagging in Wikipedia, see Wikipedia:WikiProject Microformats#MediaWiki issues A tag cloud with terms related to Web 2. ...


Examples of metadata for an audio CD include the MusicBrainz project and AMG's All Music Guide. Similarly, MP3 files have metadata tags in a format called ID3. CD redirects here. ... MusicBrainz (MusicBrainz. ... All Media Guide (commonly known as AMG), is the company which owns and maintains All Music Guide, All Game Guide and All Movie Guide. ... The All Music Guide (AMG) is a metadata database about music, owned by All Media Guide. ... For other uses, see MP3 (disambiguation). ... This article or section does not cite its references or sources. ...


Types of metadata

Metadata can be classified by:

  • Content. Metadata can either describe the resource itself (for example, name and size of a file) or the content of the resource (for example, "This video shows a boy playing football").
  • Mutability. With respect to the whole resource, metadata can be either immutable (for example, the "Title" of a video does not change as the video itself is being played) or mutable (the "Scene description" does change).
  • Logical function. There are three layers of logical function: at the bottom the subsymbolic layer that contains the raw data itself, then the symbolic layer with metadata describing the raw data, and on the top the logical layer containing metadata that allows logical reasoning using the symbolic layer.

Important issues

To successfully develop and use metadata, several important issues should be treated with care:


Metadata risks

Microsoft Office files include metadata beyond their printable content, such as the original author's name, the creation date of the document, and the amount of time spent editing it. Unintentional disclosure can be awkward or even raise malpractice concerns. Some of Microsoft Office document's metadata can be seen by clicking File then Properties from the program's menu. Other metadata is not visible except through external analysis of a file, such as is done in forensics. The author of the Microsoft Word-based Melissa computer virus in 1999 was caught due to Word metadata that uniquely identified the computer used to create the original infected document. Microsoft Office is an office suite from Microsoft for Microsoft Windows and Apple Mac OS X operating systems. ... The Melissa worm, also known as Mailissa, Simpsons, Kwyjibo, or Kwejeebo, is a mass-mailing macro virus, hence leading some to classify it as a computer worm. ...


Metadata lifecycle

Even in the early phases of planning and designing it is necessary to keep track of all metadata created. It is not economical to start attaching metadata only after the production process has been completed. For example, if metadata created by a digital camera at recording time is not stored immediately, it may have to be restored afterwards manually with great effort. Therefore, it is necessary for different groups of resource producers to cooperate using compatible methods and standards.

  • Manipulation. Metadata must adapt if the resource it describes changes. It should be merged when two resources are merged. These operations are seldom performed by today's software; for example, image editing programs usually do not keep track of the Exif metadata created by digital cameras.
  • Destruction. It can be useful to keep metadata even after the resource it describes has been destroyed, for example in change histories within a text document or to archive file deletions due to digital rights management. None of today's metadata standards considers this phase.

Exchangeable image file format (official abbreviation Exif, not EXIF[1]) is a specification for the image file format used by digital cameras. ...

Storage

Metadata can be stored either internally, in the same file as the data, or externally, in a separate file. Both ways have advantages and disadvantages:

  • Internal storage allows transferring metadata together with the data it describes; thus, metadata is always at hand and can be manipulated easily. This method creates high redundancy and does not allow holding metadata together.
  • External storage allows bundling metadata, for example in a database, for more efficient searching. There is no redundancy and metadata can be transferred simultaneously when using streaming. However, as most formats use URIs for that purpose, the method of how the metadata is linked to its data should be treated with care. What if a resource does not have an URI (resources on a local hard disk or web pages that are created on-the-fly using a content management system)? What if metadata can only be evaluated if there is a connection to the Web, especially when using RDF? How to realize that a resource is replaced by another with the same name but different content?

Moreover, there is the question of data format: storing metadata in a human-readable format such as XML can be useful because users can understand and edit it without specialized tools. On the other hand, these formats are not optimized for storage capacity; it may be useful to store metadata in a binary, non-human-readable format instead to speed up transfer and save memory. Streaming media is multimedia that is continuously received by, and normally displayed to, the end-user while it is being delivered by the provider. ... A Uniform Resource Identifier (URI), is a compact string of characters used to identify or name a resource. ... Resource Description Framework (RDF) is a family of World Wide Web Consortium (W3C) specifications originally designed as a metadata model but which has come to be used as a general method of modeling information, through a variety of syntax formats. ...


Criticisms

Although the majority of computer scientists see metadata as a chance for better interoperability, some critics argue:

  • Metadata is too expensive and time-consuming. The argument is that companies will not produce metadata without need because it costs extra money, and private users also will not produce complex metadata because its creation is very time-consuming.
  • Metadata is too complicated. Private users will not create metadata because existing formats, especially MPEG-7, are too complicated. As long as there are no automatic tools for creating metadata, it will not be created.
  • Metadata is subjective and depends on context. Most probably, two persons will attach different metadata to the same resource due to their different points of view. Moreover, metadata can be misinterpreted due to its dependency on context. For example searching for "post-modern art" may miss a certain item because the expression was not in use at the time when that work of art was created, or searching for "pictures taken at 1:00" may produce confusing results due to local time differences.
  • There is no end to metadata. For example, when annotating a match of soccer with metadata, one can describe all the players and their actions in time and stop there. One can also describe the advertisements in the background and the clothes the players wear. One can also describe each fan on the tribune and the clothes they wear. All of this metadata can be interesting to one party or another — such as the spectators, sponsors or a counterterrorist unit of the police — and even for a simple resource the amount of possible metadata can be gigantic.
  • Metadata is useless. Many of today's search engines allow finding text very efficiently. Other techniques for finding pictures, videos and music (namely query-by-example) will become more and more powerful in the future. Thus, there is no real need for metadata.

The opposers of metadata sometimes use the term metacrap to refer to the unsolved problems of metadata in some scenarios. MPEG-7 is a multimedia content description standard. ... Metacrap is a portmanteau word drawn from metadata and crap. ...


Types

In general, there are two distinct classes of metadata: structural or control metadata and guide metadata.[4] Structural metadata is used to describe the structure of computer systems such as tables, columns and indexes. Guide metadata is used to help humans find specific items and is usually expressed as a set of keywords in a natural language.


Metatadata can be divided into 3 distinct categories:

  • Descriptive
  • Administrative
  • Structural

Relational database metadata

Each relational database system has its own mechanisms for storing metadata. Examples of relational-database metadata include: A relational database is a database that conforms to the relational model, and refers to a databases data and schema (the databases structure of how that data is arranged). ...

  • Tables of all tables in database, their names, sizes and number of rows in each table.
  • Tables of columns in each database, what tables they are used in, and the type of data stored in each column.

In database terminology, this set of metadata is referred to as the catalog. The SQL standard specifies a uniform means to access the catalog, called the INFORMATION_SCHEMA, but not all databases implement it, even if they implement other aspects of the SQL standard. For an example of database-specific metadata access methods, see Oracle metadata. This article or section does not cite its references or sources. ... SQL (IPA: or ), commonly expanded as Structured Query Language, is a computer language designed for the retrieval and management of data in relational database management systems, database schema creation and modification, and database object access control management. ... The Oracle relational database keeps metadata in several tables ALL_TABLES - list of all tables in the current database ALL_TAB_COLS - list of all columns in the database The following is a list of sample Oracle queries: Here are some sample queries of Oracle Metadata using the Food and Nutrition Name Conventions. ...


Data warehouse metadata

Data warehouse metadata systems are sometimes separated into two sections: A data warehouse is the main repository of an organizations historical data, its corporate memory. ...

  1. back room metadata that are used for Extract, transform, load functions to get OLTP data into a data warehouse
  2. front room metadata that are used to label screens and create reports

Kimball[5] lists the following types of metadata in a data warehouse (See also [2]): ETL also means Express Toll Lanes, see Express Toll Lanes. ... OLTP (Online Transaction Processing) is a form of transaction processing conducted via computer network. ...

  • source system metadata
    • source specifications, such as repositories, and source schemas
    • source descriptive information, such as ownership descriptions, update frequencies, legal limitations, and access methods
    • process information, such as job schedules and extraction code
  • data staging metadata
  • DBMS metadata, such as:
    • DBMS system table contents
    • processing hints

Michael Bracket defines metadata (what he calls "Data resource data") as "any data about the organization’s data resource".[6] Adrienne Tannenbaum defines metadata as "the detailed description of instance data. The format and characteristics of populated instance data: instances and values, dependent on the role of the metadata recipient".[7] These definitions are characteristic of the "data about data" definition. A repository is a central place where data is stored and maintained. ... A Logical schema is a data model of a specific problem domain that has more detail than a conceptual schema, but does not include the design considerations and physical storage parameters found in a physical schema. ... In networking, access method can refer to several concepts. ... Data acquisition is the sampling of the real world to generate data that can be manipulated by a computer. ... Data transmission is the conveyance of any kind of information from one space to another. ... A dimension table is a data warehousing concept. ... A surrogate key is a unique primary key generated by the relational database management system that is not derived from any data in the database and whose only significance is to act as the primary key. ... ÁInsert non-formatted text here ... Look up Aggregation in Wiktionary, the free dictionary The term aggregation may refer to— in economics, combining entities into a single entity which represent them, like aggregation of individual demand to total, or market, demand. ... A database management system (DBMS) is a computer program (or more typically, a suite of them) designed to manage a database, a large set of structured data, and run operations on the data requested by numerous users. ...


Business Intelligence metadata

Business Intelligence is the process of analyzing large amounts of corporate data, usually stored in large databases such as the Data Warehouse, tracking business performance, detecting patterns and trends, and helping enterprise business users make better decisions. Business Intelligence metadata describes how data is queried, filtered, analyzed, and displayed in Business Intelligence software tools, such as Reporting tools, OLAP tools, Data Mining tools. Business intelligence (BI) is a business management term, which refers to applications and technologies that are used to gather, provide access to, and analyze data and information about company operations. ... A data warehouse is the main repository of an organizations historical data, its corporate memory. ... Business intelligence (BI) is a business management term, which refers to applications and technologies that are used to gather, provide access to, and analyze data and information about company operations. ...


Examples:

  • OLAP metadata: The descriptions and structures of Dimensions, Cubes, Measures (Metrics), Hierarchies, Levels, Drill Paths
  • Reporting metadata: The descriptions and structures of Reports, Charts, Queries, DataSets, Filters, Variables, Expressions
  • Data Mining metadata: The descriptions and structures of DataSets, Algorithms, Queries

Business Intelligence metadata can be used to understand how corporate financial reports reported to Wall Street are calculated, how the revenue, expense and profit are aggregated from individual sales transactions stored in the data warehouse. A good understanding of Business Intelligence metadata is required to solve complex problems such as compliance with corporate governance standards, such as Sarbanes Oxley (SOX) or Basel II. Online Analytical Processing, or OLAP (IPA: ), is an approach to quickly provide answers to analytical queries that are multidimensional in nature. ... Data mining is the principle of sorting through large amounts of data and picking out relevant information. ... Elaborate marble facade of NYSE as seen from the intersection of Broad and Wall Streets For other uses, see Wall Street (disambiguation). ... The Sarbanes-Oxley Act of 2002 (107 H.R. 3763), signed into law on 30 July 2002, is considered the most significant change to federal securities laws in the United States since the New Deal. ...


General IT metadata

In contrast, David Marco, another metadata theorist, defines metadata as "all physical data and knowledge from inside and outside an organization, including information about the physical data, technical and business processes, rules and constraints of the data, and structures of the data used by a corporation."[8] Others have included web services, systems and interfaces. In fact, the entire Zachman framework (see Enterprise Architecture) can be represented as metadata.[9] This article is about Enterprise Architecture. ...


Notice that such definitions expand metadata's scope considerably, to encompass most or all of the data required by the Management Information Systems capability. In this sense, the concept of metadata has significant overlaps with the ITIL concept of a Configuration Management Database (CMDB), and also with disciplines such as Enterprise Architecture and IT portfolio management. Management Information Systems (MIS), sometimes referred to as Information Management and Systems, is the discipline covering the application of people, technologies, and procedures — collectively called information systems — to solving business problems. ... Atil, also spelled Itil (Turkic for Big River), was a name of the Volga River and of the capital of Khazaria from the middle of the 8th century until towards the end of the 10th century. ... A configuration management database (CMDB) is a repository of information related to all the components of an information system. ... This article is about Enterprise Architecture. ... IT portfolio management is the application of systematic management to large classes of items managed by enterprise Information technology (IT) capabilities. ...


This broader definition of metadata has precedent. Third generation corporate repository products (such as those eventually merged into the CA Advantage line) not only store information about data definitions (COBOL copybooks, DBMS schema), but also about the programs accessing those data structures, and the Job Control Language and batch job infrastructure dependencies as well. These products (some of which are still in production) can provide a very complete picture of a mainframe computing environment, supporting exactly the kinds of impact analysis required for ITIL-based processes such as Incident and Change Management. The ITIL Back Catalogue includes the Data Management volume which recognizes the role of these metadata products on the mainframe, posing the CMDB as the distributed computing equivalent. CMDB vendors however have generally not expanded their scope to include data definitions, and metadata solutions are also available in the distributed world. Determining the appropriate role and scope for each is thus a challenge for large IT organizations requiring the services of both. Job Control Language (JCL) is a scripting language used on IBM mainframe operating systems to instruct the Job Entry Subsystem (that is, JES2 or JES3) on how to run a batch program or start a subsystem. ... Atil, also spelled Itil (Turkic for Big River), was a name of the Volga River and of the capital of Khazaria from the middle of the 8th century until towards the end of the 10th century. ... ITIL Change Management is one of the eleven IT Service Management disciplines. ... Atil, also spelled Itil (Turkic for Big River), was a name of the Volga River and of the capital of Khazaria from the middle of the 8th century until towards the end of the 10th century. ... A configuration management database (CMDB) is a repository of information related to all the components of an information system. ...


Since metadata is pervasive, centralized attempts at tracking it need to focus on the most highly leveraged assets. Enterprise Assets may only constitute a small percentage of the entire IT portfolio.


Some practitioners have successfully managed IT metadata using the Dublin Core metamodel.[10] The introduction to this article provides insufficient context for those unfamiliar with the subject matter. ...


IT metadata management products

First generation data dictionary/metadata repository tools would be those only supporting a specific DBMS, such as IDMS's IDD (integrated data dictionary), the IMS Data Dictionary, and Adabas's Predict. A database management system (DBMS) is a computer program (or more typically, a suite of them) designed to manage a database, a large set of structured data, and run operations on the data requested by numerous users. ... IDMS (Integrated Database Management System) is a (network) CODASYL database management system first developed at B.F. Goodrich and later marketed by Cullinane Database Systems (renamed Cullinet in 1983). ... Information Management System (IMS) is a joint hierarchical database and information management system. ... Adabas is Software AG’s advanced database management system. ...


Second generation would be ASG's DATAMANAGER product which could support many different file and DBMS types.


Third generation repository products became briefly popular in the early 1990s along with the rise of widespread use of RDBMS engines such as IBM's DB2. A relational database management system (RDBMS) is a database management system (DBMS) that is based on the relational model as introduced by Edgar F. Codd. ... DB2 is one of IBMs lines of relational database management system (or, as IBM now calls it, data server) software products within IBMs broader Information Management Software line. ...


Fourth generation products link the repository with more Extract, transform, load tools and can be connected with architectural modeling tools. Examples include Adaptive Metadata Manager from Adaptive, Rochade from ASG,InfoLibrarian Metadata Integration Framework and Troux Technologies Metis Server product. ETL also means Express Toll Lanes, see Express Toll Lanes. ... Troux Technologies (pronounced true) is a global provider of IT Governance software to accelerate IT and business transformation. ...


File system metadata

Nearly all file systems keep metadata about files out-of-band. Some systems keep metadata in directory entries; others in specialized structure like inodes or even in the name of a file. Metadata can range from simple timestamps, mode bits, and other special-purpose information used by the implementation itself, to icons and free-text comments, to arbitrary attribute-value pairs. For library and office filing systems, see Library classification. ... Out-of-band signaling is telecommunication signaling (exchange of information in order to control a telephone call) that is done on a channel that is dedicated for the purpose and separate from the channels used for the telephone call. ... In computing, a directory, catalog, or folder, is an entity in a file system which can contain a group of files and/or other directories. ... In computing, an inode is a data structure on a traditional Unix-style file system such as UFS. An inode stores basic information about a regular file, directory, or other file system object. ... Timestamp can refer to a time code or to a digitally signed timestamp whose signer vouches for the existence of the signed document or content at the time given as part of the digital signature. ... Nuvola icons for KDE are available as PNG images, which come in six sizes, and SVG images, which are scalable On computer displays, a computer icon is a small pictogram. ... Attribute-value pairs are a fundamental data representation in many computing systems and applications. ...


With more complex and open-ended metadata, it becomes useful to search for files based on the metadata contents. The Unix find utility was an early example, although inefficient when scanning hundreds of thousands of files on a modern computer system. Apple Computer's current version of its Mac OS X operating system (Tiger) supports cataloguing and searching for file metadata through a feature known as Spotlight. Microsoft worked in the development of similar functionality with the Instant Search system in Windows Vista, as well as being present in SharePoint Server. Linux implements file metadata using extended file attributes. Filiation of Unix and Unix-like systems Unix (officially trademarked as UNIX®, sometimes also written as or ® with small caps) is a computer operating system originally developed in 1969 by a group of AT&T employees at Bell Labs including Ken Thompson, Dennis Ritchie and Douglas McIlroy. ... For other uses, see Find (disambiguation). ... Apple Inc. ... Mac OS X (pronounced ) is a line of graphical operating systems developed, marketed, and sold by Apple Inc. ... Mac OS X version 10. ... Spotlight is an as-you-type, system-wide desktop search feature found in Mac OS X v10. ... Microsoft Corporation, (NASDAQ: MSFT, HKSE: 4338) is a multinational computer technology corporation with global annual revenue of US$44. ... Windows Vista (formerly codenamed Windows Longhorn) has many significant new features compared with previous Microsoft Windows versions, covering most aspects of the operating system. ... Windows Vista is a line of graphical operating systems used on personal computers, including home and business desktops, notebook computers, Tablet PCs, and media centers. ... Microsoft Office SharePoint Portal Server - Wikipedia, the free encyclopedia /**/ @import /skins-1. ... This article is about operating systems that use the Linux kernel. ... Extended file attributes is a file system feature that enables users to associate computer files with metadata not interpreted by the filesystem, whereas regular attributes have a purpose strictly defined by the filesystem (such as permissions or records of creation and modification times). ...


Image metadata

Examples of image files containing metadata include Exchangeable Image File Format (EXIF) and Tagged Image File Format (TIFF). Exchangeable image file format (official abbreviation Exif, not EXIF[1]) is a specification for the image file format used by digital cameras. ... “TIFF” redirects here. ...


Having metadata about images embedded in TIFF or EXIF files is one way of acquiring additional data about an image. Image metadata are attained through tags. Tagging pictures with subjects, related emotions, and other descriptive phrases helps Internet users find pictures easily rather than having to search through entire image collections. A prime example of an image tagging service is Flickr, where users upload images and then describe the contents. Other patrons of the site can then search for those tags. Flickr uses a folksonomy: a free-text keyword system in which the community defines the vocabulary through use rather than through a controlled vocabulary. For a proposal for tagging in Wikipedia, see Wikipedia:WikiProject Microformats#MediaWiki issues A tag cloud with terms related to Web 2. ... Flickr is a photo sharing website and web services suite, and an online community platform, which is generally considered an early example of a Web 2. ... A folksonomy is a user-generated taxonomy used to categorize and retrieve web content such as Web pages, photographs and Web links, using open-ended labels called tags. ... Controlled vocabularies are used in indexing schemes, subject headings, thesauri and taxonomies. ...


Digital photography is increasingly making use of metadata tags. Photographers shooting Camera RAW file formats can use applications such as Adobe Bridge or Apple Computer's Aperture to work with camera metadata for post-processing. Users can also tag photos for organization purposes using Adobe's Extensible Metadata Platform (XMP) language, for example. A raw image file contains minimally processed data from the image sensor of a digital camera or image scanner. ... Adobe Bridge is an organizational program created and released by Adobe Systems as a part of the Adobe Creative Suite 2 (CS2) in May, 2005. ... Aperture is a software program for Mac OS X announced by Apple Inc at a New York media event on October 19, 2005, designed to assist professional photographers in post-production work. ... The Extensible Metadata Platform or XMP is a specific type of extensible markup language used in PDF, photography and photo editing applications. ...


Program metadata

Metadata is casually used to describe the controlling data used in software architectures that are more abstract or configurable. Most executable file formats include what may be termed "metadata" that specifies certain, usually configurable, behavioral runtime characteristics. However, it is difficult if not impossible to precisely distinguish program "metadata" from general aspects of stored-program computing architecture; if the machine reads it and acts upon it, it is a computational instruction, and the prefix "meta" has little significance. An executable or executable file, in computer science, is a file whose contents are meant to be interpreted as a program by a computer. ... In computer science, runtime or run time describes the operation of a computer program, the duration of its execution, from beginning to termination (compare compile time). ... Design of the Von Neumann architecture For the robotic architecture also named after Von Neumann, see Von Neumann machine The von Neumann architecture is a computer design model that uses a single storage structure to hold both instructions and data. ... In computer science, an instruction typically refers to a single operation of a processor within a computer architecture. ...


In Java, the class file format contains metadata used by the Java compiler and the Java virtual machine to dynamically link classes and to support reflection. The J2SE 5.0 version of Java included a metadata facility to allow additional annotations that are used by development tools. Java language redirects here. ... The Java source files (.java files) get compiled into . ... It has been suggested that this article or section be merged into Java programming language. ... A Java Virtual Machine (JVM) is a set of computer software programs and data structures which implements a specific virtual machine model. ... A linker or link editor is a program that takes one or more objects generated by compilers and assembles them into a single executable program. ... In object-oriented programming, a class is a programming language construct used to group related fields and methods. ... In computer science, reflection is the process by which a computer program of the appropriate type can be modified in the process of being executed, in a manner that depends on abstract features of its code and its runtime behavior. ... Java 2 Platform, Standard Edition or J2SE is a collection of java Application Programming Interfaces targeting Java platform applications running on a workstation. ... The Metadata Facility for Java is a specification for Java that defines an API for annotating fields, methods, and classes as having particular attributes that indicate they should be processed in special ways by development tools, deployment tools, or run-time libraries. ... A programming tool is a program or application that software developers use to create, debug, or maintain other programs and applications. ...


In MS-DOS, the COM file format does not include metadata, while the EXE file and Windows PE formats do. These metadata can include the company that published the program, the date the program was created, the version number and more. Microsofts disk operating system, MS-DOS, was Microsofts implementation of DOS, which was the first popular operating system for the IBM PC, and until recently, was widely used on the PC compatible platform. ... In MS-DOS and compatible DOSes, and in 8-bit CP/M, a COM file is a simple type of executable file. ... EXE is the common filename extension for denoting an executable file (a program) in the OpenVMS, DOS, Microsoft Windows, ReactOS, and OS/2 operating systems. ... The Portable Executable (PE) format is a file format for executables, object code, and DLLs, used in 32-bit and 64-bit versions of Windows operating systems. ...


In the Microsoft .NET executable format, extra metadata is included to allow reflection at runtime. The Microsoft . ... In computer science, reflection is the process by which a computer program of the appropriate type can be modified in the process of being executed, in a manner that depends on abstract features of its code and its runtime behavior. ...


Existing software metadata

Object Management Group (OMG) has defined metadata format for representing entire existing applications for the purposes of software mining, software modernization and software assurance. This specification, called the OMG Knowledge Discovery Metamodel (KDM) is the OMG's foundation for "modeling in reverse". KDM is a common language-independent intermediate representation that provides an integrated view of an entire enterpise application, including its behavior (program flow), data, and structure. One of the applications of KDM is Business Rules Mining. Object Management Group (OMG) is a consortium, originally aimed at setting standards for distributed object-oriented systems, and is now focused on modeling (programs, systems and business processes) and model-based standards in some 20 vertical markets. ... Software mining is a promising application of Knowledge discovery in the area of software modernization which involves understanding existing software artifacts. ... // Software Modernization is the process of understanding and evolving existing software assets. ... Knowledge Discovery Metamodel (KDM) is publicly available specification from the Object Management Group (OMG). ...


Knowledge Discovery Metamodel includes a fine grained low-level representation (called "micro KDM"), suitable for performing static analysis of programs. Knowledge Discovery Metamodel (KDM) is publicly available specification from the Object Management Group (OMG). ...


Document metadata

Most programs that create documents, including Microsoft SharePoint, Microsoft Word and other Microsoft Office products, save metadata with the document files. These metadata can contain the name of the person who created the file (obtained from the operating system), the name of the person who last edited the file, how many times the file has been printed, and even how many revisions have been made on the file. Other saved material, such as deleted text (saved in case of an undelete command), document comments and the like, is also commonly referred to as "metadata", and the inadvertent inclusion of this material in distributed files has sometimes led to undesirable disclosures. SharePoint - Wikipedia, the free encyclopedia /**/ @import /skins-1. ... Microsoft Word is a word processing application from Microsoft. ... Microsoft Office is an office suite from Microsoft for Microsoft Windows and Apple Mac OS X operating systems. ...


Document Metadata is particularly important in legal environments where litigation can request this sensitive information (metadata) which can include many elements of private detrimental data. This data has been linked to multiple lawsuits that have got corporations into legal complications.


Many legal firms today use "Metadata Management Software", also known as "Metadata Removal Tools". This software can be used to clean documents before they are sent outside of their firm. This process, known as metadata management, protects lawfirms from potentially unsafe leaking of sensitive data through Electronic Discovery. Electronic discovery, or e-discovery, refers to discovery in civil litigation which deals with information in electronic form. ...


For a list of executable formats, see object file. In computer science, object file or object code is an intermediate representation of code generated by a compiler after it processes a source code file. ...


Metamodels

Metadata on Models are called Metamodels. In Model Driven Engineering, a Model has to conform to a given Metamodel. According to the MDA guide, a metamodel is a model and each model conforms to a given metamodel. Meta-modeling allows strict and agile automatic processing of models and metamodels. In general, a metamodel is a model which describes a model. ... // Presentation MDE (Model Driven Engineering) is an emerging technique in software, system and data engineering, based on the systematic use of models. ... An abstract model (or conceptual model) is a theoretical construct that represents something, with a set of variables and a set of logical and quantitative relationships between them. ... In general, a metamodel is a model which describes a model. ... Model-driven architecture (MDAâ„¢) is a software design approach launched by the Object Management Group (OMG)[1] in 2001. ... To meet Wikipedias quality standards, this article or section may require cleanup. ...


The Object Management Group (OMG) defines 4 layers of meta-modeling. Each level of modeling is defined, validated by the next layer: Object Management Group (OMG) is a consortium, originally aimed at setting standards for distributed object-oriented systems, and is now focused on modeling (programs, systems and business processes) and model-based standards in some 20 vertical markets. ...

Xegy uses metadata. In the field of software engineering, the Unified Modeling Language (UML) is a standardized specification language for object modeling. ... The Common Warehouse Metamodel (CWM) is a specification for modeling metadata for relational, non-relational, multi-dimensional, and most other objects found in a data warehousing environment. ... Knowledge Discovery Metamodel (KDM) is publicly available specification from the Object Management Group (OMG). ... The Meta-Object Facility (MOF), is an Object Management Group (OMG) standard for Model Driven Engineering. ...


Strange metadata

Since metadata are also data, it is possible to have metadata of metadata–"meta-metadata." Machine-generated meta-metadata, such as the reversed index created by a free-text search engine, is generally not considered metadata, though.


Metadata that are embedded with content is called embedded metadata. A data repository typically stores the metadata detached from the data.


Digital library metadata

There are three categories of metadata that are frequently used to describe objects in a digital library [3][4]:

  1. descriptive - Information describing the intellectual content of the object, such as MARC cataloguing records, finding aids or similar schemes. It is typically used for bibliographic purposes and for search and retrieval.
  2. structural - Information that ties each object to others to make up logical units (e.g., information that relates individual images of pages from a book to the others that make up the book).
  3. administrative - Information used to manage the object or control access to it. This may include information on how it was scanned, its storage format, copyright and licensing information, and information necessary for the long-term preservation of the digital objects.

Marc (or Mark) is a western European forename, a variation on the Roman name Marcus (see ‎List of names derived from Marcus ). This version, originally the French form, is spelled in the Latin manner, using a c. This variation is becoming increasingly common in the UK and US. Marc is... Not to be confused with copywriting. ... Digital preservation refers to the management of digital information over time. ...

Geospatial metadata

Metadata that describe geographic objects (such as datasets, maps, features, or simply documents with a geospatial component) have a history going back to at least 1994 (refer MIT Library page on FGDC Metadata). This class of metadata is described more fully on the Geospatial metadata page. Geospatial metadata (also geographic metadata, or simply metadata when used in a geographic context) is a type of metadata that is applicable to objects that have an explicit or implicit geographic extent, in other words, are associated with some position on the surface of the Globe. ...


See also

AgMES is the abbreviation for Agricultural Metadata Element Set. ... APEv2 tags are used to add metadata, such as the title, artist, or track number, to digital audio files. ... The Common Warehouse Metamodel (CWM) is a specification for modeling metadata for relational, non-relational, multi-dimensional, and most other objects found in a data warehousing environment. ... A data dictionary is a set of metadata that contains definitions and representations of data elements. ... A domain-specific language (DSL) is a programming language designed to be useful for a specific set of tasks, in contrast to general-purpose programming languages. ... Domain-specific modelling (DSM) is a higher level of CASE, a way to model data structures and logic in domain concepts independent from programming languages and thus also syntax details. ... The introduction to this article provides insufficient context for those unfamiliar with the subject matter. ... Exchangeable image file format (official abbreviation Exif, not EXIF[1]) is a specification for the image file format used by digital cameras. ... A folksonomy is a user-generated taxonomy used to categorize and retrieve web content such as Web pages, photographs and Web links, using open-ended labels called tags. ... Generic Modeling Environment (GME) is a model-integrated program synthesis tool for creating domain-specific models of large-scale systems. ... Geospatial metadata (also geographic metadata, or simply metadata when used in a geographic context) is a type of metadata that is applicable to objects that have an explicit or implicit geographic extent, in other words, are associated with some position on the surface of the Globe. ... GNUBrain is a framework for creating personal software agents, building a multi-agent system, manage your personal meta data and execute distributed algorithms. ... This article or section does not cite its references or sources. ... The International Press Telecommunications Council, based in Windsor, United Kingdom, is a consortium of the worlds major news agencies and news industry vendors. ... ISO/IEC 11179 (formally known as the ISO/IEC 11179 Metadata Registry (MDR) standard) is a standard for representing metadata for an organization in a Metadata Registry. ... The Kendra initiative is an initiative to create ways of using the Internet to discover, use, share and pay for digital media such as music, web content, books and films, by creating compromises between the desires of producers and consumers of digital media. ... Learning Object Metadata is a data model, usually encoded in XML, used to describe a learning object and similar digital resources used to support learning. ... In computer programming, the term magic number has multiple meanings. ... Material eXchange Format (MXF) is a container format for professional digital video and audio media defined by a set of SMPTE standards. ... Meta tags are used to provide structured data about data. ... In metadata, metadata discovery is the process of using automated tools to discover the semantics of a data element in data sets. ... A Metadata Registry is a central place where metadata definitions are stored and maintained. ... The Metadata Facility for Java is a specification for Java that defines an API for annotating fields, methods, and classes as having particular attributes that indicate they should be processed in special ways by development tools, deployment tools, or run-time libraries. ... The Meta-Object Facility (MOF), is an Object Management Group (OMG) standard for Model Driven Engineering. ... // Presentation MDE (Model Driven Engineering) is an emerging technique in software, system and data engineering, based on the systematic use of models. ... 3,4-mehtylenedioxy-N-methylamphetamine, or MDE, is a psychedelic hallucinogenic drug and entactogen of the phenethylamine family. ... Metadata publishing is the process of making metadata Data elements available to external users, both people and machines. ... Meta noise refers to inaccurate or irrelevant metadata. ... Metatables are the section of a database or other data holding structure that is designated to hold data that will act as source code or metadata. ... In information technology, MicroContent is the smallest piece of content, usually up to 60 characters of text. ... Microformat logo A microformat (sometimes abbreviated μF or uF) is a web-based[1] data formatting approach that seeks to re-use existing content as metadata, using only XHTML and HTML classes[2] and attributes. ... Model-based testing is software testing in which test cases are derived in whole or in part from a model that describes some (usually functional) aspects of the system under test (SUT). ... To meet Wikipedias quality standards, this article or section may require cleanup. ... MPEG-7 is a multimedia content description standard. ... The PBCore (Public Broadcasting Metadata Dictionary) was created by the public broadcasting community in the United States of America for use by public broadcasters and related communities. ... REBOL, the Relative Expression Based Object Language (pronounced [rebl]), is a data exchange and programming language designed specifically for network communications and distributed computing. ... Resource Description Framework (RDF) is a family of World Wide Web Consortium (W3C) specifications originally designed as a metadata model but which has come to be used as a general method of modeling information, through a variety of syntax formats. ... RIFE is a full-stack open source Java web application framework with tools and APIs to implement most common web features. ... W3Cs Semantic Web logo The Semantic Web is an evolving extension of the World Wide Web in which web content can be expressed not only in natural language, but also in a format that can be read and used by software agents, thus permitting them to find, share and... “TIFF” redirects here. ... Topic Maps are an ISO standard for the representation and interchange of knowledge, with an emphasis on the findability of information. ... Look up transformation in Wiktionary, the free dictionary. ... A Vocabulary-Based Transformation (VBT) is a transformation that is aided by the use of a semantic equivalence statements within a controlled vocabulary. ... The Extensible Markup Language (XML) is a general-purpose markup language. ... XMI (XML Metadata Interchange) is an Object Management Group standard for exchanging metadata information via XML. It can be used for any metadata whose metamodel can be expressed in MOF. The most common use of XMI is as an interchange format for UML, although it can also be used for... The Extensible Metadata Platform or XMP is a specific type of extensible markup language used in PDF, photography and photo editing applications. ...

References

  1. ^ American Library Association, Task Force on Metadata Summary Report., June 1999
  2. ^ D. C. A. Bultermann, Is It Time For a Moratorium on Metadata?, IEEE MultiMedia, Oct-Dec 2004
  3. ^ William R. Durrell, Data Administration: A Practical Guide to Data Administration, McGraw-Hill, 1985
  4. ^ Bretherton, F. P. and Singley, P. T. 1994, Metadata: A User's View, Proceedings of the International Conference on Very Large Data Bases (VLDB), 1091-1094
  5. ^ Ralph Kimball, The Data Warehouse Lifecycle Toolkit, Wiley, 1998, ISBN 0-471-25547-5
  6. ^ Guy V Tozer, Metadata Management for Information Control and Business Success, Artech House, 1999, ISBN 0-89006-280-3
  7. ^ Adrienne Tannenbaum, Metadata Solutions: Using Metamodels, Repositories, XML, and Enterprise Portals to Generate Information on Demand, Addison-Wesley, 2002, ISBN 0-201-71976-2
  8. ^ David Marco, Building and Managing the Meta Data Repository: A Full Lifecycle Guide, Wiley, 2000, ISBN 0-471-35523-2
  9. ^ David C. Hay, Data Model Patterns: A Metadata Map, Morgan Kaufman, 2006, ISBN 0-12-088798-3
  10. ^ R. Todd Stephens (2003). Utilizing Metadata as a Knowledge Communication Tool. Proceedings of the International Professional Communication Conference 2004. Minneapolis, MN: Institute of Electrical and Electronics Engineers, Inc.
  • Michael H. Brackett, Data Resource Quality, Addison-Wesley, 2000, ISBN 0-201-71306-3

Ralph Kimball, PhD, is a widely published author on the subject of data warehousing and business intelligence. ...

External links

Look up metadata in Wiktionary, the free dictionary.

  Results from FactBites:
 
Metadata - Wikipedia, the free encyclopedia (3837 words)
Metadata are of special interest in various fields of computer science, e.
Fundamentally, then, metadata are "the data that describe the structure and workings of an organization’s use of information, and which describe the systems it uses to manage that information." To do a model of metadata is to do an "Enterprise model" of the information technology industry itself.
Metadata are more properly called an ontology or schema when structured into a hierarchical arrangement.
  More results at FactBites »


 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your comments
Please enter the 5-letter protection code

Want to know more?
Search encyclopedia, statistics and forums:

 


Lesson Plans | Student Area | Student FAQ | Reviews | Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms.