FACTOID # 59: People might eat oats when they're hungry, but people from Hungary don't eat oats.
 
 Home   Encyclopedia   Statistics   Countries A-Z   Flags   Maps   Education   Forum   FAQ   About 
 
 
 
WHAT'S NEW
RECENT ARTICLES
More Recent Articles »
 

SEARCH ALL

FACTS & STATISTICS    Advanced view

Search encyclopedia, statistics and forums:

 

 

(* = Graphable)

 

 


Encyclopedia > Corpora

In law a corpus (Latin: "body") is a set, a collection of documents and sources. See Corpus Juris Civilis. Corruption Jurisprudence Philosophy of law Law (principle) List of legal abbreviations Legal code Intent Letter versus Spirit Natural Justice Natural law Religious law Witness intimidation Legal research Critical legal studies External links Wikibooks Wikiversity has more about this subject: School of Law Look up law in Wiktionary, the free dictionary... Latin is the language originally spoken in the region around Rome called Latium. ... The Corpus Juris Civilis (Body of Civil Law) is a fundamental work in jurisprudence issued from 529 to 534 by order of Justinian I, Byzantine Emperor. ...


In linguistics, corpus (plural corpora) is a large and structured set of texts (now usually electronically stored and processed). A corpus may contain texts in single language (monolingual corpus) or text data in multiple languages (multilingual corpus). Multilingual corpora that have been specially formatted for side-by-side comparison are called aligned parallel corpora. Broadly conceived, linguistics is the scientific study of human language, and a linguist is someone who engages in this study. ...


In order to make the corpora more useful for doing linguistic research, they are often subjected to a process known as annotation. An example of annotating a corpus is part-of-speech tagging, or POS-tagging, in which information about each word's part of speech (verb, noun, adjective, etc.) are added to the corpus in the form of tags. Part-of-speech tagging is the process of marking up the words in a text with their corresponding parts of speech. ...


Corpora are the main knowledge base in corpus linguistics. Corpus linguistics is the study of language as expressed in samples (corpora) or real world text. ...


In biology, corpus refers to the main body/mass/part of an organ or other anatomical structure, distinguished from the head or tail. Main articles: Life All organisms (viruses not included) consist of cells, which in turn, are based on a common carbon-based biochemistry. ...


In Christianity, the corpus refers to the body of Christ and often is used as a technical term for the figure which hangs upon a crucifix. Christianity is a monotheistic religion based on the life and teachings of Jesus of Nazareth as presented in the New Testament writings of his early followers. ... Christ, is the English representation of the Greek word Χριστός (transliterated as Khristós), which means anointed. ... For the California hardcore punk band see Crucifix (band) A small, handheld crufix. ...


See also

The Bank of English is the name of the COBUILD corpus, a collection of English texts. ... For the California hardcore punk band see Crucifix (band) A small, handheld crufix. ... A parallel text is a text in one language together with its translation in another language. ... A translation memory, or TM, is a software program designed as an aid for human translators. ...

External links

  • WebCorp - The Web as a Corpus: http://www.webcorp.org.uk/

  Results from FactBites:
 
Hugh's Corpora print friendly article (1719 words)
Nevertheless, it seems to me that despite all these advances, corpora linguistics has also had several negative side-effects on the way teachers perceive their roles, and that they have actually enslaved us in ways which are not entirely healthy.
Corpora linguists repeatedly promote their products with often highly-detailed reference to frequency counts and the idea that frequency is central has become a common one.
What corpora have done is to place language back at the centre of classrooms and, as such, we all now have to think much more about how we actually use language.
Corpus linguistics - Wikipedia, the free encyclopedia (605 words)
This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language.
The core of a corpus is the derivation of a set of Part-of-speech tags, representing a formal overview of the various types of words and word-relationships in a given language.
Other corpora represent many languages, varieties and modes, and include The British National Corpus, a 100 million word collection of a range of spoken and written texts, created in the 1990s by a consortium of publishers, universities (Oxford and Lancaster) and the British Library.
  More results at FactBites »


 
 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your comments

Want to know more?
Search encyclopedia, statistics and forums:

 


Lesson Plans | Student Area | Student FAQ | Reviews | Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms, 1022, m