FACTOID # 51: Russia won the first World Air Games, held in Turkey in 1997. Events included hang-gliding, sky-surfing, and ballooning.
 
 Home   Encyclopedia   Statistics   Countries A-Z   Flags   Maps   Education   Forum   FAQ   About 
 
 
 
WHAT'S NEW
RECENT ARTICLES
More Recent Articles »
 

SEARCH ALL

FACTS & STATISTICS    Advanced view

Search encyclopedia, statistics and forums:

 

 

(* = Graphable)

 

 


Encyclopedia > Apache Hadoop

Hadoop is a collection of Free Java software previously developed by the Nutch project but now maintainted by Lucene[1]. The system includes a distributed filesystem reminiscent of GoogleFS named the "Hadoop Distributed File System" (or just DFS[1]), a clone of MapReduce called "HadoopMapReduce"[2] and a few other miscellaneous pieces of software. The collection is intended to support the Lucene project's search engine by allowing it to be distributed over a network of computers. This article is about free software as defined by the sociopolitical free software movement; for information on software distributed without charge, see freeware. ... Java is an object-oriented programming language developed by James Gosling and colleagues at Sun Microsystems in the early 1990s. ... Computer software (or simply software) refers to one or more computer programs and data held in the storage of a computer for some purpose. ... Nutch is an effort to build an open source search engine. ... Lucene is a free open source, information retrieval API originally implemented in Java by Doug Cutting. ... A distributed file system is a file system that supports sharing of files and resources in the form of persisent storage over a network. ... Google File System (GFS) is a proprietary distributed file system based on Linux and developed by Google for their applications use. ... MapReduce is a programming tool developed by Google in C++ (Python and Java are supported through interfaces), in which parallel computations over large (> 1 terabyte) data sets are performed. ... A search engine or search service is a program designed to help find information stored on a computer system such as the World Wide Web, inside a corporate or proprietary network or a personal computer. ...


External links

  • Hadoop website
    • Hadoop wiki
    • Hadoop Distributed File System requirements
  • Mention of Nutch and Hadoop in an article on Google


 
 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your comments

Want to know more?
Search encyclopedia, statistics and forums:

 


Lesson Plans | Student Area | Student FAQ | Reviews | Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms, 1022, m