FACTOID # 169: Train spotters should go to Australia - Australians have more railway per capita than anyone else on the globe.
 
 Home   Encyclopedia   Statistics   Countries A-Z   Flags   Maps   Education   Forum   FAQ   About 
 
WHAT'S NEW
RELATED ARTICLES
People who viewed "Nutch" also viewed:
RECENT ARTICLES
More Recent Articles »
 

FACTS & STATISTICS    Simple view

  1. Select countries to view: (hold down Control key and click to select several)

     

     

    Compare:

     

     

  1. Select fact or statistic: (* = graphable)

     

     

     

  2. (OPTIONAL) Compare to statistic: (both need to be graphable)

     

     

     

  3. View result as:

     

       
(OR) SEARCH ALL encyclopedia, stats & forums:   

Encyclopedia > Nutch

Nutch is an effort to build an open source search engine. It uses Lucene for the search and index component. The fetcher (robot) has been written from scratch solely for this project. Open source refers to projects that are open to the public and which draw on other projects that are freely available to the general public. ... The success of the Google search engine was mainly due to its powerful PageRank algorithm and its simple, easy-to-use interface. ... Lucene is an open source search engine library released by the Apache Software Foundation. ... See WebCrawler for the specific search engine of that name. ...


Nutch has a highly modular architecture allowing developers to create plugins for the following activities: media-type parsing, data retrieval, querying and clustering.


Tim O'Reilly has a seat in Nutch's board of directors. Tim OReilly is the founder of OReilly Media formerly OReilly & Associates and a booster of the free software and open source movements. ...


Doug Cutting is the lead developer.


As of June 2005, Nutch has graduated from the Apache Incubator, and is now a subproject of Lucene. Apache Incubator is the gateway for projects hoping to become fully fledged Apache Software Foundation projects. ...


It is completely coded in Java, but data is written in language-independent formats. In June 2003 there was a successful 100 million page demo system. Java is a reflective, object-oriented programming language developed initially by James Gosling and colleagues at Sun Microsystems. ...


External links

  • Official page of the project
  • Nutch & Lucene Consulting from Otis Gospodnetić, Lucene developer and Lucene in Action co-author.
  • Nutch/Lucene Consultant Offers Nutch/Lucene based solutions.
  • Building Nutch: Open Source Search (2004) - ACM Queue vol. 2, no. 2
  • An article about Nutch (2003) - Search Engine Watch
  • Another article about Nutch (2003) - Tech News World
  • non official Documentation

  Results from FactBites:
 
Nutch - Wikipedia, the free encyclopedia (161 words)
Nutch is an effort to build an open source search engine.
Nutch has a highly modular architecture allowing developers to create plugins for the following activities: media-type parsing, data retrieval, querying and clustering.
As of June 2005, Nutch has graduated from the Apache Incubator, and is now a subproject of Lucene.
Project searches for open-source niche | Tech News on ZDNet (1296 words)
Nutch itself has been operating secretly for roughly a year, gathering support from developers and funding from one of the biggest commercial players in search: Overture Services.
Nutch is actively seeking funding for hardware that would support traffic from Web surfers, but for now, its systems do not have the capacity to handle an influx of visitors.
Nutch is an alternative test bed for the company's use, she said.
  More results at FactBites »


 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your comments
Please enter the 5-letter protection code

Want to know more?
Search encyclopedia, statistics and forums:

 


Lesson Plans | Student Area | Student FAQ | Reviews | Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms.