FACTOID # 100: The United States puts 0.7 % of its population in Prison - a vastly higher percentage than any other nation.
 
 Home   Encyclopedia   Statistics   Countries A-Z   Flags   Maps   Education   Forum   FAQ   About 
 
WHAT'S NEW
RECENT ARTICLES
More Recent Articles »
 

SEARCH ALL

FACTS & STATISTICS    Advanced view

Search encyclopedia, statistics and forums:

 

 

(* = Graphable)

 

 


Encyclopedia > Fuzzy string searching

Fuzzy string searching is the name for a category of techniques for finding one or more substrings of a text that approximately match some given pattern string. The bitap algorithm is one fuzzy string searching algorithm that is efficient for relatively short pattern strings. The bitap algorithm computes the levenshtein distance between two strings assuming it is smaller than a small integer n. ...


The Unix utility agrep is a fuzzy string searching program. Unix or UNIX is a computer operating system originally developed in the 1960s and 1970s by a group of AT&T Bell Labs employees including Ken Thompson, Dennis Ritchie, and Douglas McIlroy. ... A programming tool is a program or application that software developers use to create, debug, or maintain other programs and applications. ... Agrep (Approximate grep) is a fuzzy string searching program, developed by Udi Manber and Sun Wu between 1988 and 1991, for use with the Unix operating system. ...


See also

Soundex is a phonetic algorithm, an algorithm for indexing names by their sound when pronounced in English. ... In computing terms, a spelling checker (also spell checker) is a software program designed to verify the spelling of words in a file, helping a user ensure his/her spelling is correct. ... String searching algorithms are an important class of string algorithms that try to find a place where one or several strings (also called patterns) are found within a larger string or text. ... The term wildcard character has the following meanings: Telecommunication In telecommunications, a wildcard character is a character that may be substituted for any of a defined subset of all possible characters. ...

External links

  • http://laurikari.net/tre/

  Results from FactBites:
 
Search, MFM Communication Software, Inc. (1887 words)
You may be searching for a word which appears on a page as a substring of a larger word, or a long word that has more than one form.
If you are trying to search for a phrase that contains a comma, either precede the comma with a backslash character, or type a semicolon instead of the comma in your search string.
"Fuzzy" searching is beneficial because it reduces the chance of missing a web page due to incorrect spelling of a word or phrase, or the use of acronyms on a web page.
String searching algorithm Summary (1083 words)
String searching algorithms, sometimes called string matching algorithms, are an important class of string algorithms that try to find a place where one or several strings (also called patterns) are found within a larger string or text.
Baeza-Yates and Gonnet uses bits in a word to keep track of whether the previous j characters were a prefix of the search string, and is therefore adaptable to fuzzy string searching.
Some search methods, for instance trigram search, are intended to find a "closeness" score between the search string and the text rather than a "match/non-match".
  More results at FactBites »


 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your comments
Please enter the 5-letter protection code

Want to know more?
Search encyclopedia, statistics and forums:

 


Lesson Plans | Student Area | Student FAQ | Reviews | Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms.