FACTOID # 162: You are more likely to be reported as having been killed by lightning in Cuba than in any other country.
 
 Home   Encyclopedia   Statistics   Countries A-Z   Flags   Maps   Education   Forum   FAQ   About 
 
 
 
WHAT'S NEW
RECENT ARTICLES
More Recent Articles »
 

SEARCH ALL

FACTS & STATISTICS    Advanced view

Search encyclopedia, statistics and forums:

 

 

(* = Graphable)

 

 


Encyclopedia > Collaborative filtering

Collaborative filtering (CF) is the method of making automatic predictions (filtering) about the interests of a user by collecting taste information from many users (collaborating). The underlying assumption of CF approach is that those who agreed in the past tend to agree again in the future. For example, a collaborative filtering or recommendation system for music tastes could make predictions about which music a user should like given a partial list of that user's tastes (likes or dislikes). Note that these predictions are specific to the user, but use information gleaned from many users. This differs from the more simple approach of giving an average (non-specific) score for each item of interest, for example based on its number of votes. Taste (sociology) - Wikipedia, the free encyclopedia /**/ @import /skins-1. ... Recommendation systems are programs which attempt to predict items (movies, music, books, news, web pages) that a user may be interested in, given some information about the users profile. ... Allegory of Music on the Opéra Garnier Music is an art form that involves organised sounds and silence. ... In mathematics, an average or central tendency of a set (list) of data refers to a measure of the middle of the data set. ... Voting is a method of decision making wherein a group such as a meeting or an electorate attempts to gauge its opinion—usually as a final step following discussions or debates. ...

Contents

Methodology

Collaborative filtering systems usually take two steps:

  1. Look for users who share the same rating patterns with the active user (the user whom the prediction is for).
  2. Use the ratings from those like-minded users found in step 1 to calculate a prediction for the active user

Alternatively, item-based collaborative filtering popularized by Amazon.com (users who bought x also bought y) and first proposed in the context of rating-based collaborative filtering by Vucetic and Obradovic in 2000, proceeds in an item-centric manner:

  1. Build an item-item matrix determining relationships between pairs of items
  2. Using the matrix, and the data on the current user, infer his taste

See, for example, the Slope One item-based collaborative filtering family. // Item-based collaborative filtering Item-based Collaborative Filtering (Vucetic, Obradovic 2000),(Sarwar, Karypis et al. ...


Another form of collaborative filtering can be based on implicit observations of normal user behavior (as opposed to the artificial behavior imposed by a rating task). In these systems you observe what a user has done together with what all users have done (what music they have listened to, what items they have bought) and use that data to predict the users behavior in the future or to predict how a user might like to behave if only they were given a chance. These predictions then have to be filtered through business logic to determine how these predictions might affect what a business system ought to do. It is, for instance, not useful to offer to sell somebody some music if they already have demonstrated that they own that music. Business Logic is a non-technical term generally used to describe the functional algorithms which handle information exchange between a database and a user interface. ...


In the age of information explosion such techniques can prove very useful as the number of items in only one category (such as music, movies, books, news, web pages) have become so large that a single person cannot possibly view them all in order to select relevant ones. Relying on a scoring or rating system which is averaged across all users ignores specific demands of a user, and is particularly poor in tasks where there is large variation in interest, for example in the recommendation of music. Obviously, other methods to combat information explosion exist such as web search, data clustering, and more. In Library and Information Science, information explosion is a term used for the ever increasing rate of publication. ... Film is a term that encompasses individual motion pictures, the field of film as an art form, and the motion picture industry. ... [1]#redirect Book ... For other uses, see News (disambiguation). ... A screenshot of a web page. ... Rating is a means of classifying things in different categories. ... Graphic representation of the world wide web around Wikipedia The World Wide Web (WWW, or simply Web) is an information space in which the items of interest, referred to as resources, are identified by global identifiers called Uniform Resource Identifiers (URI). ... Clustering is the classification of objects into different groups, or more precisely, the partitioning of a data set into subsets (clusters), so that the data in each subset (ideally) share some common trait - often proximity according to some defined distance measure. ...


History

Collaborative filtering stems from the earlier system of information filtering, where relevant information is brought to the attention of the user by observing patterns in previous behaviour and building a user profile. This system was essentially unable to help with exploration of the web and suffered from the cold-start problem that new users had to build up tendencies before the filtering was effective.


The first system to use collaborative filtering was the Information Tapestry project at Xerox PARC. This system allowed users to find documents based on previous comments by other users. There were many problems with this system as it only worked for small groups of people and had to be accessed through word specific queries which largely defeated the purpose of collaborative filtering.


USENET Net news furthered collaborative filtering such that it was available for a mass scale of users while having a simpler method for accessing articles. The system allowed users to rate material based on popularity, which then allowed other users to search for articles based on these ratings. Usenet (USEr NETwork) is a global, distributed Internet discussion system that evolved from a general purpose UUCP network of the same name. ...


Types

Active filtering

Active filtering is a method that in recent years has become increasingly popular. This popularity increase is due to the fact that there is an ever growing base of information available to users of the World Wide Web. With an exponentially growing amount of information being added to the internet, finding efficient and valuable information is becoming more difficult. In recent years a basic search for information using the World Wide Web turns out thousands of results and a high percentage of this information is not effective and — more often than not — irrelevant as well. There are a large number of databases and search engines in the market today to use for searches but a majority of the population is not familiar with all the options available and this is where active filtering comes into effect.


Active filtering differs from other methods of collaborative filtering due to the fact that it uses a peer-to-peer approach. This means that it is a system where peers, coworkers, and people with similar interests rate products, reports, and other material objects, also sharing this information over the web for other people to see. It is a system based on the fact that people want to share consumer information with the other peers. The users of active filtering use lists of commonly used links to send the information over the web where others can view it and use the ratings of the products to make their own decisions. A peer-to-peer (or P2P) computer network is a network that relies on the computing power and bandwidth of the participants in the network rather than concentrating it in a relatively few servers. ...


Active collaborative filtering can be useful to many people in many situations. This type of filtering can be extremely important and effective in a situation where a non-guided web search produces thousands of results that are not useful or effective for the person locating the information. In cases where people are not comfortable or knowledgeable about the array of databases that are available to them, active filtering is very useful and effective.


Advantages

There are many advantages to using or viewing an Active collaborative filtering. One of these advantages is an actual rating given to something of interest by a person who has viewed the topic or product of interest. This produces a reasonable explanation and rank from a reliable source, being the person who has come into contact with the product. Another advantage of Active filtering is the fact that the people want to and ultimately do provide information regarding the matter at hand.


Disadvantages

There are a few disadvantages of active filtering. One is that the opinion may be biased. Also, as providing feedback requires action by the user, less data may be available than with a passive approach.


Passive filtering

A method of collaborative filtering that is thought to have great potential in the future is passive filtering, which collects information implicitly. A web browser is used to record a user’s preferences by following and measuring their actions. These implicit filters are then used to determine what else the user will like and recommend potential items of interest. Implicit filtering relies on the actions of users to determine a value rating for specific content, such as:

  • Purchasing an item
  • Repeatedly using, saving, printing an item
  • Refer or link to a site
  • Number of times queried

An important feature of passive collaborative filtering is using the time aspect to determine whether a user is scanning a document or fully reading the material. The greatest strength of the system is that it takes away certain variables from the analysis that would normally be present in active filtering. For example, only certain types of people will take the time to rate a site, in passive collaborative filtering anyone accessing the site has automatically given data.


Item based filtering

Item based filtering is another method of collaborative filtering in which items are rated and used as parameters instead of users. This type of filtering uses the ratings to group various items together in groups so consumers can compare them as well as a rating scale that is available to manufacturers so they can locate where their product stands in the market in a consumer based rating scale.


Through this method of filtering, users or user groups use and test the product and give it a rating that is relevant to the product and the product class in which it falls. These users test many products and with the results, the products are classified based on the information which the rating holds. The products are used and tested by the same user or group in order to get an accurate rating and eliminate some of the error that is possible in the tests that take place under this type of filtering.


Explicit versus implicit filtering

Within active and passive filtering there are explicit and implicit methods for determining user preferences. Explicit collection of user preferences requires the evaluator to indicate a value for the content on a rating scale. This creates a cognitive aspect to collaborative filtering, but can mean that the feedback received is more accurate. Implicit collection does not involve the direct input of opinion by the user, but instead it is assumed that their opinion is implied by their actions. This reduces variability amongst users and reduces the demand on the user, which can mean that much more data is available. However, this behaviour data does not necessarily accurately represent the user's true opinion of an item. If something is explicit, it generally leaves nothing to the imagination. ... Implicit 1. ...


Applications

In commercial systems

Commercial sites that implement collaborative filtering systems include:

Amazon. ... A typical Barnes & Noble bookstore. ... Half. ... Movie Gallery, based in Atlanta, Georgia, is the second largest movie and game rental company in the United States, behind Blockbuster Video. ... Last. ... LibraryThing is a web application for storing and sharing personal library catalogs and book lists, a prominent social cataloging application. ... Musicmatch is the former company (founded in 1997) that produces the Musicmatch Jukebox software. ... Netflix (NASDAQ: NFLX), established in 1998, is the largest online DVD rental service, offering flat rate rental-by-mail to customers in the United States. ... Netflix (NASDAQ: NFLX), established in 1998, is the largest online DVD rental service, offering flat rate rental-by-mail to customers in the United States. ... TiVo (pronounced tee-voh, IPA: ) is a popular brand of digital video recorder (DVR) in the United States. ...

In non-commercial systems

Non-commercial sites that implement collaborative filtering systems include:

  • like-i-like.org - movies
  • AmphetaRate - RSS articles
  • Clinko - music & movies
  • Everyone's a Critic - movies
  • FilmAffinity - movies
  • GiveALink.org - websites
  • Gnod The Global Network of Dreams. A recommendation system that recommends music, movies and authors of books
  • Gnomoradio - free music
  • Grepr - podcasts
  • Indy - free music
  • iRATE radio - free music
  • Jester - Jokes
  • KindaKarma - authors, video games, movies and music
  • Moonranker - music, movies, and books
  • MovieCritic - movies, Macromedia closed it.
  • MovieLens - movies
  • Music Recommendation System for iTunes - music
  • Musicmobs - music
  • MyStrands - music
  • Popularism - movies
  • Rate Your Music - music
  • StumbleUpon - websites
  • The SuggestBot - a Wikipedia bot that recommends interesting Wikipedia pages to edit using collaborative filtering among other algorithms
  • Upto11 - music
  • WikiLens - various (movies, books, albums, restaurants, beers, etc.). You can add your own items and categories.
  • Naboj - scientific articles.
  • WheretoCycle - bicycle touring locations.

AmphetaRate is an open source RSS recommendation server. ... For RSS feeds from Wikipedia, see Wikipedia:Syndication. ... Everyones a Critic (EaC) is a film community website. ... GiveALink GiveALink is a public site where people can donate their bookmarks to the Web community. ... The folder fetched files contains automatically downloaded music. ... The title of this article is incorrect due to technical limitations. ... Macromedia was an American graphics and web development software house headquartered in San Francisco, California. ... MovieLens is a research site run by GroupLens Research in the computer science department at the University of Minnesota. ... The Music Recommendation System for iTunes is a recommender system that uses collaborative filtering to generate music recommendations. ... Musicmobs is a site where users trade playlists and sync their listening statistics. ... Rate Your Music is a metadata database where musical albums, EPs, singles and bootlegs are rated and reviewed by users. ... StumbleUpon is a commercial web discovery service that integrates peer and social networking principles with one-click blogging. ...

Software libraries

There are also software libraries which allow a developer to add collaborative filtering to an application or web site:

  • Taste - open-source, Java
  • Cofi - open-source, Java
  • CoFE - open-source, Java
  • RACOFI - open-source, Java
  • SUGGEST - Free, written in C. (A library, not open source.)
  • Rating-Based Item-to-Item - public domain, PHP
  • Vogoo PHP Lib - open-source, PHP
  • consensus - open-source, Python
  • C/Matlab Toolkit for Collaborative Filtering - open-source, Matlab, C

See also

It has been suggested that symbiotic intelligence be merged into this article or section. ... The phrase The Long Tail (as a proper noun with capitalized letters) was first coined by Chris Anderson in an October 2004 Wired magazine article[1] to describe certain business and economic models such as Amazon. ... Recommendation systems are programs which attempt to predict items (movies, music, books, news, web pages) that a user may be interested in, given some information about the users profile. ... A reputation system is a type of collaborative filtering algorithm which attempts to determine ratings for a collection of entities, given a collection of opinions that those entities hold about each other. ... Preference Elicitation refers to the problem of developing a decision support system capable of generating recommendations to a user,thus assisting him in decision making. ...

External links

  • Word of Mouse: The Marketing Power of Collaborative Filtering
  • Collaborative Filtering bibliography by James Thornton
  • Collaborative Filtering by Francis Heylighen
  • Collaborative Filtering Resources by Jun Wang
  • Evaluating collaborative filtering recommender systems (DOI: 10.1145/963770.963772)
  • GroupLens research papers. GroupLens is one of the research labs that did a lot of pioneering research in collaborative filtering.
  • Content-Boosted Collaborative Filtering for Improved Recommendations. Prem Melville, Raymond J. Mooney, and Ramadass Nagarajan
  • 'Social Information Filtering: Algorithms for Automating "Word of Mouth"' by Upendra Shardanand
  • 'Learning utility graphs for multi-issue negotiation using collaborative filtering' - Valentin Robu
  • A collection of past and present "information filtering" projects (including collaborative filtering) at MIT Media Lab
  • Collaborative filtering visualized as a network using Amazon data on political book purchases
  • Eigentaste: A Constant Time Collaborative Filtering Algorithm. Ken Goldberg, Theresa Roeder, Dhruv Gupta, and Chris Perkins. Information Retrieval, 4(2), 133-151. July 2001.
  • Methods and Metrics for Cold-Start Recommendations
  • Competition to write a better Collaborative filter for Netflix - prize $1M

  Results from FactBites:
 
Collaborative filtering - Wikipedia, the free encyclopedia (1643 words)
Collaborative filtering (CF) is the method of making automatic predictions (filtering) about the interests of a user by collecting taste information from many users (collaborating).
Collaborative filtering stems from the earlier system of information filtering, where relevant information is brought to the attention of the user by observing patterns in previous behaviour and building a user profile.
This type of filtering uses the ratings to group various items together in groups so consumers can compare them as well as a rating scale that is available to manufacturers so they can locate where their product stands in the market in a consumer based rating scale.
Collaborative Filtering Research Papers (9340 words)
Collaborative filtering is such a filtering technique that allows incorporation of the profiles of a user, which can be implicitly learned from user download activities.
And collaborative filtering techniques have proven to be an vital component of many such recommender systems as they facilitate the generation of high-quality recommendations by leveraging the preferences of communities of similar users.
Collaborative filtering is valuable in e-commerce, and for direct recommendations for music, movies, news etc. But today's systems use centralized databases and have several disadvantages, including privacy risks.
  More results at FactBites »


 
 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your comments

Want to know more?
Search encyclopedia, statistics and forums:

 


Lesson Plans | Student Area | Student FAQ | Reviews | Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms, 1022, m