« Search 2.0 - example of odd suggestion of related product based on social factors | Main | Visit the NIE Booth at Enterprise Search Summit, New York, May 20-21, 2008 »

March 19, 2008

Advanced Duplicate Detection (also related to spam detection and clustering)

We need to do a dedicated article about this area, but I wanted to share some material here that we have written about it, and that will likely re-appear in a future article.

In our recent newsletter article, we covered the problem of generic duplicate detection in search, and them duplicate detection in federated search.

A SearchDev posting Mark talked more about why checksums aren't always enough for duplicate detection, in messages 485 and 490

TrackBack

TrackBack URL for this entry:
https://www.typepad.com/services/trackback/6a00d8341c84cf53ef00e55136d5de8833

Listed below are links to weblogs that reference Advanced Duplicate Detection (also related to spam detection and clustering):

Comments

The comments to this entry are closed.