« Search 2.0 - example of odd suggestion of related product based on social factors | Main | Visit the NIE Booth at Enterprise Search Summit, New York, May 20-21, 2008 »

March 19, 2008

Advanced Duplicate Detection (also related to spam detection and clustering)

We need to do a dedicated article about this area, but I wanted to share some material here that we have written about it, and that will likely re-appear in a future article.

In our recent newsletter article, we covered the problem of generic duplicate detection in search, and them duplicate detection in federated search.

A SearchDev posting Mark talked more about why checksums aren't always enough for duplicate detection, in messages 485 and 490

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/t/trackback/2197018/27255130

Listed below are links to weblogs that reference Advanced Duplicate Detection (also related to spam detection and clustering):

Comments

Post a comment

Comments are moderated, and will not appear on this weblog until the author has approved them.

If you have a TypeKey or TypePad account, please Sign In

Search Blog Archive

Dr Search

  • Dr. Search is the technical genius of enterprise search. Feel free to Ask the Doctor any questions you may have about enterprise search.

Enterprise Search Newsletter

Other Resources