The term weighting and ranking function is at the core of any information retrieval system.
The vector space model with the cosine similarity is maybe the best known and most widely used, but there are plenty of alternatives.
We're looking at two here, the BM25 function based around a probabilistic model, and a function based around language modeling.
Just to get something to work with we'll we'll build a quick index on some strings which will stand in...
Ian Barber's Blog: Alternative Term Weighting
In this new post from Ian Barber he takes a look at something that can come in very handy when you need something a bit more complex than the standard search results - term weighting. The...
PageRank In PHP - Ian Barber
Google was a better search engine than it's predecessors for a number of reasons, but probably the most well known one is PageRank, the algorithm for measuring the importance of a page based...
Linksys WRT610N Simultaneous Dual-N Band Wireless...
The sleek Simultaneous Dual-N Band WRT610N Router sets a new standard for design, expanded bandwidth, and robust performance. It's the ideal router for all your current and future digital...
1 GB SanDisk MicroSD TransFlash Memory Card (Bulk)
The MicroSD card is based on TransFlash, which was developed by SanDisk in cooperation with Motorola and is the worlds smalles flash memory card form factor.
Post new comment