Sameer Borate's Blog: Porter Stemming algorithm for search

Courtesy PHPDeveloper.org  Wed, 04/29/2009 - 06:57

In a recent post to his blog Sameer looks at implementing a Stemming algorithm to search an array of words.

It uses this library (as written by Richard Heyes).

A stemming algorithm lets you reduce each English input word to its basic root or stem (e.g.

'walking' to 'walk') so that variations on a word ('walks', 'walked', 'walking') are considered equivalent when searching.

This stems can than be used in a search query rather than the...


 

More related items

Sameer Borate's Blog: Checking your site for...
Sameer Borate, in the wake of having security issues with his site, has posted a hint you could use to help detect when something has changed in important files in your application by...

Sameer Borate's Blog: Read the version of a PDF in PHP
Sameer Borate has a quick post to his blog today with some code that lets you read the version of a PDF document programmatically without a dependency on a PDF extension or library being...

Sameer Borate's Blog: Generating a color palette from...
Sameer Borate has posted a simple method for creating a color palette from a given image using just PHP and GD. In the previous post we saw how we can generate a screenshot from a url. In...


 

Post new comment

The content of this field is kept private and will not be shown publicly.
computer-internet.marc8.com