Monday, January 12, 2015

Jazzy-based Spell Checker


Jazzy is a set of APIs that allow you to add spell checking functionality to Java Applications easily.

For a misspelled word, Jazzy can provide with some corrected words for selection.
Additional task is to judge which word is better. We have two schemes to do that: 1. Edit distance; 2. Input frequency or context. Context information is based on Jazzy dictionary. However, for some specific search engine, common dictionary is not enough. For example, we should add some merchant brand names into the dictionary for Amazon's search engine.

There are still two problems here:
  1. How do we customize our own dictionary? How do we confirm the validity of the data source for building the dictionary?
  2. How do you balance the results of edit distance and context?

No comments:

Post a Comment