PHONETIC MATCHING TOOLKIT WITH STATE-OF-THE-ART META-SOUNDEX ALGORITHM (ENGLISH AND SPANISH)

dc.contributor.advisorVarol, Cihan
dc.contributor.committeeMemberKarpoor, Shashidhar
dc.contributor.committeeMemberZhou, Bing
dc.creatorKoneru, Keerthi
dc.creator.orcid0000-0002-0112-6858
dc.date.accessioned2016-10-27T20:18:29Z
dc.date.available2016-10-27T20:18:29Z
dc.date.created2016-12
dc.date.issued2016-10-27
dc.date.submittedDecember 2016
dc.date.updated2016-10-27T20:18:30Z
dc.description.abstractResearchers confront major problems while searching for various kinds of data in large imprecise databases, as they are not spelled correctly or in the way they were expected to be spelled. As a result, they cannot find the word they sought. Over the years of struggle, pronunciation of words was considered to be one of the practices to solve the problem effectively. The technique used to acquire words based on sounds is known as “Phonetic Matching”. Soundex was the first algorithm developed and other algorithms like Metaphone, Caverphone, DMetaphone, Phonex etc., are also used for information retrieval in different environments. This project mainly deals with the analysis and implementation of newly proposed Meta-Soundex algorithm for English and Spanish languages which retrieves suggestions for the misspelled words. The newly developed Meta-Soundex algorithm addresses the limitations of Metaphone and Soundex algorithms. Specifically, the new algorithm has more accuracy compared to both Soundex and Metaphone algorithm. The new algorithm also has higher precision compared to Soundex, thus reducing the noise in the considered arena. A phonetic matching toolkit is also developed enclosing the different phoneticmatching algorithms along with the state-of-the-art Meta-Soundex algorithm for both Spanish and English languages.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/20.500.11875/46
dc.language.isoen
dc.subjectCaverphone
dc.subjectDMetaphone
dc.subjectInformation retrieval
dc.subjectMisspelled words
dc.subjectMetaphone
dc.subjectNYSIIS
dc.subjectPhonetic matching
dc.subjectSoundex
dc.titlePHONETIC MATCHING TOOLKIT WITH STATE-OF-THE-ART META-SOUNDEX ALGORITHM (ENGLISH AND SPANISH)
dc.typeThesis
dc.type.materialtext
thesis.degree.departmentComputer Science
thesis.degree.grantorSam Houston State University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
KONERU-THESIS-2016.pdf
Size:
787.77 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.85 KB
Format:
Plain Text
Description: