buchspektrum Internet-Buchhandlung

Neuerscheinungen 2012

Stand: 2020-01-07
Schnellsuche
ISBN/Stichwort/Autor
Herderstraße 10
10625 Berlin
Tel.: 030 315 714 16
Fax 030 315 714 14
info@buchspektrum.de

André Reckhemke

The Construction of String Similarity Predicates


Application-specific and Index-supported String Similarity Predicates - Fundamentals and Design of Similarity Queries
Aufl. 2012. 88 S. 220 mm
Verlag/Jahr: AV AKADEMIKERVERLAG 2012
ISBN: 3-639-43969-4 (3639439694) / 3-8364-6638-4 (3836466384)
Neue ISBN: 978-3-639-43969-4 (9783639439694) / 978-3-8364-6638-7 (9783836466387)

Preis und Lieferzeit: Bitte klicken


Revision with unchanged content. In times of worldwide globalisation the knowledge of useful information is becoming increasingly important. Parallel to genetic engineering, the expansion of the Internet produces similar volumes of data - frequently saved in text files. One of the most relevant intersection is the usage of approximate string matching in large text data. The Internet has to face the challenge of not only to concentrating on request times but also finding more context-relevant information. Associated with this aim, further steps in this field have to take into consideration that documents can include mistakes in orthography or words being abbreviated. Other areas of information are substituted with their acronyms or are less important and can be ignored. All of these tasks are united in the fields of computational linguistics. This master thesis shows stepwise the tokenising of real text, the homogenisation of words, and the storage in a specific index structure for subsequent approximate string matching - in consideration of secondary storage. A prototype programmed in Java completes the current work.
André Reckhemke, geb. am 30.04.1973 in Braunschweig, Ausbildung: Dipl. Informatiker (FH) und Master of Science.