German
A Self-Learning Context-Aware Lemmatizer for German
Abstract
Accurate lemmatization of German nouns mandates the use of a lexicon. Comprehensive lexicons, however, are expensive to build and maintain. We present a self-learning lemmatizer capable of automatically creating a full-form lexicon by processing German documents.
Durm German Lemmatizer v1.0 Released
Submitted by rene on Thu, 2007-05-31 08:59.I'm happy to announce the first public release of our free/open source Durm Lemmatization System for the German language.
The release comes with source code, binaries, documentation, resources (German lexicon, Case Tagger probabilities), and manually annotated texts from the German Wikipedia for evaluation.
