TüBa-D/Z lemmatizer available in WebLicht
The "TüBa-D/Z lemmatizer", a syntax-based lemmatizer for German that was developed in the context of the TüBa-D/Z treebank is now available for use as a component via the WebLicht interface to noncommercial users from the entire CLARIN community. By integrating morphology and syntactic information, the tool creates lemma and morphological tags that are often richer or more precise than the output of a surface-based model. Using frequency heuristics, the tool also provides an automatic classification of separable and inseparable prefix verbs as well as heuristic completion of truncated words. In the WebLicht version of the TüBa-D/Z lemmatizer, the syntax information is provided by the Berkeley parser and an internally developed grammar model.

1st CLARIN-D Doktorandentage - Corpora
Venue: Institut für Machinelle Sprachverarbeitung, Universität Stuttgart
Location: Universität Stuttgart, Institut für maschinelle Sprachverarbeitung (IMS), Forschungszentrum Informatik (FZI), Pfaffenwaldring 5b, 70569 Stuttgart, Seminarraum 1, V 5.01 (on the groundfloor)
Date: 25th-26th March 2013
Target group: PhD students, young researchers.
Information about the event at: http://fr46.uni-saarland.de/lsteich/ClarindDS2013
Workshop Exploring data from language documentation
Dates: 10.05.2013 - 11.05.2013
Location: ZAS Berlin
Languages: English, German
Webpage: http://www.zas.gwz-berlin.de/workshop_edla.html
Organizers: Felix Rau (
This email address is being protected from spambots. You need JavaScript enabled to view it.
) Kilu von Prince (
This email address is being protected from spambots. You need JavaScript enabled to view it.
)
CLARIN Standards Guide
The Clarin Standards Guide provides information on standards, guidelines and standard-promulgating organizations that deal with language technology resources such as text corpora, lexica, and language databases. It was created at IDS Mannheim. A thorough description can be found here on this web site or you can inspect the Standard's Guide own site.










