Accessing Suitable Data in CLARIN-D

To make use of a resource, or a part thereof, you need to first find it. The data that can be searched for comes in diverse forms and the CLARIN centres offer the possibility to retrieve data in three ways. First, there is the CLARIN Virtual Language Observatory (VLO), which helps identifying a resource in its entirety. Second, there is the Federated Content Search (FCS) which offers a string-based search access to resources, and also retrieves the citation information for the resource. Third, there are a number of resource-specific portals with purpose-built, easy-to-use access methods for browsing the resources' structures and content. Below you can find direct access to VLO, FCS and several other resources within CLARIN.

Search engine for language resources within CLARIN


Finding strings in accessible resources


Query tools for specific resources

GermaNet is a lexical semantic network for German, maintained by the University of Tübingen. Part of it is integrated in EuroWordNet, which integrates various European languages with the help of an interlingual index.

Learn more »

The German Reference Corpus DeReKo is an electronic archive of German text corpora of spoken language, existing since 1964. It is maintained and build by the Institute of German Language (IDS) in Mannheim.

Learn more »

The German Text Archive (DTA) is a digital archive of multidisciplinary texts. Funded by the German Research Foundation, it is hosted at the Berlin-Brandenburg Academy of Sciences since July 2007.

Learn more »

The German Vocabulary database contains 35 million sentences with 500 million running words, with more than 9 million different words and groups of words. The data are selected from publically accessible sources and automatically extracted.

Learn more »

The dictionary of contemporary German, the "Wörterbuch der deutschen Gegenwartssprache", is linked to a text corpus covering material from 1900-2000.

Learn more »