|CWB Homepage||Online CQP Demos|
Online CQP Demos
BUNDESTAG corpus contains Hansards of the German Parliament (Bundestag)
from the parliamentary term running from 1994 to 1997.
This corpus amounts to a total of 5.7 million running words. It has been
annotated with a rich variety of linguistic information. The token-level annotations comprise
part-of-speech tags (TreeTagger),
lemmata, and morpho-syntactic information (both IMSLex).
In addition, a partial phrase-structure analysis was performed with the
developed by Hannah Kermes.