BOOKQUE

No image available
A Hybrid Architecture for Robust Parsing of German
Erhard Hinrichs , Sandra Kübler , Frank H. Müller , Tylman Ule
· 2002
No image available
Language Resources, Taxonomies and Metadata
Andreas Witt , Lothar Lemnitzer , Erhard Hinrichs
· 2018
No image available
TüSBL: A Similarity-based Chunk Parser for Robust Syntactic Processing
Sandra Kübler , Erhard Hinrichs
· 2001
No image available
Avoiding Data Graveyards : from Heterogeneous Data Collected in Multiple Research Projects to Sustainable Linguistic Resources
Thomas Schmidt , Christian Chiarcos , Timm Lehmberg , Georg Rehm , Andreas Witt , Erhard Hinrichs
· 2014
No image available
From Chunks to Function-argument Structure: A Similarity-based Approach
Sandra Kübler , Erhard W. Hinrichs
· 2001
Chunk parsing has focused on the recognition of partial constituent structures at the level of individual chunks. Little attention has been paid to the question of how such partial analyses can be combined into larger structures for complete utterances. Such larger structures are not only desirable for a deeper syntactic analysis. They also constitute a necessary prerequisite for assigning function-argument structure. The present paper offers a similaritybased algorithm for assigning functional labels such as subject, object, head, complement, etc. to complete syntactic structures on the basis of prechunked input. The evaluation of the algorithm has concentrated on measuring the quality of functional labels. It was performed on a German and an English treebank using two different annotation schemes at the level of function argument structure. The results of 89.73% correct functional labels for German and 90.40%for English validate the general approach.
No image available
Connecting Resources: Which Issues Have to be Solved to Integrate CMC Corpora from Heterogeneous Sources and for Different Languages?
Michael Beißwenger , Carole Etienne , Darja Fišer , Holger Grumt Suárez , Laura Herzberg , Erhard Hinrichs , Tobias Horsmann , Natali Karlova-Bourbonus , Lothar Lemnitzer , Julien Longhi , Harald Lüngen , Lydia-Mai Ho-Dac , Christophe Parisse , Céline Poudat , Thomas Schmidt , Angelika Storrer , Torsten Zesch
· 2017
No image available
Recent Developments in Linguistic Annotations of the TüBa-D/Z Treebank
Erhard Hinrichs , Sandra Kübler , Karin Naumann , Heike Telljohann , Julia Trushkina
· 2004
The purpose of this paper is to describe recent developments in the morphological, syntactic, and semantic annotation of the TüBa-D/Z treebank of German. The TüBa-D/Z annotation scheme is derived from the Verbmobil treebank of spoken German [4, 10], but has been extended along various dimensions to accommodate the characteristics of written texts. TüBa-D/Z uses as its data source the "die tageszeitung" (taz) newspaper corpus. The Verbmobil treebank annotation scheme distinguishes four levels of syntactic constituency: the lexical level, the phrasal level, the level of topological fields, and the clausal level. The primary ordering principle of a clause is the inventory of topological fields, which characterize the word order regularities among different clause types of German, and which are widely accepted among descriptive linguists of German [3, 6]. The TüBa-D/Z annotation relies on a context-free backbone (i.e. proper trees without crossing branches) of phrase structure combined with edge labels that specify the grammatical function of the phrase in question. The syntactic annotation scheme of the TüBa-D/Z is described in more detail in [12, 11]. TüBa-D/Z currently comprises approximately 15 000 sentences, with approximately 7 000 sentences being in the correction phase. The latter will be released along with an updated version of the existing treebank before the end of this year. The treebank is available in an XML format, in the NEGRA export format [1] and in the Penn treebank bracketing format. The XML format contains all types of information as described above, the NEGRA export format contains all sentenceinternal information while the Penn treebank format includes only those layers of information that can be expressed as pure tree structures. ...
No image available
The Tüba-D/Z Treebank: Annotating German with a Context-free Backbone
Heike Telljohann , Erhard Hinrichs , Sandra Kübler
· 2004
No image available
A Compositional Semantics for Aktionsarten and NP Reference in English
Erhard Hinrichs
· 2002
No image available
Linguistically Annotated Corpora: Quality Assurance, Reusability and Sustainability
Heike Zinsmeister , Andreas Witt , Sandra Kübler , Erhard Hinrichs
· 2015

A Hybrid Architecture for Robust Parsing of German

Language Resources, Taxonomies and Metadata

TüSBL: A Similarity-based Chunk Parser for Robust Syntactic Processing

Avoiding Data Graveyards : from Heterogeneous Data Collected in Multiple Research Projects to Sustainable Linguistic Resources

From Chunks to Function-argument Structure: A Similarity-based Approach

Connecting Resources: Which Issues Have to be Solved to Integrate CMC Corpora from Heterogeneous Sources and for Different Languages?

Recent Developments in Linguistic Annotations of the TüBa-D/Z Treebank

The Tüba-D/Z Treebank: Annotating German with a Context-free Backbone

A Compositional Semantics for Aktionsarten and NP Reference in English

Linguistically Annotated Corpora: Quality Assurance, Reusability and Sustainability