treebank

treebank
noun /ˈtɹiː.bæŋk/
A database of sentences which are annotated with syntactic information, often in the form of a tree.

If one wants to use a treebank for linguistic investigation,


Wikipedia foundation.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Treebank — A treebank or parsed corpus is a text corpus in which each sentence has been parsed, i.e. annotated with syntactic structure. Syntactic structure is commonly represented as a tree structure, hence the name Treebank. The term Parsed Corpus is… …   Wikipedia

  • Baumbank — Eine Baumbank (engl. Treebank), auch geparstes Korpus, ist ein Textkorpus, in dem jeder Satz geparst, also mit syntaktischer Struktur annotiert wurde. Der Begriff Baumbank bezieht sich darauf, dass die syntaktische Struktur gewöhnlich als eine… …   Deutsch Wikipedia

  • Dependency grammar — Hybrid constituency/dependency tree from the Quranic Arabic Corpus Dependency grammar (DG) is a class of syntactic theories developed by Lucien Tesnière. It is distinct from phrase structure grammars, as it lacks phrasal nodes. Structure is… …   Wikipedia

  • Stochastic context-free grammar — A stochastic context free grammar (SCFG; also probabilistic context free grammar, PCFG) is a context free grammar in which each production is augmented with a probability. The probability of a derivation (parse) is then the product of the… …   Wikipedia

  • Natural language processing — (NLP) is a field of computer science and linguistics concerned with the interactions between computers and human (natural) languages; it began as a branch of artificial intelligence.[1] In theory, natural language processing is a very attractive… …   Wikipedia

  • Corpus linguistics — is the study of language as expressed in samples (corpora) or real world text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language. Originally …   Wikipedia

  • Text corpus — In linguistics, a corpus (plural corpora ) or text corpus is a large and structured set of texts (now usually electronically stored and processed). They are used to do statistical analysis and hypothesis testing, checking occurrences or… …   Wikipedia

  • Parsing — In computer science and linguistics, parsing, or, more formally, syntactic analysis, is the process of analyzing a sequence of tokens to determine their grammatical structure with respect to a given (more or less) formal grammar.Parsing is also… …   Wikipedia

  • Tree-adjoining grammar — (TAG) is a grammar formalism defined by Aravind Joshi. Tree adjoining grammars are somewhat similar to context free grammars, but the elementary unit of rewriting is the tree rather than the symbol. Whereas context free grammars have rules for… …   Wikipedia

  • PDT — may refer to: Computers: PHP Development Tools, an IDE plugin for the Eclipse platform Portable data terminal, an electronic device that is used to enter or retrieve data via wireless transmission Medicine: Patient delivered therapy Photodynamic… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”