Corpus Gesproken Nederlands

(130594 corpus graphs)

1. General information

Name:

Corpus Gesproken Nederlands

ID:

CGN

Format:

NeGra format, version 3

Description:

Tiger-indexbestanden voor CGN versie 2.0. Januari 2006, NTU/INL/TST-centrale

2. Corpus details

Features (T):

word, pos, morph

Features (NT):

cat

Labelled edges:

yes

Crossing edges:

yes

Secondary edges:

yes

3. Statistical information

Number of corpus graphs:

130594

Number of tokens:

1142420

Average number of tokens:

8.7

Number of inner nodes:

625104

Number of edges:

1766833

4. Feature documentation

Feature values: cat

Feature values: pos

Feature values: morph

Edge labels

Secondary edge labels