Noun Phrases The annotated data and guidelines described in the paper: Adding Noun Phrase Structure to the Penn Treebank are now available. The data script requires the Penn Treebank 3 corpus, and Python. If you have any problems, comments or corrections, then please email me: dvadas1 [at] cs.usyd.edu.au
Version 0.9 This initial version of the data was experimented with in the ACL07 paper.
Version 1.0 I've been using this version of the NP data since the ALTA07 paper. It's treatment of conjunctions, brackets and speech marks varies from the previous version. Read the guidelines for a better description.
CCG Data The data from the ACL08 paper, where I converted the NP structure to CCGbank, will be up soon. Email me (dvadas1 [at] cs.usyd.edu.au) if you'd like to be notified when it is.
|