Item Infomation
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.creator | Berzak, Yevgeni | - |
dc.creator | Kenney, Jessica | - |
dc.creator | Spadine, Carolyn | - |
dc.creator | Wang, Jing Xian | - |
dc.creator | Lam, Lucia | - |
dc.creator | Mori, Keiko Sophie | - |
dc.creator | Garza, Sebastian | - |
dc.creator | Katz, Boris | - |
dc.date | 2016-06-30T20:31:54Z | - |
dc.date | 2016-06-30T20:31:54Z | - |
dc.date | 2016-08-01 | - |
dc.date.accessioned | 2023-04-13T10:03:45Z | - |
dc.date.available | 2023-04-13T10:03:45Z | - |
dc.identifier | http://hdl.handle.net/1721.1/103401 | - |
dc.identifier | arXiv:1605.04278v2 [cs.CL] | - |
dc.identifier.uri | http://lib.yhn.edu.vn/handle/YHN/720 | - |
dc.description | We introduce the Treebank of Learner English (TLE), the first publicly available syntactic treebank for English as a Second Language (ESL). The TLE provides manually annotated POS tags and Universal Dependency (UD) trees for 5,124 sentences from the Cambridge First Certificate in English (FCE) corpus. The UD annotations are tied to a pre-existing error annotation of the FCE, whereby full syntactic analyses are provided for both the original and error corrected versions of each sentence. Further on, we delineate ESL annotation guidelines that allow for consistent syntactic treatment of ungrammatical English. Finally, we benchmark POS tagging and dependency parsing performance on the TLE dataset and measure the effect of grammatical errors on parsing accuracy. We envision the treebank to support a wide range of linguistic and computational research o n second language acquisition as well as automatic processing of ungrammatical language. | - |
dc.description | This work was supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF – 1231216. | - |
dc.format | application/pdf | - |
dc.language | en_US | - |
dc.publisher | Center for Brains, Minds and Machines (CBMM), arXiv | - |
dc.relation | CBMM Memo Series;052 | - |
dc.rights | Attribution-NonCommercial-ShareAlike 3.0 United States | - |
dc.rights | http://creativecommons.org/licenses/by-nc-sa/3.0/us/ | - |
dc.subject | Treebank of Learner English (TLE) | - |
dc.subject | English as Second Language (ESL) | - |
dc.subject | Universal Dependency (UD) | - |
dc.subject | Cambridge First Certificate in English (FCE) | - |
dc.title | Universal Dependencies for Learner English | - |
dc.type | Technical Report | - |
dc.type | Working Paper | - |
dc.type | Other | - |
Appears in Collections | Tài liệu ngoại văn |
Files in This Item: