publication . Conference object . Article . Preprint . 2017

Complex Word Identification: Challenges in Data Annotation and System Performance

Zampieri, Marcos; Malmasi, Shervin; Paetzold, Gustavo Henrique; Specia, Lucia;
Open Access
  • Published: 01 Dec 2017
Abstract
Comment: Proceedings of the 4th Workshop on NLP Techniques for Educational Applications (NLPTEA 2017)
Subjects
free text keywords: Computer Science - Computation and Language
Funded by
EC| SIMPATICO
Project
SIMPATICO
SIMplifying the interaction with Public Administration Through Information technology for Citizens and cOmpanies
  • Funder: European Commission (EC)
  • Project Code: 692819
  • Funding stream: H2020 | RIA
Validated by funder
22 references, page 1 of 2

Julian Brooke, Alexandra Uitdenbogerd, and Timothy Baldwin. 2016. Melbourne at semeval 2016 task 11: Classifying type-level word complexity using random forests with corpus and word list features. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), pages 975- 981, San Diego, California. Association for Computational Linguistics.

Cyril Goutte, Serge Le´ger, Shervin Malmasi, and Marcos Zampieri. 2016. Discriminating Similar Languages: Evaluations and Explorations. In Proceedings of LREC. [OpenAIRE]

David Kauchak. 2016. Pomona at semeval-2016 task 11: Predicting word complexity based on corpus frequency. In Proceedings of the 10th SemEval, pages 1047-1051.

J Peter Kincaid, Robert P Fishburne Jr, Richard L Rogers, and Brad S Chissom. 1975. Derivation of New Readability Formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy Enlisted Personnel. Technical report, Naval Technical Training Command Millington TN Research Branch.

Michal Konkol. 2016. Uwb at semeval-2016 task 11: Exploring features for complex word identification. In Proceedings of the 10th SemEval, pages 1038- 1041.

Ludmila I Kuncheva, James C Bezdek, and Robert PW Duin. 2001. Decision Templates for Multiple Classifier Fusion: An Experimental Comparison. Pattern Recognition, 34(2):299-314.

Shervin Malmasi, Joel Tetreault, and Mark Dras. 2015. Oracle and Human Baselines for Native Language Identification. In Proceedings of the BEA workshop.

Niloy Mukherjee, Braja Gopal Patra, Dipankar Das, and Sivaji Bandyopadhyay. 2016. Ju nlp at semeval2016 task 11: Identifying complex words in a sentence. In Proceedings of the 10th SemEval, pages 986-990.

Gustavo Henrique Paetzold. 2016. Lexical Simplification for Non-Native English Speakers. Ph.D. thesis, University of Sheffield.

Gustavo Henrique Paetzold and Lucia Specia. 2016a. SemEval 2016 Task 11: Complex Word Identification. In Proceedings of SemEval.

Gustavo Henrique Paetzold and Lucia Specia. 2016b. SV000gg at SemEval-2016 Task 11: Heavy Gauge Complex Word Identification with System Voting. In Proceedings of the 10th SemEval, pages 969-974.

Ashish Palakurthi and Radhika Mamidi. 2016. Iiit at semeval-2016 task 11: Complex word identification using nearest centroid classification. In Proceedings of the 10th SemEval, pages 1017-1021.

Sarah E Petersen and Mari Ostendorf. 2007. Text Simplification for Language Learners: A Corpus Analysis. In Proceedings of SLaTE.

Robi Polikar. 2006. Ensemble Based Systems in Decision Making. Circuits and systems magazine, IEEE, 6(3):21-45. [OpenAIRE]

Maury Quijada and Julie Medero. 2016. Hmc at semeval-2016 task 11: Identifying complex words using depth-limited decision trees. In Proceedings of the 10th SemEval, pages 1034-1037.

22 references, page 1 of 2
Any information missing or wrong?Report an Issue