LREC 2020 Marseille : Twelfth International Conference on Language Resources and Evaluation$dMay 11-16, 2020, Palais du Pharo, Marseille, France : conference proceedings (Englisch)
Freier Zugriff
- Neue Suche nach: International Conference on Language Resources and Evaluation
- Weitere Informationen zu International Conference on Language Resources and Evaluation:
- http://d-nb.info/gnd/1206105291
- Neue Suche nach: European Language Resources Association
- Weitere Informationen zu European Language Resources Association:
- http://d-nb.info/gnd/3038785-1
- Neue Suche nach: Calzolari, Nicoletta
- Weitere Informationen zu Calzolari, Nicoletta:
- http://d-nb.info/gnd/1179733398
- Neue Suche nach: International Conference on Language Resources and Evaluation
- Weitere Informationen zu International Conference on Language Resources and Evaluation:
- http://d-nb.info/gnd/1206105291
- Neue Suche nach: European Language Resources Association
- Weitere Informationen zu European Language Resources Association:
- http://d-nb.info/gnd/3038785-1
2020
-
ISBN:
- Konferenzband / Elektronische Ressource
-
Titel:LREC 2020 Marseille : Twelfth International Conference on Language Resources and Evaluation$dMay 11-16, 2020, Palais du Pharo, Marseille, France : conference proceedings
-
Weitere Titelangaben:Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)
LREC 2020, Twelfth International Conference on Language Resources and Evaluation
LREC 2020 conference proceedings -
Beteiligte:Calzolari, Nicoletta ( Herausgeber:in ) / International Conference on Language Resources and Evaluation ( Autor:in ) / European Language Resources Association ( Herausgebendes Organ , Veranstalter:in )
-
Kongress:LREC ; 12 ; 2020 ; Marseille
International Conference on Language Resources and Evaluation ; 12 ; 2020 ; Marseille -
Verlag:
- Neue Suche nach: The European Language Resources Association (ELRA)
-
Erscheinungsort:Paris
-
Erscheinungsdatum:2020
-
Format / Umfang:1 Online-Ressource (lxxii, 7252 Seiten)
-
Anmerkungen:Illustrationen
Literaturangaben
Langzeitarchivierung durch Technische Informationsbibliothek (TIB) / Leibniz-Informationszentrum Technik und Naturwissenschaften und Universitätsbibliothek -
ISBN:
-
Medientyp:Konferenzband
-
Format:Elektronische Ressource
-
Sprache:Englisch
- Neue Suche nach: 17.46 / 18.00 / 54.75
- Weitere Informationen zu Basisklassifikation
-
Klassifikation:
-
Lizenzbestimmungen:
-
Datenquelle:
Die Inhaltsverzeichnisse werden automatisch erzeugt und basieren auf den im Index des TIB-Portals verfügbaren Einzelnachweisen der enthaltenen Beiträge. Die Anzeige der Inhaltsverzeichnisse kann daher unvollständig oder lückenhaft sein.
- 1
-
Neural Mention DetectionYu, Juntao / Bohnet, Bernd / Poesio, Massimo et al. | 2020
- 11
-
A Cluster Ranking Model for Full Anaphora ResolutionYu, Juntao / Uma, Alexandra / Poesio, Massimo et al. | 2020
- 21
-
Mandarinograd: A Chinese Collection of Winograd SchemasBernard, Timothée / Han, Ting et al. | 2020
- 27
-
On the Influence of Coreference Resolution on Word Embeddings in Lexical-semantic Evaluation TasksHenlein, Alexander / Mehler, Alexander et al. | 2020
- 34
-
NoEl: An Annotated Corpus for Noun Ellipsis in EnglishKhullar, Payal / Majmundar, Kushal / Shrivastava, Manish et al. | 2020
- 44
-
An Annotated Dataset of Coreference in English LiteratureBamman, David / Lewke, Olivia / Mansoor, Anya et al. | 2020
- 55
-
GerDraCor-Coref: A Coreference Corpus for Dramatic Texts in GermanPagel, Janis / Reiter, Nils et al. | 2020
- 65
-
A Study on Entity Resolution for Email ConversationsDakle, Parag Pravin / Desai, Takshak / Moldovan, Dan et al. | 2020
- 74
-
Model-based Annotation of CoreferenceAralikatte, Rahul / Søgaard, Anders et al. | 2020
- 80
-
French Coreference for Spoken and Written LanguageWilkens, Rodrigo / Oberle, Bruno / Landragin, Frédéric / Todirascu, Amalia et al. | 2020
- 90
-
Cross-lingual Zero Pronoun ResolutionAloraini, Abdulrahman / Poesio, Massimo et al. | 2020
- 99
-
Exploiting Cross-Lingual Hints to Discover Event PronounsLoáiciga, Sharid / Hardmeier, Christian / Sayeed, Asad et al. | 2020
- 104
-
MuDoCo: Corpus for Multidomain Coreference Resolution and Referring Expression GenerationMartin, Scott / Poddar, Shivani / Upasani, Kartikeya et al. | 2020
- 112
-
Affection Driven Neural Networks for Sentiment AnalysisXiang, Rong / Long, Yunfei / Wan, Mingyu / Gu, Jinghang / Lu, Qin / Huang, Chu-Ren et al. | 2020
- 120
-
The Alice Datasets: fMRI & EEG Observations of Natural Language ComprehensionBhattasali, Shohini / Brennan, Jonathan / Luh, Wen-Ming / Franzluebbers, Berta / Hale, John et al. | 2020
- 126
-
Modelling Narrative Elements in a Short Story: A Study on Annotation Schemes and GuidelinesMikhalkova, Elena / Protasov, Timofei / Sokolova, Polina / Bashmakova, Anastasiia / Drozdova, Anastasiia et al. | 2020
- 133
-
Cortical Speech Databases For Deciphering the Articulatory CodeHöge, Harald et al. | 2020
- 138
-
ZuCo 2.0: A Dataset of Physiological Recordings During Natural Reading and AnnotationHollenstein, Nora / Troendle, Marius / Zhang, Ce / Langer, Nicolas et al. | 2020
- 147
-
Linguistic, Kinematic and Gaze Information in Task Descriptions: The LKG-CorpusReinboth, Tim / Gross, Stephanie / Bishop, Laura / Krenn, Brigitte et al. | 2020
- 156
-
The ACQDIV Corpus Database and Aggregation PipelineJancso, Anna / Moran, Steven / Stoll, Sabine et al. | 2020
- 166
-
Providing Semantic Knowledge to a Set of Pictograms for People with Disabilities: a Set of Links between WordNet and Arasaac: Arasaac-WNSchwab, Didier / Trial, Pauline / Vaschalde, Céline / Vial, Loïc / Esperanca-Rodier, Emmanuelle / Lecouteux, Benjamin et al. | 2020
- 172
-
Orthographic Codes and the Neighborhood Effect: Lessons from Information TheoryTulkens, Stéphan / Sandra, Dominiek / Daelemans, Walter et al. | 2020
- 182
-
Understanding the Dynamics of Second Language Writing through Keystroke Logging and Complexity ContoursKerz, Elma / Pruneri, Fabio / Wiechmann, Daniel / Qiao, Yu / Ströbel, Marcus et al. | 2020
- 189
-
Design of BCCWJ-EEG: Balanced Corpus with Human ElectroencephalographyOseki, Yohei / Asahara, Masayuki et al. | 2020
- 195
-
Using the RUPEX Multichannel Corpus in a Pilot fMRI Study on Speech DisfluenciesSmirnova, Katerina / Korotaev, Nikolay / Panikratova, Yana / Lebedeva, Irina / Pechenkova, Ekaterina / Fedorova, Olga et al. | 2020
- 204
-
Construction of an Evaluation Corpus for Grammatical Error Correction for Learners of Japanese as a Second LanguageKoyama, Aomi / Kiyuna, Tomoshige / Kobayashi, Kenji / Arai, Mio / Komachi, Mamoru et al. | 2020
- 212
-
Effective Crowdsourcing of Multiple Tasks for Comprehensive Knowledge ExtractionNam, Sangha / Lee, Minho / Kim, Donghwan / Han, Kijong / Kim, Kuntae / Yoon, Sooji / Kim, Eun-kyung / Choi, Key-Sun et al. | 2020
- 220
-
Developing a Corpus of Indirect Speech Act SchemasRoque, Antonio / Tsuetaki, Alexander / Sarathy, Vasanth / Scheutz, Matthias et al. | 2020
- 229
-
Quality Estimation for Partially Subjective Classification Tasks via CrowdsourcingSato, Yoshinao / Miyazawa, Kouki et al. | 2020
- 236
-
Crowdsourcing in the Development of a Multilingual FrameNet: A Case Study of Korean FrameNetHahm, Younggyun / Noh, Youngbin / Han, Ji Yoon / Oh, Tae Hwan / Choe, Hyonsu / Kim, Hansaem / Choi, Key-Sun et al. | 2020
- 245
-
Towards a Reliable and Robust Methodology for Crowd-Based Subjective Quality Assessment of Query-Based Extractive Text SummarizationIskender, Neslihan / Polzehl, Tim / Möller, Sebastian et al. | 2020
- 254
-
A Seed Corpus of Hindu Temples in IndiaRadhakrishnan, Priya et al. | 2020
- 259
-
Do You Believe It Happened? Assessing Chinese Readers' Veridicality JudgmentsChang, Yu-Yun / Hsieh, Shu-Kai et al. | 2020
- 268
-
Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language LearningNicolas, Lionel / Lyding, Verena / Borg, Claudia / Forascu, Corina / Fort, Karën / Zdravkova, Katerina / Kosem, Iztok / Čibej, Jaka / Arhar Holdt, Špela / Millour, Alice et al. | 2020
- 279
-
MAGPIE: A Large Corpus of Potentially Idiomatic ExpressionsHaagsma, Hessel / Bos, Johan / Nissim, Malvina et al. | 2020
- 288
-
CRWIZ: A Framework for Crowdsourcing Real-Time Wizard-of-Oz DialoguesChiyah Garcia, Francisco Javier / Lopes, José / Liu, Xingkun / Hastie, Helen et al. | 2020
- 298
-
Effort Estimation in Named Entity Tagging TasksGomes, Inês / Correia, Rui / Ribeiro, Jorge / Freitas, João et al. | 2020
- 307
-
Using Crowdsourced Exercises for Vocabulary Training to Expand ConceptNetRodosthenous, Christos / Lyding, Verena / Sangati, Federico / König, Alexander / ul Hassan, Umair / Nicolas, Lionel / Horbacauskiene, Jolita / Katinskaia, Anisia / Aparaschivei, Lavinia et al. | 2020
- 317
-
Predicting Multidimensional Subjective Ratings of Children’ Readings from the Speech Signals for the Automatic Assessment of FluencyBailly, Gérard / Godde, Erika / Piat-Marchand, Anne-Laure / Bosse, Marie-Line et al. | 2020
- 323
-
Constructing Multimodal Language Learner Texts Using LARA: Experiences with Nine LanguagesAkhlaghi, Elham / Bédi, Branislav / Bektaş, Fatih / Berthelsen, Harald / Butterweck, Matthias / Chua, Cathy / Cucchiarin, Catia / Eryiğit, Gülşen / Gerlach, Johanna / Habibi, Hanieh et al. | 2020
- 332
-
A Dataset for Investigating the Impact of Feedback on Student Revision OutcomePilan, Ildiko / Lee, John / Yeung, Chak Yan / Webster, Jonathan et al. | 2020
- 340
-
Creating Corpora for Research in Feedback Comment GenerationNagata, Ryo / Inui, Kentaro / Ishikawa, Shin'ichiro et al. | 2020
- 346
-
Using Multilingual Resources to Evaluate CEFRLex for Learner ApplicationsGraën, Johannes / Alfter, David / Schneider, Gerold et al. | 2020
- 356
-
Immersive Language Exploration with Object Recognition and Augmented RealityPlatte, Benny / Platte, Anett / Roschke, Christian / Thomanek, Rico / Rolletschke, Thony / Zimmer, Frank / Ritter, Marc et al. | 2020
- 363
-
A Process-oriented Dataset of Revisions during WritingConijn, Rianne / Dux Speltz, Emily / van Zaanen, Menno / Van Waes, Luuk / Chukharev-Hudilainen, Evgeny et al. | 2020
- 369
-
Automated Writing Support Using Deep Linguistic ParsersMorgado da Costa, Luís / V P Winder, Roger / Li, Shu Yun / Lin Tzer Liang, Benedict Christopher / Mackinnon, Joseph / Bond, Francis et al. | 2020
- 378
-
TLT-school: a Corpus of Non Native Children SpeechGretter, Roberto / Matassoni, Marco / Bannò, Stefano / Daniele, Falavigna et al. | 2020
- 386
-
Toward a Paradigm Shift in Collection of Learner CorporaKatinskaia, Anisia / Ivanova, Sardana / Yangarber, Roman et al. | 2020
- 392
-
Quality Focused Approach to a Learner Corpus DevelopmentDarģis, Roberts / Auziņa, Ilze / Levāne-Petrova, Kristīne / Kaija, Inga et al. | 2020
- 397
-
An Exploratory Study into Automated Précis GradingDe Clercq, Orphee / Van Hoecke, Senne et al. | 2020
- 405
-
Adjusting Image Attributes of Localized Regions with Low-level DialogueLin, Tzu-Hsiang / Rudnicky, Alexander / Bui, Trung / Kim, Doo Soon / Oh, Jean et al. | 2020
- 413
-
Alignment Annotation for Clinic Visit Dialogue to Clinical Note Sentence Language GenerationYim, Wen-wai / Yetisgen, Meliha / Huang, Jenny / Grossman, Micah et al. | 2020
- 422
-
MultiWOZ 2.1: A Consolidated Multi-Domain Dialogue Dataset with State Corrections and State Tracking BaselinesEric, Mihail / Goel, Rahul / Paul, Shachi / Sethi, Abhishek / Agarwal, Sanchit / Gao, Shuyang / Kumar, Adarsh / Goyal, Anuj / Ku, Peter / Hakkani-Tur, Dilek et al. | 2020
- 429
-
A Comparison of Explicit and Implicit Proactive Dialogue Strategies for Conversational RecommendationKraus, Matthias / Fischbach, Fabian / Jansen, Pascal / Minker, Wolfgang et al. | 2020
- 436
-
Conversational Question Answering in Low Resource Scenarios: A Dataset and Case Study for BasqueOtegi, Arantxa / Agirre, Aitor / Campos, Jon Ander / Soroa, Aitor / Agirre, Eneko et al. | 2020
- 443
-
Construction and Analysis of a Multimodal Chat-talk Corpus for Dialog Systems Considering Interpersonal ClosenessYamazaki, Yoshihiro / Chiba, Yuya / Nose, Takashi / Ito, Akinori et al. | 2020
- 449
-
BLISS: An Agent for Collecting Spoken Dialogue Data about Health and Well-beingvan Waterschoot, Jelte / Hendrickx, Iris / Khan, Arif / Klabbers, Esther / de Korte, Marcel / Strik, Helmer / Cucchiarini, Catia / Theune, Mariët et al. | 2020
- 459
-
The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer ServiceChen, Meng / Liu, Ruixue / Shen, Lei / Yuan, Shaozu / Zhou, Jingyan / Wu, Youzheng / He, Xiaodong / Zhou, Bowen et al. | 2020
- 467
-
"Cheese!": a Corpus of Face-to-face French Interactions. A Case Study for Analyzing Smiling and Conversational HumorPriego-Valverde, Béatrice / Bigi, Brigitte / Amoyal, Mary et al. | 2020
- 476
-
The Margarita Dialogue Corpus: A Data Set for Time-Offset Interactions and Unstructured Dialogue SystemsChierici, Alberto / Habash, Nizar / Bicec, Margarita et al. | 2020
- 485
-
How Users React to Proactive Voice Assistant Behavior While DrivingSchmidt, Maria / Minker, Wolfgang / Werner, Steffen et al. | 2020
- 491
-
Emotional Speech Corpus for Persuasive Dialogue SystemAsai, Sara / Yoshino, Koichiro / Shinagawa, Seitaro / Sakti, Sakriani / Nakamura, Satoshi et al. | 2020
- 498
-
Multimodal Analysis of Cohesion in Multi-party InteractionsBangalore Kantharaju, Reshmashree / Langlet, Caroline / Barange, Mukesh / Clavel, Chloé / Pelachaud, Catherine et al. | 2020
- 508
-
Treating Dialogue Quality Evaluation as an Anomaly Detection ProblemNedelchev, Rostislav / Usbeck, Ricardo / Lehmann, Jens et al. | 2020
- 513
-
Evaluation of Argument Search Approaches in the Context of Argumentative Dialogue SystemsRach, Niklas / Matsuda, Yuki / Daxenberger, Johannes / Ultes, Stefan / Yasumoto, Keiichi / Minker, Wolfgang et al. | 2020
- 523
-
PATE: A Corpus of Temporal Expressions for the In-car Voice Assistant DomainZarcone, Alessandra / Alam, Touhidul / Kolagar, Zahra et al. | 2020
- 531
-
Mapping the Dialog Act Annotations of the LEGO Corpus into ISO 24617-2 Communicative FunctionsRibeiro, Eugénio / Ribeiro, Ricardo / Martins de Matos, David et al. | 2020
- 540
-
Estimating User Communication Styles for Spoken Dialogue SystemsMiehle, Juliana / Feustel, Isabel / Hornauer, Julia / Minker, Wolfgang / Ultes, Stefan et al. | 2020
- 549
-
The ISO Standard for Dialogue Act Annotation, Second EditionBunt, Harry / Petukhova, Volha / Gilmartin, Emer / Pelachaud, Catherine / Fang, Alex / Keizer, Simon / Prévot, Laurent et al. | 2020
- 559
-
The AICO Multimodal Corpus - Data Collection and Preliminary AnalysesJokinen, Kristiina et al. | 2020
- 565
-
A Corpus of Controlled Opinionated and Knowledgeable Movie Discussions for Training Neural Conversation ModelsGaletzka, Fabian / Eneh, Chukwuemeka Uchenna / Schlangen, David et al. | 2020
- 574
-
A French Medical Conversations Corpus Annotated for a Virtual Patient Dialogue SystemLaleye, Fréjus A. A. / de Chalendar, Gaël / Blanié, Antonia / Brouquet, Antoine / Behnamou, Dan et al. | 2020
- 581
-
Getting To Know You: User Attribute Extraction from DialoguesWu, Chien-Sheng / Madotto, Andrea / Lin, Zhaojiang / Xu, Peng / Fung, Pascale et al. | 2020
- 590
-
Augmenting Small Data to Classify Contextualized Dialogue Acts for Exploratory VisualizationKumar, Abhinav / Di Eugenio, Barbara / Aurisano, Jillian / Johnson, Andrew et al. | 2020
- 600
-
RDG-Map: A Multimodal Corpus of Pedagogical Human-Agent Spoken Interactions.Paetzel, Maike / Karkada, Deepthi / Manuvinakurike, Ramesh et al. | 2020
- 610
-
MPDD: A Multi-Party Dialogue Dataset for Analysis of Emotions and Interpersonal RelationshipsChen, Yi-Ting / Huang, Hen-Hsen / Chen, Hsin-Hsi et al. | 2020
- 615
-
"Alexa in the wild" - Collecting Unconstrained Conversations with a Modern Voice Assistant in a Public EnvironmentSiegert, Ingo et al. | 2020
- 620
-
EDA: Enriching Emotional Dialogue Acts using an Ensemble of Neural AnnotatorsBothe, Chandrakant / Weber, Cornelius / Magg, Sven / Wermter, Stefan et al. | 2020
- 628
-
PACO: a Corpus to Analyze the Impact of Common Ground in Spontaneous Face-to-Face InteractionAmoyal, Mary / Priego-Valverde, Béatrice / Rauzy, Stephane et al. | 2020
- 634
-
Dialogue Act Annotation in a Multimodal Corpus of First Encounter DialoguesNavarretta, Costanza / Paggio, Patrizia et al. | 2020
- 644
-
A Conversation-Analytic Annotation of Turn-Taking Behavior in Japanese Multi-Party Conversation and its Preliminary AnalysisEnomoto, Mika / Den, Yasuharu / Ishimoto, Yuichi et al. | 2020
- 653
-
Understanding User Utterances in a Dialog System for CaregivingAsao, Yoshihiko / Kloetzer, Julien / Mizuno, Junta / Saiki, Dai / Kadowaki, Kazuma / Torisawa, Kentaro et al. | 2020
- 662
-
Designing Multilingual Interactive Agents using Small Dialogue CorporaLin, Donghui / Otani, Masayuki / Okuno, Ryosuke / Ishida, Toru et al. | 2020
- 668
-
Multimodal Corpus of Bidirectional Conversation of Human-human and Human-robot Interaction during fMRI ScanningRauchbauer, Birgit / Hmamouche, Youssef / Bigi, Brigitte / Prévot, Laurent / Ochs, Magalie / Chaminade, Thierry et al. | 2020
- 676
-
The Brain-IHM Dataset: a New Resource for Studying the Brain Basis of Human-Human and Human-Machine ConversationsOchs, Magalie / Bertrand, Roxane / Goujon, Aurélie / Bolger, Deirdre / Dubarry, Anne-Sophie / Blache, Philippe et al. | 2020
- 684
-
Dialogue-AMR: Abstract Meaning Representation for DialogueBonial, Claire / Donatelli, Lucia / Abrams, Mitchell / Lukin, Stephanie M. / Tratz, Stephen / Marge, Matthew / Artstein, Ron / Traum, David / Voss, Clare et al. | 2020
- 696
-
Relation between Degree of Empathy for Narrative Speech and Type of Responsive Utterance in Attentive ListeningIto, Koichiro / Murata, Masaki / Ohno, Tomohiro / Matsubara, Shigeki et al. | 2020
- 702
-
Intent Recognition in Doctor-Patient InterviewsRojowiec, Robin / Roth, Benjamin / Fink, Maximilian et al. | 2020
- 710
-
BrainPredict: a Tool for Predicting and Visualising Local Brain ActivityHmamouche, Youssef / Prévot, Laurent / Ochs, Magalie / Chaminade, Thierry et al. | 2020
- 717
-
MTSI-BERT: A Session-aware Knowledge-based Conversational AgentSenese, Matteo Antonio / Rizzo, Giuseppe / Dragoni, Mauro / Morisio, Maurizio et al. | 2020
- 726
-
Predicting Ratings of Real Dialogue Participants from Artificial Data and Ratings of Human Dialogue ObserversGeorgila, Kallirroi / Gordon, Carla / Yanov, Volodymyr / Traum, David et al. | 2020
- 735
-
Which Model Should We Use for a Real-World Conversational Dialogue System? a Cross-Language Relevance Model or a Deep Neural Net?Alavi, Seyed Hossein / Leuski, Anton / Traum, David et al. | 2020
- 743
-
Chinese Whispers: A Multimodal Dataset for Embodied Language GroundingKontogiorgos, Dimosthenis / Sibirtseva, Elena / Gustafson, Joakim et al. | 2020
- 750
-
AMUSED: A Multi-Stream Vector Representation Method for Use in Natural DialogueKumar, Gaurav / Joshi, Rishabh / Singh, Jaspreet / Yenigalla, Promod et al. | 2020
- 759
-
An Annotation Approach for Social and Referential Gaze in DialogueSomashekarappa, Vidya / Howes, Christine / Sayeed, Asad et al. | 2020
- 766
-
A Penn-style Treebank of Middle Low GermanBooth, Hannah / Breitbarth, Anne / Ecay, Aaron / Farasyn, Melissa et al. | 2020
- 776
-
Books of Hours. the First Liturgical Data Set for Text Segmentation.Hazem, Amir / Daille, Beatrice / Kermorvant, Christopher / Stutzmann, Dominique / Bonhomme, Marie-Laurence / Maarand, Martin / Boillet, Mélodie et al. | 2020
- 785
-
Corpus of Chinese Dynastic Histories: Gender Analysis over Two MillenniaZinin, Sergey / Xu, Yang et al. | 2020
- 794
-
The Royal Society Corpus 6.0: Providing 300+ Years of Scientific Writing for Humanistic StudyFischer, Stefan / Knappen, Jörg / Menzel, Katrin / Teich, Elke et al. | 2020
- 803
-
Corpus REDEWIEDERGABEBrunner, Annelen / Engelberg, Stefan / Jannidis, Fotis / Tu, Ngoc Duyen Tanja / Weimer, Lukas et al. | 2020
- 813
-
WeDH - a Friendly Tool for Building Literary Corpora Enriched with Encyclopedic MetadataEgloff, Mattia / Picca, Davide et al. | 2020
- 817
-
Automatic Section Recognition in ObituariesSabbatino, Valentino / Bostan, Laura Ana Maria / Klinger, Roman et al. | 2020
- 826
-
SLäNDa: An Annotated Corpus of Narrative and Dialogue in Swedish Literary FictionStymne, Sara / Östman, Carin et al. | 2020
- 835
-
RiQuA: A Corpus of Rich Quotation Annotation for English Literary TextPapay, Sean / Padó, Sebastian et al. | 2020
- 842
-
A Corpus Linguistic Perspective on Contemporary German Pop Lyrics with the Multi-Layer Annotated "Songkorpus"Schneider, Roman et al. | 2020
- 849
-
The BDCamões Collection of Portuguese Literary Documents: a Research Resource for Digital Humanities and Language TechnologyGrilo, Sara / Bolrinha, Márcia / Silva, João / Vaz, Rui / Branco, António et al. | 2020
- 855
-
Dataset for Temporal Analysis of English-French CognatesFrossard, Esteban / Coustaty, Mickael / Doucet, Antoine / Jatowt, Adam / Hengchen, Simon et al. | 2020
- 860
-
Material Philology Meets Digital Onomastic Lexicography: The NordiCon Database of Medieval Nordic Personal Names in Continental SourcesWaldispühl, Michelle / Dannells, Dana / Borin, Lars et al. | 2020
- 868
-
NLP Scholar: A Dataset for Examining the State of NLP ResearchMohammad, Saif M. et al. | 2020
- 878
-
The DReaM Corpus: A Multilingual Annotated Corpus of Grammars for the World’s LanguagesVirk, Shafqat Mumtaz / Hammarström, Harald / Forsberg, Markus / Wichmann, Søren et al. | 2020
- 885
-
LiViTo: Linguistic and Visual Features Tool for Assisted Analysis of Historic ManuscriptsMüller, Klaus / Tikhonov, Aleksej / Meyer, Roland et al. | 2020
- 891
-
TextAnnotator: A UIMA Based Tool for the Simultaneous and Collaborative Annotation of TextsAbrami, Giuseppe / Stoeckel, Manuel / Mehler, Alexander et al. | 2020
- 901
-
Deduplication of Scholarly Documents using Locality Sensitive Hashing and Word EmbeddingsGyawali, Bikash / Anastasiou, Lucas / Knoth, Petr et al. | 2020
- 911
-
"Voices of the Great War": A Richly Annotated Corpus of Italian Texts on the First World WarBoschetti, Federico / de felice, irene / Dei Rossi, Stefano / Dell'Orletta, Felice / Di Giorgio, Michele / Miliani, Martina / Passaro, Lucia C. / Puddu, Angelica / Venturi, Giulia / Labanca, Nicola et al. | 2020
- 919
-
DEbateNet-mig15:Tracing the 2015 Immigration Debate in Germany Over TimeLapesa, Gabriella / Blessing, Andre / Blokker, Nico / Dayanik, Erenay / Haunss, Sebastian / Kuhn, Jonas / Padó, Sebastian et al. | 2020
- 928
-
A Corpus of Spanish Political Speeches from 1937 to 2019Álvarez-Mellado, Elena et al. | 2020
- 933
-
A New Latin Treebank for Universal Dependencies: Charters between Ancient Latin and Romance LanguagesCecchini, Flavio Massimiliano / Korkiakangas, Timo / Passarotti, Marco et al. | 2020
- 943
-
Identification of Indigenous Knowledge Concepts through Semantic Networks, Spelling Tools and Word EmbeddingsRocha Souza, Renato / Dorn, Amelie / Piringer, Barbara / Wandl-Vogt, Eveline et al. | 2020
- 948
-
A Multi-Orthography Parallel Corpus of Yiddish NounsSaleva, Jonne et al. | 2020
- 953
-
An Annotated Corpus of Adjective-Adverb Interfaces in Romance LanguagesGerhalter, Katharina / Schneider, Gerlinde / Pollin, Christopher / Hummel, Martin et al. | 2020
- 958
-
Language Resources for Historical Newspapers: the Impresso CollectionEhrmann, Maud / Romanello, Matteo / Clematide, Simon / Ströbel, Phillip Benjamin / Barman, Raphaël et al. | 2020
- 969
-
Allgemeine Musikalische Zeitung as a Searchable Online CorpusKampe, Bernd / Duan, Tinghui / Hahn, Udo et al. | 2020
- 977
-
Stylometry in a Bilingual SetupCinkova, Silvie / Rybicki, Jan et al. | 2020
- 985
-
Dialect Clustering with Character-Based Metrics: in Search of the Boundary of Language and DialectSato, Yo / Heffernan, Kevin et al. | 2020
- 991
-
DiscSense: Automated Semantic Analysis of Discourse MarkersSileo, Damien / Van de Cruys, Tim / Pradel, Camille / Muller, Philippe et al. | 2020
- 1000
-
ThemePro: A Toolkit for the Analysis of Thematic ProgressionDominguez, Monica / Soler, Juan / Wanner, Leo et al. | 2020
- 1008
-
Machine-Aided Annotation for Fine-Grained Proposition Types in ArgumentationJo, Yohan / Mayfield, Elijah / Reed, Chris / Hovy, Eduard et al. | 2020
- 1019
-
Chinese Discourse Parsing: Model and EvaluationChuan-An, Lin / Hung, Shyh-Shiun / Huang, Hen-Hsen / Chen, Hsin-Hsi et al. | 2020
- 1025
-
Shallow Discourse Annotation for Chinese TED TalksLong, Wanqiu / Cai, Xinyi / Reid, James / Webber, Bonnie / Xiong, Deyi et al. | 2020
- 1033
-
The Discussion Tracker Corpus of Collaborative ArgumentationOlshefski, Christopher / Lugini, Luca / Singh, Ravneet / Litman, Diane / Godley, Amanda et al. | 2020
- 1044
-
Shallow Discourse Parsing for Under-Resourced Languages: Combining Machine Translation and Annotation ProjectionSluyter-Gäthje, Henny / Bourgonje, Peter / Stede, Manfred et al. | 2020
- 1051
-
A Corpus of Encyclopedia Articles with Logical FormsRasmussen, Nathan / Schuler, William et al. | 2020
- 1061
-
The Potsdam Commentary Corpus 2.2: Extending Annotations for Shallow Discourse ParsingBourgonje, Peter / Stede, Manfred et al. | 2020
- 1067
-
On the Creation of a Corpus for Coherence Evaluation of Discursive UnitsMohammadi, Elham / Beiko, Timothe / Kosseim, Leila et al. | 2020
- 1073
-
Joint Learning of Syntactic Features Helps Discourse SegmentationDesai, Takshak / Dakle, Parag Pravin / Moldovan, Dan et al. | 2020
- 1081
-
Creating a Corpus of Gestures and Predicting the Audience Response based on Gestures in Speeches of Donald TrumpRuf, Verena / Navarretta, Costanza et al. | 2020
- 1089
-
GeCzLex: Lexicon of Czech and German Anaphoric ConnectivesPoláková, Lucie / Rysová, Kateřina / Rysová, Magdaléna / Mírovský, Jiří et al. | 2020
- 1097
-
DiMLex-Bangla: A Lexicon of Bangla Discourse ConnectivesDas, Debopam / Stede, Manfred / Ghosh, Soumya Sankar / Chatterjee, Lahari et al. | 2020
- 1103
-
Semi-Supervised Tri-Training for Explicit Discourse Argument ExpansionKnaebel, Rene / Stede, Manfred et al. | 2020
- 1110
-
WikiPossessions: Possession Timeline Generation as an Evaluation Benchmark for Machine Reading Comprehension of Long TextsChinnappa, Dhivya / Palmer, Alexis / Blanco, Eduardo et al. | 2020
- 1118
-
TED-Q: TED Talks and the Questions they EvokeWestera, Matthijs / Mayol, Laia / Rohde, Hannah et al. | 2020
- 1128
-
CzeDLex 0.6 and its Representation in the PML-TQMírovský, Jiří / Poláková, Lucie / Synková, Pavlína et al. | 2020
- 1135
-
Corpus for Modeling User Interactions in Online Persuasive DiscussionsEgawa, Ryo / Morio, Gaku / Fujita, Katsuhide et al. | 2020
- 1142
-
Simplifying Coreference Chains for Dyslexic ChildrenWilkens, Rodrigo / Todirascu, Amalia et al. | 2020
- 1152
-
Adapting BERT to Implicit Discourse Relation Classification with a Focus on Discourse ConnectivesKishimoto, Yudai / Murawaki, Yugo / Kurohashi, Sadao et al. | 2020
- 1159
-
What Speakers really Mean when they Ask Questions: Classification of Intentions with a Supervised ApproachBarbedette, Angèle / Eshkol-Taravella, Iris et al. | 2020
- 1167
-
Modeling Dialogue in Conversational Cognitive Health Screening InterviewsFarzana, Shahla / Valizadeh, Mina / Parde, Natalie et al. | 2020
- 1178
-
Stigma Annotation Scheme and Stigmatized Language Detection in Health-Care Discussions on Social MediaStraton, Nadiya / Jang, Hyeju / Ng, Raymond et al. | 2020
- 1191
-
An Annotated Dataset of Discourse Modes in Hindi StoriesDhanwal, Swapnil / Dutta, Hritwik / Nankani, Hitesh / Shrivastava, Nilay / Kumar, Yaman / Li, Junyi Jessy / Mahata, Debanjan / Gosangi, Rakesh / Zhang, Haimin / Shah, Rajiv Ratn et al. | 2020
- 1197
-
Multi-class Multilingual Classification of Wikipedia Articles Using Extended Named Entity Tag SetShavarani, Hassan S. / Sekine, Satoshi et al. | 2020
- 1202
-
An Algerian Corpus and an Annotation Platform for Opinion and Emotion AnalysisMoudjari, Leila / Akli-Astouati, Karima / Benamara, Farah et al. | 2020
- 1211
-
Transfer Learning from Transformers to Fake News Challenge Stance Detection (FNC-1) TaskSlovikovskaya, Valeriya / Attardi, Giuseppe et al. | 2020
- 1219
-
Scientific Statement Classification over arXiv.orgGinev, Deyan / Miller, Bruce R et al. | 2020
- 1227
-
Cross-domain Author Gender Classification in Brazilian PortugueseDias, Rafael / Paraboni, Ivandré et al. | 2020
- 1235
-
LEDGAR: A Large-Scale Multi-label Corpus for Text Classification of Legal Provisions in ContractsTuggener, Don / von Däniken, Pius / Peetz, Thomas / Cieliebak, Mark et al. | 2020
- 1242
-
Online Near-Duplicate Detection of News ArticlesRodier, Simon / Carter, Dave et al. | 2020
- 1250
-
Automated Essay Scoring System for Nonnative Japanese LearnersHirao, Reo / Arai, Mio / Shimanaka, Hiroki / Katsumata, Satoru / Komachi, Mamoru et al. | 2020
- 1258
-
A Real-World Data Resource of Complex Sensitive Sentences Based on Documents from the Monsanto TrialNeerbek, Jan / Eskildsen, Morten / Dolog, Peter / Assent, Ira et al. | 2020
- 1268
-
Discovering Biased News Articles Leveraging Multiple Human AnnotationsLazaridou, Konstantina / Löser, Alexander / Mestre, Maria / Naumann, Felix et al. | 2020
- 1278
-
Corpora and Baselines for Humour Recognition in PortugueseGonçalo Oliveira, Hugo / Clemêncio, André / Alves, Ana et al. | 2020
- 1286
-
FactCorp: A Corpus of Dutch Fact-checks and its Multiple Usagesvan der Meulen, Marten / Reijnierse, W. Gudrun et al. | 2020
- 1293
-
Automatic Orality Identification in Historical TextsOrtmann, Katrin / Dipper, Stefanie et al. | 2020
- 1303
-
Using Deep Neural Networks with Intra- and Inter-Sentence Context to Classify Suicidal BehaviourSong, Xingyi / Downs, Johnny / Velupillai, Sumithra / Holden, Rachel / Kikoler, Maxim / Bontcheva, Kalina / Dutta, Rina / Roberts, Angus et al. | 2020
- 1311
-
A First Dataset for Film Age Appropriateness InvestigationMohamed, Emad / Ha, Le An et al. | 2020
- 1318
-
Habibi - a multi Dialect multi National Arabic Song Lyrics CorpusEl-Haj, Mahmoud et al. | 2020
- 1327
-
Age Suitability Rating: Predicting the MPAA Rating Based on Movie DialoguesShafaei, Mahsa / Safi Samghabadi, Niloofar / Kar, Sudipta / Solorio, Thamar et al. | 2020
- 1336
-
Email Classification Incorporating Social Networks and Thread StructureAlkhereyf, Sakhar / Rambow, Owen et al. | 2020
- 1346
-
Development and Validation of a Corpus for Machine Humor ComprehensionTseng, Yuen-Hsien / Wu, Wun-Syuan / Chang, Chia-Yueh / Chen, Hsueh-Chih / Hsu, Wei-Lun et al. | 2020
- 1353
-
Alector: A Parallel Corpus of Simplified French Texts with Alignments of Misreadings by Poor and Dyslexic ReadersGala, Núria / Tack, Anaïs / Javourey-Drevet, Ludivine / François, Thomas / Ziegler, Johannes C. et al. | 2020
- 1362
-
A Corpus for Detecting High-Context Medical Conditions in Intensive Care Patient Notes Focusing on Frequently Readmitted PatientsMoseley, Edward T. / Wu, Joy T. / Welt, Jonathan / Foote, John / Tyler, Patrick D. / Grant, David W. / Carlson, Eric T. / Gehrmann, Sebastian / Dernoncourt, Franck / Celi, Leo Anthony et al. | 2020
- 1368
-
Multilingual Stance Detection in Tweets: The Catalonia Independence CorpusZotova, Elena / Agerri, Rodrigo / Nuñez, Manuel / Rigau, German et al. | 2020
- 1376
-
An Evaluation of Progressive Neural Networksfor Transfer Learning in Natural Language ProcessingMoeed, Abdul / Hagerer, Gerhard / Dugar, Sumit / Gupta, Sarthak / Ghosh, Mainak / Danner, Hannah / Mitevski, Oliver / Nawroth, Andreas / Groh, Georg et al. | 2020
- 1382
-
WAC: A Corpus of Wikipedia Conversations for Online Abuse DetectionCécillon, Noé / Labatut, Vincent / Dufour, Richard / Linarès, Georges et al. | 2020
- 1391
-
FloDusTA: Saudi Tweets Dataset for Flood, Dust Storm, and Traffic Accident EventsHamoui, Btool / Mars, Mourad / Almotairi, Khaled et al. | 2020
- 1397
-
An Annotated Corpus for Sexism Detection in French TweetsChiril, Patricia / Moriceau, Véronique / Benamara, Farah / Mari, Alda / Origgi, Gloria / Coulomb-Gully, Marlène et al. | 2020
- 1404
-
Measuring the Impact of Readability Features in Fake News DetectionSantos, Roney / Pedro, Gabriela / Leal, Sidney / Vale, Oto / Pardo, Thiago / Bontcheva, Kalina / Scarton, Carolina et al. | 2020
- 1414
-
When Shallow is Good Enough: Automatic Assessment of Conceptual Text Complexity using Shallow Semantic FeaturesStajner, Sanja / Hulpuș, Ioana et al. | 2020
- 1423
-
DecOp: A Multilingual and Multi-domain Corpus For Detecting Deception In Typed TextCapuozzo, Pasquale / Lauriola, Ivano / Strapparava, Carlo / Aiolli, Fabio / Sartori, Giuseppe et al. | 2020
- 1431
-
Age Recommendation for TextsBlandin, Alexis / Lecorvé, Gwénolé / Battistelli, Delphine / Étienne, Aline et al. | 2020
- 1440
-
Multilingual Twitter Corpus and Baselines for Evaluating Demographic Bias in Hate Speech RecognitionHuang, Xiaolei / Xing, Linzi / Dernoncourt, Franck / Paul, Michael J. et al. | 2020
- 1449
-
VICTOR: a Dataset for Brazilian Legal Documents ClassificationLuz de Araujo, Pedro Henrique / de Campos, Teófilo Emídio / Ataides Braz, Fabricio / Correia da Silva, Nilton et al. | 2020
- 1459
-
Dynamic Classification in Web Archiving CollectionsPatel, Krutarth / Caragea, Cornelia / Phillips, Mark et al. | 2020
- 1469
-
Aspect Flow Representation and Audio Inspired Analysis for TextsVasconcelos, Larissa / Campelo, Claudio / Jeronimo, Caio et al. | 2020
- 1478
-
Annotating and Analyzing Biased Sentences in News Articles using CrowdsourcingLim, Sora / Jatowt, Adam / Färber, Michael / Yoshikawa, Masatoshi et al. | 2020
- 1485
-
Evaluation of Deep Gaussian Processes for Text ClassificationJayashree, P. / Srijith, P. K. et al. | 2020
- 1492
-
EmoEvent: A Multilingual Emotion Corpus based on different EventsPlaza del Arco, Flor Miriam / Strapparava, Carlo / Urena Lopez, L. Alfonso / Martin, Maite et al. | 2020
- 1499
-
MuSE: a Multimodal Dataset of Stressed EmotionJaiswal, Mimansa / Bara, Cristian-Paul / Luo, Yuanhang / Burzo, Mihai / Mihalcea, Rada / Provost, Emily Mower et al. | 2020
- 1511
-
Affect inTweets: A Transfer Learning ApproachZhang, Linrui / Huang, Hsin-Lun / Yu, Yang / Moldovan, Dan et al. | 2020
- 1517
-
Annotation of Emotion Carriers in Personal NarrativesTammewar, Aniruddha / Cervone, Alessandra / Messner, Eva-Maria / Riccardi, Giuseppe et al. | 2020
- 1526
-
Towards Interactive Annotation for Hesitation in Conversational SpeechWottawa, Jane / Tahon, Marie / Marin, Apolline / Audibert, Nicolas et al. | 2020
- 1533
-
Abusive language in Spanish children and young teenager’s conversations: data preparation and short text classification with contextual word embeddingsCosta-jussà, Marta R. / González, Esther / Moreno, Asuncion / Cumalat, Eudald et al. | 2020
- 1538
-
IIIT-H TEMD Semi-Natural Emotional Speech Database from Professional Actors and Non-ActorsRambabu, Banothu / Botsa, Kishore Kumar / Paidi, Gangamohan / Gangashetty, Suryakanth V et al. | 2020
- 1546
-
The POTUS Corpus, a Database of Weekly Addresses for the Study of Stance in Politics and Virtual AgentsJanssoone, Thomas / Bailly, Kévin / Richard, Gaël / Clavel, Chloé et al. | 2020
- 1554
-
GoodNewsEveryone: A Corpus of News Headlines Annotated with Emotions, Semantic Roles, and Reader PerceptionBostan, Laura Ana Maria / Kim, Evgeny / Klinger, Roman et al. | 2020
- 1567
-
SOLO: A Corpus of Tweets for Examining the State of Being AloneKiritchenko, Svetlana / Hipson, Will / Coplan, Robert / Mohammad, Saif M. et al. | 2020
- 1578
-
PoKi: A Large Dataset of Poems by ChildrenHipson, Will / Mohammad, Saif M. et al. | 2020
- 1590
-
AlloSat: A New Call Center French Corpus for Satisfaction and Frustration AnalysisMacary, Manon / Tahon, Marie / Estève, Yannick / Rousseau, Anthony et al. | 2020
- 1598
-
Learning the Human Judgment for the Automatic Evaluation of ChatbotWu, Shih-Hung / Chien, Sheng-Lun et al. | 2020
- 1603
-
Korean-Specific Emotion Annotation Procedure Using N-Gram-Based Distant Supervision and Korean-Specific-Feature-Based Distant SupervisionLee, Young-Jun / Lim, Chae-Gyun / Choi, Ho-Jin et al. | 2020
- 1611
-
Semi-Automatic Construction and Refinement of an Annotated Corpus for a Deep Learning Framework for Emotion ClassificationXu, Jiajun / Masuda, Kyosuke / Nishizaki, Hiromitsu / Fukumoto, Fumiyo / Suzuki, Yoshimi et al. | 2020
- 1618
-
CEASE, a Corpus of Emotion Annotated Suicide notes in EnglishGhosh, Soumitra / Ekbal, Asif / Bhattacharyya, Pushpak et al. | 2020
- 1627
-
Training a Broad-Coverage German Sentiment Classification Model for Dialog SystemsGuhr, Oliver / Schumann, Anne-Kathrin / Bahrmann, Frank / Böhme, Hans Joachim et al. | 2020
- 1633
-
An Event-comment Social Media Corpus for Implicit Emotion AnalysisLee, Sophia Yat Mei / Lau, Helena Yan Ping et al. | 2020
- 1643
-
An Emotional Mess! Deciding on a Framework for Building a Dutch Emotion-Annotated CorpusDe Bruyne, Luna / De Clercq, Orphee / Hoste, Veronique et al. | 2020
- 1652
-
PO-EMO: Conceptualization, Annotation, and Modeling of Aesthetic Emotions in German and English PoetryHaider, Thomas / Eger, Steffen / Kim, Evgeny / Klinger, Roman / Menninghaus, Winfried et al. | 2020
- 1664
-
Learning Word Ratings for Empathy and Distress from Document-Level User ResponsesSedoc, João / Buechel, Sven / Nachmany, Yehonathan / Buffone, Anneke / Ungar, Lyle et al. | 2020
- 1674
-
Evaluation of Sentence Representations in PolishDadas, Slawomir / Perełkiewicz, Michał / Poświata, Rafał et al. | 2020
- 1681
-
Identification of Primary and Collateral Tracks in Stuttered SpeechRiad, Rachid / Bachoud-Lévi, Anne-Catherine / Rudzicz, Frank / Dupoux, Emmanuel et al. | 2020
- 1689
-
How to Compare Automatically Two Phonological Strings: Application to Intelligibility Measurement in the Case of Atypical SpeechGhio, Alain / Lalain, Muriel / Giusti, Laurence / Fredouille, Corinne / Woisard, Virginie et al. | 2020
- 1695
-
Evaluating Text Coherence at Sentence and Paragraph LevelsLiu, Sennan / Zeng, Shuang / Li, Sujian et al. | 2020
- 1704
-
HardEval: Focusing on Challenging Tokens to Assess Robustness of NERBernier-Colborne, Gabriel / Langlais, Phillippe et al. | 2020
- 1712
-
An Evaluation Dataset for Identifying Communicative Functions of Sentences in English Scholarly PapersIwatsuki, Kenichi / Boudin, Florian / Aizawa, Akiko et al. | 2020
- 1721
-
An Automatic Tool For Language EvaluationFassetti, Fabio / Fassetti, Ilaria et al. | 2020
- 1727
-
Which Evaluations Uncover Sense Representations that Actually Make Sense?Boyd-Graber, Jordan / Guo, Fenfei / Findlater, Leah / Iyyer, Mohit et al. | 2020
- 1739
-
Diversity, Density, and Homogeneity: Quantitative Characteristic Metrics for Text CollectionsLai, Yi-An / Zhu, Xuan / Zhang, Yi / Diab, Mona et al. | 2020
- 1747
-
Towards Few-Shot Event Mention Retrieval: An Evaluation Framework and A Siamese Network ApproachMin, Bonan / Chan, Yee Seng / Zhao, Lingjun et al. | 2020
- 1753
-
Linguistic Appropriateness and Pedagogic Usefulness of Reading Comprehension QuestionsHorbach, Andrea / Aldabe, Itziar / Bexte, Marie / Lopez de Lacalle, Oier / Maritxalar, Montse et al. | 2020
- 1763
-
Dataset Reproducibility and IR Methods in Timeline SummarizationBorn, Leo / Bacher, Maximilian / Markert, Katja et al. | 2020
- 1772
-
Database Search vs. Information Retrieval: A Novel Method for Studying Natural Language Querying of Semi-Structured DataNadig, Stefanie / Braschler, Martin / Stockinger, Kurt et al. | 2020
- 1780
-
Why Attention is Not Explanation: Surgical Intervention and Causal Reasoning about Neural ModelsGrimsley, Christopher / Mayfield, Elijah / R.S. Bursten, Julia et al. | 2020
- 1791
-
Have a Cake and Eat it Too: Assessing Discriminating Performance of an Intelligibility Index Obtained from a Reduced Sample SizeMarczyk, Anna / Ghio, Alain / Lalain, Muriel / Rebourg, Marie / Fredouille, Corinne / Woisard, Virginie et al. | 2020
- 1796
-
Evaluation Metrics for Headline Generation Using Deep Pre-Trained EmbeddingsMoeed, Abdul / An, Yang / Hagerer, Gerhard / Groh, Georg et al. | 2020
- 1803
-
LinCE: A Centralized Benchmark for Linguistic Code-switching EvaluationAguilar, Gustavo / Kar, Sudipta / Solorio, Thamar et al. | 2020
- 1814
-
Paraphrase Generation and Evaluation on Colloquial-Style SentencesSjöblom, Eetu / Creutz, Mathias / Scherrer, Yves et al. | 2020
- 1823
-
Analyzing Word Embedding Through Structural Equation ModelingHan, Namgi / Hayashi, Katsuhiko / Miyao, Yusuke et al. | 2020
- 1833
-
Evaluation of Lifelong Learning SystemsProkopalo, Yevhenii / Meignier, Sylvain / Galibert, Olivier / Barrault, Loic / Larcher, Anthony et al. | 2020
- 1842
-
Interannotator Agreement for Lexico-Semantic Annotation of a CorpusHajnicz, Elżbieta et al. | 2020
- 1849
-
An In-Depth Comparison of 14 Spelling Correction Tools on a Common BenchmarkNäther, Markus et al. | 2020
- 1858
-
Sentence Level Human Translation Quality Estimation with Attention-based Neural NetworksYuan, Yu / Sharoff, Serge et al. | 2020
- 1866
-
Evaluating Language Tools for Fifteen EU-official Under-resourced LanguagesAlves, Diego / Thakkar, Gaurish / Tadić, Marko et al. | 2020
- 1874
-
Word Embedding Evaluation for SinhalaLakmal, Dimuthu / Ranathunga, Surangika / Peramuna, Saman / Herath, Indu et al. | 2020
- 1882
-
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding TasksAspillaga, Carlos / Carvallo, Andrés / Araujo, Vladimir et al. | 2020
- 1895
-
Brand-Product Relation Extraction Using Heterogeneous Vector Space RepresentationsJanz, Arkadiusz / Kopociński, Łukasz / Piasecki, Maciej / Pluwak, Agnieszka et al. | 2020
- 1902
-
A Tale of Three Parsers: Towards Diagnostic Evaluation for Meaning Representation ParsingBuljan, Maja / Nivre, Joakim / Oepen, Stephan / Øvrelid, Lilja et al. | 2020
- 1910
-
Headword-Oriented Entity Linking: A Special Entity Linking Task with Dataset and BaselineYang, Mu / Chen, Chi-Yen / Lee, Yi-Hui / Zeng, Qian-hui / Ma, Wei-Yun / Shih, Chen-Yang / Chen, Wei-Jhih et al. | 2020
- 1918
-
TableBank: Table Benchmark for Image-based Table Detection and RecognitionLi, Minghao / Cui, Lei / Huang, Shaohan / Wei, Furu / Zhou, Ming / Li, Zhoujun et al. | 2020
- 1926
-
WIKIR: A Python Toolkit for Building a Large-scale Wikipedia-based English Information Retrieval DatasetFrej, Jibril / Schwab, Didier / Chevallet, Jean-Pierre et al. | 2020
- 1934
-
Constructing a Public Meeting CorpusTanaka, Koji / Chu, Chenhui / Ren, Haolin / Renoust, Benjamin / Nakashima, Yuta / Takemura, Noriko / Nagahara, Hajime / Fujikawa, Takao et al. | 2020
- 1941
-
Annotating and Extracting Synthesis Process of All-Solid-State Batteries from Scientific LiteratureKuniyoshi, Fusataka / Makino, Kohei / Ozawa, Jun / Miwa, Makoto et al. | 2020
- 1951
-
WEXEA: Wikipedia EXhaustive Entity AnnotationStrobl, Michael / Trabelsi, Amine / Zaiane, Osmar et al. | 2020
- 1959
-
Handling Entity Normalization with no Annotated Corpus: Weakly Supervised Methods Based on Distributional Representation and Ontological InformationFerré, Arnaud / Bossy, Robert / Ba, Mouhamadou / Deléger, Louise / Lavergne, Thomas / Zweigenbaum, Pierre / Nédellec, Claire et al. | 2020
- 1967
-
HBCP Corpus: A New Resource for the Analysis of Behavioural Change Intervention ReportsBonin, Francesca / Gleize, Martin / Finnerty, Ailbhe / Moore, Candice / Jochim, Charles / Norris, Emma / Hou, Yufang / Wright, Alison J. / Ganguly, Debasis / Hayes, Emily et al. | 2020
- 1976
-
Cross-lingual Structure Transfer for Zero-resource Event ExtractionLu, Di / Subburathinam, Ananya / Ji, Heng / May, Jonathan / Chang, Shih-Fu / Sil, Avi / Voss, Clare et al. | 2020
- 1982
-
Cross-Domain Evaluation of Edge Detection for Biomedical Event ExtractionRamponi, Alan / Plank, Barbara / Lombardo, Rosario et al. | 2020
- 1990
-
Semantic Annotation for Improved Safety in Construction WorkThompson, Paul / Yates, Tim / Inan, Emrah / Ananiadou, Sophia et al. | 2020
- 2000
-
Social Web Observatory: A Platform and Method for Gathering Knowledge on Entities from Different Textual SourcesTsekouras, Leonidas / Petasis, Georgios / Giannakopoulos, George / Kosmopoulos, Aris et al. | 2020
- 2009
-
Development of a Corpus Annotated with Medications and their Attributes in Psychiatric Health RecordsChaturvedi, Jaya / Viani, Natalia / Sanyal, Jyoti / Tytherleigh, Chloe / Hasan, Idil / Baird, Kate / Velupillai, Sumithra / Stewart, Robert / Roberts, Angus et al. | 2020
- 2017
-
Do not let the history haunt you: Mitigating Compounding Errors in Conversational Question AnsweringMandya, Angrosh / O' Neill, James / Bollegala, Danushka / Coenen, Frans et al. | 2020
- 2026
-
CLEEK: A Chinese Long-text Corpus for Entity LinkingZeng, Weixin / Zhao, Xiang / Tang, Jiuyang / Tan, Zhen / Huang, Xuqian et al. | 2020
- 2036
-
The Medical Scribe: Corpus Development and Model Performance AnalysesShafran, Izhak / Du, Nan / Tran, Linh / Perry, Amanda / Keyes, Lauren / Knichel, Mark / Domin, Ashley / Huang, Lei / Chen, Yu-hui / Li, Gang et al. | 2020
- 2045
-
A Contract Corpus for Recognizing Rights and ObligationsFunaki, Ruka / Nagata, Yusuke / Suenaga, Kohei / Mori, Shinsuke et al. | 2020
- 2054
-
Recognition of Implicit Geographic Movement in TextPezanowski, Scott / Mitra, Prasenjit et al. | 2020
- 2064
-
Extraction of the Argument Structure of Tokyo Metropolitan Assembly Minutes: Segmentation of Question-and-Answer SetsTakamaru, Keiichi / Kimura, Yasutomo / Shibuki, Hideyuki / Ototake, Hokuto / Uchida, Yuzu / Sakamoto, Kotaro / Ishioroshi, Madoka / Mitamura, Teruko / Kando, Noriko et al. | 2020
- 2069
-
A Term Extraction Approach to Survey Analysis in Health CareRobin, Cécile / Isazad Mashinchi, Mona / Ahmadi Zeleti, Fatemeh / Ojo, Adegboyega / Buitelaar, Paul et al. | 2020
- 2078
-
A Scientific Information Extraction Dataset for Nature Inspired EngineeringKruiper, Ruben / Vincent, Julian F.V. / Chen-Burger, Jessica / Desmulliez, Marc P.Y. / Konstas, Ioannis et al. | 2020
- 2086
-
Automated Discovery of Mathematical Definitions in TextVanetik, Natalia / Litvak, Marina / Shevchuk, Sergey / Reznik, Lior et al. | 2020
- 2095
-
WN-Salience: A Corpus of News Articles with Entity Salience AnnotationsWu, Chuan / Kanoulas, Evangelos / de Rijke, Maarten / Lu, Wei et al. | 2020
- 2103
-
Event Extraction from Unstructured Amharic TextTadesse, Ephrem / Tsegaye, Rosa / Qaqqabaa, Kuulaa et al. | 2020
- 2110
-
Comparing Machine Learning and Deep Learning Approaches on NLP Tasks for the Italian LanguageMagnini, Bernardo / Lavelli, Alberto / Magnolini, Simone et al. | 2020
- 2120
-
MyFixit: An Annotated Dataset, Annotation Tool, and Baseline Methods for Information Extraction from Repair ManualsNabizadeh, Nima / Kolossa, Dorothea / Heckmann, Martin et al. | 2020
- 2129
-
Towards Entity Spacesvan Erp, Marieke / Groth, Paul et al. | 2020
- 2138
-
Love Me, Love Me, Say (and Write!) that You Love Me: Enriching the WASABI Song Corpus with Lyrics AnnotationsFell, Michael / Cabrio, Elena / Korfed, Elmahdi / Buffa, Michel / Gandon, Fabien et al. | 2020
- 2148
-
Evaluating Information Loss in Temporal Dependency TreesOcal, Mustafa / Finlayson, Mark et al. | 2020
- 2157
-
Populating Legal Ontologies using Semantic Role LabelingHumphreys, Llio / Boella, Guido / Di Caro, Luigi / Robaldo, Livio / van der Torre, Leon / Ghanavati, Sepideh / Muthuri, Robert et al. | 2020
- 2167
-
PST 2.0 - Corpus of Polish Spatial TextsMarcińczuk, Michał / Oleksy, Marcin / Wieczorek, Jan et al. | 2020
- 2175
-
Natural Language Premise Selection: Finding Supporting Statements for Mathematical TextFerreira, Deborah / Freitas, André et al. | 2020
- 2183
-
Odinson: A Fast Rule-based Information Extraction FrameworkValenzuela-Escárcega, Marco A. / Hahn-Powell, Gus / Bell, Dane et al. | 2020
- 2192
-
The STEM-ECR Dataset: Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic SourcesD'Souza, Jennifer / Hoppe, Anett / Brack, Arthur / Jaradeh, Mohmad Yaser / Auer, Sören / Ewerth, Ralph et al. | 2020
- 2204
-
MathAlign: Linking Formula Identifiers to their Contextual Natural Language DescriptionsAlexeeva, Maria / Sharp, Rebecca / Valenzuela-Escárcega, Marco A. / Kadowaki, Jennifer / Pyarelal, Adarsh / Morrison, Clayton et al. | 2020
- 2213
-
Domain Adapted Distant Supervision for Pedagogically Motivated Relation ExtractionSainz, Oscar / Lopez de Lacalle, Oier / Aldabe, Itziar / Maritxalar, Montse et al. | 2020
- 2223
-
Temporal Histories of Epidemic Events (THEE): A Case Study in Temporal Annotation for Public HealthNiu, Jingcheng / Ng, Victoria / Penn, Gerald / Rees, Erin E. et al. | 2020
- 2231
-
Exploiting Citation Knowledge in Personalised Recommendation of Recent Scientific PublicationsKhadka, Anita / Cantador, Iván / Fernandez, Miriam et al. | 2020
- 2241
-
A Platform for Event Extraction in HindiSahoo, Sovan Kumar / Saha, Saumajit / Ekbal, Asif / Bhattacharyya, Pushpak et al. | 2020
- 2251
-
Rad-SpatialNet: A Frame-based Resource for Fine-Grained Spatial Relations in Radiology ReportsDatta, Surabhi / Ulinski, Morgan / Godfrey-Stovall, Jordan / Khanpara, Shekhar / Riascos-Castaneda, Roy F. / Roberts, Kirk et al. | 2020
- 2261
-
NLP Analytics in Finance with DoRe: A French 250M Tokens Corpus of Corporate Annual ReportsMasson, Corentin / Paroubek, Patrick et al. | 2020
- 2268
-
The Language of Brain Signals: Natural Language Processing of Electroencephalography ReportsMaldonado, Ramon / Harabagiu, Sanda et al. | 2020
- 2276
-
Humans Keep It One Hundred: an Overview of AI JourneyShavrina, Tatiana / Emelyanov, Anton / Fenogenova, Alena / Fomin, Vadim / Mikhailov, Vladislav / Evlampiev, Andrey / Malykh, Valentin / Larin, Vladimir / Natekin, Alex / Vatulin, Aleksandr et al. | 2020
- 2285
-
Towards Data-driven Ontologies: a Filtering Approach using Keywords and Natural Language Constructsde Boer, Maaike / Verhoosel, Jack P. C. et al. | 2020
- 2293
-
A French Corpus and Annotation Schema for Named Entity Recognition and Relation Extraction of Financial NewsJabbari, Ali / Sauvage, Olivier / Zeine, Hamada / Chergui, Hamza et al. | 2020
- 2300
-
Inferences for Lexical Semantic Resource Building with Less SupervisionBebeshina, Nadia / Lafourcade, Mathieu et al. | 2020
- 2306
-
Acquiring Social Knowledge about Personality and Driving-related BehaviorIwai, Ritsuko / Kawahara, Daisuke / Kumada, Takatsune / Kurohashi, Sadao et al. | 2020
- 2316
-
Implicit knowledge in argumentative texts : an annotated corpusBecker, Maria / Korfhage, Katharina / Frank, Anette et al. | 2020
- 2325
-
Multiple Knowledge GraphDB (MKGDB)Faralli, Stefano / Velardi, Paola / Yusifli, Farid et al. | 2020
- 2332
-
Orchestrating NLP Services for the Legal DomainMoreno-Schneider, Julian / Rehm, Georg / Montiel-Ponsoda, Elena / Rodriguez-Doncel, Víctor / Revenko, Artem / Karampatakis, Sotirios / Khvalchik, Maria / Sageder, Christian / Gracia, Jorge / Maganza, Filippo et al. | 2020
- 2341
-
Evaluation Dataset and Methodology for Extracting Application-Specific Taxonomies from the Wikipedia Knowledge GraphBordea, Georgeta / Faralli, Stefano / Mougin, Fleur / Buitelaar, Paul / Diallo, Gayo et al. | 2020
- 2348
-
Subjective Evaluation of Comprehensibility in Movie InteractionsRandria, Estelle / Fontan, Lionel / Le Coz, Maxime / Ferrané, Isabelle / Pinquier, Julien et al. | 2020
- 2358
-
Representing Multiword Term Variation in a Terminological Knowledge Base: a Corpus-Based StudyLeón-Araúz, Pilar / Reimerink, Arianne / Cabezas-García, Melania et al. | 2020
- 2368
-
Understanding Spatial Relations through Multiple ModalitiesDan, Soham / He, Hangfeng / Roth, Dan et al. | 2020
- 2373
-
A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource LanguagesRoy, Dwaipayan / Bhatia, Sumit / Jain, Prateek et al. | 2020
- 2381
-
Pártélet: A Hungarian Corpus of Propaganda Texts from the Hungarian Socialist EraKmetty, Zoltán / Vincze, Veronika / Demszky, Dorottya / Ring, Orsolya / Nagy, Balázs / Szabó, Martina Katalin et al. | 2020
- 2389
-
KORE 50^DYWC: An Evaluation Data Set for Entity Linking Based on DBpedia, YAGO, Wikidata, and CrunchbaseNoullet, Kristian / Mix, Rico / Färber, Michael et al. | 2020
- 2396
-
Eye4Ref: A Multimodal Eye Movement Dataset of Referentially Complex SituationsAlacam, Özge / Ruppert, Eugen / Salama, Amr Rekaby / Staron, Tobias / Menzel, Wolfgang et al. | 2020
- 2405
-
SiBert: Enhanced Chinese Pre-trained Language Model with Sentence InsertionChen, Jiahao / Cao, Chenjie / Jiang, Xiuyan et al. | 2020
- 2413
-
Processing South Asian Languages Written in the Latin Script: the Dakshina DatasetRoark, Brian / Wolf-Sonkin, Lawrence / Kirov, Christo / Mielke, Sabrina J. / Johny, Cibu / Demirsahin, Isin / Hall, Keith et al. | 2020
- 2424
-
GM-RKB WikiText Error Correction Task and BaselinesMelli, Gabor / Eldallal, Abdelrhman / Lazem, Bassim / Moreira, Olga et al. | 2020
- 2431
-
Embedding Space Correlation as a Measure of Domain SimilarityBeyer, Anne / Kauermann, Göran / Schütze, Hinrich et al. | 2020
- 2440
-
Wiki-40B: Multilingual Language Model DatasetGuo, Mandy / Dai, Zihang / Vrandečić, Denny / Al-Rfou, Rami et al. | 2020
- 2453
-
Know thy Corpus! Robust Methods for Digital Curation of Web corporaSharoff, Serge et al. | 2020
- 2461
-
Evaluating Approaches to Personalizing Language ModelsKing, Milton / Cook, Paul et al. | 2020
- 2470
-
Class-based LSTM Russian Language Model with Linguistic InformationKipyatkova, Irina / Karpov, Alexey et al. | 2020
- 2475
-
Adaptation of Deep Bidirectional Transformers for Afrikaans LanguageRalethe, Sello et al. | 2020
- 2479
-
FlauBERT: Unsupervised Language Model Pre-training for FrenchLe, Hang / Vial, Loïc / Frej, Jibril / Segonne, Vincent / Coavoux, Maximin / Lecouteux, Benjamin / Allauzen, Alexandre / Crabbé, Benoit / Besacier, Laurent / Schwab, Didier et al. | 2020
- 2491
-
Accelerated High-Quality Mutual-Information Based Word ClusteringCiosici, Manuel R. / Assent, Ira / Derczynski, Leon et al. | 2020
- 2497
-
Rhythmic Proximity Between Natives And Learners Of French - Evaluation of a metric based on the CEFC corpusCoulange, Sylvain / Rossato, Solange et al. | 2020
- 2503
-
From Linguistic Resources to Ontology-Aware Terminologies: Minding the Representation GapSperanza, Giulia / di Buono, Maria Pia / Monti, Johanna / Sangati, Federico et al. | 2020
- 2511
-
Modeling Factual Claims with Semantic FramesArslan, Fatma / Caraballo, Josue / Jimenez, Damian / Li, Chengkai et al. | 2020
- 2521
-
Automatic Transcription Challenges for Inuktitut, a Low-Resource Polysynthetic LanguageGupta, Vishwa / Boulianne, Gilles et al. | 2020
- 2528
-
Geographically-Balanced Gigaword Corpora for 50 Language VarietiesDunn, Jonathan / Adams, Ben et al. | 2020
- 2537
-
Data Augmentation using Machine Translation for Fake News Detection in the Urdu LanguageAmjad, Maaz / Sidorov, Grigori / Zhila, Alisa et al. | 2020
- 2543
-
Evaluation of Greek Word EmbeddingsOutsios, Stamatis / Karatsalos, Christos / Skianis, Konstantinos / Vazirgiannis, Michalis et al. | 2020
- 2552
-
A Dataset of Mycenaean Linear B SequencesPapavassiliou, Katerina / Owens, Gareth / Kosmopoulos, Dimitrios et al. | 2020
- 2562
-
The Nunavut Hansard Inuktitut-English Parallel Corpus 3.0 with Preliminary Machine Translation ResultsJoanis, Eric / Knowles, Rebecca / Kuhn, Roland / Larkin, Samuel / Littell, Patrick / Lo, Chi-kiu / Stewart, Darlene / Micher, Jeffrey et al. | 2020
- 2573
-
Exploring Bilingual Word Embeddings for Hiligaynon, a Low-Resource LanguageMichel, Leah / Hangya, Viktor / Fraser, Alexander et al. | 2020
- 2581
-
A Finite-State Morphological Analyser for EvenkiZueva, Anna / Kuznetsova, Anastasia / Tyers, Francis et al. | 2020
- 2590
-
Morphology-rich Alphasyllabary EmbeddingsMersha, Amanuel / Wu, Stephen et al. | 2020
- 2596
-
Localization of Fake News Detection via Multitask Transfer LearningCruz, Jan Christian Blaise / Tan, Julianne Agatha / Cheng, Charibeth et al. | 2020
- 2605
-
Evaluating Sentence Segmentation in Different Datasets of Neuropsychological Language Tests in Brazilian PortugueseCasanova, Edresson / Treviso, Marcos / Hübner, Lilian / Aluísio, Sandra et al. | 2020
- 2615
-
Jejueo Datasets for Machine Translation and Speech SynthesisPark, Kyubyong / Choe, Yo Joong / Ham, Jiyeon et al. | 2020
- 2622
-
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu LanguageMatsuura, Kohei / Ueno, Sei / Mimura, Masato / Sakai, Shinsuke / Kawahara, Tatsuya et al. | 2020
- 2629
-
Development of a Guarani - Spanish Parallel CorpusChiruzzo, Luis / Amarilla, Pedro / Ríos, Adolfo / Giménez Lugo, Gustavo et al. | 2020
- 2634
-
AR-ASAG An ARabic Dataset for Automatic Short Answer Grading EvaluationOuahrani, Leila / Bennouar, Djamal et al. | 2020
- 2644
-
Processing Language Resources of Under-Resourced and Endangered Languages for the Generation of Augmentative Alternative Communication BoardsFerger, Anne et al. | 2020
- 2649
-
The Nisvai Corpus of Oral Narrative Practices from Malekula (Vanuatu) and its Associated Language ResourcesAznar, Jocelyn / Gala, Núria et al. | 2020
- 2657
-
Building a Time-Aligned Cross-Linguistic Reference Corpus from Language Documentation Data (DoReCo)Paschen, Ludger / Delafontaine, François / Draxler, Christoph / Fuchs, Susanne / Stave, Matthew / Seifart, Frank et al. | 2020
- 2667
-
Benchmarking Neural and Statistical Machine Translation on Low-Resource African LanguagesDuh, Kevin / McNamee, Paul / Post, Matt / Thompson, Brian et al. | 2020
- 2676
-
Improved Finite-State Morphological Analysis for St. Lawrence Island Yupik Using Paradigm Function MorphologyChen, Emily / Park, Hyunji Hayley / Schwartz, Lane et al. | 2020
- 2685
-
Towards a Spell Checker for Zamboanga Chavacano OrthographyHimoro, Marcelo Yuji / Pareja-Lora, Antonio et al. | 2020
- 2698
-
Identifying Sentiments in Algerian Code-switched User-generated CommentsAdouane, Wafia / Touileb, Samia / Bernardy, Jean-Philippe et al. | 2020
- 2706
-
Automatic Creation of Text Corpora for Low-Resource Languages from the Internet: The Case of Swiss GermanLinder, Lucy / Jungo, Michael / Hennebert, Jean / Musat, Claudiu Cristian / Fischer, Andreas et al. | 2020
- 2712
-
Evaluating Sub-word Embeddings in Cross-lingual ModelsHakimi Parizi, Ali / Cook, Paul et al. | 2020
- 2720
-
A Swiss German Dictionary: Variation in Speech and WritingSchmidt, Larissa / Linder, Lucy / Djambazovska, Sandra / Lazaridis, Alexandros / Samardžić, Tanja / Musat, Claudiu et al. | 2020
- 2726
-
Towards a Corsican Basic Language Resource KitKevers, Laurent / Retali-Medori, Stella et al. | 2020
- 2736
-
Evaluating the Impact of Sub-word Information and Cross-lingual Word Embeddings on Mi'kmaq Language ModellingBoudreau, Jeremie / Patra, Akankshya / Suvarna, Ashima / Cook, Paul et al. | 2020
- 2746
-
Exploring a Choctaw Language Corpus with Word Vectors and Minimum Distance LengthBrixey, Jacqueline / Sides, David / Vizthum, Timothy / Traum, David / Iskarous, Khalil et al. | 2020
- 2754
-
Massive vs. Curated Embeddings for Low-Resourced Languages: the Case of Yorùbá and TwiAlabi, Jesujoba / Amponsah-Kaakyire, Kwabena / Adelani, David / España-Bonet, Cristina et al. | 2020
- 2763
-
TRopBank: Turkish PropBank V2.0Kara, Neslihan / Aslan, Deniz Baran / Marşan, Büşra / Bakay, Özge / Ak, Koray / Yıldız, Olcay Taner et al. | 2020
- 2773
-
Collection and Annotation of the Romanian Legal CorpusTufiș, Dan / Mitrofan, Maria / Păiș, Vasile / Ion, Radu / Coman, Andrei et al. | 2020
- 2778
-
An Empirical Evaluation of Annotation Practices in Corpora from Language Documentationvon Prince, Kilu / Nordhoff, Sebastian et al. | 2020
- 2788
-
Annotated Corpus for Sentiment Analysis in Odia LanguageMohanty, Gaurav / Mishra, Pruthwik / Mamidi, Radhika et al. | 2020
- 2796
-
Building a Task-oriented Dialog System for Languages with no Training Data: the Case for BasqueLópez de Lacalle, Maddalen / Saralegi, Xabier / San Vicente, Iñaki et al. | 2020
- 2803
-
SENCORPUS: A French-Wolof Parallel CorpusNguer, Elhadji Mamadou / Lo, Alla / Dione, Cheikh M. Bamba / Ba, Sileye O. / Lo, Moussa et al. | 2020
- 2812
-
A Major Wordnet for a Minority Language: Scottish GaelicBella, Gábor / McNeill, Fiona / Gorman, Rody / O Donnaile, Caoimhin / MacDonald, Kirsty / Chandrashekar, Yamini / Freihat, Abed Alhakim / Giunchiglia, Fausto et al. | 2020
- 2819
-
Crowdsourcing Speech Data for Low-Resource Languages from Low-Income WorkersAbraham, Basil / Goel, Danish / Siddarth, Divya / Bali, Kalika / Chopra, Manu / Choudhury, Monojit / Joshi, Pratik / Jyoti, Preethi / Sitaram, Sunayana / Seshadri, Vivek et al. | 2020
- 2827
-
A Resource for Studying Chatino Verbal MorphologyCruz, Hilaria / Anastasopoulos, Antonios / Stump, Gregory et al. | 2020
- 2832
-
Learnings from Technological Interventions in a Low Resource Language: A Case-Study on GondiMehta, Devansh / Santy, Sebastin / Mothilal, Ramaravind Kommiya / Srivastava, Brij Mohan Lal / Sharma, Alok / Shukla, Anurag / Prasad, Vishnu / U, Venkanna / Sharma, Amit / Bali, Kalika et al. | 2020
- 2839
-
Irony Detection in Persian Language: A Transfer Learning Approach Using Emoji PredictionGolazizian, Preni / Sabeti, Behnam / Ashrafi Asli, Seyed Arad / Majdabadi, Zahra / Momenzadeh, Omid / Fahmi, Reza et al. | 2020
- 2846
-
Towards Computational Resource Grammars for Runyankore and RukigaBamutura, David / Ljunglöf, Peter / Nebende, Peter et al. | 2020
- 2855
-
Optimizing Annotation Effort Using Active Learning Strategies: A Sentiment Analysis Case Study in PersianAshrafi Asli, Seyed Arad / Sabeti, Behnam / Majdabadi, Zahra / Golazizian, Preni / Fahmi, Reza / Momenzadeh, Omid et al. | 2020
- 2862
-
BanFakeNews: A Dataset for Detecting Fake News in BanglaHossain, Md Zobaer / Rahman, Md Ashraful / Islam, Md Saiful / Kar, Sudipta et al. | 2020
- 2872
-
A Resource for Computational Experiments on MapudungunDuan, Mingjun / Fasola, Carlos / Rallabandi, Sai Krishna / Vega, Rodolfo / Anastasopoulos, Antonios / Levin, Lori / Black, Alan W et al. | 2020
- 2878
-
Automated Parsing of Interlinear Glossed Text from Page Images of Grammatical DescriptionsRound, Erich / Ellison, Mark / Macklin-Cordes, Jayden / Beniamine, Sacha et al. | 2020
- 2884
-
The Johns Hopkins University Bible Corpus: 1600+ Tongues for Typological ExplorationMcCarthy, Arya D. / Wicks, Rachel / Lewis, Dylan / Mueller, Aaron / Wu, Winston / Adams, Oliver / Nicolai, Garrett / Post, Matt / Yarowsky, David et al. | 2020
- 2893
-
Towards Building an Automatic Transcription System for Language Documentation: Experiences from MuyuZahrer, Alexander / Zgank, Andrej / Schuppler, Barbara et al. | 2020
- 2901
-
Towards Flexible Cross-Resource Exploitation of Heterogeneous Language Documentation DataJettka, Daniel / Lehmberg, Timm et al. | 2020
- 2906
-
CantoMap: a Hong Kong Cantonese MapTask CorpusWinterstein, Grégoire / Tang, Carmen / Lai, Regine et al. | 2020
- 2914
-
No Data to Crawl? Monolingual Corpus Creation from PDF Files of Truly low-Resource Languages in PeruBustamante, Gina / Oncevay, Arturo / Zariquiey, Roberto et al. | 2020
- 2924
-
Creating a Parallel Icelandic Dependency Treebank from Raw Text to Universal DependenciesJónsdóttir, Hildur / Ingason, Anton Karl et al. | 2020
- 2932
-
Building a Universal Dependencies Treebank for OccitanMiletic, Aleksandra / Bras, Myriam / Vergez-Couret, Marianne / Esher, Louise / Poujade, Clamença / Sibille, Jean et al. | 2020
- 2940
-
Building the Old Javanese WordnetMoeljadi, David / Aminullah, Zakariya Pamuji et al. | 2020
- 2947
-
CPLM, a Parallel Corpus for Mexican Languages: Development and InterfaceSierra Martínez, Gerardo / Montaño, Cynthia / Bel-Enguix, Gemma / Córdova, Diego / Mota Montoya, Margarita et al. | 2020
- 2953
-
SiNER: A Large Dataset for Sindhi Named Entity RecognitionAli, Wazir / Lu, Junyu / Xu, Zenglin et al. | 2020
- 2962
-
Construct a Sense-Frame Aligned Predicate Lexicon for Chinese AMR CorpusSong, Li / Dai, Yuling / Liu, Yihuan / Li, Bin / Qu, Weiguang et al. | 2020
- 2970
-
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel CorporaHan, Lifeng / Jones, Gareth / Smeaton, Alan et al. | 2020
- 2980
-
A Myanmar (Burmese)-English Named Entity Transliteration DictionaryMyat Mon, Aye / Ding, Chenchen / Kaing, Hour / Mar Soe, Khin / Utiyama, Masao / Sumita, Eiichiro et al. | 2020
- 2984
-
CA-EHN: Commonsense Analogy from E-HowNetLi, Peng-Hsuan / Yang, Tsan-Yu / Ma, Wei-Yun et al. | 2020
- 2991
-
Building Semantic Grams of Human KnowledgeLeone, Valentina / Siragusa, Giovanni / Di Caro, Luigi / Navigli, Roberto et al. | 2020
- 3001
-
Automatically Building a Multilingual Lexicon of False Friends With No SupervisionUban, Ana Sabina / Dinu, Liviu P. et al. | 2020
- 3008
-
A Parallel WordNet for English, Swedish and BulgarianAngelov, Krasimir et al. | 2020
- 3016
-
ENGLAWI: From Human- to Machine-Readable WiktionarySajous, Franck / Calderone, Basilio / Hathout, Nabil et al. | 2020
- 3027
-
Opening the Romance Verbal Inflection Dataset 2.0: A CLDF lexiconBeniamine, Sacha / Maiden, Martin / Round, Erich et al. | 2020
- 3036
-
word2word: A Collection of Bilingual Lexicons for 3,564 Language PairsChoe, Yo Joong / Park, Kyubyong / Kim, Dongwoo et al. | 2020
- 3046
-
Introducing Lexical Masks: a New Representation of Lexical Entries for Better Evaluation and Exchange of LexiconsCartoni, Bruno / Calvelo Aros, Daniel / Vrandecic, Denny / Lertpradit, Saran et al. | 2020
- 3053
-
A Large-Scale Leveled Readability Lexicon for Standard ArabicAl Khalil, Muhamed / Habash, Nizar / Jiang, Zhengyang et al. | 2020
- 3063
-
Preserving Semantic Information from Old Dictionaries: Linking Senses of the 'Altfranzösisches Wörterbuch' to WordNetStein, Achim et al. | 2020
- 3069
-
Cifu: a Frequency Lexicon of Hong Kong CantoneseLai, Regine / Winterstein, Grégoire et al. | 2020
- 3078
-
Odi et Amo. Creating, Evaluating and Extending Sentiment Lexicons for Latin.Sprugnoli, Rachele / Passarotti, Marco / Corbetta, Daniela / Peverelli, Andrea et al. | 2020
- 3087
-
WordWars: A Dataset to Examine the Natural Selection of WordsMohammad, Saif M. et al. | 2020
- 3096
-
Challenge Dataset of Cognates and False Friend Pairs from Indian LanguagesKanojia, Diptesh / Kulkarni, Malhar / Bhattacharyya, Pushpak / Haffari, Gholamreza et al. | 2020
- 3103
-
Development of a Japanese Personality Dictionary based on Psychological MethodsIwai, Ritsuko / Kawahara, Daisuke / Kumada, Takatsune / Kurohashi, Sadao et al. | 2020
- 3109
-
A Lexicon-Based Approach for Detecting Hedges in Informal TextIslam, Jumayel / Xiao, Lu / Mercer, Robert E. et al. | 2020
- 3114
-
Word Complexity Estimation for Japanese Lexical SimplificationNishihara, Daiki / Kajiwara, Tomoyuki et al. | 2020
- 3121
-
Inducing Universal Semantic Tag VectorsHuo, Da / de Melo, Gerard et al. | 2020
- 3128
-
LexiDB: Patterns & Methods for Corpus Linguistic Database ManagementCoole, Matthew / Rayson, Paul / Mariani, John et al. | 2020
- 3136
-
Towards a Semi-Automatic Detection of Reflexive and Reciprocal Constructions and Their Representation in a Valency LexiconKettnerová, Václava / Lopatkova, Marketa / Vernerová, Anna / Barancikova, Petra et al. | 2020
- 3145
-
Languages Resources for Poorly Endowed Languages : The Case Study of Classical ArmenianVidal-Gorène, Chahan / Decours-Perez, Aliénor et al. | 2020
- 3153
-
Constructing Web-Accessible Semantic Role Labels and Frames for Japanese as Additions to the NPCMJ Parsed CorpusTakeuchi, Koichi / Butler, Alastair / Nagasaki, Iku / Okamura, Takuya / Pardeshi, Prashant et al. | 2020
- 3162
-
Large-scale Cross-lingual Language Resources for Referencing and FramingVossen, Piek / Ilievski, Filip / Postma, Marten / Fokkens, Antske / Minnema, Gosse / Remijnse, Levi et al. | 2020
- 3172
-
Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use CaseKhan, Fahad / Romary, Laurent / Salgado, Ana / Bowers, Jack / Khemakhem, Mohamed / Tasovac, Toma et al. | 2020
- 3181
-
Linking the TUFS Basic Vocabulary to the Open Multilingual WordnetBond, Francis / Nomoto, Hiroki / Morgado da Costa, Luís / Bond, Arthur et al. | 2020
- 3189
-
Some Issues with Building a Multilingual WordnetBond, Francis / Morgado da Costa, Luis / Goodman, Michael Wayne / McCrae, John Philip / Lohk, Ahti et al. | 2020
- 3198
-
Collocations in Russian Lexicography and Russian Collocations DatabaseKhokhlova, Maria et al. | 2020
- 3207
-
Methodological Aspects of Developing and Managing an Etymological Lexical Resource: Introducing EtymDB-2.0Fourrier, Clémentine / Sagot, Benoît et al. | 2020
- 3217
-
OFrLex: A Computational Morphological and Syntactic Lexicon for Old FrenchGuibon, Gaël / Sagot, Benoît et al. | 2020
- 3226
-
Automatic Reconstruction of Missing Romanian Cognates and Unattested Latin WordsCiobanu, Alina Maria / Dinu, Liviu P. / Zoicas, Laurentiu et al. | 2020
- 3232
-
A Multilingual Evaluation Dataset for Monolingual Word Sense AlignmentAhmadi, Sina / McCrae, John Philip / Nimb, Sanni / Khan, Fahad / Monachini, Monica / Pedersen, Bolette / Declerck, Thierry / Wissik, Tanja / Bellandi, Andrea / Pisani, Irene et al. | 2020
- 3243
-
A Broad-Coverage Deep Semantic Lexicon for VerbsAllen, James / An, Hannah / Bose, Ritwik / de Beaumont, Will / Teng, Choh Man et al. | 2020
- 3252
-
Computational Etymology and Word EmergenceWu, Winston / Yarowsky, David et al. | 2020
- 3260
-
A Dataset of Translational Equivalents Built on the Basis of plWordNet-Princeton WordNet Synset MappingRudnicka, Ewa / Naskręt, Tomasz et al. | 2020
- 3265
-
TRANSLIT: A Large-scale Name Transliteration ResourceBenites, Fernando / Duivesteijn, Gilbert François / von Däniken, Pius / Cieliebak, Mark et al. | 2020
- 3272
-
Computing with Subjectivity LexiconsL. M. Jeronimo, Caio / E. C. Campelo, Claudio / Balby Marinho, Leandro / Sales, Allan / Veloso, Adriano / Viola, Roberta et al. | 2020
- 3281
-
The ACoLi Dictionary GraphChiarcos, Christian / Fäth, Christian / Ionov, Maxim et al. | 2020
- 3291
-
Resources in Underrepresented Languages: Building a Representative Romanian CorpusMidrigan - Ciochina, Ludmila / Boyd, Victoria / Sanchez-Ortega, Lucila / Malancea_Malac, Diana / Midrigan, Doina / Corina, David P. et al. | 2020
- 3297
-
World Class Language Technology - Developing a Language Technology Strategy for DanishKirchmeier, Sabine / Pedersen, Bolette / Nimb, Sanni / Diderichsen, Philip / Henrichsen, Peter Juel et al. | 2020
- 3302
-
A Corpus for Automatic Readability Assessment and Text Simplification of GermanBattisti, Alessia / Pfütze, Dominik / Säuberli, Andreas / Kostrzewa, Marek / Ebling, Sarah et al. | 2020
- 3312
-
The CLARIN Knowledge Centre for Atypical Communication Expertisevan den Heuvel, Henk / Oostdijk, Nelleke / Rowland, Caroline / Trilsbeek, Paul et al. | 2020
- 3317
-
Corpora of Disordered Speech in the Light of the GDPR: Two Use Cases from the DELAD Initiativevan den Heuvel, Henk / Kelli, Aleksei / Klessa, Katarzyna / Salaasti, Satu et al. | 2020
- 3322
-
The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual EuropeRehm, Georg / Marheinecke, Katrin / Hegele, Stefanie / Piperidis, Stelios / Bontcheva, Kalina / Hajic, Jan / Choukri, Khalid / Vasiļjevs, Andrejs / Backfried, Gerhard / Prinz, Christoph et al. | 2020
- 3333
-
A Framework for Shared Agreement of Language Tags beyond ISO 639Gillis-Webber, Frances / Tittel, Sabine et al. | 2020
- 3340
-
Gigafida 2.0: The Reference Corpus of Written Standard SloveneKrek, Simon / Arhar Holdt, Špela / Erjavec, Tomaž / Čibej, Jaka / Repar, Andraz / Gantar, Polona / Ljubešić, Nikola / Kosem, Iztok / Dobrovoljc, Kaja et al. | 2020
- 3346
-
Corpus Query Lingua Franca part II: OntologyEvert, Stefan / Harlamov, Oleg / Heinrich, Philipp / Banski, Piotr et al. | 2020
- 3353
-
A CLARIN Transcription Portal for Interview DataDraxler, Christoph / van den Heuvel, Henk / van Hessen, Arjan / Calamai, Silvia / Corti, Louise et al. | 2020
- 3360
-
Ellogon Casual Annotation InfrastructurePetasis, Georgios / Tsekouras, Leonidas et al. | 2020
- 3366
-
European Language Grid: An OverviewRehm, Georg / Berger, Maria / Elsholz, Ela / Hegele, Stefanie / Kintzel, Florian / Marheinecke, Katrin / Piperidis, Stelios / Deligiannis, Miltos / Galanis, Dimitris / Gkirtzou, Katerina et al. | 2020
- 3381
-
The Competitiveness Analysis of the European Language Technology MarketVasiļjevs, Andrejs / Skadina, Inguna / Samite, Indra / Kauliņš, Kaspars / Ajausks, Ēriks / Meļņika, Jūlija / Bērziņš, Aivars et al. | 2020
- 3390
-
Constructing a Bilingual Hadith Corpus Using a Segmentation ToolAltammami, Shatha / Atwell, Eric / Alsalka, Ammar et al. | 2020
- 3399
-
Facilitating Corpus Usage: Making Icelandic Corpora More Accessible for Researchers and Language UsersSteingrímsson, Steinþór / Barkarson, Starkaður / Örnólfsson, Gunnar Thor et al. | 2020
- 3406
-
Interoperability in an Infrastructure Enabling Multidisciplinary Research: The case of CLARINde Jong, Franciska / Maegaard, Bente / Fišer, Darja / van Uytvanck, Dieter / Witt, Andreas et al. | 2020
- 3414
-
Language Technology Programme for Icelandic 2019-2023Nikulásdóttir, Anna / Guðnason, Jón / Ingason, Anton Karl / Loftsson, Hrafn / Rögnvaldsson, Eiríkur / Sigurðsson, Einar Freyr / Steingrímsson, Steinþór et al. | 2020
- 3423
-
Privacy by Design and Language ResourcesKamocki, Pawel / Witt, Andreas et al. | 2020
- 3428
-
Making Metadata Fit for Next Generation Language Technology Platforms: The Metadata Schema of the European Language GridLabropoulou, Penny / Gkirtzou, Katerina / Gavriilidou, Maria / Deligiannis, Miltos / Galanis, Dimitris / Piperidis, Stelios / Rehm, Georg / Berger, Maria / Mapelli, Valérie / Rigault, Michael et al. | 2020
- 3438
-
Related Works in the Linguistic Data Consortium CatalogJaquette, Daniel / Cieri, Christopher / DiPersio, Denise et al. | 2020
- 3443
-
Language Data Sharing in European Public Services - Overcoming Obstacles and Creating Sustainable Data Sharing InfrastructuresSmal, Lilli / Lösch, Andrea / van Genabith, Josef / Giagkou, Maria / Declerck, Thierry / Busemann, Stephan et al. | 2020
- 3449
-
A Progress Report on Activities at the Linguistic Data Consortium Benefitting the LREC CommunityCieri, Christopher / Fiumara, James / Strassel, Stephanie / Wright, Jonathan / DiPersio, Denise / Liberman, Mark et al. | 2020
- 3457
-
Digital Language Infrastructures - Documenting Language ActorsLyding, Verena / König, Alexander / Pretti, Monica et al. | 2020
- 3463
-
Samrómur: Crowd-sourcing Data Collection for Icelandic Speech RecognitionMollberg, David Erik / Jónsson, Ólafur Helgi / Þorsteinsdóttir, Sunneva / Steingrímsson, Steinþór / Magnúsdóttir, Eydís Huld / Gudnason, Jon et al. | 2020
- 3468
-
Semi-supervised Development of ASR Systems for Multilingual Code-switched Speech in Under-resourced LanguagesBiswas, Astik / Yilmaz, Emre / De Wet, Febe / Van der westhuizen, Ewald / Niesler, Thomas et al. | 2020
- 3475
-
CLFD: A Novel Vectorization Technique and Its Application in Fake News DetectionMersinias, Michail / Afantenos, Stergos / Chalkiadakis, Georgios et al. | 2020
- 3484
-
SimplifyUR: Unsupervised Lexical Text Simplification for UrduQasmi, Namoos Hayat / Zia, Haris Bin / Athar, Awais / Raza, Agha Ali et al. | 2020
- 3490
-
Jamo Pair Encoding: Subcharacter Representation-based Extreme Korean Vocabulary Compression for Efficient Subword TokenizationMoon, Sangwhan / Okazaki, Naoaki et al. | 2020
- 3498
-
Offensive Language and Hate Speech Detection for DanishSigurbergsson, Gudbjartur Ingi / Derczynski, Leon et al. | 2020
- 3509
-
Semi-supervised Deep Embedded Clustering with Anomaly Detection for Semantic Frame InductionYong, Zheng Xin / Timponi Torrent, Tiago et al. | 2020
- 3520
-
Search Query Language Identification Using Weak LabelingTambi, Ritiz / Kale, Ajinkya / King, Tracy Holloway et al. | 2020
- 3528
-
Automated Phonological Transcription of Akkadian Cuneiform TextSahala, Aleksi / Silfverberg, Miikka / Arppe, Antti / Lindén, Krister et al. | 2020
- 3535
-
COSTRA 1.0: A Dataset of Complex Sentence TransformationsBarancikova, Petra / Bojar, Ondřej et al. | 2020
- 3542
-
Automatic In-the-wild Dataset Annotation with Deep Generalized Multiple Instance LearningCorreia, Joana / Trancoso, Isabel / Raj, Bhiksha et al. | 2020
- 3551
-
How Much Data Do You Need? About the Creation of a Ground Truth for Black Letter and the Effectiveness of Neural OCRStröbel, Phillip Benjamin / Clematide, Simon / Volk, Martin et al. | 2020
- 3560
-
Dirichlet-Smoothed Word Embeddings for Low-Resource SettingsJungmaier, Jakob / Kassner, Nora / Roth, Benjamin et al. | 2020
- 3566
-
On The Performance of Time-Pooling Strategies for End-to-End Spoken Language IdentificationMonteiro, Joao / Alam, Md Jahangir / Falk, Tiago et al. | 2020
- 3573
-
Neural Disambiguation of Lemma and Part of Speech in Morphologically Rich LanguagesHoya Quecedo, José María / Maximilian, Koppatz / Yangarber, Roman et al. | 2020
- 3583
-
Non-Linearity in Mapping Based Cross-Lingual Word EmbeddingsZhao, Jiawei / Gilman, Andrew et al. | 2020
- 3590
-
LibriVoxDeEn: A Corpus for German-to-English Speech Translation and German Speech RecognitionBeilharz, Benjamin / Sun, Xin / Karimova, Sariya / Riezler, Stefan et al. | 2020
- 3595
-
SEDAR: a Large Scale French-English Financial Domain Parallel CorpusGhaddar, Abbas / Langlais, Phillippe et al. | 2020
- 3603
-
JParaCrawl: A Large Scale Web-Based English-Japanese Parallel CorpusMorishita, Makoto / Suzuki, Jun / Nagata, Masaaki et al. | 2020
- 3610
-
Neural Machine Translation for Low-Resourced Indian LanguagesChoudhary, Himanshu / Rao, Shivansh / Rohilla, Rajesh et al. | 2020
- 3616
-
Content-Equivalent Translated Parallel News Corpus and Extension of Domain Adaptation for NMTMino, Hideya / Tanaka, Hideki / Ito, Hitoshi / Goto, Isao / Yamada, Ichiro / Tokunaga, Takenobu et al. | 2020
- 3623
-
NMT and PBSMT Error Analyses in English to Brazilian Portuguese Automatic TranslationsCaseli, Helena / Inácio, Marcio et al. | 2020
- 3630
-
Evaluation Dataset for Zero Pronoun in Japanese to English TranslationShimazu, Sho / Takase, Sho / Nakazawa, Toshiaki / Okazaki, Naoaki et al. | 2020
- 3635
-
Better Together: Modern Methods Plus Traditional Thinking in NP AlignmentKovács, Ádám / Ács, Judit / Kornai, Andras / Recski, Gábor et al. | 2020
- 3640
-
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures TranslationSong, Haiyue / Dabre, Raj / Fujita, Atsushi / Kurohashi, Sadao et al. | 2020
- 3650
-
Being Generous with Sub-Words towards Small NMT ChildrenDefauw, Arne / Vanallemeersch, Tom / Van Winckel, Koen / Szoc, Sara / Van den Bogaert, Joachim et al. | 2020
- 3657
-
Document Sub-structure in Neural Machine TranslationDobreva, Radina / Zhou, Jie / Bawden, Rachel et al. | 2020
- 3668
-
An Evaluation Benchmark for Testing the Word Sense Disambiguation Capabilities of Machine Translation SystemsRaganato, Alessandro / Scherrer, Yves / Tiedemann, Jörg et al. | 2020
- 3676
-
MEDLINE as a Parallel Corpus: a Survey to Gain Insight on French-, Spanish- and Portuguese-speaking Authors’ Abstract Writing PracticeNévéol, Aurélie / Jimeno Yepes, Antonio / Neves, Mariana et al. | 2020
- 3683
-
JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine TranslationMao, Zhuoyuan / Cromieres, Fabien / Dabre, Raj / Song, Haiyue / Kurohashi, Sadao et al. | 2020
- 3692
-
A Post-Editing Dataset in the Legal Domain: Do we Underestimate Neural Machine Translation Quality?Ive, Julia / Specia, Lucia / Szoc, Sara / Vanallemeersch, Tom / Van den Bogaert, Joachim / Farah, Eduardo / Maroti, Christine / Ventura, Artur / Khalilov, Maxim et al. | 2020
- 3698
-
Linguistically Informed Hindi-English Neural Machine TranslationGoyal, Vikrant / Mishra, Pruthwik / Sharma, Dipti Misra et al. | 2020
- 3704
-
A Test Set for Discourse Translation from Japanese to EnglishNagata, Masaaki / Morishita, Makoto et al. | 2020
- 3710
-
An Analysis of Massively Multilingual Neural Machine Translation for Low-Resource LanguagesMueller, Aaron / Nicolai, Garrett / McCarthy, Arya D. / Lewis, Dylan / Wu, Winston / Yarowsky, David et al. | 2020
- 3719
-
TDDC: Timely Disclosure Documents CorpusDoi, Nobushige / Oda, Yusuke / Nakazawa, Toshiaki et al. | 2020
- 3727
-
MuST-Cinema: a Speech-to-Subtitles corpusKarakanta, Alina / Negri, Matteo / Turchi, Marco et al. | 2020
- 3735
-
On Context Span Needed for Machine Translation EvaluationCastilho, Sheila / Popović, Maja / Way, Andy et al. | 2020
- 3743
-
A Multilingual Parallel Corpora Collection Effort for Indian LanguagesSiripragrada, Shashank / Philip, Jerin / Namboodiri, Vinay P. / Jawahar, C V et al. | 2020
- 3752
-
To Case or not to case: Evaluating Casing Methods for Neural Machine TranslationEtchegoyhen, Thierry / Gete, Harritxu et al. | 2020
- 3761
-
The MARCELL Legislative CorpusVáradi, Tamás / Koeva, Svetla / Yamalov, Martin / Tadić, Marko / Sass, Bálint / Nitoń, Bartłomiej / Ogrodniczuk, Maciej / Pęzik, Piotr / Barbu Mititelu, Verginica / Ion, Radu et al. | 2020
- 3769
-
ParaPat: The Multi-Million Sentences Parallel Corpus of Patents AbstractsSoares, Felipe / Stevenson, Mark / Bartolome, Diego / Zaretskaya, Anna et al. | 2020
- 3775
-
Corpora for Document-Level Neural Machine TranslationLiu, Siyou / Zhang, Xiaojun et al. | 2020
- 3782
-
OpusTools and Parallel Corpus DiagnosticsAulamo, Mikko / Sulubacak, Umut / Virpioja, Sami / Tiedemann, Jörg et al. | 2020
- 3790
-
Literary Machine Translation under the Magnifying Glass: Assessing the Quality of an NMT-Translated Detective Novel on Document LevelFonteyne, Margot / Tezcan, Arda / Macken, Lieve et al. | 2020
- 3799
-
Handle with Care: A Case Study in Comparable Corpora Exploitation for Neural Machine TranslationEtchegoyhen, Thierry / Gete, Harritxu et al. | 2020
- 3808
-
The FISKMÖ Project: Resources and Tools for Finnish-Swedish Machine Translation and Cross-Linguistic ResearchTiedemann, Jörg / Nieminen, Tommi / Aulamo, Mikko / Kanerva, Jenna / Leino, Akseli / Ginter, Filip / Papula, Niko et al. | 2020
- 3816
-
Multiword Expression aware Neural Machine TranslationZaninello, Andrea / Birch, Alexandra et al. | 2020
- 3826
-
An Enhanced Mapping Scheme of the Universal Part-Of-Speech for KoreanKim, Myung Hee / Colineau, Nathalie et al. | 2020
- 3834
-
Finite State Machine Pattern-Root Arabic Morphological Generator, Analyzer and DiacritizerAlkhairy, Maha / Jafri, Afshan / Smith, David et al. | 2020
- 3842
-
An Unsupervised Method for Weighting Finite-state Morphological AnalyzersKeleg, Amr / Tyers, Francis / Howell, Nick / Pirinen, Tommi et al. | 2020
- 3851
-
Language-Independent Tokenisation Rivals Language-Specific Tokenisation for Word Similarity PredictionBollegala, Danushka / Kiryo, Ryuichi / Tsujino, Kosuke / Yukawa, Haruki et al. | 2020
- 3861
-
A Supervised Part-Of-Speech Tagger for the Greek Language of the Social WebNikiforos, Maria Nefeli / Kermanidis, Katia Lida et al. | 2020
- 3868
-
Bag & Tag'em - A New Dutch StemmerJonker, Anne / de Ruijt, Corné / de Gruijl, Jornt et al. | 2020
- 3877
-
Glawinette: a Linguistically Motivated Derivational Description of French Acquired from GLAWIHathout, Nabil / Sajous, Franck / Calderone, Basilio / Namer, Fiammetta et al. | 2020
- 3886
-
BabyFST - Towards a Finite-State Based Computational Model of Ancient BabylonianSahala, Aleksi / Silfverberg, Miikka / Arppe, Antti / Lindén, Krister et al. | 2020
- 3895
-
Morphological Analysis and Disambiguation for Gulf Arabic: The Interplay between Resources and MethodsKhalifa, Salam / Zalmout, Nasser / Habash, Nizar et al. | 2020
- 3905
-
Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional CorpusMetheniti, Eleni / Neumann, Guenter et al. | 2020
- 3913
-
Introducing a Large-Scale Dataset for Vietnamese POS Tagging on Conversational TextsTran, Oanh / Pham, Tu / Dang, Vu / Nguyen, Bang et al. | 2020
- 3922
-
UniMorph 3.0: Universal MorphologyMcCarthy, Arya D. / Kirov, Christo / Grella, Matteo / Nidhi, Amrit / Xia, Patrick / Gorman, Kyle / Vylomova, Ekaterina / Mielke, Sabrina J. / Nicolai, Garrett / Silfverberg, Miikka et al. | 2020
- 3932
-
Building the Spanish-Croatian Parallel CorpusMikelenić, Bojana / Tadić, Marko et al. | 2020
- 3937
-
DerivBase.Ru: a Derivational Morphology Resource for RussianVodolazsky, Daniil et al. | 2020
- 3944
-
Morfessor EM+Prune: Improved Subword Segmentation with Expectation Maximization and PruningGrönroos, Stig-Arne / Virpioja, Sami / Kurimo, Mikko et al. | 2020
- 3954
-
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for SerbianStankovic, Ranka / Šandrih, Branislava / Krstev, Cvetana / Utvić, Miloš / Skoric, Mihailo et al. | 2020
- 3963
-
Fine-grained Morphosyntactic Analysis and Generation Tools for More Than One Thousand LanguagesNicolai, Garrett / Lewis, Dylan / McCarthy, Arya D. / Mueller, Aaron / Wu, Winston / Yarowsky, David et al. | 2020
- 3973
-
Cairo Student Code-Switch (CSCS) Corpus: An Annotated Egyptian Arabic-English CorpusBalabel, Mohamed / Hamed, Injy / Abdennadher, Slim / Vu, Ngoc Thang / Çetinoğlu, Özlem et al. | 2020
- 3978
-
Getting More Data for Low-resource Morphological Inflection: Language Models and Data AugmentationSorokin, Alexey et al. | 2020
- 3984
-
Visual Modeling of Turkish MorphologyÖzenç, Berke / Solak, Ercan et al. | 2020
- 3991
-
Kvistur 2.0: a BiLSTM Compound Splitter for IcelandicDaðason, Jón / Mollberg, David / Loftsson, Hrafn / Bjarnadóttir, Kristín et al. | 2020
- 3996
-
Morphological Segmentation for Low Resource LanguagesMott, Justin / Bies, Ann / Strassel, Stephanie / Kodner, Jordan / Richter, Caitlin / Xu, Hongzhi / Marcus, Mitchell et al. | 2020
- 4003
-
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl DataWenzek, Guillaume / Lachaux, Marie-Anne / Conneau, Alexis / Chaudhary, Vishrav / Guzmán, Francisco / Joulin, Armand / Grave, Edouard et al. | 2020
- 4013
-
On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding LearningDoval, Yerai / Camacho-Collados, Jose / Espinosa Anke, Luis / Schockaert, Steven et al. | 2020
- 4024
-
Building an English-Chinese Parallel Corpus Annotated with Sub-sentential Translation TechniquesZhai, Yuming / Liu, Lufei / Zhong, Xinyi / Illouz, Gabriel / Vilnat, Anne et al. | 2020
- 4034
-
Universal Dependencies v2: An Evergrowing Multilingual Treebank CollectionNivre, Joakim / de Marneffe, Marie-Catherine / Ginter, Filip / Hajic, Jan / Manning, Christopher D. / Pyysalo, Sampo / Schuster, Sebastian / Tyers, Francis / Zeman, Daniel et al. | 2020
- 4044
-
EMPAC: an English-Spanish Corpus of Institutional SubtitlesSerrat Roozen, Iris / Martínez Martínez, José Manuel et al. | 2020
- 4054
-
Cross-Lingual Word Embeddings for Turkic LanguagesKuriyozov, Elmurod / Doval, Yerai / Gómez-Rodríguez, Carlos et al. | 2020
- 4063
-
How Universal are Universal Dependencies? Exploiting Syntax for Multilingual Clause-level Sentiment DetectionKanayama, Hiroshi / Iwamoto, Ran et al. | 2020