A Major Wordnet for a Minority Language: Scottish Gaelic (English)
Free access
- New search for: Bella, Gábor
- New search for: McNeill, Fiona
- New search for: Gorman, Rody
- New search for: O Donnaile, Caoimhin
- New search for: MacDonald, Kirsty
- New search for: Chandrashekar, Yamini
- New search for: Freihat, Abed Alhakim
- New search for: Giunchiglia, Fausto
- New search for: Bella, Gábor
- New search for: McNeill, Fiona
- New search for: Gorman, Rody
- New search for: O Donnaile, Caoimhin
- New search for: MacDonald, Kirsty
- New search for: Chandrashekar, Yamini
- New search for: Freihat, Abed Alhakim
- New search for: Giunchiglia, Fausto
In:
LREC 2020 Marseille
; 2812-2818
;
2020
- Conference paper / Electronic Resource
-
Title:A Major Wordnet for a Minority Language: Scottish Gaelic
-
Contributors:Bella, Gábor ( author ) / McNeill, Fiona ( author ) / Gorman, Rody ( author ) / O Donnaile, Caoimhin ( author ) / MacDonald, Kirsty ( author ) / Chandrashekar, Yamini ( author ) / Freihat, Abed Alhakim ( author ) / Giunchiglia, Fausto ( author )
-
Conference:International Conference on Language Resources and Evaluation ; 12. ; 2020 ; Marseille
-
Published in:LREC 2020 Marseille ; 2812-2818
-
Publisher:
- New search for: The European Language Resources Association (ELRA)
-
Place of publication:Paris
-
Publication date:2020
-
Type of media:Conference paper
-
Type of material:Electronic Resource
-
Language:English
- New search for: 17.46 / 18.00 / 54.75
- Further information on Basic classification
-
Classification:
-
Source:
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
Neural Mention DetectionYu, Juntao / Bohnet, Bernd / Poesio, Massimo et al. | 2020
- 11
-
A Cluster Ranking Model for Full Anaphora ResolutionYu, Juntao / Uma, Alexandra / Poesio, Massimo et al. | 2020
- 21
-
Mandarinograd: A Chinese Collection of Winograd SchemasBernard, Timothée / Han, Ting et al. | 2020
- 27
-
On the Influence of Coreference Resolution on Word Embeddings in Lexical-semantic Evaluation TasksHenlein, Alexander / Mehler, Alexander et al. | 2020
- 34
-
NoEl: An Annotated Corpus for Noun Ellipsis in EnglishKhullar, Payal / Majmundar, Kushal / Shrivastava, Manish et al. | 2020
- 44
-
An Annotated Dataset of Coreference in English LiteratureBamman, David / Lewke, Olivia / Mansoor, Anya et al. | 2020
- 55
-
GerDraCor-Coref: A Coreference Corpus for Dramatic Texts in GermanPagel, Janis / Reiter, Nils et al. | 2020
- 65
-
A Study on Entity Resolution for Email ConversationsDakle, Parag Pravin / Desai, Takshak / Moldovan, Dan et al. | 2020
- 74
-
Model-based Annotation of CoreferenceAralikatte, Rahul / Søgaard, Anders et al. | 2020
- 80
-
French Coreference for Spoken and Written LanguageWilkens, Rodrigo / Oberle, Bruno / Landragin, Frédéric / Todirascu, Amalia et al. | 2020
- 90
-
Cross-lingual Zero Pronoun ResolutionAloraini, Abdulrahman / Poesio, Massimo et al. | 2020
- 99
-
Exploiting Cross-Lingual Hints to Discover Event PronounsLoáiciga, Sharid / Hardmeier, Christian / Sayeed, Asad et al. | 2020
- 104
-
MuDoCo: Corpus for Multidomain Coreference Resolution and Referring Expression GenerationMartin, Scott / Poddar, Shivani / Upasani, Kartikeya et al. | 2020
- 112
-
Affection Driven Neural Networks for Sentiment AnalysisXiang, Rong / Long, Yunfei / Wan, Mingyu / Gu, Jinghang / Lu, Qin / Huang, Chu-Ren et al. | 2020
- 120
-
The Alice Datasets: fMRI & EEG Observations of Natural Language ComprehensionBhattasali, Shohini / Brennan, Jonathan / Luh, Wen-Ming / Franzluebbers, Berta / Hale, John et al. | 2020
- 126
-
Modelling Narrative Elements in a Short Story: A Study on Annotation Schemes and GuidelinesMikhalkova, Elena / Protasov, Timofei / Sokolova, Polina / Bashmakova, Anastasiia / Drozdova, Anastasiia et al. | 2020
- 133
-
Cortical Speech Databases For Deciphering the Articulatory CodeHöge, Harald et al. | 2020
- 138
-
ZuCo 2.0: A Dataset of Physiological Recordings During Natural Reading and AnnotationHollenstein, Nora / Troendle, Marius / Zhang, Ce / Langer, Nicolas et al. | 2020
- 147
-
Linguistic, Kinematic and Gaze Information in Task Descriptions: The LKG-CorpusReinboth, Tim / Gross, Stephanie / Bishop, Laura / Krenn, Brigitte et al. | 2020
- 156
-
The ACQDIV Corpus Database and Aggregation PipelineJancso, Anna / Moran, Steven / Stoll, Sabine et al. | 2020
- 166
-
Providing Semantic Knowledge to a Set of Pictograms for People with Disabilities: a Set of Links between WordNet and Arasaac: Arasaac-WNSchwab, Didier / Trial, Pauline / Vaschalde, Céline / Vial, Loïc / Esperanca-Rodier, Emmanuelle / Lecouteux, Benjamin et al. | 2020
- 172
-
Orthographic Codes and the Neighborhood Effect: Lessons from Information TheoryTulkens, Stéphan / Sandra, Dominiek / Daelemans, Walter et al. | 2020
- 182
-
Understanding the Dynamics of Second Language Writing through Keystroke Logging and Complexity ContoursKerz, Elma / Pruneri, Fabio / Wiechmann, Daniel / Qiao, Yu / Ströbel, Marcus et al. | 2020
- 189
-
Design of BCCWJ-EEG: Balanced Corpus with Human ElectroencephalographyOseki, Yohei / Asahara, Masayuki et al. | 2020
- 195
-
Using the RUPEX Multichannel Corpus in a Pilot fMRI Study on Speech DisfluenciesSmirnova, Katerina / Korotaev, Nikolay / Panikratova, Yana / Lebedeva, Irina / Pechenkova, Ekaterina / Fedorova, Olga et al. | 2020
- 204
-
Construction of an Evaluation Corpus for Grammatical Error Correction for Learners of Japanese as a Second LanguageKoyama, Aomi / Kiyuna, Tomoshige / Kobayashi, Kenji / Arai, Mio / Komachi, Mamoru et al. | 2020
- 212
-
Effective Crowdsourcing of Multiple Tasks for Comprehensive Knowledge ExtractionNam, Sangha / Lee, Minho / Kim, Donghwan / Han, Kijong / Kim, Kuntae / Yoon, Sooji / Kim, Eun-kyung / Choi, Key-Sun et al. | 2020
- 220
-
Developing a Corpus of Indirect Speech Act SchemasRoque, Antonio / Tsuetaki, Alexander / Sarathy, Vasanth / Scheutz, Matthias et al. | 2020
- 229
-
Quality Estimation for Partially Subjective Classification Tasks via CrowdsourcingSato, Yoshinao / Miyazawa, Kouki et al. | 2020
- 236
-
Crowdsourcing in the Development of a Multilingual FrameNet: A Case Study of Korean FrameNetHahm, Younggyun / Noh, Youngbin / Han, Ji Yoon / Oh, Tae Hwan / Choe, Hyonsu / Kim, Hansaem / Choi, Key-Sun et al. | 2020
- 245
-
Towards a Reliable and Robust Methodology for Crowd-Based Subjective Quality Assessment of Query-Based Extractive Text SummarizationIskender, Neslihan / Polzehl, Tim / Möller, Sebastian et al. | 2020
- 254
-
A Seed Corpus of Hindu Temples in IndiaRadhakrishnan, Priya et al. | 2020
- 259
-
Do You Believe It Happened? Assessing Chinese Readers' Veridicality JudgmentsChang, Yu-Yun / Hsieh, Shu-Kai et al. | 2020
- 268
-
Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language LearningNicolas, Lionel / Lyding, Verena / Borg, Claudia / Forascu, Corina / Fort, Karën / Zdravkova, Katerina / Kosem, Iztok / Čibej, Jaka / Arhar Holdt, Špela / Millour, Alice et al. | 2020
- 279
-
MAGPIE: A Large Corpus of Potentially Idiomatic ExpressionsHaagsma, Hessel / Bos, Johan / Nissim, Malvina et al. | 2020
- 288
-
CRWIZ: A Framework for Crowdsourcing Real-Time Wizard-of-Oz DialoguesChiyah Garcia, Francisco Javier / Lopes, José / Liu, Xingkun / Hastie, Helen et al. | 2020
- 298
-
Effort Estimation in Named Entity Tagging TasksGomes, Inês / Correia, Rui / Ribeiro, Jorge / Freitas, João et al. | 2020
- 307
-
Using Crowdsourced Exercises for Vocabulary Training to Expand ConceptNetRodosthenous, Christos / Lyding, Verena / Sangati, Federico / König, Alexander / ul Hassan, Umair / Nicolas, Lionel / Horbacauskiene, Jolita / Katinskaia, Anisia / Aparaschivei, Lavinia et al. | 2020
- 317
-
Predicting Multidimensional Subjective Ratings of Children’ Readings from the Speech Signals for the Automatic Assessment of FluencyBailly, Gérard / Godde, Erika / Piat-Marchand, Anne-Laure / Bosse, Marie-Line et al. | 2020
- 323
-
Constructing Multimodal Language Learner Texts Using LARA: Experiences with Nine LanguagesAkhlaghi, Elham / Bédi, Branislav / Bektaş, Fatih / Berthelsen, Harald / Butterweck, Matthias / Chua, Cathy / Cucchiarin, Catia / Eryiğit, Gülşen / Gerlach, Johanna / Habibi, Hanieh et al. | 2020
- 332
-
A Dataset for Investigating the Impact of Feedback on Student Revision OutcomePilan, Ildiko / Lee, John / Yeung, Chak Yan / Webster, Jonathan et al. | 2020
- 340
-
Creating Corpora for Research in Feedback Comment GenerationNagata, Ryo / Inui, Kentaro / Ishikawa, Shin'ichiro et al. | 2020
- 346
-
Using Multilingual Resources to Evaluate CEFRLex for Learner ApplicationsGraën, Johannes / Alfter, David / Schneider, Gerold et al. | 2020
- 356
-
Immersive Language Exploration with Object Recognition and Augmented RealityPlatte, Benny / Platte, Anett / Roschke, Christian / Thomanek, Rico / Rolletschke, Thony / Zimmer, Frank / Ritter, Marc et al. | 2020
- 363
-
A Process-oriented Dataset of Revisions during WritingConijn, Rianne / Dux Speltz, Emily / van Zaanen, Menno / Van Waes, Luuk / Chukharev-Hudilainen, Evgeny et al. | 2020
- 369
-
Automated Writing Support Using Deep Linguistic ParsersMorgado da Costa, Luís / V P Winder, Roger / Li, Shu Yun / Lin Tzer Liang, Benedict Christopher / Mackinnon, Joseph / Bond, Francis et al. | 2020
- 378
-
TLT-school: a Corpus of Non Native Children SpeechGretter, Roberto / Matassoni, Marco / Bannò, Stefano / Daniele, Falavigna et al. | 2020
- 386
-
Toward a Paradigm Shift in Collection of Learner CorporaKatinskaia, Anisia / Ivanova, Sardana / Yangarber, Roman et al. | 2020
- 392
-
Quality Focused Approach to a Learner Corpus DevelopmentDarģis, Roberts / Auziņa, Ilze / Levāne-Petrova, Kristīne / Kaija, Inga et al. | 2020
- 397
-
An Exploratory Study into Automated Précis GradingDe Clercq, Orphee / Van Hoecke, Senne et al. | 2020
- 405
-
Adjusting Image Attributes of Localized Regions with Low-level DialogueLin, Tzu-Hsiang / Rudnicky, Alexander / Bui, Trung / Kim, Doo Soon / Oh, Jean et al. | 2020
- 413
-
Alignment Annotation for Clinic Visit Dialogue to Clinical Note Sentence Language GenerationYim, Wen-wai / Yetisgen, Meliha / Huang, Jenny / Grossman, Micah et al. | 2020
- 422
-
MultiWOZ 2.1: A Consolidated Multi-Domain Dialogue Dataset with State Corrections and State Tracking BaselinesEric, Mihail / Goel, Rahul / Paul, Shachi / Sethi, Abhishek / Agarwal, Sanchit / Gao, Shuyang / Kumar, Adarsh / Goyal, Anuj / Ku, Peter / Hakkani-Tur, Dilek et al. | 2020
- 429
-
A Comparison of Explicit and Implicit Proactive Dialogue Strategies for Conversational RecommendationKraus, Matthias / Fischbach, Fabian / Jansen, Pascal / Minker, Wolfgang et al. | 2020
- 436
-
Conversational Question Answering in Low Resource Scenarios: A Dataset and Case Study for BasqueOtegi, Arantxa / Agirre, Aitor / Campos, Jon Ander / Soroa, Aitor / Agirre, Eneko et al. | 2020
- 443
-
Construction and Analysis of a Multimodal Chat-talk Corpus for Dialog Systems Considering Interpersonal ClosenessYamazaki, Yoshihiro / Chiba, Yuya / Nose, Takashi / Ito, Akinori et al. | 2020
- 449
-
BLISS: An Agent for Collecting Spoken Dialogue Data about Health and Well-beingvan Waterschoot, Jelte / Hendrickx, Iris / Khan, Arif / Klabbers, Esther / de Korte, Marcel / Strik, Helmer / Cucchiarini, Catia / Theune, Mariët et al. | 2020
- 459
-
The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer ServiceChen, Meng / Liu, Ruixue / Shen, Lei / Yuan, Shaozu / Zhou, Jingyan / Wu, Youzheng / He, Xiaodong / Zhou, Bowen et al. | 2020
- 467
-
"Cheese!": a Corpus of Face-to-face French Interactions. A Case Study for Analyzing Smiling and Conversational HumorPriego-Valverde, Béatrice / Bigi, Brigitte / Amoyal, Mary et al. | 2020
- 476
-
The Margarita Dialogue Corpus: A Data Set for Time-Offset Interactions and Unstructured Dialogue SystemsChierici, Alberto / Habash, Nizar / Bicec, Margarita et al. | 2020
- 485
-
How Users React to Proactive Voice Assistant Behavior While DrivingSchmidt, Maria / Minker, Wolfgang / Werner, Steffen et al. | 2020
- 491
-
Emotional Speech Corpus for Persuasive Dialogue SystemAsai, Sara / Yoshino, Koichiro / Shinagawa, Seitaro / Sakti, Sakriani / Nakamura, Satoshi et al. | 2020
- 498
-
Multimodal Analysis of Cohesion in Multi-party InteractionsBangalore Kantharaju, Reshmashree / Langlet, Caroline / Barange, Mukesh / Clavel, Chloé / Pelachaud, Catherine et al. | 2020
- 508
-
Treating Dialogue Quality Evaluation as an Anomaly Detection ProblemNedelchev, Rostislav / Usbeck, Ricardo / Lehmann, Jens et al. | 2020
- 513
-
Evaluation of Argument Search Approaches in the Context of Argumentative Dialogue SystemsRach, Niklas / Matsuda, Yuki / Daxenberger, Johannes / Ultes, Stefan / Yasumoto, Keiichi / Minker, Wolfgang et al. | 2020
- 523
-
PATE: A Corpus of Temporal Expressions for the In-car Voice Assistant DomainZarcone, Alessandra / Alam, Touhidul / Kolagar, Zahra et al. | 2020
- 531
-
Mapping the Dialog Act Annotations of the LEGO Corpus into ISO 24617-2 Communicative FunctionsRibeiro, Eugénio / Ribeiro, Ricardo / Martins de Matos, David et al. | 2020
- 540
-
Estimating User Communication Styles for Spoken Dialogue SystemsMiehle, Juliana / Feustel, Isabel / Hornauer, Julia / Minker, Wolfgang / Ultes, Stefan et al. | 2020
- 549
-
The ISO Standard for Dialogue Act Annotation, Second EditionBunt, Harry / Petukhova, Volha / Gilmartin, Emer / Pelachaud, Catherine / Fang, Alex / Keizer, Simon / Prévot, Laurent et al. | 2020
- 559
-
The AICO Multimodal Corpus - Data Collection and Preliminary AnalysesJokinen, Kristiina et al. | 2020
- 565
-
A Corpus of Controlled Opinionated and Knowledgeable Movie Discussions for Training Neural Conversation ModelsGaletzka, Fabian / Eneh, Chukwuemeka Uchenna / Schlangen, David et al. | 2020
- 574
-
A French Medical Conversations Corpus Annotated for a Virtual Patient Dialogue SystemLaleye, Fréjus A. A. / de Chalendar, Gaël / Blanié, Antonia / Brouquet, Antoine / Behnamou, Dan et al. | 2020
- 581
-
Getting To Know You: User Attribute Extraction from DialoguesWu, Chien-Sheng / Madotto, Andrea / Lin, Zhaojiang / Xu, Peng / Fung, Pascale et al. | 2020
- 590
-
Augmenting Small Data to Classify Contextualized Dialogue Acts for Exploratory VisualizationKumar, Abhinav / Di Eugenio, Barbara / Aurisano, Jillian / Johnson, Andrew et al. | 2020
- 600
-
RDG-Map: A Multimodal Corpus of Pedagogical Human-Agent Spoken Interactions.Paetzel, Maike / Karkada, Deepthi / Manuvinakurike, Ramesh et al. | 2020
- 610
-
MPDD: A Multi-Party Dialogue Dataset for Analysis of Emotions and Interpersonal RelationshipsChen, Yi-Ting / Huang, Hen-Hsen / Chen, Hsin-Hsi et al. | 2020
- 615
-
"Alexa in the wild" - Collecting Unconstrained Conversations with a Modern Voice Assistant in a Public EnvironmentSiegert, Ingo et al. | 2020
- 620
-
EDA: Enriching Emotional Dialogue Acts using an Ensemble of Neural AnnotatorsBothe, Chandrakant / Weber, Cornelius / Magg, Sven / Wermter, Stefan et al. | 2020
- 628
-
PACO: a Corpus to Analyze the Impact of Common Ground in Spontaneous Face-to-Face InteractionAmoyal, Mary / Priego-Valverde, Béatrice / Rauzy, Stephane et al. | 2020
- 634
-
Dialogue Act Annotation in a Multimodal Corpus of First Encounter DialoguesNavarretta, Costanza / Paggio, Patrizia et al. | 2020
- 644
-
A Conversation-Analytic Annotation of Turn-Taking Behavior in Japanese Multi-Party Conversation and its Preliminary AnalysisEnomoto, Mika / Den, Yasuharu / Ishimoto, Yuichi et al. | 2020
- 653
-
Understanding User Utterances in a Dialog System for CaregivingAsao, Yoshihiko / Kloetzer, Julien / Mizuno, Junta / Saiki, Dai / Kadowaki, Kazuma / Torisawa, Kentaro et al. | 2020
- 662
-
Designing Multilingual Interactive Agents using Small Dialogue CorporaLin, Donghui / Otani, Masayuki / Okuno, Ryosuke / Ishida, Toru et al. | 2020
- 668
-
Multimodal Corpus of Bidirectional Conversation of Human-human and Human-robot Interaction during fMRI ScanningRauchbauer, Birgit / Hmamouche, Youssef / Bigi, Brigitte / Prévot, Laurent / Ochs, Magalie / Chaminade, Thierry et al. | 2020
- 676
-
The Brain-IHM Dataset: a New Resource for Studying the Brain Basis of Human-Human and Human-Machine ConversationsOchs, Magalie / Bertrand, Roxane / Goujon, Aurélie / Bolger, Deirdre / Dubarry, Anne-Sophie / Blache, Philippe et al. | 2020
- 684
-
Dialogue-AMR: Abstract Meaning Representation for DialogueBonial, Claire / Donatelli, Lucia / Abrams, Mitchell / Lukin, Stephanie M. / Tratz, Stephen / Marge, Matthew / Artstein, Ron / Traum, David / Voss, Clare et al. | 2020
- 696
-
Relation between Degree of Empathy for Narrative Speech and Type of Responsive Utterance in Attentive ListeningIto, Koichiro / Murata, Masaki / Ohno, Tomohiro / Matsubara, Shigeki et al. | 2020
- 702
-
Intent Recognition in Doctor-Patient InterviewsRojowiec, Robin / Roth, Benjamin / Fink, Maximilian et al. | 2020
- 710
-
BrainPredict: a Tool for Predicting and Visualising Local Brain ActivityHmamouche, Youssef / Prévot, Laurent / Ochs, Magalie / Chaminade, Thierry et al. | 2020
- 717
-
MTSI-BERT: A Session-aware Knowledge-based Conversational AgentSenese, Matteo Antonio / Rizzo, Giuseppe / Dragoni, Mauro / Morisio, Maurizio et al. | 2020
- 726
-
Predicting Ratings of Real Dialogue Participants from Artificial Data and Ratings of Human Dialogue ObserversGeorgila, Kallirroi / Gordon, Carla / Yanov, Volodymyr / Traum, David et al. | 2020
- 735
-
Which Model Should We Use for a Real-World Conversational Dialogue System? a Cross-Language Relevance Model or a Deep Neural Net?Alavi, Seyed Hossein / Leuski, Anton / Traum, David et al. | 2020
- 743
-
Chinese Whispers: A Multimodal Dataset for Embodied Language GroundingKontogiorgos, Dimosthenis / Sibirtseva, Elena / Gustafson, Joakim et al. | 2020
- 750
-
AMUSED: A Multi-Stream Vector Representation Method for Use in Natural DialogueKumar, Gaurav / Joshi, Rishabh / Singh, Jaspreet / Yenigalla, Promod et al. | 2020
- 759
-
An Annotation Approach for Social and Referential Gaze in DialogueSomashekarappa, Vidya / Howes, Christine / Sayeed, Asad et al. | 2020
- 766
-
A Penn-style Treebank of Middle Low GermanBooth, Hannah / Breitbarth, Anne / Ecay, Aaron / Farasyn, Melissa et al. | 2020
- 776
-
Books of Hours. the First Liturgical Data Set for Text Segmentation.Hazem, Amir / Daille, Beatrice / Kermorvant, Christopher / Stutzmann, Dominique / Bonhomme, Marie-Laurence / Maarand, Martin / Boillet, Mélodie et al. | 2020
- 785
-
Corpus of Chinese Dynastic Histories: Gender Analysis over Two MillenniaZinin, Sergey / Xu, Yang et al. | 2020
- 794
-
The Royal Society Corpus 6.0: Providing 300+ Years of Scientific Writing for Humanistic StudyFischer, Stefan / Knappen, Jörg / Menzel, Katrin / Teich, Elke et al. | 2020
- 803
-
Corpus REDEWIEDERGABEBrunner, Annelen / Engelberg, Stefan / Jannidis, Fotis / Tu, Ngoc Duyen Tanja / Weimer, Lukas et al. | 2020
- 813
-
WeDH - a Friendly Tool for Building Literary Corpora Enriched with Encyclopedic MetadataEgloff, Mattia / Picca, Davide et al. | 2020
- 817
-
Automatic Section Recognition in ObituariesSabbatino, Valentino / Bostan, Laura Ana Maria / Klinger, Roman et al. | 2020
- 826
-
SLäNDa: An Annotated Corpus of Narrative and Dialogue in Swedish Literary FictionStymne, Sara / Östman, Carin et al. | 2020
- 835
-
RiQuA: A Corpus of Rich Quotation Annotation for English Literary TextPapay, Sean / Padó, Sebastian et al. | 2020
- 842
-
A Corpus Linguistic Perspective on Contemporary German Pop Lyrics with the Multi-Layer Annotated "Songkorpus"Schneider, Roman et al. | 2020
- 849
-
The BDCamões Collection of Portuguese Literary Documents: a Research Resource for Digital Humanities and Language TechnologyGrilo, Sara / Bolrinha, Márcia / Silva, João / Vaz, Rui / Branco, António et al. | 2020
- 855
-
Dataset for Temporal Analysis of English-French CognatesFrossard, Esteban / Coustaty, Mickael / Doucet, Antoine / Jatowt, Adam / Hengchen, Simon et al. | 2020
- 860
-
Material Philology Meets Digital Onomastic Lexicography: The NordiCon Database of Medieval Nordic Personal Names in Continental SourcesWaldispühl, Michelle / Dannells, Dana / Borin, Lars et al. | 2020
- 868
-
NLP Scholar: A Dataset for Examining the State of NLP ResearchMohammad, Saif M. et al. | 2020
- 878
-
The DReaM Corpus: A Multilingual Annotated Corpus of Grammars for the World’s LanguagesVirk, Shafqat Mumtaz / Hammarström, Harald / Forsberg, Markus / Wichmann, Søren et al. | 2020
- 885
-
LiViTo: Linguistic and Visual Features Tool for Assisted Analysis of Historic ManuscriptsMüller, Klaus / Tikhonov, Aleksej / Meyer, Roland et al. | 2020
- 891
-
TextAnnotator: A UIMA Based Tool for the Simultaneous and Collaborative Annotation of TextsAbrami, Giuseppe / Stoeckel, Manuel / Mehler, Alexander et al. | 2020
- 901
-
Deduplication of Scholarly Documents using Locality Sensitive Hashing and Word EmbeddingsGyawali, Bikash / Anastasiou, Lucas / Knoth, Petr et al. | 2020
- 911
-
"Voices of the Great War": A Richly Annotated Corpus of Italian Texts on the First World WarBoschetti, Federico / de felice, irene / Dei Rossi, Stefano / Dell'Orletta, Felice / Di Giorgio, Michele / Miliani, Martina / Passaro, Lucia C. / Puddu, Angelica / Venturi, Giulia / Labanca, Nicola et al. | 2020
- 919
-
DEbateNet-mig15:Tracing the 2015 Immigration Debate in Germany Over TimeLapesa, Gabriella / Blessing, Andre / Blokker, Nico / Dayanik, Erenay / Haunss, Sebastian / Kuhn, Jonas / Padó, Sebastian et al. | 2020
- 928
-
A Corpus of Spanish Political Speeches from 1937 to 2019Álvarez-Mellado, Elena et al. | 2020
- 933
-
A New Latin Treebank for Universal Dependencies: Charters between Ancient Latin and Romance LanguagesCecchini, Flavio Massimiliano / Korkiakangas, Timo / Passarotti, Marco et al. | 2020
- 943
-
Identification of Indigenous Knowledge Concepts through Semantic Networks, Spelling Tools and Word EmbeddingsRocha Souza, Renato / Dorn, Amelie / Piringer, Barbara / Wandl-Vogt, Eveline et al. | 2020
- 948
-
A Multi-Orthography Parallel Corpus of Yiddish NounsSaleva, Jonne et al. | 2020
- 953
-
An Annotated Corpus of Adjective-Adverb Interfaces in Romance LanguagesGerhalter, Katharina / Schneider, Gerlinde / Pollin, Christopher / Hummel, Martin et al. | 2020
- 958
-
Language Resources for Historical Newspapers: the Impresso CollectionEhrmann, Maud / Romanello, Matteo / Clematide, Simon / Ströbel, Phillip Benjamin / Barman, Raphaël et al. | 2020
- 969
-
Allgemeine Musikalische Zeitung as a Searchable Online CorpusKampe, Bernd / Duan, Tinghui / Hahn, Udo et al. | 2020
- 977
-
Stylometry in a Bilingual SetupCinkova, Silvie / Rybicki, Jan et al. | 2020
- 985
-
Dialect Clustering with Character-Based Metrics: in Search of the Boundary of Language and DialectSato, Yo / Heffernan, Kevin et al. | 2020
- 991
-
DiscSense: Automated Semantic Analysis of Discourse MarkersSileo, Damien / Van de Cruys, Tim / Pradel, Camille / Muller, Philippe et al. | 2020
- 1000
-
ThemePro: A Toolkit for the Analysis of Thematic ProgressionDominguez, Monica / Soler, Juan / Wanner, Leo et al. | 2020
- 1008
-
Machine-Aided Annotation for Fine-Grained Proposition Types in ArgumentationJo, Yohan / Mayfield, Elijah / Reed, Chris / Hovy, Eduard et al. | 2020
- 1019
-
Chinese Discourse Parsing: Model and EvaluationChuan-An, Lin / Hung, Shyh-Shiun / Huang, Hen-Hsen / Chen, Hsin-Hsi et al. | 2020
- 1025
-
Shallow Discourse Annotation for Chinese TED TalksLong, Wanqiu / Cai, Xinyi / Reid, James / Webber, Bonnie / Xiong, Deyi et al. | 2020
- 1033
-
The Discussion Tracker Corpus of Collaborative ArgumentationOlshefski, Christopher / Lugini, Luca / Singh, Ravneet / Litman, Diane / Godley, Amanda et al. | 2020
- 1044
-
Shallow Discourse Parsing for Under-Resourced Languages: Combining Machine Translation and Annotation ProjectionSluyter-Gäthje, Henny / Bourgonje, Peter / Stede, Manfred et al. | 2020
- 1051
-
A Corpus of Encyclopedia Articles with Logical FormsRasmussen, Nathan / Schuler, William et al. | 2020
- 1061
-
The Potsdam Commentary Corpus 2.2: Extending Annotations for Shallow Discourse ParsingBourgonje, Peter / Stede, Manfred et al. | 2020
- 1067
-
On the Creation of a Corpus for Coherence Evaluation of Discursive UnitsMohammadi, Elham / Beiko, Timothe / Kosseim, Leila et al. | 2020
- 1073
-
Joint Learning of Syntactic Features Helps Discourse SegmentationDesai, Takshak / Dakle, Parag Pravin / Moldovan, Dan et al. | 2020
- 1081
-
Creating a Corpus of Gestures and Predicting the Audience Response based on Gestures in Speeches of Donald TrumpRuf, Verena / Navarretta, Costanza et al. | 2020
- 1089
-
GeCzLex: Lexicon of Czech and German Anaphoric ConnectivesPoláková, Lucie / Rysová, Kateřina / Rysová, Magdaléna / Mírovský, Jiří et al. | 2020
- 1097
-
DiMLex-Bangla: A Lexicon of Bangla Discourse ConnectivesDas, Debopam / Stede, Manfred / Ghosh, Soumya Sankar / Chatterjee, Lahari et al. | 2020
- 1103
-
Semi-Supervised Tri-Training for Explicit Discourse Argument ExpansionKnaebel, Rene / Stede, Manfred et al. | 2020
- 1110
-
WikiPossessions: Possession Timeline Generation as an Evaluation Benchmark for Machine Reading Comprehension of Long TextsChinnappa, Dhivya / Palmer, Alexis / Blanco, Eduardo et al. | 2020
- 1118
-
TED-Q: TED Talks and the Questions they EvokeWestera, Matthijs / Mayol, Laia / Rohde, Hannah et al. | 2020
- 1128
-
CzeDLex 0.6 and its Representation in the PML-TQMírovský, Jiří / Poláková, Lucie / Synková, Pavlína et al. | 2020
- 1135
-
Corpus for Modeling User Interactions in Online Persuasive DiscussionsEgawa, Ryo / Morio, Gaku / Fujita, Katsuhide et al. | 2020
- 1142
-
Simplifying Coreference Chains for Dyslexic ChildrenWilkens, Rodrigo / Todirascu, Amalia et al. | 2020
- 1152
-
Adapting BERT to Implicit Discourse Relation Classification with a Focus on Discourse ConnectivesKishimoto, Yudai / Murawaki, Yugo / Kurohashi, Sadao et al. | 2020
- 1159
-
What Speakers really Mean when they Ask Questions: Classification of Intentions with a Supervised ApproachBarbedette, Angèle / Eshkol-Taravella, Iris et al. | 2020
- 1167
-
Modeling Dialogue in Conversational Cognitive Health Screening InterviewsFarzana, Shahla / Valizadeh, Mina / Parde, Natalie et al. | 2020
- 1178
-
Stigma Annotation Scheme and Stigmatized Language Detection in Health-Care Discussions on Social MediaStraton, Nadiya / Jang, Hyeju / Ng, Raymond et al. | 2020
- 1191
-
An Annotated Dataset of Discourse Modes in Hindi StoriesDhanwal, Swapnil / Dutta, Hritwik / Nankani, Hitesh / Shrivastava, Nilay / Kumar, Yaman / Li, Junyi Jessy / Mahata, Debanjan / Gosangi, Rakesh / Zhang, Haimin / Shah, Rajiv Ratn et al. | 2020
- 1197
-
Multi-class Multilingual Classification of Wikipedia Articles Using Extended Named Entity Tag SetShavarani, Hassan S. / Sekine, Satoshi et al. | 2020
- 1202
-
An Algerian Corpus and an Annotation Platform for Opinion and Emotion AnalysisMoudjari, Leila / Akli-Astouati, Karima / Benamara, Farah et al. | 2020
- 1211
-
Transfer Learning from Transformers to Fake News Challenge Stance Detection (FNC-1) TaskSlovikovskaya, Valeriya / Attardi, Giuseppe et al. | 2020
- 1219
-
Scientific Statement Classification over arXiv.orgGinev, Deyan / Miller, Bruce R et al. | 2020
- 1227
-
Cross-domain Author Gender Classification in Brazilian PortugueseDias, Rafael / Paraboni, Ivandré et al. | 2020
- 1235
-
LEDGAR: A Large-Scale Multi-label Corpus for Text Classification of Legal Provisions in ContractsTuggener, Don / von Däniken, Pius / Peetz, Thomas / Cieliebak, Mark et al. | 2020
- 1242
-
Online Near-Duplicate Detection of News ArticlesRodier, Simon / Carter, Dave et al. | 2020
- 1250
-
Automated Essay Scoring System for Nonnative Japanese LearnersHirao, Reo / Arai, Mio / Shimanaka, Hiroki / Katsumata, Satoru / Komachi, Mamoru et al. | 2020
- 1258
-
A Real-World Data Resource of Complex Sensitive Sentences Based on Documents from the Monsanto TrialNeerbek, Jan / Eskildsen, Morten / Dolog, Peter / Assent, Ira et al. | 2020
- 1268
-
Discovering Biased News Articles Leveraging Multiple Human AnnotationsLazaridou, Konstantina / Löser, Alexander / Mestre, Maria / Naumann, Felix et al. | 2020
- 1278
-
Corpora and Baselines for Humour Recognition in PortugueseGonçalo Oliveira, Hugo / Clemêncio, André / Alves, Ana et al. | 2020
- 1286
-
FactCorp: A Corpus of Dutch Fact-checks and its Multiple Usagesvan der Meulen, Marten / Reijnierse, W. Gudrun et al. | 2020
- 1293
-
Automatic Orality Identification in Historical TextsOrtmann, Katrin / Dipper, Stefanie et al. | 2020
- 1303
-
Using Deep Neural Networks with Intra- and Inter-Sentence Context to Classify Suicidal BehaviourSong, Xingyi / Downs, Johnny / Velupillai, Sumithra / Holden, Rachel / Kikoler, Maxim / Bontcheva, Kalina / Dutta, Rina / Roberts, Angus et al. | 2020
- 1311
-
A First Dataset for Film Age Appropriateness InvestigationMohamed, Emad / Ha, Le An et al. | 2020
- 1318
-
Habibi - a multi Dialect multi National Arabic Song Lyrics CorpusEl-Haj, Mahmoud et al. | 2020
- 1327
-
Age Suitability Rating: Predicting the MPAA Rating Based on Movie DialoguesShafaei, Mahsa / Safi Samghabadi, Niloofar / Kar, Sudipta / Solorio, Thamar et al. | 2020
- 1336
-
Email Classification Incorporating Social Networks and Thread StructureAlkhereyf, Sakhar / Rambow, Owen et al. | 2020
- 1346
-
Development and Validation of a Corpus for Machine Humor ComprehensionTseng, Yuen-Hsien / Wu, Wun-Syuan / Chang, Chia-Yueh / Chen, Hsueh-Chih / Hsu, Wei-Lun et al. | 2020
- 1353
-
Alector: A Parallel Corpus of Simplified French Texts with Alignments of Misreadings by Poor and Dyslexic ReadersGala, Núria / Tack, Anaïs / Javourey-Drevet, Ludivine / François, Thomas / Ziegler, Johannes C. et al. | 2020
- 1362
-
A Corpus for Detecting High-Context Medical Conditions in Intensive Care Patient Notes Focusing on Frequently Readmitted PatientsMoseley, Edward T. / Wu, Joy T. / Welt, Jonathan / Foote, John / Tyler, Patrick D. / Grant, David W. / Carlson, Eric T. / Gehrmann, Sebastian / Dernoncourt, Franck / Celi, Leo Anthony et al. | 2020
- 1368
-
Multilingual Stance Detection in Tweets: The Catalonia Independence CorpusZotova, Elena / Agerri, Rodrigo / Nuñez, Manuel / Rigau, German et al. | 2020
- 1376
-
An Evaluation of Progressive Neural Networksfor Transfer Learning in Natural Language ProcessingMoeed, Abdul / Hagerer, Gerhard / Dugar, Sumit / Gupta, Sarthak / Ghosh, Mainak / Danner, Hannah / Mitevski, Oliver / Nawroth, Andreas / Groh, Georg et al. | 2020
- 1382
-
WAC: A Corpus of Wikipedia Conversations for Online Abuse DetectionCécillon, Noé / Labatut, Vincent / Dufour, Richard / Linarès, Georges et al. | 2020
- 1391
-
FloDusTA: Saudi Tweets Dataset for Flood, Dust Storm, and Traffic Accident EventsHamoui, Btool / Mars, Mourad / Almotairi, Khaled et al. | 2020
- 1397
-
An Annotated Corpus for Sexism Detection in French TweetsChiril, Patricia / Moriceau, Véronique / Benamara, Farah / Mari, Alda / Origgi, Gloria / Coulomb-Gully, Marlène et al. | 2020
- 1404
-
Measuring the Impact of Readability Features in Fake News DetectionSantos, Roney / Pedro, Gabriela / Leal, Sidney / Vale, Oto / Pardo, Thiago / Bontcheva, Kalina / Scarton, Carolina et al. | 2020
- 1414
-
When Shallow is Good Enough: Automatic Assessment of Conceptual Text Complexity using Shallow Semantic FeaturesStajner, Sanja / Hulpuș, Ioana et al. | 2020
- 1423
-
DecOp: A Multilingual and Multi-domain Corpus For Detecting Deception In Typed TextCapuozzo, Pasquale / Lauriola, Ivano / Strapparava, Carlo / Aiolli, Fabio / Sartori, Giuseppe et al. | 2020
- 1431
-
Age Recommendation for TextsBlandin, Alexis / Lecorvé, Gwénolé / Battistelli, Delphine / Étienne, Aline et al. | 2020
- 1440
-
Multilingual Twitter Corpus and Baselines for Evaluating Demographic Bias in Hate Speech RecognitionHuang, Xiaolei / Xing, Linzi / Dernoncourt, Franck / Paul, Michael J. et al. | 2020
- 1449
-
VICTOR: a Dataset for Brazilian Legal Documents ClassificationLuz de Araujo, Pedro Henrique / de Campos, Teófilo Emídio / Ataides Braz, Fabricio / Correia da Silva, Nilton et al. | 2020
- 1459
-
Dynamic Classification in Web Archiving CollectionsPatel, Krutarth / Caragea, Cornelia / Phillips, Mark et al. | 2020
- 1469
-
Aspect Flow Representation and Audio Inspired Analysis for TextsVasconcelos, Larissa / Campelo, Claudio / Jeronimo, Caio et al. | 2020
- 1478
-
Annotating and Analyzing Biased Sentences in News Articles using CrowdsourcingLim, Sora / Jatowt, Adam / Färber, Michael / Yoshikawa, Masatoshi et al. | 2020
- 1485
-
Evaluation of Deep Gaussian Processes for Text ClassificationJayashree, P. / Srijith, P. K. et al. | 2020
- 1492
-
EmoEvent: A Multilingual Emotion Corpus based on different EventsPlaza del Arco, Flor Miriam / Strapparava, Carlo / Urena Lopez, L. Alfonso / Martin, Maite et al. | 2020
- 1499
-
MuSE: a Multimodal Dataset of Stressed EmotionJaiswal, Mimansa / Bara, Cristian-Paul / Luo, Yuanhang / Burzo, Mihai / Mihalcea, Rada / Provost, Emily Mower et al. | 2020
- 1511
-
Affect inTweets: A Transfer Learning ApproachZhang, Linrui / Huang, Hsin-Lun / Yu, Yang / Moldovan, Dan et al. | 2020
- 1517
-
Annotation of Emotion Carriers in Personal NarrativesTammewar, Aniruddha / Cervone, Alessandra / Messner, Eva-Maria / Riccardi, Giuseppe et al. | 2020
- 1526
-
Towards Interactive Annotation for Hesitation in Conversational SpeechWottawa, Jane / Tahon, Marie / Marin, Apolline / Audibert, Nicolas et al. | 2020
- 1533
-
Abusive language in Spanish children and young teenager’s conversations: data preparation and short text classification with contextual word embeddingsCosta-jussà, Marta R. / González, Esther / Moreno, Asuncion / Cumalat, Eudald et al. | 2020
- 1538
-
IIIT-H TEMD Semi-Natural Emotional Speech Database from Professional Actors and Non-ActorsRambabu, Banothu / Botsa, Kishore Kumar / Paidi, Gangamohan / Gangashetty, Suryakanth V et al. | 2020
- 1546
-
The POTUS Corpus, a Database of Weekly Addresses for the Study of Stance in Politics and Virtual AgentsJanssoone, Thomas / Bailly, Kévin / Richard, Gaël / Clavel, Chloé et al. | 2020
- 1554
-
GoodNewsEveryone: A Corpus of News Headlines Annotated with Emotions, Semantic Roles, and Reader PerceptionBostan, Laura Ana Maria / Kim, Evgeny / Klinger, Roman et al. | 2020
- 1567
-
SOLO: A Corpus of Tweets for Examining the State of Being AloneKiritchenko, Svetlana / Hipson, Will / Coplan, Robert / Mohammad, Saif M. et al. | 2020
- 1578
-
PoKi: A Large Dataset of Poems by ChildrenHipson, Will / Mohammad, Saif M. et al. | 2020
- 1590
-
AlloSat: A New Call Center French Corpus for Satisfaction and Frustration AnalysisMacary, Manon / Tahon, Marie / Estève, Yannick / Rousseau, Anthony et al. | 2020
- 1598
-
Learning the Human Judgment for the Automatic Evaluation of ChatbotWu, Shih-Hung / Chien, Sheng-Lun et al. | 2020
- 1603
-
Korean-Specific Emotion Annotation Procedure Using N-Gram-Based Distant Supervision and Korean-Specific-Feature-Based Distant SupervisionLee, Young-Jun / Lim, Chae-Gyun / Choi, Ho-Jin et al. | 2020
- 1611
-
Semi-Automatic Construction and Refinement of an Annotated Corpus for a Deep Learning Framework for Emotion ClassificationXu, Jiajun / Masuda, Kyosuke / Nishizaki, Hiromitsu / Fukumoto, Fumiyo / Suzuki, Yoshimi et al. | 2020
- 1618
-
CEASE, a Corpus of Emotion Annotated Suicide notes in EnglishGhosh, Soumitra / Ekbal, Asif / Bhattacharyya, Pushpak et al. | 2020
- 1627
-
Training a Broad-Coverage German Sentiment Classification Model for Dialog SystemsGuhr, Oliver / Schumann, Anne-Kathrin / Bahrmann, Frank / Böhme, Hans Joachim et al. | 2020
- 1633
-
An Event-comment Social Media Corpus for Implicit Emotion AnalysisLee, Sophia Yat Mei / Lau, Helena Yan Ping et al. | 2020
- 1643
-
An Emotional Mess! Deciding on a Framework for Building a Dutch Emotion-Annotated CorpusDe Bruyne, Luna / De Clercq, Orphee / Hoste, Veronique et al. | 2020
- 1652
-
PO-EMO: Conceptualization, Annotation, and Modeling of Aesthetic Emotions in German and English PoetryHaider, Thomas / Eger, Steffen / Kim, Evgeny / Klinger, Roman / Menninghaus, Winfried et al. | 2020
- 1664
-
Learning Word Ratings for Empathy and Distress from Document-Level User ResponsesSedoc, João / Buechel, Sven / Nachmany, Yehonathan / Buffone, Anneke / Ungar, Lyle et al. | 2020
- 1674
-
Evaluation of Sentence Representations in PolishDadas, Slawomir / Perełkiewicz, Michał / Poświata, Rafał et al. | 2020
- 1681
-
Identification of Primary and Collateral Tracks in Stuttered SpeechRiad, Rachid / Bachoud-Lévi, Anne-Catherine / Rudzicz, Frank / Dupoux, Emmanuel et al. | 2020
- 1689
-
How to Compare Automatically Two Phonological Strings: Application to Intelligibility Measurement in the Case of Atypical SpeechGhio, Alain / Lalain, Muriel / Giusti, Laurence / Fredouille, Corinne / Woisard, Virginie et al. | 2020
- 1695
-
Evaluating Text Coherence at Sentence and Paragraph LevelsLiu, Sennan / Zeng, Shuang / Li, Sujian et al. | 2020
- 1704
-
HardEval: Focusing on Challenging Tokens to Assess Robustness of NERBernier-Colborne, Gabriel / Langlais, Phillippe et al. | 2020
- 1712
-
An Evaluation Dataset for Identifying Communicative Functions of Sentences in English Scholarly PapersIwatsuki, Kenichi / Boudin, Florian / Aizawa, Akiko et al. | 2020
- 1721
-
An Automatic Tool For Language EvaluationFassetti, Fabio / Fassetti, Ilaria et al. | 2020
- 1727
-
Which Evaluations Uncover Sense Representations that Actually Make Sense?Boyd-Graber, Jordan / Guo, Fenfei / Findlater, Leah / Iyyer, Mohit et al. | 2020
- 1739
-
Diversity, Density, and Homogeneity: Quantitative Characteristic Metrics for Text CollectionsLai, Yi-An / Zhu, Xuan / Zhang, Yi / Diab, Mona et al. | 2020
- 1747
-
Towards Few-Shot Event Mention Retrieval: An Evaluation Framework and A Siamese Network ApproachMin, Bonan / Chan, Yee Seng / Zhao, Lingjun et al. | 2020
- 1753
-
Linguistic Appropriateness and Pedagogic Usefulness of Reading Comprehension QuestionsHorbach, Andrea / Aldabe, Itziar / Bexte, Marie / Lopez de Lacalle, Oier / Maritxalar, Montse et al. | 2020
- 1763
-
Dataset Reproducibility and IR Methods in Timeline SummarizationBorn, Leo / Bacher, Maximilian / Markert, Katja et al. | 2020
- 1772
-
Database Search vs. Information Retrieval: A Novel Method for Studying Natural Language Querying of Semi-Structured DataNadig, Stefanie / Braschler, Martin / Stockinger, Kurt et al. | 2020
- 1780
-
Why Attention is Not Explanation: Surgical Intervention and Causal Reasoning about Neural ModelsGrimsley, Christopher / Mayfield, Elijah / R.S. Bursten, Julia et al. | 2020
- 1791
-
Have a Cake and Eat it Too: Assessing Discriminating Performance of an Intelligibility Index Obtained from a Reduced Sample SizeMarczyk, Anna / Ghio, Alain / Lalain, Muriel / Rebourg, Marie / Fredouille, Corinne / Woisard, Virginie et al. | 2020
- 1796
-
Evaluation Metrics for Headline Generation Using Deep Pre-Trained EmbeddingsMoeed, Abdul / An, Yang / Hagerer, Gerhard / Groh, Georg et al. | 2020
- 1803
-
LinCE: A Centralized Benchmark for Linguistic Code-switching EvaluationAguilar, Gustavo / Kar, Sudipta / Solorio, Thamar et al. | 2020
- 1814
-
Paraphrase Generation and Evaluation on Colloquial-Style SentencesSjöblom, Eetu / Creutz, Mathias / Scherrer, Yves et al. | 2020
- 1823
-
Analyzing Word Embedding Through Structural Equation ModelingHan, Namgi / Hayashi, Katsuhiko / Miyao, Yusuke et al. | 2020
- 1833
-
Evaluation of Lifelong Learning SystemsProkopalo, Yevhenii / Meignier, Sylvain / Galibert, Olivier / Barrault, Loic / Larcher, Anthony et al. | 2020
- 1842
-
Interannotator Agreement for Lexico-Semantic Annotation of a CorpusHajnicz, Elżbieta et al. | 2020
- 1849
-
An In-Depth Comparison of 14 Spelling Correction Tools on a Common BenchmarkNäther, Markus et al. | 2020
- 1858
-
Sentence Level Human Translation Quality Estimation with Attention-based Neural NetworksYuan, Yu / Sharoff, Serge et al. | 2020
- 1866
-
Evaluating Language Tools for Fifteen EU-official Under-resourced LanguagesAlves, Diego / Thakkar, Gaurish / Tadić, Marko et al. | 2020
- 1874
-
Word Embedding Evaluation for SinhalaLakmal, Dimuthu / Ranathunga, Surangika / Peramuna, Saman / Herath, Indu et al. | 2020
- 1882
-
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding TasksAspillaga, Carlos / Carvallo, Andrés / Araujo, Vladimir et al. | 2020
- 1895
-
Brand-Product Relation Extraction Using Heterogeneous Vector Space RepresentationsJanz, Arkadiusz / Kopociński, Łukasz / Piasecki, Maciej / Pluwak, Agnieszka et al. | 2020
- 1902
-
A Tale of Three Parsers: Towards Diagnostic Evaluation for Meaning Representation ParsingBuljan, Maja / Nivre, Joakim / Oepen, Stephan / Øvrelid, Lilja et al. | 2020
- 1910
-
Headword-Oriented Entity Linking: A Special Entity Linking Task with Dataset and BaselineYang, Mu / Chen, Chi-Yen / Lee, Yi-Hui / Zeng, Qian-hui / Ma, Wei-Yun / Shih, Chen-Yang / Chen, Wei-Jhih et al. | 2020
- 1918
-
TableBank: Table Benchmark for Image-based Table Detection and RecognitionLi, Minghao / Cui, Lei / Huang, Shaohan / Wei, Furu / Zhou, Ming / Li, Zhoujun et al. | 2020
- 1926
-
WIKIR: A Python Toolkit for Building a Large-scale Wikipedia-based English Information Retrieval DatasetFrej, Jibril / Schwab, Didier / Chevallet, Jean-Pierre et al. | 2020
- 1934
-
Constructing a Public Meeting CorpusTanaka, Koji / Chu, Chenhui / Ren, Haolin / Renoust, Benjamin / Nakashima, Yuta / Takemura, Noriko / Nagahara, Hajime / Fujikawa, Takao et al. | 2020
- 1941
-
Annotating and Extracting Synthesis Process of All-Solid-State Batteries from Scientific LiteratureKuniyoshi, Fusataka / Makino, Kohei / Ozawa, Jun / Miwa, Makoto et al. | 2020
- 1951
-
WEXEA: Wikipedia EXhaustive Entity AnnotationStrobl, Michael / Trabelsi, Amine / Zaiane, Osmar et al. | 2020
- 1959
-
Handling Entity Normalization with no Annotated Corpus: Weakly Supervised Methods Based on Distributional Representation and Ontological InformationFerré, Arnaud / Bossy, Robert / Ba, Mouhamadou / Deléger, Louise / Lavergne, Thomas / Zweigenbaum, Pierre / Nédellec, Claire et al. | 2020
- 1967
-
HBCP Corpus: A New Resource for the Analysis of Behavioural Change Intervention ReportsBonin, Francesca / Gleize, Martin / Finnerty, Ailbhe / Moore, Candice / Jochim, Charles / Norris, Emma / Hou, Yufang / Wright, Alison J. / Ganguly, Debasis / Hayes, Emily et al. | 2020
- 1976
-
Cross-lingual Structure Transfer for Zero-resource Event ExtractionLu, Di / Subburathinam, Ananya / Ji, Heng / May, Jonathan / Chang, Shih-Fu / Sil, Avi / Voss, Clare et al. | 2020
- 1982
-
Cross-Domain Evaluation of Edge Detection for Biomedical Event ExtractionRamponi, Alan / Plank, Barbara / Lombardo, Rosario et al. | 2020
- 1990
-
Semantic Annotation for Improved Safety in Construction WorkThompson, Paul / Yates, Tim / Inan, Emrah / Ananiadou, Sophia et al. | 2020
- 2000
-
Social Web Observatory: A Platform and Method for Gathering Knowledge on Entities from Different Textual SourcesTsekouras, Leonidas / Petasis, Georgios / Giannakopoulos, George / Kosmopoulos, Aris et al. | 2020
- 2009
-
Development of a Corpus Annotated with Medications and their Attributes in Psychiatric Health RecordsChaturvedi, Jaya / Viani, Natalia / Sanyal, Jyoti / Tytherleigh, Chloe / Hasan, Idil / Baird, Kate / Velupillai, Sumithra / Stewart, Robert / Roberts, Angus et al. | 2020
- 2017
-
Do not let the history haunt you: Mitigating Compounding Errors in Conversational Question AnsweringMandya, Angrosh / O' Neill, James / Bollegala, Danushka / Coenen, Frans et al. | 2020
- 2026
-
CLEEK: A Chinese Long-text Corpus for Entity LinkingZeng, Weixin / Zhao, Xiang / Tang, Jiuyang / Tan, Zhen / Huang, Xuqian et al. | 2020
- 2036
-
The Medical Scribe: Corpus Development and Model Performance AnalysesShafran, Izhak / Du, Nan / Tran, Linh / Perry, Amanda / Keyes, Lauren / Knichel, Mark / Domin, Ashley / Huang, Lei / Chen, Yu-hui / Li, Gang et al. | 2020
- 2045
-
A Contract Corpus for Recognizing Rights and ObligationsFunaki, Ruka / Nagata, Yusuke / Suenaga, Kohei / Mori, Shinsuke et al. | 2020
- 2054
-
Recognition of Implicit Geographic Movement in TextPezanowski, Scott / Mitra, Prasenjit et al. | 2020
- 2064
-
Extraction of the Argument Structure of Tokyo Metropolitan Assembly Minutes: Segmentation of Question-and-Answer SetsTakamaru, Keiichi / Kimura, Yasutomo / Shibuki, Hideyuki / Ototake, Hokuto / Uchida, Yuzu / Sakamoto, Kotaro / Ishioroshi, Madoka / Mitamura, Teruko / Kando, Noriko et al. | 2020
- 2069
-
A Term Extraction Approach to Survey Analysis in Health CareRobin, Cécile / Isazad Mashinchi, Mona / Ahmadi Zeleti, Fatemeh / Ojo, Adegboyega / Buitelaar, Paul et al. | 2020
- 2078
-
A Scientific Information Extraction Dataset for Nature Inspired EngineeringKruiper, Ruben / Vincent, Julian F.V. / Chen-Burger, Jessica / Desmulliez, Marc P.Y. / Konstas, Ioannis et al. | 2020
- 2086
-
Automated Discovery of Mathematical Definitions in TextVanetik, Natalia / Litvak, Marina / Shevchuk, Sergey / Reznik, Lior et al. | 2020
- 2095
-
WN-Salience: A Corpus of News Articles with Entity Salience AnnotationsWu, Chuan / Kanoulas, Evangelos / de Rijke, Maarten / Lu, Wei et al. | 2020
- 2103
-
Event Extraction from Unstructured Amharic TextTadesse, Ephrem / Tsegaye, Rosa / Qaqqabaa, Kuulaa et al. | 2020
- 2110
-
Comparing Machine Learning and Deep Learning Approaches on NLP Tasks for the Italian LanguageMagnini, Bernardo / Lavelli, Alberto / Magnolini, Simone et al. | 2020
- 2120
-
MyFixit: An Annotated Dataset, Annotation Tool, and Baseline Methods for Information Extraction from Repair ManualsNabizadeh, Nima / Kolossa, Dorothea / Heckmann, Martin et al. | 2020
- 2129
-
Towards Entity Spacesvan Erp, Marieke / Groth, Paul et al. | 2020
- 2138
-
Love Me, Love Me, Say (and Write!) that You Love Me: Enriching the WASABI Song Corpus with Lyrics AnnotationsFell, Michael / Cabrio, Elena / Korfed, Elmahdi / Buffa, Michel / Gandon, Fabien et al. | 2020
- 2148
-
Evaluating Information Loss in Temporal Dependency TreesOcal, Mustafa / Finlayson, Mark et al. | 2020
- 2157
-
Populating Legal Ontologies using Semantic Role LabelingHumphreys, Llio / Boella, Guido / Di Caro, Luigi / Robaldo, Livio / van der Torre, Leon / Ghanavati, Sepideh / Muthuri, Robert et al. | 2020
- 2167
-
PST 2.0 - Corpus of Polish Spatial TextsMarcińczuk, Michał / Oleksy, Marcin / Wieczorek, Jan et al. | 2020
- 2175
-
Natural Language Premise Selection: Finding Supporting Statements for Mathematical TextFerreira, Deborah / Freitas, André et al. | 2020
- 2183
-
Odinson: A Fast Rule-based Information Extraction FrameworkValenzuela-Escárcega, Marco A. / Hahn-Powell, Gus / Bell, Dane et al. | 2020
- 2192
-
The STEM-ECR Dataset: Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic SourcesD'Souza, Jennifer / Hoppe, Anett / Brack, Arthur / Jaradeh, Mohmad Yaser / Auer, Sören / Ewerth, Ralph et al. | 2020
- 2204
-
MathAlign: Linking Formula Identifiers to their Contextual Natural Language DescriptionsAlexeeva, Maria / Sharp, Rebecca / Valenzuela-Escárcega, Marco A. / Kadowaki, Jennifer / Pyarelal, Adarsh / Morrison, Clayton et al. | 2020
- 2213
-
Domain Adapted Distant Supervision for Pedagogically Motivated Relation ExtractionSainz, Oscar / Lopez de Lacalle, Oier / Aldabe, Itziar / Maritxalar, Montse et al. | 2020
- 2223
-
Temporal Histories of Epidemic Events (THEE): A Case Study in Temporal Annotation for Public HealthNiu, Jingcheng / Ng, Victoria / Penn, Gerald / Rees, Erin E. et al. | 2020
- 2231
-
Exploiting Citation Knowledge in Personalised Recommendation of Recent Scientific PublicationsKhadka, Anita / Cantador, Iván / Fernandez, Miriam et al. | 2020
- 2241
-
A Platform for Event Extraction in HindiSahoo, Sovan Kumar / Saha, Saumajit / Ekbal, Asif / Bhattacharyya, Pushpak et al. | 2020
- 2251
-
Rad-SpatialNet: A Frame-based Resource for Fine-Grained Spatial Relations in Radiology ReportsDatta, Surabhi / Ulinski, Morgan / Godfrey-Stovall, Jordan / Khanpara, Shekhar / Riascos-Castaneda, Roy F. / Roberts, Kirk et al. | 2020
- 2261
-
NLP Analytics in Finance with DoRe: A French 250M Tokens Corpus of Corporate Annual ReportsMasson, Corentin / Paroubek, Patrick et al. | 2020
- 2268
-
The Language of Brain Signals: Natural Language Processing of Electroencephalography ReportsMaldonado, Ramon / Harabagiu, Sanda et al. | 2020
- 2276
-
Humans Keep It One Hundred: an Overview of AI JourneyShavrina, Tatiana / Emelyanov, Anton / Fenogenova, Alena / Fomin, Vadim / Mikhailov, Vladislav / Evlampiev, Andrey / Malykh, Valentin / Larin, Vladimir / Natekin, Alex / Vatulin, Aleksandr et al. | 2020
- 2285
-
Towards Data-driven Ontologies: a Filtering Approach using Keywords and Natural Language Constructsde Boer, Maaike / Verhoosel, Jack P. C. et al. | 2020
- 2293
-
A French Corpus and Annotation Schema for Named Entity Recognition and Relation Extraction of Financial NewsJabbari, Ali / Sauvage, Olivier / Zeine, Hamada / Chergui, Hamza et al. | 2020
- 2300
-
Inferences for Lexical Semantic Resource Building with Less SupervisionBebeshina, Nadia / Lafourcade, Mathieu et al. | 2020
- 2306
-
Acquiring Social Knowledge about Personality and Driving-related BehaviorIwai, Ritsuko / Kawahara, Daisuke / Kumada, Takatsune / Kurohashi, Sadao et al. | 2020
- 2316
-
Implicit knowledge in argumentative texts : an annotated corpusBecker, Maria / Korfhage, Katharina / Frank, Anette et al. | 2020
- 2325
-
Multiple Knowledge GraphDB (MKGDB)Faralli, Stefano / Velardi, Paola / Yusifli, Farid et al. | 2020
- 2332
-
Orchestrating NLP Services for the Legal DomainMoreno-Schneider, Julian / Rehm, Georg / Montiel-Ponsoda, Elena / Rodriguez-Doncel, Víctor / Revenko, Artem / Karampatakis, Sotirios / Khvalchik, Maria / Sageder, Christian / Gracia, Jorge / Maganza, Filippo et al. | 2020
- 2341
-
Evaluation Dataset and Methodology for Extracting Application-Specific Taxonomies from the Wikipedia Knowledge GraphBordea, Georgeta / Faralli, Stefano / Mougin, Fleur / Buitelaar, Paul / Diallo, Gayo et al. | 2020
- 2348
-
Subjective Evaluation of Comprehensibility in Movie InteractionsRandria, Estelle / Fontan, Lionel / Le Coz, Maxime / Ferrané, Isabelle / Pinquier, Julien et al. | 2020
- 2358
-
Representing Multiword Term Variation in a Terminological Knowledge Base: a Corpus-Based StudyLeón-Araúz, Pilar / Reimerink, Arianne / Cabezas-García, Melania et al. | 2020
- 2368
-
Understanding Spatial Relations through Multiple ModalitiesDan, Soham / He, Hangfeng / Roth, Dan et al. | 2020
- 2373
-
A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource LanguagesRoy, Dwaipayan / Bhatia, Sumit / Jain, Prateek et al. | 2020
- 2381
-
Pártélet: A Hungarian Corpus of Propaganda Texts from the Hungarian Socialist EraKmetty, Zoltán / Vincze, Veronika / Demszky, Dorottya / Ring, Orsolya / Nagy, Balázs / Szabó, Martina Katalin et al. | 2020
- 2389
-
KORE 50^DYWC: An Evaluation Data Set for Entity Linking Based on DBpedia, YAGO, Wikidata, and CrunchbaseNoullet, Kristian / Mix, Rico / Färber, Michael et al. | 2020
- 2396
-
Eye4Ref: A Multimodal Eye Movement Dataset of Referentially Complex SituationsAlacam, Özge / Ruppert, Eugen / Salama, Amr Rekaby / Staron, Tobias / Menzel, Wolfgang et al. | 2020
- 2405
-
SiBert: Enhanced Chinese Pre-trained Language Model with Sentence InsertionChen, Jiahao / Cao, Chenjie / Jiang, Xiuyan et al. | 2020
- 2413
-
Processing South Asian Languages Written in the Latin Script: the Dakshina DatasetRoark, Brian / Wolf-Sonkin, Lawrence / Kirov, Christo / Mielke, Sabrina J. / Johny, Cibu / Demirsahin, Isin / Hall, Keith et al. | 2020
- 2424
-
GM-RKB WikiText Error Correction Task and BaselinesMelli, Gabor / Eldallal, Abdelrhman / Lazem, Bassim / Moreira, Olga et al. | 2020
- 2431
-
Embedding Space Correlation as a Measure of Domain SimilarityBeyer, Anne / Kauermann, Göran / Schütze, Hinrich et al. | 2020
- 2440
-
Wiki-40B: Multilingual Language Model DatasetGuo, Mandy / Dai, Zihang / Vrandečić, Denny / Al-Rfou, Rami et al. | 2020
- 2453
-
Know thy Corpus! Robust Methods for Digital Curation of Web corporaSharoff, Serge et al. | 2020
- 2461
-
Evaluating Approaches to Personalizing Language ModelsKing, Milton / Cook, Paul et al. | 2020
- 2470
-
Class-based LSTM Russian Language Model with Linguistic InformationKipyatkova, Irina / Karpov, Alexey et al. | 2020
- 2475
-
Adaptation of Deep Bidirectional Transformers for Afrikaans LanguageRalethe, Sello et al. | 2020
- 2479
-
FlauBERT: Unsupervised Language Model Pre-training for FrenchLe, Hang / Vial, Loïc / Frej, Jibril / Segonne, Vincent / Coavoux, Maximin / Lecouteux, Benjamin / Allauzen, Alexandre / Crabbé, Benoit / Besacier, Laurent / Schwab, Didier et al. | 2020
- 2491
-
Accelerated High-Quality Mutual-Information Based Word ClusteringCiosici, Manuel R. / Assent, Ira / Derczynski, Leon et al. | 2020
- 2497
-
Rhythmic Proximity Between Natives And Learners Of French - Evaluation of a metric based on the CEFC corpusCoulange, Sylvain / Rossato, Solange et al. | 2020
- 2503
-
From Linguistic Resources to Ontology-Aware Terminologies: Minding the Representation GapSperanza, Giulia / di Buono, Maria Pia / Monti, Johanna / Sangati, Federico et al. | 2020
- 2511
-
Modeling Factual Claims with Semantic FramesArslan, Fatma / Caraballo, Josue / Jimenez, Damian / Li, Chengkai et al. | 2020
- 2521
-
Automatic Transcription Challenges for Inuktitut, a Low-Resource Polysynthetic LanguageGupta, Vishwa / Boulianne, Gilles et al. | 2020
- 2528
-
Geographically-Balanced Gigaword Corpora for 50 Language VarietiesDunn, Jonathan / Adams, Ben et al. | 2020
- 2537
-
Data Augmentation using Machine Translation for Fake News Detection in the Urdu LanguageAmjad, Maaz / Sidorov, Grigori / Zhila, Alisa et al. | 2020
- 2543
-
Evaluation of Greek Word EmbeddingsOutsios, Stamatis / Karatsalos, Christos / Skianis, Konstantinos / Vazirgiannis, Michalis et al. | 2020
- 2552
-
A Dataset of Mycenaean Linear B SequencesPapavassiliou, Katerina / Owens, Gareth / Kosmopoulos, Dimitrios et al. | 2020
- 2562
-
The Nunavut Hansard Inuktitut-English Parallel Corpus 3.0 with Preliminary Machine Translation ResultsJoanis, Eric / Knowles, Rebecca / Kuhn, Roland / Larkin, Samuel / Littell, Patrick / Lo, Chi-kiu / Stewart, Darlene / Micher, Jeffrey et al. | 2020
- 2573
-
Exploring Bilingual Word Embeddings for Hiligaynon, a Low-Resource LanguageMichel, Leah / Hangya, Viktor / Fraser, Alexander et al. | 2020
- 2581
-
A Finite-State Morphological Analyser for EvenkiZueva, Anna / Kuznetsova, Anastasia / Tyers, Francis et al. | 2020
- 2590
-
Morphology-rich Alphasyllabary EmbeddingsMersha, Amanuel / Wu, Stephen et al. | 2020
- 2596
-
Localization of Fake News Detection via Multitask Transfer LearningCruz, Jan Christian Blaise / Tan, Julianne Agatha / Cheng, Charibeth et al. | 2020
- 2605
-
Evaluating Sentence Segmentation in Different Datasets of Neuropsychological Language Tests in Brazilian PortugueseCasanova, Edresson / Treviso, Marcos / Hübner, Lilian / Aluísio, Sandra et al. | 2020
- 2615
-
Jejueo Datasets for Machine Translation and Speech SynthesisPark, Kyubyong / Choe, Yo Joong / Ham, Jiyeon et al. | 2020
- 2622
-
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu LanguageMatsuura, Kohei / Ueno, Sei / Mimura, Masato / Sakai, Shinsuke / Kawahara, Tatsuya et al. | 2020
- 2629
-
Development of a Guarani - Spanish Parallel CorpusChiruzzo, Luis / Amarilla, Pedro / Ríos, Adolfo / Giménez Lugo, Gustavo et al. | 2020
- 2634
-
AR-ASAG An ARabic Dataset for Automatic Short Answer Grading EvaluationOuahrani, Leila / Bennouar, Djamal et al. | 2020
- 2644
-
Processing Language Resources of Under-Resourced and Endangered Languages for the Generation of Augmentative Alternative Communication BoardsFerger, Anne et al. | 2020
- 2649
-
The Nisvai Corpus of Oral Narrative Practices from Malekula (Vanuatu) and its Associated Language ResourcesAznar, Jocelyn / Gala, Núria et al. | 2020
- 2657
-
Building a Time-Aligned Cross-Linguistic Reference Corpus from Language Documentation Data (DoReCo)Paschen, Ludger / Delafontaine, François / Draxler, Christoph / Fuchs, Susanne / Stave, Matthew / Seifart, Frank et al. | 2020
- 2667
-
Benchmarking Neural and Statistical Machine Translation on Low-Resource African LanguagesDuh, Kevin / McNamee, Paul / Post, Matt / Thompson, Brian et al. | 2020
- 2676
-
Improved Finite-State Morphological Analysis for St. Lawrence Island Yupik Using Paradigm Function MorphologyChen, Emily / Park, Hyunji Hayley / Schwartz, Lane et al. | 2020
- 2685
-
Towards a Spell Checker for Zamboanga Chavacano OrthographyHimoro, Marcelo Yuji / Pareja-Lora, Antonio et al. | 2020
- 2698
-
Identifying Sentiments in Algerian Code-switched User-generated CommentsAdouane, Wafia / Touileb, Samia / Bernardy, Jean-Philippe et al. | 2020
- 2706
-
Automatic Creation of Text Corpora for Low-Resource Languages from the Internet: The Case of Swiss GermanLinder, Lucy / Jungo, Michael / Hennebert, Jean / Musat, Claudiu Cristian / Fischer, Andreas et al. | 2020
- 2712
-
Evaluating Sub-word Embeddings in Cross-lingual ModelsHakimi Parizi, Ali / Cook, Paul et al. | 2020
- 2720
-
A Swiss German Dictionary: Variation in Speech and WritingSchmidt, Larissa / Linder, Lucy / Djambazovska, Sandra / Lazaridis, Alexandros / Samardžić, Tanja / Musat, Claudiu et al. | 2020
- 2726
-
Towards a Corsican Basic Language Resource KitKevers, Laurent / Retali-Medori, Stella et al. | 2020
- 2736
-
Evaluating the Impact of Sub-word Information and Cross-lingual Word Embeddings on Mi'kmaq Language ModellingBoudreau, Jeremie / Patra, Akankshya / Suvarna, Ashima / Cook, Paul et al. | 2020
- 2746
-
Exploring a Choctaw Language Corpus with Word Vectors and Minimum Distance LengthBrixey, Jacqueline / Sides, David / Vizthum, Timothy / Traum, David / Iskarous, Khalil et al. | 2020
- 2754
-
Massive vs. Curated Embeddings for Low-Resourced Languages: the Case of Yorùbá and TwiAlabi, Jesujoba / Amponsah-Kaakyire, Kwabena / Adelani, David / España-Bonet, Cristina et al. | 2020
- 2763
-
TRopBank: Turkish PropBank V2.0Kara, Neslihan / Aslan, Deniz Baran / Marşan, Büşra / Bakay, Özge / Ak, Koray / Yıldız, Olcay Taner et al. | 2020
- 2773
-
Collection and Annotation of the Romanian Legal CorpusTufiș, Dan / Mitrofan, Maria / Păiș, Vasile / Ion, Radu / Coman, Andrei et al. | 2020
- 2778
-
An Empirical Evaluation of Annotation Practices in Corpora from Language Documentationvon Prince, Kilu / Nordhoff, Sebastian et al. | 2020
- 2788
-
Annotated Corpus for Sentiment Analysis in Odia LanguageMohanty, Gaurav / Mishra, Pruthwik / Mamidi, Radhika et al. | 2020
- 2796
-
Building a Task-oriented Dialog System for Languages with no Training Data: the Case for BasqueLópez de Lacalle, Maddalen / Saralegi, Xabier / San Vicente, Iñaki et al. | 2020
- 2803
-
SENCORPUS: A French-Wolof Parallel CorpusNguer, Elhadji Mamadou / Lo, Alla / Dione, Cheikh M. Bamba / Ba, Sileye O. / Lo, Moussa et al. | 2020
- 2812
-
A Major Wordnet for a Minority Language: Scottish GaelicBella, Gábor / McNeill, Fiona / Gorman, Rody / O Donnaile, Caoimhin / MacDonald, Kirsty / Chandrashekar, Yamini / Freihat, Abed Alhakim / Giunchiglia, Fausto et al. | 2020
- 2819
-
Crowdsourcing Speech Data for Low-Resource Languages from Low-Income WorkersAbraham, Basil / Goel, Danish / Siddarth, Divya / Bali, Kalika / Chopra, Manu / Choudhury, Monojit / Joshi, Pratik / Jyoti, Preethi / Sitaram, Sunayana / Seshadri, Vivek et al. | 2020
- 2827
-
A Resource for Studying Chatino Verbal MorphologyCruz, Hilaria / Anastasopoulos, Antonios / Stump, Gregory et al. | 2020
- 2832
-
Learnings from Technological Interventions in a Low Resource Language: A Case-Study on GondiMehta, Devansh / Santy, Sebastin / Mothilal, Ramaravind Kommiya / Srivastava, Brij Mohan Lal / Sharma, Alok / Shukla, Anurag / Prasad, Vishnu / U, Venkanna / Sharma, Amit / Bali, Kalika et al. | 2020
- 2839
-
Irony Detection in Persian Language: A Transfer Learning Approach Using Emoji PredictionGolazizian, Preni / Sabeti, Behnam / Ashrafi Asli, Seyed Arad / Majdabadi, Zahra / Momenzadeh, Omid / Fahmi, Reza et al. | 2020
- 2846
-
Towards Computational Resource Grammars for Runyankore and RukigaBamutura, David / Ljunglöf, Peter / Nebende, Peter et al. | 2020
- 2855
-
Optimizing Annotation Effort Using Active Learning Strategies: A Sentiment Analysis Case Study in PersianAshrafi Asli, Seyed Arad / Sabeti, Behnam / Majdabadi, Zahra / Golazizian, Preni / Fahmi, Reza / Momenzadeh, Omid et al. | 2020
- 2862
-
BanFakeNews: A Dataset for Detecting Fake News in BanglaHossain, Md Zobaer / Rahman, Md Ashraful / Islam, Md Saiful / Kar, Sudipta et al. | 2020
- 2872
-
A Resource for Computational Experiments on MapudungunDuan, Mingjun / Fasola, Carlos / Rallabandi, Sai Krishna / Vega, Rodolfo / Anastasopoulos, Antonios / Levin, Lori / Black, Alan W et al. | 2020
- 2878
-
Automated Parsing of Interlinear Glossed Text from Page Images of Grammatical DescriptionsRound, Erich / Ellison, Mark / Macklin-Cordes, Jayden / Beniamine, Sacha et al. | 2020
- 2884
-
The Johns Hopkins University Bible Corpus: 1600+ Tongues for Typological ExplorationMcCarthy, Arya D. / Wicks, Rachel / Lewis, Dylan / Mueller, Aaron / Wu, Winston / Adams, Oliver / Nicolai, Garrett / Post, Matt / Yarowsky, David et al. | 2020
- 2893
-
Towards Building an Automatic Transcription System for Language Documentation: Experiences from MuyuZahrer, Alexander / Zgank, Andrej / Schuppler, Barbara et al. | 2020
- 2901
-
Towards Flexible Cross-Resource Exploitation of Heterogeneous Language Documentation DataJettka, Daniel / Lehmberg, Timm et al. | 2020
- 2906
-
CantoMap: a Hong Kong Cantonese MapTask CorpusWinterstein, Grégoire / Tang, Carmen / Lai, Regine et al. | 2020
- 2914
-
No Data to Crawl? Monolingual Corpus Creation from PDF Files of Truly low-Resource Languages in PeruBustamante, Gina / Oncevay, Arturo / Zariquiey, Roberto et al. | 2020
- 2924
-
Creating a Parallel Icelandic Dependency Treebank from Raw Text to Universal DependenciesJónsdóttir, Hildur / Ingason, Anton Karl et al. | 2020
- 2932
-
Building a Universal Dependencies Treebank for OccitanMiletic, Aleksandra / Bras, Myriam / Vergez-Couret, Marianne / Esher, Louise / Poujade, Clamença / Sibille, Jean et al. | 2020
- 2940
-
Building the Old Javanese WordnetMoeljadi, David / Aminullah, Zakariya Pamuji et al. | 2020
- 2947
-
CPLM, a Parallel Corpus for Mexican Languages: Development and InterfaceSierra Martínez, Gerardo / Montaño, Cynthia / Bel-Enguix, Gemma / Córdova, Diego / Mota Montoya, Margarita et al. | 2020
- 2953
-
SiNER: A Large Dataset for Sindhi Named Entity RecognitionAli, Wazir / Lu, Junyu / Xu, Zenglin et al. | 2020
- 2962
-
Construct a Sense-Frame Aligned Predicate Lexicon for Chinese AMR CorpusSong, Li / Dai, Yuling / Liu, Yihuan / Li, Bin / Qu, Weiguang et al. | 2020
- 2970
-
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel CorporaHan, Lifeng / Jones, Gareth / Smeaton, Alan et al. | 2020
- 2980
-
A Myanmar (Burmese)-English Named Entity Transliteration DictionaryMyat Mon, Aye / Ding, Chenchen / Kaing, Hour / Mar Soe, Khin / Utiyama, Masao / Sumita, Eiichiro et al. | 2020
- 2984
-
CA-EHN: Commonsense Analogy from E-HowNetLi, Peng-Hsuan / Yang, Tsan-Yu / Ma, Wei-Yun et al. | 2020
- 2991
-
Building Semantic Grams of Human KnowledgeLeone, Valentina / Siragusa, Giovanni / Di Caro, Luigi / Navigli, Roberto et al. | 2020
- 3001
-
Automatically Building a Multilingual Lexicon of False Friends With No SupervisionUban, Ana Sabina / Dinu, Liviu P. et al. | 2020
- 3008
-
A Parallel WordNet for English, Swedish and BulgarianAngelov, Krasimir et al. | 2020
- 3016
-
ENGLAWI: From Human- to Machine-Readable WiktionarySajous, Franck / Calderone, Basilio / Hathout, Nabil et al. | 2020
- 3027
-
Opening the Romance Verbal Inflection Dataset 2.0: A CLDF lexiconBeniamine, Sacha / Maiden, Martin / Round, Erich et al. | 2020
- 3036
-
word2word: A Collection of Bilingual Lexicons for 3,564 Language PairsChoe, Yo Joong / Park, Kyubyong / Kim, Dongwoo et al. | 2020
- 3046
-
Introducing Lexical Masks: a New Representation of Lexical Entries for Better Evaluation and Exchange of LexiconsCartoni, Bruno / Calvelo Aros, Daniel / Vrandecic, Denny / Lertpradit, Saran et al. | 2020
- 3053
-
A Large-Scale Leveled Readability Lexicon for Standard ArabicAl Khalil, Muhamed / Habash, Nizar / Jiang, Zhengyang et al. | 2020
- 3063
-
Preserving Semantic Information from Old Dictionaries: Linking Senses of the 'Altfranzösisches Wörterbuch' to WordNetStein, Achim et al. | 2020
- 3069
-
Cifu: a Frequency Lexicon of Hong Kong CantoneseLai, Regine / Winterstein, Grégoire et al. | 2020
- 3078
-
Odi et Amo. Creating, Evaluating and Extending Sentiment Lexicons for Latin.Sprugnoli, Rachele / Passarotti, Marco / Corbetta, Daniela / Peverelli, Andrea et al. | 2020
- 3087
-
WordWars: A Dataset to Examine the Natural Selection of WordsMohammad, Saif M. et al. | 2020
- 3096
-
Challenge Dataset of Cognates and False Friend Pairs from Indian LanguagesKanojia, Diptesh / Kulkarni, Malhar / Bhattacharyya, Pushpak / Haffari, Gholamreza et al. | 2020
- 3103
-
Development of a Japanese Personality Dictionary based on Psychological MethodsIwai, Ritsuko / Kawahara, Daisuke / Kumada, Takatsune / Kurohashi, Sadao et al. | 2020
- 3109
-
A Lexicon-Based Approach for Detecting Hedges in Informal TextIslam, Jumayel / Xiao, Lu / Mercer, Robert E. et al. | 2020
- 3114
-
Word Complexity Estimation for Japanese Lexical SimplificationNishihara, Daiki / Kajiwara, Tomoyuki et al. | 2020
- 3121
-
Inducing Universal Semantic Tag VectorsHuo, Da / de Melo, Gerard et al. | 2020
- 3128
-
LexiDB: Patterns & Methods for Corpus Linguistic Database ManagementCoole, Matthew / Rayson, Paul / Mariani, John et al. | 2020
- 3136
-
Towards a Semi-Automatic Detection of Reflexive and Reciprocal Constructions and Their Representation in a Valency LexiconKettnerová, Václava / Lopatkova, Marketa / Vernerová, Anna / Barancikova, Petra et al. | 2020
- 3145
-
Languages Resources for Poorly Endowed Languages : The Case Study of Classical ArmenianVidal-Gorène, Chahan / Decours-Perez, Aliénor et al. | 2020
- 3153
-
Constructing Web-Accessible Semantic Role Labels and Frames for Japanese as Additions to the NPCMJ Parsed CorpusTakeuchi, Koichi / Butler, Alastair / Nagasaki, Iku / Okamura, Takuya / Pardeshi, Prashant et al. | 2020
- 3162
-
Large-scale Cross-lingual Language Resources for Referencing and FramingVossen, Piek / Ilievski, Filip / Postma, Marten / Fokkens, Antske / Minnema, Gosse / Remijnse, Levi et al. | 2020
- 3172
-
Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use CaseKhan, Fahad / Romary, Laurent / Salgado, Ana / Bowers, Jack / Khemakhem, Mohamed / Tasovac, Toma et al. | 2020
- 3181
-
Linking the TUFS Basic Vocabulary to the Open Multilingual WordnetBond, Francis / Nomoto, Hiroki / Morgado da Costa, Luís / Bond, Arthur et al. | 2020
- 3189
-
Some Issues with Building a Multilingual WordnetBond, Francis / Morgado da Costa, Luis / Goodman, Michael Wayne / McCrae, John Philip / Lohk, Ahti et al. | 2020
- 3198
-
Collocations in Russian Lexicography and Russian Collocations DatabaseKhokhlova, Maria et al. | 2020
- 3207
-
Methodological Aspects of Developing and Managing an Etymological Lexical Resource: Introducing EtymDB-2.0Fourrier, Clémentine / Sagot, Benoît et al. | 2020
- 3217
-
OFrLex: A Computational Morphological and Syntactic Lexicon for Old FrenchGuibon, Gaël / Sagot, Benoît et al. | 2020
- 3226
-
Automatic Reconstruction of Missing Romanian Cognates and Unattested Latin WordsCiobanu, Alina Maria / Dinu, Liviu P. / Zoicas, Laurentiu et al. | 2020
- 3232
-
A Multilingual Evaluation Dataset for Monolingual Word Sense AlignmentAhmadi, Sina / McCrae, John Philip / Nimb, Sanni / Khan, Fahad / Monachini, Monica / Pedersen, Bolette / Declerck, Thierry / Wissik, Tanja / Bellandi, Andrea / Pisani, Irene et al. | 2020
- 3243
-
A Broad-Coverage Deep Semantic Lexicon for VerbsAllen, James / An, Hannah / Bose, Ritwik / de Beaumont, Will / Teng, Choh Man et al. | 2020
- 3252
-
Computational Etymology and Word EmergenceWu, Winston / Yarowsky, David et al. | 2020
- 3260
-
A Dataset of Translational Equivalents Built on the Basis of plWordNet-Princeton WordNet Synset MappingRudnicka, Ewa / Naskręt, Tomasz et al. | 2020
- 3265
-
TRANSLIT: A Large-scale Name Transliteration ResourceBenites, Fernando / Duivesteijn, Gilbert François / von Däniken, Pius / Cieliebak, Mark et al. | 2020
- 3272
-
Computing with Subjectivity LexiconsL. M. Jeronimo, Caio / E. C. Campelo, Claudio / Balby Marinho, Leandro / Sales, Allan / Veloso, Adriano / Viola, Roberta et al. | 2020
- 3281
-
The ACoLi Dictionary GraphChiarcos, Christian / Fäth, Christian / Ionov, Maxim et al. | 2020
- 3291
-
Resources in Underrepresented Languages: Building a Representative Romanian CorpusMidrigan - Ciochina, Ludmila / Boyd, Victoria / Sanchez-Ortega, Lucila / Malancea_Malac, Diana / Midrigan, Doina / Corina, David P. et al. | 2020
- 3297
-
World Class Language Technology - Developing a Language Technology Strategy for DanishKirchmeier, Sabine / Pedersen, Bolette / Nimb, Sanni / Diderichsen, Philip / Henrichsen, Peter Juel et al. | 2020
- 3302
-
A Corpus for Automatic Readability Assessment and Text Simplification of GermanBattisti, Alessia / Pfütze, Dominik / Säuberli, Andreas / Kostrzewa, Marek / Ebling, Sarah et al. | 2020
- 3312
-
The CLARIN Knowledge Centre for Atypical Communication Expertisevan den Heuvel, Henk / Oostdijk, Nelleke / Rowland, Caroline / Trilsbeek, Paul et al. | 2020
- 3317
-
Corpora of Disordered Speech in the Light of the GDPR: Two Use Cases from the DELAD Initiativevan den Heuvel, Henk / Kelli, Aleksei / Klessa, Katarzyna / Salaasti, Satu et al. | 2020
- 3322
-
The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual EuropeRehm, Georg / Marheinecke, Katrin / Hegele, Stefanie / Piperidis, Stelios / Bontcheva, Kalina / Hajic, Jan / Choukri, Khalid / Vasiļjevs, Andrejs / Backfried, Gerhard / Prinz, Christoph et al. | 2020
- 3333
-
A Framework for Shared Agreement of Language Tags beyond ISO 639Gillis-Webber, Frances / Tittel, Sabine et al. | 2020
- 3340
-
Gigafida 2.0: The Reference Corpus of Written Standard SloveneKrek, Simon / Arhar Holdt, Špela / Erjavec, Tomaž / Čibej, Jaka / Repar, Andraz / Gantar, Polona / Ljubešić, Nikola / Kosem, Iztok / Dobrovoljc, Kaja et al. | 2020
- 3346
-
Corpus Query Lingua Franca part II: OntologyEvert, Stefan / Harlamov, Oleg / Heinrich, Philipp / Banski, Piotr et al. | 2020
- 3353
-
A CLARIN Transcription Portal for Interview DataDraxler, Christoph / van den Heuvel, Henk / van Hessen, Arjan / Calamai, Silvia / Corti, Louise et al. | 2020
- 3360
-
Ellogon Casual Annotation InfrastructurePetasis, Georgios / Tsekouras, Leonidas et al. | 2020
- 3366
-
European Language Grid: An OverviewRehm, Georg / Berger, Maria / Elsholz, Ela / Hegele, Stefanie / Kintzel, Florian / Marheinecke, Katrin / Piperidis, Stelios / Deligiannis, Miltos / Galanis, Dimitris / Gkirtzou, Katerina et al. | 2020
- 3381
-
The Competitiveness Analysis of the European Language Technology MarketVasiļjevs, Andrejs / Skadina, Inguna / Samite, Indra / Kauliņš, Kaspars / Ajausks, Ēriks / Meļņika, Jūlija / Bērziņš, Aivars et al. | 2020
- 3390
-
Constructing a Bilingual Hadith Corpus Using a Segmentation ToolAltammami, Shatha / Atwell, Eric / Alsalka, Ammar et al. | 2020
- 3399
-
Facilitating Corpus Usage: Making Icelandic Corpora More Accessible for Researchers and Language UsersSteingrímsson, Steinþór / Barkarson, Starkaður / Örnólfsson, Gunnar Thor et al. | 2020
- 3406
-
Interoperability in an Infrastructure Enabling Multidisciplinary Research: The case of CLARINde Jong, Franciska / Maegaard, Bente / Fišer, Darja / van Uytvanck, Dieter / Witt, Andreas et al. | 2020
- 3414
-
Language Technology Programme for Icelandic 2019-2023Nikulásdóttir, Anna / Guðnason, Jón / Ingason, Anton Karl / Loftsson, Hrafn / Rögnvaldsson, Eiríkur / Sigurðsson, Einar Freyr / Steingrímsson, Steinþór et al. | 2020
- 3423
-
Privacy by Design and Language ResourcesKamocki, Pawel / Witt, Andreas et al. | 2020
- 3428
-
Making Metadata Fit for Next Generation Language Technology Platforms: The Metadata Schema of the European Language GridLabropoulou, Penny / Gkirtzou, Katerina / Gavriilidou, Maria / Deligiannis, Miltos / Galanis, Dimitris / Piperidis, Stelios / Rehm, Georg / Berger, Maria / Mapelli, Valérie / Rigault, Michael et al. | 2020
- 3438
-
Related Works in the Linguistic Data Consortium CatalogJaquette, Daniel / Cieri, Christopher / DiPersio, Denise et al. | 2020
- 3443
-
Language Data Sharing in European Public Services - Overcoming Obstacles and Creating Sustainable Data Sharing InfrastructuresSmal, Lilli / Lösch, Andrea / van Genabith, Josef / Giagkou, Maria / Declerck, Thierry / Busemann, Stephan et al. | 2020
- 3449
-
A Progress Report on Activities at the Linguistic Data Consortium Benefitting the LREC CommunityCieri, Christopher / Fiumara, James / Strassel, Stephanie / Wright, Jonathan / DiPersio, Denise / Liberman, Mark et al. | 2020
- 3457
-
Digital Language Infrastructures - Documenting Language ActorsLyding, Verena / König, Alexander / Pretti, Monica et al. | 2020
- 3463
-
Samrómur: Crowd-sourcing Data Collection for Icelandic Speech RecognitionMollberg, David Erik / Jónsson, Ólafur Helgi / Þorsteinsdóttir, Sunneva / Steingrímsson, Steinþór / Magnúsdóttir, Eydís Huld / Gudnason, Jon et al. | 2020
- 3468
-
Semi-supervised Development of ASR Systems for Multilingual Code-switched Speech in Under-resourced LanguagesBiswas, Astik / Yilmaz, Emre / De Wet, Febe / Van der westhuizen, Ewald / Niesler, Thomas et al. | 2020
- 3475
-
CLFD: A Novel Vectorization Technique and Its Application in Fake News DetectionMersinias, Michail / Afantenos, Stergos / Chalkiadakis, Georgios et al. | 2020
- 3484
-
SimplifyUR: Unsupervised Lexical Text Simplification for UrduQasmi, Namoos Hayat / Zia, Haris Bin / Athar, Awais / Raza, Agha Ali et al. | 2020
- 3490
-
Jamo Pair Encoding: Subcharacter Representation-based Extreme Korean Vocabulary Compression for Efficient Subword TokenizationMoon, Sangwhan / Okazaki, Naoaki et al. | 2020
- 3498
-
Offensive Language and Hate Speech Detection for DanishSigurbergsson, Gudbjartur Ingi / Derczynski, Leon et al. | 2020
- 3509
-
Semi-supervised Deep Embedded Clustering with Anomaly Detection for Semantic Frame InductionYong, Zheng Xin / Timponi Torrent, Tiago et al. | 2020
- 3520
-
Search Query Language Identification Using Weak LabelingTambi, Ritiz / Kale, Ajinkya / King, Tracy Holloway et al. | 2020
- 3528
-
Automated Phonological Transcription of Akkadian Cuneiform TextSahala, Aleksi / Silfverberg, Miikka / Arppe, Antti / Lindén, Krister et al. | 2020
- 3535
-
COSTRA 1.0: A Dataset of Complex Sentence TransformationsBarancikova, Petra / Bojar, Ondřej et al. | 2020
- 3542
-
Automatic In-the-wild Dataset Annotation with Deep Generalized Multiple Instance LearningCorreia, Joana / Trancoso, Isabel / Raj, Bhiksha et al. | 2020
- 3551
-
How Much Data Do You Need? About the Creation of a Ground Truth for Black Letter and the Effectiveness of Neural OCRStröbel, Phillip Benjamin / Clematide, Simon / Volk, Martin et al. | 2020
- 3560
-
Dirichlet-Smoothed Word Embeddings for Low-Resource SettingsJungmaier, Jakob / Kassner, Nora / Roth, Benjamin et al. | 2020
- 3566
-
On The Performance of Time-Pooling Strategies for End-to-End Spoken Language IdentificationMonteiro, Joao / Alam, Md Jahangir / Falk, Tiago et al. | 2020
- 3573
-
Neural Disambiguation of Lemma and Part of Speech in Morphologically Rich LanguagesHoya Quecedo, José María / Maximilian, Koppatz / Yangarber, Roman et al. | 2020
- 3583
-
Non-Linearity in Mapping Based Cross-Lingual Word EmbeddingsZhao, Jiawei / Gilman, Andrew et al. | 2020
- 3590
-
LibriVoxDeEn: A Corpus for German-to-English Speech Translation and German Speech RecognitionBeilharz, Benjamin / Sun, Xin / Karimova, Sariya / Riezler, Stefan et al. | 2020
- 3595
-
SEDAR: a Large Scale French-English Financial Domain Parallel CorpusGhaddar, Abbas / Langlais, Phillippe et al. | 2020
- 3603
-
JParaCrawl: A Large Scale Web-Based English-Japanese Parallel CorpusMorishita, Makoto / Suzuki, Jun / Nagata, Masaaki et al. | 2020
- 3610
-
Neural Machine Translation for Low-Resourced Indian LanguagesChoudhary, Himanshu / Rao, Shivansh / Rohilla, Rajesh et al. | 2020
- 3616
-
Content-Equivalent Translated Parallel News Corpus and Extension of Domain Adaptation for NMTMino, Hideya / Tanaka, Hideki / Ito, Hitoshi / Goto, Isao / Yamada, Ichiro / Tokunaga, Takenobu et al. | 2020
- 3623
-
NMT and PBSMT Error Analyses in English to Brazilian Portuguese Automatic TranslationsCaseli, Helena / Inácio, Marcio et al. | 2020
- 3630
-
Evaluation Dataset for Zero Pronoun in Japanese to English TranslationShimazu, Sho / Takase, Sho / Nakazawa, Toshiaki / Okazaki, Naoaki et al. | 2020
- 3635
-
Better Together: Modern Methods Plus Traditional Thinking in NP AlignmentKovács, Ádám / Ács, Judit / Kornai, Andras / Recski, Gábor et al. | 2020
- 3640
-
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures TranslationSong, Haiyue / Dabre, Raj / Fujita, Atsushi / Kurohashi, Sadao et al. | 2020
- 3650
-
Being Generous with Sub-Words towards Small NMT ChildrenDefauw, Arne / Vanallemeersch, Tom / Van Winckel, Koen / Szoc, Sara / Van den Bogaert, Joachim et al. | 2020
- 3657
-
Document Sub-structure in Neural Machine TranslationDobreva, Radina / Zhou, Jie / Bawden, Rachel et al. | 2020
- 3668
-
An Evaluation Benchmark for Testing the Word Sense Disambiguation Capabilities of Machine Translation SystemsRaganato, Alessandro / Scherrer, Yves / Tiedemann, Jörg et al. | 2020
- 3676
-
MEDLINE as a Parallel Corpus: a Survey to Gain Insight on French-, Spanish- and Portuguese-speaking Authors’ Abstract Writing PracticeNévéol, Aurélie / Jimeno Yepes, Antonio / Neves, Mariana et al. | 2020
- 3683
-
JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine TranslationMao, Zhuoyuan / Cromieres, Fabien / Dabre, Raj / Song, Haiyue / Kurohashi, Sadao et al. | 2020
- 3692
-
A Post-Editing Dataset in the Legal Domain: Do we Underestimate Neural Machine Translation Quality?Ive, Julia / Specia, Lucia / Szoc, Sara / Vanallemeersch, Tom / Van den Bogaert, Joachim / Farah, Eduardo / Maroti, Christine / Ventura, Artur / Khalilov, Maxim et al. | 2020
- 3698
-
Linguistically Informed Hindi-English Neural Machine TranslationGoyal, Vikrant / Mishra, Pruthwik / Sharma, Dipti Misra et al. | 2020
- 3704
-
A Test Set for Discourse Translation from Japanese to EnglishNagata, Masaaki / Morishita, Makoto et al. | 2020
- 3710
-
An Analysis of Massively Multilingual Neural Machine Translation for Low-Resource LanguagesMueller, Aaron / Nicolai, Garrett / McCarthy, Arya D. / Lewis, Dylan / Wu, Winston / Yarowsky, David et al. | 2020
- 3719
-
TDDC: Timely Disclosure Documents CorpusDoi, Nobushige / Oda, Yusuke / Nakazawa, Toshiaki et al. | 2020
- 3727
-
MuST-Cinema: a Speech-to-Subtitles corpusKarakanta, Alina / Negri, Matteo / Turchi, Marco et al. | 2020
- 3735
-
On Context Span Needed for Machine Translation EvaluationCastilho, Sheila / Popović, Maja / Way, Andy et al. | 2020
- 3743
-
A Multilingual Parallel Corpora Collection Effort for Indian LanguagesSiripragrada, Shashank / Philip, Jerin / Namboodiri, Vinay P. / Jawahar, C V et al. | 2020
- 3752
-
To Case or not to case: Evaluating Casing Methods for Neural Machine TranslationEtchegoyhen, Thierry / Gete, Harritxu et al. | 2020
- 3761
-
The MARCELL Legislative CorpusVáradi, Tamás / Koeva, Svetla / Yamalov, Martin / Tadić, Marko / Sass, Bálint / Nitoń, Bartłomiej / Ogrodniczuk, Maciej / Pęzik, Piotr / Barbu Mititelu, Verginica / Ion, Radu et al. | 2020
- 3769
-
ParaPat: The Multi-Million Sentences Parallel Corpus of Patents AbstractsSoares, Felipe / Stevenson, Mark / Bartolome, Diego / Zaretskaya, Anna et al. | 2020
- 3775
-
Corpora for Document-Level Neural Machine TranslationLiu, Siyou / Zhang, Xiaojun et al. | 2020
- 3782
-
OpusTools and Parallel Corpus DiagnosticsAulamo, Mikko / Sulubacak, Umut / Virpioja, Sami / Tiedemann, Jörg et al. | 2020
- 3790
-
Literary Machine Translation under the Magnifying Glass: Assessing the Quality of an NMT-Translated Detective Novel on Document LevelFonteyne, Margot / Tezcan, Arda / Macken, Lieve et al. | 2020
- 3799
-
Handle with Care: A Case Study in Comparable Corpora Exploitation for Neural Machine TranslationEtchegoyhen, Thierry / Gete, Harritxu et al. | 2020
- 3808
-
The FISKMÖ Project: Resources and Tools for Finnish-Swedish Machine Translation and Cross-Linguistic ResearchTiedemann, Jörg / Nieminen, Tommi / Aulamo, Mikko / Kanerva, Jenna / Leino, Akseli / Ginter, Filip / Papula, Niko et al. | 2020
- 3816
-
Multiword Expression aware Neural Machine TranslationZaninello, Andrea / Birch, Alexandra et al. | 2020
- 3826
-
An Enhanced Mapping Scheme of the Universal Part-Of-Speech for KoreanKim, Myung Hee / Colineau, Nathalie et al. | 2020
- 3834
-
Finite State Machine Pattern-Root Arabic Morphological Generator, Analyzer and DiacritizerAlkhairy, Maha / Jafri, Afshan / Smith, David et al. | 2020
- 3842
-
An Unsupervised Method for Weighting Finite-state Morphological AnalyzersKeleg, Amr / Tyers, Francis / Howell, Nick / Pirinen, Tommi et al. | 2020
- 3851
-
Language-Independent Tokenisation Rivals Language-Specific Tokenisation for Word Similarity PredictionBollegala, Danushka / Kiryo, Ryuichi / Tsujino, Kosuke / Yukawa, Haruki et al. | 2020
- 3861
-
A Supervised Part-Of-Speech Tagger for the Greek Language of the Social WebNikiforos, Maria Nefeli / Kermanidis, Katia Lida et al. | 2020
- 3868
-
Bag & Tag'em - A New Dutch StemmerJonker, Anne / de Ruijt, Corné / de Gruijl, Jornt et al. | 2020
- 3877
-
Glawinette: a Linguistically Motivated Derivational Description of French Acquired from GLAWIHathout, Nabil / Sajous, Franck / Calderone, Basilio / Namer, Fiammetta et al. | 2020
- 3886
-
BabyFST - Towards a Finite-State Based Computational Model of Ancient BabylonianSahala, Aleksi / Silfverberg, Miikka / Arppe, Antti / Lindén, Krister et al. | 2020
- 3895
-
Morphological Analysis and Disambiguation for Gulf Arabic: The Interplay between Resources and MethodsKhalifa, Salam / Zalmout, Nasser / Habash, Nizar et al. | 2020
- 3905
-
Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional CorpusMetheniti, Eleni / Neumann, Guenter et al. | 2020
- 3913
-
Introducing a Large-Scale Dataset for Vietnamese POS Tagging on Conversational TextsTran, Oanh / Pham, Tu / Dang, Vu / Nguyen, Bang et al. | 2020
- 3922
-
UniMorph 3.0: Universal MorphologyMcCarthy, Arya D. / Kirov, Christo / Grella, Matteo / Nidhi, Amrit / Xia, Patrick / Gorman, Kyle / Vylomova, Ekaterina / Mielke, Sabrina J. / Nicolai, Garrett / Silfverberg, Miikka et al. | 2020
- 3932
-
Building the Spanish-Croatian Parallel CorpusMikelenić, Bojana / Tadić, Marko et al. | 2020
- 3937
-
DerivBase.Ru: a Derivational Morphology Resource for RussianVodolazsky, Daniil et al. | 2020
- 3944
-
Morfessor EM+Prune: Improved Subword Segmentation with Expectation Maximization and PruningGrönroos, Stig-Arne / Virpioja, Sami / Kurimo, Mikko et al. | 2020
- 3954
-
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for SerbianStankovic, Ranka / Šandrih, Branislava / Krstev, Cvetana / Utvić, Miloš / Skoric, Mihailo et al. | 2020
- 3963
-
Fine-grained Morphosyntactic Analysis and Generation Tools for More Than One Thousand LanguagesNicolai, Garrett / Lewis, Dylan / McCarthy, Arya D. / Mueller, Aaron / Wu, Winston / Yarowsky, David et al. | 2020
- 3973
-
Cairo Student Code-Switch (CSCS) Corpus: An Annotated Egyptian Arabic-English CorpusBalabel, Mohamed / Hamed, Injy / Abdennadher, Slim / Vu, Ngoc Thang / Çetinoğlu, Özlem et al. | 2020
- 3978
-
Getting More Data for Low-resource Morphological Inflection: Language Models and Data AugmentationSorokin, Alexey et al. | 2020
- 3984
-
Visual Modeling of Turkish MorphologyÖzenç, Berke / Solak, Ercan et al. | 2020
- 3991
-
Kvistur 2.0: a BiLSTM Compound Splitter for IcelandicDaðason, Jón / Mollberg, David / Loftsson, Hrafn / Bjarnadóttir, Kristín et al. | 2020
- 3996
-
Morphological Segmentation for Low Resource LanguagesMott, Justin / Bies, Ann / Strassel, Stephanie / Kodner, Jordan / Richter, Caitlin / Xu, Hongzhi / Marcus, Mitchell et al. | 2020
- 4003
-
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl DataWenzek, Guillaume / Lachaux, Marie-Anne / Conneau, Alexis / Chaudhary, Vishrav / Guzmán, Francisco / Joulin, Armand / Grave, Edouard et al. | 2020
- 4013
-
On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding LearningDoval, Yerai / Camacho-Collados, Jose / Espinosa Anke, Luis / Schockaert, Steven et al. | 2020
- 4024
-
Building an English-Chinese Parallel Corpus Annotated with Sub-sentential Translation TechniquesZhai, Yuming / Liu, Lufei / Zhong, Xinyi / Illouz, Gabriel / Vilnat, Anne et al. | 2020
- 4034
-
Universal Dependencies v2: An Evergrowing Multilingual Treebank CollectionNivre, Joakim / de Marneffe, Marie-Catherine / Ginter, Filip / Hajic, Jan / Manning, Christopher D. / Pyysalo, Sampo / Schuster, Sebastian / Tyers, Francis / Zeman, Daniel et al. | 2020
- 4044
-
EMPAC: an English-Spanish Corpus of Institutional SubtitlesSerrat Roozen, Iris / Martínez Martínez, José Manuel et al. | 2020
- 4054
-
Cross-Lingual Word Embeddings for Turkic LanguagesKuriyozov, Elmurod / Doval, Yerai / Gómez-Rodríguez, Carlos et al. | 2020
- 4063
-
How Universal are Universal Dependencies? Exploiting Syntax for Multilingual Clause-level Sentiment DetectionKanayama, Hiroshi / Iwamoto, Ran et al. | 2020