A Major Wordnet for a Minority Language: Scottish Gaelic (English)

Free access

Bella, Gábor / McNeill, Fiona / Gorman, Rody / O Donnaile, Caoimhin / MacDonald, Kirsty / Chandrashekar, Yamini / Freihat, Abed Alhakim / Giunchiglia, Fausto

In: LREC 2020 Marseille ; 2812-2818 ; 2020

Conference paper / Electronic Resource

How to get this title?

Download

Export, share and cite

Title:

A Major Wordnet for a Minority Language: Scottish Gaelic
Contributors:

Bella, Gábor ( author ) / McNeill, Fiona ( author ) / Gorman, Rody ( author ) / O Donnaile, Caoimhin ( author ) / MacDonald, Kirsty ( author ) / Chandrashekar, Yamini ( author ) / Freihat, Abed Alhakim ( author ) / Giunchiglia, Fausto ( author )
Conference:

International Conference on Language Resources and Evaluation ; 12. ; 2020 ; Marseille
Published in:

LREC 2020 Marseille ; 2812-2818
Publisher:

The European Language Resources Association (ELRA)

Place of publication:

Paris
Publication date:

2020
Type of media:

Conference paper
Type of material:

Electronic Resource
Language:

English

Classification:

BKL:

17.46 Mathematische Linguistik / 18.00 Einzelne Sprachen und Literaturen allgemein / 54.75 Sprachverarbeitung

Source:

TIBKAT

The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.

1: Neural Mention Detection
Yu, Juntao / Bohnet, Bernd / Poesio, Massimo et al. | 2020
digital version
11: A Cluster Ranking Model for Full Anaphora Resolution
Yu, Juntao / Uma, Alexandra / Poesio, Massimo et al. | 2020
digital version
21: Mandarinograd: A Chinese Collection of Winograd Schemas
Bernard, Timothée / Han, Ting et al. | 2020
digital version
27: On the Influence of Coreference Resolution on Word Embeddings in Lexical-semantic Evaluation Tasks
Henlein, Alexander / Mehler, Alexander et al. | 2020
digital version
34: NoEl: An Annotated Corpus for Noun Ellipsis in English
Khullar, Payal / Majmundar, Kushal / Shrivastava, Manish et al. | 2020
digital version
44: An Annotated Dataset of Coreference in English Literature
Bamman, David / Lewke, Olivia / Mansoor, Anya et al. | 2020
digital version
55: GerDraCor-Coref: A Coreference Corpus for Dramatic Texts in German
Pagel, Janis / Reiter, Nils et al. | 2020
digital version
65: A Study on Entity Resolution for Email Conversations
Dakle, Parag Pravin / Desai, Takshak / Moldovan, Dan et al. | 2020
digital version
74: Model-based Annotation of Coreference
Aralikatte, Rahul / Søgaard, Anders et al. | 2020
digital version
80: French Coreference for Spoken and Written Language
Wilkens, Rodrigo / Oberle, Bruno / Landragin, Frédéric / Todirascu, Amalia et al. | 2020
digital version
90: Cross-lingual Zero Pronoun Resolution
Aloraini, Abdulrahman / Poesio, Massimo et al. | 2020
digital version
99: Exploiting Cross-Lingual Hints to Discover Event Pronouns
Loáiciga, Sharid / Hardmeier, Christian / Sayeed, Asad et al. | 2020
digital version
104: MuDoCo: Corpus for Multidomain Coreference Resolution and Referring Expression Generation
Martin, Scott / Poddar, Shivani / Upasani, Kartikeya et al. | 2020
digital version
112: Affection Driven Neural Networks for Sentiment Analysis
Xiang, Rong / Long, Yunfei / Wan, Mingyu / Gu, Jinghang / Lu, Qin / Huang, Chu-Ren et al. | 2020
digital version
120: The Alice Datasets: fMRI & EEG Observations of Natural Language Comprehension
Bhattasali, Shohini / Brennan, Jonathan / Luh, Wen-Ming / Franzluebbers, Berta / Hale, John et al. | 2020
digital version
126: Modelling Narrative Elements in a Short Story: A Study on Annotation Schemes and Guidelines
Mikhalkova, Elena / Protasov, Timofei / Sokolova, Polina / Bashmakova, Anastasiia / Drozdova, Anastasiia et al. | 2020
digital version
133: Cortical Speech Databases For Deciphering the Articulatory Code
Höge, Harald et al. | 2020
digital version
138: ZuCo 2.0: A Dataset of Physiological Recordings During Natural Reading and Annotation
Hollenstein, Nora / Troendle, Marius / Zhang, Ce / Langer, Nicolas et al. | 2020
digital version
147: Linguistic, Kinematic and Gaze Information in Task Descriptions: The LKG-Corpus
Reinboth, Tim / Gross, Stephanie / Bishop, Laura / Krenn, Brigitte et al. | 2020
digital version
156: The ACQDIV Corpus Database and Aggregation Pipeline
Jancso, Anna / Moran, Steven / Stoll, Sabine et al. | 2020
digital version
166: Providing Semantic Knowledge to a Set of Pictograms for People with Disabilities: a Set of Links between WordNet and Arasaac: Arasaac-WN
Schwab, Didier / Trial, Pauline / Vaschalde, Céline / Vial, Loïc / Esperanca-Rodier, Emmanuelle / Lecouteux, Benjamin et al. | 2020
digital version
172: Orthographic Codes and the Neighborhood Effect: Lessons from Information Theory
Tulkens, Stéphan / Sandra, Dominiek / Daelemans, Walter et al. | 2020
digital version
182: Understanding the Dynamics of Second Language Writing through Keystroke Logging and Complexity Contours
Kerz, Elma / Pruneri, Fabio / Wiechmann, Daniel / Qiao, Yu / Ströbel, Marcus et al. | 2020
digital version
189: Design of BCCWJ-EEG: Balanced Corpus with Human Electroencephalography
Oseki, Yohei / Asahara, Masayuki et al. | 2020
digital version
195: Using the RUPEX Multichannel Corpus in a Pilot fMRI Study on Speech Disfluencies
Smirnova, Katerina / Korotaev, Nikolay / Panikratova, Yana / Lebedeva, Irina / Pechenkova, Ekaterina / Fedorova, Olga et al. | 2020
digital version
204: Construction of an Evaluation Corpus for Grammatical Error Correction for Learners of Japanese as a Second Language
Koyama, Aomi / Kiyuna, Tomoshige / Kobayashi, Kenji / Arai, Mio / Komachi, Mamoru et al. | 2020
digital version
212: Effective Crowdsourcing of Multiple Tasks for Comprehensive Knowledge Extraction
Nam, Sangha / Lee, Minho / Kim, Donghwan / Han, Kijong / Kim, Kuntae / Yoon, Sooji / Kim, Eun-kyung / Choi, Key-Sun et al. | 2020
digital version
220: Developing a Corpus of Indirect Speech Act Schemas
Roque, Antonio / Tsuetaki, Alexander / Sarathy, Vasanth / Scheutz, Matthias et al. | 2020
digital version
229: Quality Estimation for Partially Subjective Classification Tasks via Crowdsourcing
Sato, Yoshinao / Miyazawa, Kouki et al. | 2020
digital version
236: Crowdsourcing in the Development of a Multilingual FrameNet: A Case Study of Korean FrameNet
Hahm, Younggyun / Noh, Youngbin / Han, Ji Yoon / Oh, Tae Hwan / Choe, Hyonsu / Kim, Hansaem / Choi, Key-Sun et al. | 2020
digital version
245: Towards a Reliable and Robust Methodology for Crowd-Based Subjective Quality Assessment of Query-Based Extractive Text Summarization
Iskender, Neslihan / Polzehl, Tim / Möller, Sebastian et al. | 2020
digital version
254: A Seed Corpus of Hindu Temples in India
Radhakrishnan, Priya et al. | 2020
digital version
259: Do You Believe It Happened? Assessing Chinese Readers' Veridicality Judgments
Chang, Yu-Yun / Hsieh, Shu-Kai et al. | 2020
digital version
268: Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning
Nicolas, Lionel / Lyding, Verena / Borg, Claudia / Forascu, Corina / Fort, Karën / Zdravkova, Katerina / Kosem, Iztok / Čibej, Jaka / Arhar Holdt, Špela / Millour, Alice et al. | 2020
digital version
279: MAGPIE: A Large Corpus of Potentially Idiomatic Expressions
Haagsma, Hessel / Bos, Johan / Nissim, Malvina et al. | 2020
digital version
288: CRWIZ: A Framework for Crowdsourcing Real-Time Wizard-of-Oz Dialogues
Chiyah Garcia, Francisco Javier / Lopes, José / Liu, Xingkun / Hastie, Helen et al. | 2020
digital version
298: Effort Estimation in Named Entity Tagging Tasks
Gomes, Inês / Correia, Rui / Ribeiro, Jorge / Freitas, João et al. | 2020
digital version
307: Using Crowdsourced Exercises for Vocabulary Training to Expand ConceptNet
Rodosthenous, Christos / Lyding, Verena / Sangati, Federico / König, Alexander / ul Hassan, Umair / Nicolas, Lionel / Horbacauskiene, Jolita / Katinskaia, Anisia / Aparaschivei, Lavinia et al. | 2020
digital version
317: Predicting Multidimensional Subjective Ratings of Children’ Readings from the Speech Signals for the Automatic Assessment of Fluency
Bailly, Gérard / Godde, Erika / Piat-Marchand, Anne-Laure / Bosse, Marie-Line et al. | 2020
digital version
323: Constructing Multimodal Language Learner Texts Using LARA: Experiences with Nine Languages
Akhlaghi, Elham / Bédi, Branislav / Bektaş, Fatih / Berthelsen, Harald / Butterweck, Matthias / Chua, Cathy / Cucchiarin, Catia / Eryiğit, Gülşen / Gerlach, Johanna / Habibi, Hanieh et al. | 2020
digital version
332: A Dataset for Investigating the Impact of Feedback on Student Revision Outcome
Pilan, Ildiko / Lee, John / Yeung, Chak Yan / Webster, Jonathan et al. | 2020
digital version
340: Creating Corpora for Research in Feedback Comment Generation
Nagata, Ryo / Inui, Kentaro / Ishikawa, Shin'ichiro et al. | 2020
digital version
346: Using Multilingual Resources to Evaluate CEFRLex for Learner Applications
Graën, Johannes / Alfter, David / Schneider, Gerold et al. | 2020
digital version
356: Immersive Language Exploration with Object Recognition and Augmented Reality
Platte, Benny / Platte, Anett / Roschke, Christian / Thomanek, Rico / Rolletschke, Thony / Zimmer, Frank / Ritter, Marc et al. | 2020
digital version
363: A Process-oriented Dataset of Revisions during Writing
Conijn, Rianne / Dux Speltz, Emily / van Zaanen, Menno / Van Waes, Luuk / Chukharev-Hudilainen, Evgeny et al. | 2020
digital version
369: Automated Writing Support Using Deep Linguistic Parsers
Morgado da Costa, Luís / V P Winder, Roger / Li, Shu Yun / Lin Tzer Liang, Benedict Christopher / Mackinnon, Joseph / Bond, Francis et al. | 2020
digital version
378: TLT-school: a Corpus of Non Native Children Speech
Gretter, Roberto / Matassoni, Marco / Bannò, Stefano / Daniele, Falavigna et al. | 2020
digital version
386: Toward a Paradigm Shift in Collection of Learner Corpora
Katinskaia, Anisia / Ivanova, Sardana / Yangarber, Roman et al. | 2020
digital version
392: Quality Focused Approach to a Learner Corpus Development
Darģis, Roberts / Auziņa, Ilze / Levāne-Petrova, Kristīne / Kaija, Inga et al. | 2020
digital version
397: An Exploratory Study into Automated Précis Grading
De Clercq, Orphee / Van Hoecke, Senne et al. | 2020
digital version
405: Adjusting Image Attributes of Localized Regions with Low-level Dialogue
Lin, Tzu-Hsiang / Rudnicky, Alexander / Bui, Trung / Kim, Doo Soon / Oh, Jean et al. | 2020
digital version
413: Alignment Annotation for Clinic Visit Dialogue to Clinical Note Sentence Language Generation
Yim, Wen-wai / Yetisgen, Meliha / Huang, Jenny / Grossman, Micah et al. | 2020
digital version
422: MultiWOZ 2.1: A Consolidated Multi-Domain Dialogue Dataset with State Corrections and State Tracking Baselines
Eric, Mihail / Goel, Rahul / Paul, Shachi / Sethi, Abhishek / Agarwal, Sanchit / Gao, Shuyang / Kumar, Adarsh / Goyal, Anuj / Ku, Peter / Hakkani-Tur, Dilek et al. | 2020
digital version
429: A Comparison of Explicit and Implicit Proactive Dialogue Strategies for Conversational Recommendation
Kraus, Matthias / Fischbach, Fabian / Jansen, Pascal / Minker, Wolfgang et al. | 2020
digital version
436: Conversational Question Answering in Low Resource Scenarios: A Dataset and Case Study for Basque
Otegi, Arantxa / Agirre, Aitor / Campos, Jon Ander / Soroa, Aitor / Agirre, Eneko et al. | 2020
digital version
443: Construction and Analysis of a Multimodal Chat-talk Corpus for Dialog Systems Considering Interpersonal Closeness
Yamazaki, Yoshihiro / Chiba, Yuya / Nose, Takashi / Ito, Akinori et al. | 2020
digital version
449: BLISS: An Agent for Collecting Spoken Dialogue Data about Health and Well-being
van Waterschoot, Jelte / Hendrickx, Iris / Khan, Arif / Klabbers, Esther / de Korte, Marcel / Strik, Helmer / Cucchiarini, Catia / Theune, Mariët et al. | 2020
digital version
459: The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service
Chen, Meng / Liu, Ruixue / Shen, Lei / Yuan, Shaozu / Zhou, Jingyan / Wu, Youzheng / He, Xiaodong / Zhou, Bowen et al. | 2020
digital version
467: "Cheese!": a Corpus of Face-to-face French Interactions. A Case Study for Analyzing Smiling and Conversational Humor
Priego-Valverde, Béatrice / Bigi, Brigitte / Amoyal, Mary et al. | 2020
digital version
476: The Margarita Dialogue Corpus: A Data Set for Time-Offset Interactions and Unstructured Dialogue Systems
Chierici, Alberto / Habash, Nizar / Bicec, Margarita et al. | 2020
digital version
485: How Users React to Proactive Voice Assistant Behavior While Driving
Schmidt, Maria / Minker, Wolfgang / Werner, Steffen et al. | 2020
digital version
491: Emotional Speech Corpus for Persuasive Dialogue System
Asai, Sara / Yoshino, Koichiro / Shinagawa, Seitaro / Sakti, Sakriani / Nakamura, Satoshi et al. | 2020
digital version
498: Multimodal Analysis of Cohesion in Multi-party Interactions
Bangalore Kantharaju, Reshmashree / Langlet, Caroline / Barange, Mukesh / Clavel, Chloé / Pelachaud, Catherine et al. | 2020
digital version
508: Treating Dialogue Quality Evaluation as an Anomaly Detection Problem
Nedelchev, Rostislav / Usbeck, Ricardo / Lehmann, Jens et al. | 2020
digital version
513: Evaluation of Argument Search Approaches in the Context of Argumentative Dialogue Systems
Rach, Niklas / Matsuda, Yuki / Daxenberger, Johannes / Ultes, Stefan / Yasumoto, Keiichi / Minker, Wolfgang et al. | 2020
digital version
523: PATE: A Corpus of Temporal Expressions for the In-car Voice Assistant Domain
Zarcone, Alessandra / Alam, Touhidul / Kolagar, Zahra et al. | 2020
digital version
531: Mapping the Dialog Act Annotations of the LEGO Corpus into ISO 24617-2 Communicative Functions
Ribeiro, Eugénio / Ribeiro, Ricardo / Martins de Matos, David et al. | 2020
digital version
540: Estimating User Communication Styles for Spoken Dialogue Systems
Miehle, Juliana / Feustel, Isabel / Hornauer, Julia / Minker, Wolfgang / Ultes, Stefan et al. | 2020
digital version
549: The ISO Standard for Dialogue Act Annotation, Second Edition
Bunt, Harry / Petukhova, Volha / Gilmartin, Emer / Pelachaud, Catherine / Fang, Alex / Keizer, Simon / Prévot, Laurent et al. | 2020
digital version
559: The AICO Multimodal Corpus - Data Collection and Preliminary Analyses
Jokinen, Kristiina et al. | 2020
digital version
565: A Corpus of Controlled Opinionated and Knowledgeable Movie Discussions for Training Neural Conversation Models
Galetzka, Fabian / Eneh, Chukwuemeka Uchenna / Schlangen, David et al. | 2020
digital version
574: A French Medical Conversations Corpus Annotated for a Virtual Patient Dialogue System
Laleye, Fréjus A. A. / de Chalendar, Gaël / Blanié, Antonia / Brouquet, Antoine / Behnamou, Dan et al. | 2020
digital version
581: Getting To Know You: User Attribute Extraction from Dialogues
Wu, Chien-Sheng / Madotto, Andrea / Lin, Zhaojiang / Xu, Peng / Fung, Pascale et al. | 2020
digital version
590: Augmenting Small Data to Classify Contextualized Dialogue Acts for Exploratory Visualization
Kumar, Abhinav / Di Eugenio, Barbara / Aurisano, Jillian / Johnson, Andrew et al. | 2020
digital version
600: RDG-Map: A Multimodal Corpus of Pedagogical Human-Agent Spoken Interactions.
Paetzel, Maike / Karkada, Deepthi / Manuvinakurike, Ramesh et al. | 2020
digital version
610: MPDD: A Multi-Party Dialogue Dataset for Analysis of Emotions and Interpersonal Relationships
Chen, Yi-Ting / Huang, Hen-Hsen / Chen, Hsin-Hsi et al. | 2020
digital version
615: "Alexa in the wild" - Collecting Unconstrained Conversations with a Modern Voice Assistant in a Public Environment
Siegert, Ingo et al. | 2020
digital version
620: EDA: Enriching Emotional Dialogue Acts using an Ensemble of Neural Annotators
Bothe, Chandrakant / Weber, Cornelius / Magg, Sven / Wermter, Stefan et al. | 2020
digital version
628: PACO: a Corpus to Analyze the Impact of Common Ground in Spontaneous Face-to-Face Interaction
Amoyal, Mary / Priego-Valverde, Béatrice / Rauzy, Stephane et al. | 2020
digital version
634: Dialogue Act Annotation in a Multimodal Corpus of First Encounter Dialogues
Navarretta, Costanza / Paggio, Patrizia et al. | 2020
digital version
644: A Conversation-Analytic Annotation of Turn-Taking Behavior in Japanese Multi-Party Conversation and its Preliminary Analysis
Enomoto, Mika / Den, Yasuharu / Ishimoto, Yuichi et al. | 2020
digital version
653: Understanding User Utterances in a Dialog System for Caregiving
Asao, Yoshihiko / Kloetzer, Julien / Mizuno, Junta / Saiki, Dai / Kadowaki, Kazuma / Torisawa, Kentaro et al. | 2020
digital version
662: Designing Multilingual Interactive Agents using Small Dialogue Corpora
Lin, Donghui / Otani, Masayuki / Okuno, Ryosuke / Ishida, Toru et al. | 2020
digital version
668: Multimodal Corpus of Bidirectional Conversation of Human-human and Human-robot Interaction during fMRI Scanning
Rauchbauer, Birgit / Hmamouche, Youssef / Bigi, Brigitte / Prévot, Laurent / Ochs, Magalie / Chaminade, Thierry et al. | 2020
digital version
676: The Brain-IHM Dataset: a New Resource for Studying the Brain Basis of Human-Human and Human-Machine Conversations
Ochs, Magalie / Bertrand, Roxane / Goujon, Aurélie / Bolger, Deirdre / Dubarry, Anne-Sophie / Blache, Philippe et al. | 2020
digital version
684: Dialogue-AMR: Abstract Meaning Representation for Dialogue
Bonial, Claire / Donatelli, Lucia / Abrams, Mitchell / Lukin, Stephanie M. / Tratz, Stephen / Marge, Matthew / Artstein, Ron / Traum, David / Voss, Clare et al. | 2020
digital version
696: Relation between Degree of Empathy for Narrative Speech and Type of Responsive Utterance in Attentive Listening
Ito, Koichiro / Murata, Masaki / Ohno, Tomohiro / Matsubara, Shigeki et al. | 2020
digital version
702: Intent Recognition in Doctor-Patient Interviews
Rojowiec, Robin / Roth, Benjamin / Fink, Maximilian et al. | 2020
digital version
710: BrainPredict: a Tool for Predicting and Visualising Local Brain Activity
Hmamouche, Youssef / Prévot, Laurent / Ochs, Magalie / Chaminade, Thierry et al. | 2020
digital version
717: MTSI-BERT: A Session-aware Knowledge-based Conversational Agent
Senese, Matteo Antonio / Rizzo, Giuseppe / Dragoni, Mauro / Morisio, Maurizio et al. | 2020
digital version
726: Predicting Ratings of Real Dialogue Participants from Artificial Data and Ratings of Human Dialogue Observers
Georgila, Kallirroi / Gordon, Carla / Yanov, Volodymyr / Traum, David et al. | 2020
digital version
735: Which Model Should We Use for a Real-World Conversational Dialogue System? a Cross-Language Relevance Model or a Deep Neural Net?
Alavi, Seyed Hossein / Leuski, Anton / Traum, David et al. | 2020
digital version
743: Chinese Whispers: A Multimodal Dataset for Embodied Language Grounding
Kontogiorgos, Dimosthenis / Sibirtseva, Elena / Gustafson, Joakim et al. | 2020
digital version
750: AMUSED: A Multi-Stream Vector Representation Method for Use in Natural Dialogue
Kumar, Gaurav / Joshi, Rishabh / Singh, Jaspreet / Yenigalla, Promod et al. | 2020
digital version
759: An Annotation Approach for Social and Referential Gaze in Dialogue
Somashekarappa, Vidya / Howes, Christine / Sayeed, Asad et al. | 2020
digital version
766: A Penn-style Treebank of Middle Low German
Booth, Hannah / Breitbarth, Anne / Ecay, Aaron / Farasyn, Melissa et al. | 2020
digital version
776: Books of Hours. the First Liturgical Data Set for Text Segmentation.
Hazem, Amir / Daille, Beatrice / Kermorvant, Christopher / Stutzmann, Dominique / Bonhomme, Marie-Laurence / Maarand, Martin / Boillet, Mélodie et al. | 2020
digital version
785: Corpus of Chinese Dynastic Histories: Gender Analysis over Two Millennia
Zinin, Sergey / Xu, Yang et al. | 2020
digital version
794: The Royal Society Corpus 6.0: Providing 300+ Years of Scientific Writing for Humanistic Study
Fischer, Stefan / Knappen, Jörg / Menzel, Katrin / Teich, Elke et al. | 2020
digital version
803: Corpus REDEWIEDERGABE
Brunner, Annelen / Engelberg, Stefan / Jannidis, Fotis / Tu, Ngoc Duyen Tanja / Weimer, Lukas et al. | 2020
digital version
813: WeDH - a Friendly Tool for Building Literary Corpora Enriched with Encyclopedic Metadata
Egloff, Mattia / Picca, Davide et al. | 2020
digital version
817: Automatic Section Recognition in Obituaries
Sabbatino, Valentino / Bostan, Laura Ana Maria / Klinger, Roman et al. | 2020
digital version
826: SLäNDa: An Annotated Corpus of Narrative and Dialogue in Swedish Literary Fiction
Stymne, Sara / Östman, Carin et al. | 2020
digital version
835: RiQuA: A Corpus of Rich Quotation Annotation for English Literary Text
Papay, Sean / Padó, Sebastian et al. | 2020
digital version
842: A Corpus Linguistic Perspective on Contemporary German Pop Lyrics with the Multi-Layer Annotated "Songkorpus"
Schneider, Roman et al. | 2020
digital version
849: The BDCamões Collection of Portuguese Literary Documents: a Research Resource for Digital Humanities and Language Technology
Grilo, Sara / Bolrinha, Márcia / Silva, João / Vaz, Rui / Branco, António et al. | 2020
digital version
855: Dataset for Temporal Analysis of English-French Cognates
Frossard, Esteban / Coustaty, Mickael / Doucet, Antoine / Jatowt, Adam / Hengchen, Simon et al. | 2020
digital version
860: Material Philology Meets Digital Onomastic Lexicography: The NordiCon Database of Medieval Nordic Personal Names in Continental Sources
Waldispühl, Michelle / Dannells, Dana / Borin, Lars et al. | 2020
digital version
868: NLP Scholar: A Dataset for Examining the State of NLP Research
Mohammad, Saif M. et al. | 2020
digital version
878: The DReaM Corpus: A Multilingual Annotated Corpus of Grammars for the World’s Languages
Virk, Shafqat Mumtaz / Hammarström, Harald / Forsberg, Markus / Wichmann, Søren et al. | 2020
digital version
885: LiViTo: Linguistic and Visual Features Tool for Assisted Analysis of Historic Manuscripts
Müller, Klaus / Tikhonov, Aleksej / Meyer, Roland et al. | 2020
digital version
891: TextAnnotator: A UIMA Based Tool for the Simultaneous and Collaborative Annotation of Texts
Abrami, Giuseppe / Stoeckel, Manuel / Mehler, Alexander et al. | 2020
digital version
901: Deduplication of Scholarly Documents using Locality Sensitive Hashing and Word Embeddings
Gyawali, Bikash / Anastasiou, Lucas / Knoth, Petr et al. | 2020
digital version
911: "Voices of the Great War": A Richly Annotated Corpus of Italian Texts on the First World War
Boschetti, Federico / de felice, irene / Dei Rossi, Stefano / Dell'Orletta, Felice / Di Giorgio, Michele / Miliani, Martina / Passaro, Lucia C. / Puddu, Angelica / Venturi, Giulia / Labanca, Nicola et al. | 2020
digital version
919: DEbateNet-mig15:Tracing the 2015 Immigration Debate in Germany Over Time
Lapesa, Gabriella / Blessing, Andre / Blokker, Nico / Dayanik, Erenay / Haunss, Sebastian / Kuhn, Jonas / Padó, Sebastian et al. | 2020
digital version
928: A Corpus of Spanish Political Speeches from 1937 to 2019
Álvarez-Mellado, Elena et al. | 2020
digital version
933: A New Latin Treebank for Universal Dependencies: Charters between Ancient Latin and Romance Languages
Cecchini, Flavio Massimiliano / Korkiakangas, Timo / Passarotti, Marco et al. | 2020
digital version
943: Identification of Indigenous Knowledge Concepts through Semantic Networks, Spelling Tools and Word Embeddings
Rocha Souza, Renato / Dorn, Amelie / Piringer, Barbara / Wandl-Vogt, Eveline et al. | 2020
digital version
948: A Multi-Orthography Parallel Corpus of Yiddish Nouns
Saleva, Jonne et al. | 2020
digital version
953: An Annotated Corpus of Adjective-Adverb Interfaces in Romance Languages
Gerhalter, Katharina / Schneider, Gerlinde / Pollin, Christopher / Hummel, Martin et al. | 2020
digital version
958: Language Resources for Historical Newspapers: the Impresso Collection
Ehrmann, Maud / Romanello, Matteo / Clematide, Simon / Ströbel, Phillip Benjamin / Barman, Raphaël et al. | 2020
digital version
969: Allgemeine Musikalische Zeitung as a Searchable Online Corpus
Kampe, Bernd / Duan, Tinghui / Hahn, Udo et al. | 2020
digital version
977: Stylometry in a Bilingual Setup
Cinkova, Silvie / Rybicki, Jan et al. | 2020
digital version
985: Dialect Clustering with Character-Based Metrics: in Search of the Boundary of Language and Dialect
Sato, Yo / Heffernan, Kevin et al. | 2020
digital version
991: DiscSense: Automated Semantic Analysis of Discourse Markers
Sileo, Damien / Van de Cruys, Tim / Pradel, Camille / Muller, Philippe et al. | 2020
digital version
1000: ThemePro: A Toolkit for the Analysis of Thematic Progression
Dominguez, Monica / Soler, Juan / Wanner, Leo et al. | 2020
digital version
1008: Machine-Aided Annotation for Fine-Grained Proposition Types in Argumentation
Jo, Yohan / Mayfield, Elijah / Reed, Chris / Hovy, Eduard et al. | 2020
digital version
1019: Chinese Discourse Parsing: Model and Evaluation
Chuan-An, Lin / Hung, Shyh-Shiun / Huang, Hen-Hsen / Chen, Hsin-Hsi et al. | 2020
digital version
1025: Shallow Discourse Annotation for Chinese TED Talks
Long, Wanqiu / Cai, Xinyi / Reid, James / Webber, Bonnie / Xiong, Deyi et al. | 2020
digital version
1033: The Discussion Tracker Corpus of Collaborative Argumentation
Olshefski, Christopher / Lugini, Luca / Singh, Ravneet / Litman, Diane / Godley, Amanda et al. | 2020
digital version
1044: Shallow Discourse Parsing for Under-Resourced Languages: Combining Machine Translation and Annotation Projection
Sluyter-Gäthje, Henny / Bourgonje, Peter / Stede, Manfred et al. | 2020
digital version
1051: A Corpus of Encyclopedia Articles with Logical Forms
Rasmussen, Nathan / Schuler, William et al. | 2020
digital version
1061: The Potsdam Commentary Corpus 2.2: Extending Annotations for Shallow Discourse Parsing
Bourgonje, Peter / Stede, Manfred et al. | 2020
digital version
1067: On the Creation of a Corpus for Coherence Evaluation of Discursive Units
Mohammadi, Elham / Beiko, Timothe / Kosseim, Leila et al. | 2020
digital version
1073: Joint Learning of Syntactic Features Helps Discourse Segmentation
Desai, Takshak / Dakle, Parag Pravin / Moldovan, Dan et al. | 2020
digital version
1081: Creating a Corpus of Gestures and Predicting the Audience Response based on Gestures in Speeches of Donald Trump
Ruf, Verena / Navarretta, Costanza et al. | 2020
digital version
1089: GeCzLex: Lexicon of Czech and German Anaphoric Connectives
Poláková, Lucie / Rysová, Kateřina / Rysová, Magdaléna / Mírovský, Jiří et al. | 2020
digital version
1097: DiMLex-Bangla: A Lexicon of Bangla Discourse Connectives
Das, Debopam / Stede, Manfred / Ghosh, Soumya Sankar / Chatterjee, Lahari et al. | 2020
digital version
1103: Semi-Supervised Tri-Training for Explicit Discourse Argument Expansion
Knaebel, Rene / Stede, Manfred et al. | 2020
digital version
1110: WikiPossessions: Possession Timeline Generation as an Evaluation Benchmark for Machine Reading Comprehension of Long Texts
Chinnappa, Dhivya / Palmer, Alexis / Blanco, Eduardo et al. | 2020
digital version
1118: TED-Q: TED Talks and the Questions they Evoke
Westera, Matthijs / Mayol, Laia / Rohde, Hannah et al. | 2020
digital version
1128: CzeDLex 0.6 and its Representation in the PML-TQ
Mírovský, Jiří / Poláková, Lucie / Synková, Pavlína et al. | 2020
digital version
1135: Corpus for Modeling User Interactions in Online Persuasive Discussions
Egawa, Ryo / Morio, Gaku / Fujita, Katsuhide et al. | 2020
digital version
1142: Simplifying Coreference Chains for Dyslexic Children
Wilkens, Rodrigo / Todirascu, Amalia et al. | 2020
digital version
1152: Adapting BERT to Implicit Discourse Relation Classification with a Focus on Discourse Connectives
Kishimoto, Yudai / Murawaki, Yugo / Kurohashi, Sadao et al. | 2020
digital version
1159: What Speakers really Mean when they Ask Questions: Classification of Intentions with a Supervised Approach
Barbedette, Angèle / Eshkol-Taravella, Iris et al. | 2020
digital version
1167: Modeling Dialogue in Conversational Cognitive Health Screening Interviews
Farzana, Shahla / Valizadeh, Mina / Parde, Natalie et al. | 2020
digital version
1178: Stigma Annotation Scheme and Stigmatized Language Detection in Health-Care Discussions on Social Media
Straton, Nadiya / Jang, Hyeju / Ng, Raymond et al. | 2020
digital version
1191: An Annotated Dataset of Discourse Modes in Hindi Stories
Dhanwal, Swapnil / Dutta, Hritwik / Nankani, Hitesh / Shrivastava, Nilay / Kumar, Yaman / Li, Junyi Jessy / Mahata, Debanjan / Gosangi, Rakesh / Zhang, Haimin / Shah, Rajiv Ratn et al. | 2020
digital version
1197: Multi-class Multilingual Classification of Wikipedia Articles Using Extended Named Entity Tag Set
Shavarani, Hassan S. / Sekine, Satoshi et al. | 2020
digital version
1202: An Algerian Corpus and an Annotation Platform for Opinion and Emotion Analysis
Moudjari, Leila / Akli-Astouati, Karima / Benamara, Farah et al. | 2020
digital version
1211: Transfer Learning from Transformers to Fake News Challenge Stance Detection (FNC-1) Task
Slovikovskaya, Valeriya / Attardi, Giuseppe et al. | 2020
digital version
1219: Scientific Statement Classification over arXiv.org
Ginev, Deyan / Miller, Bruce R et al. | 2020
digital version
1227: Cross-domain Author Gender Classification in Brazilian Portuguese
Dias, Rafael / Paraboni, Ivandré et al. | 2020
digital version
1235: LEDGAR: A Large-Scale Multi-label Corpus for Text Classification of Legal Provisions in Contracts
Tuggener, Don / von Däniken, Pius / Peetz, Thomas / Cieliebak, Mark et al. | 2020
digital version
1242: Online Near-Duplicate Detection of News Articles
Rodier, Simon / Carter, Dave et al. | 2020
digital version
1250: Automated Essay Scoring System for Nonnative Japanese Learners
Hirao, Reo / Arai, Mio / Shimanaka, Hiroki / Katsumata, Satoru / Komachi, Mamoru et al. | 2020
digital version
1258: A Real-World Data Resource of Complex Sensitive Sentences Based on Documents from the Monsanto Trial
Neerbek, Jan / Eskildsen, Morten / Dolog, Peter / Assent, Ira et al. | 2020
digital version
1268: Discovering Biased News Articles Leveraging Multiple Human Annotations
Lazaridou, Konstantina / Löser, Alexander / Mestre, Maria / Naumann, Felix et al. | 2020
digital version
1278: Corpora and Baselines for Humour Recognition in Portuguese
Gonçalo Oliveira, Hugo / Clemêncio, André / Alves, Ana et al. | 2020
digital version
1286: FactCorp: A Corpus of Dutch Fact-checks and its Multiple Usages
van der Meulen, Marten / Reijnierse, W. Gudrun et al. | 2020
digital version
1293: Automatic Orality Identification in Historical Texts
Ortmann, Katrin / Dipper, Stefanie et al. | 2020
digital version
1303: Using Deep Neural Networks with Intra- and Inter-Sentence Context to Classify Suicidal Behaviour
Song, Xingyi / Downs, Johnny / Velupillai, Sumithra / Holden, Rachel / Kikoler, Maxim / Bontcheva, Kalina / Dutta, Rina / Roberts, Angus et al. | 2020
digital version
1311: A First Dataset for Film Age Appropriateness Investigation
Mohamed, Emad / Ha, Le An et al. | 2020
digital version
1318: Habibi - a multi Dialect multi National Arabic Song Lyrics Corpus
El-Haj, Mahmoud et al. | 2020
digital version
1327: Age Suitability Rating: Predicting the MPAA Rating Based on Movie Dialogues
Shafaei, Mahsa / Safi Samghabadi, Niloofar / Kar, Sudipta / Solorio, Thamar et al. | 2020
digital version
1336: Email Classification Incorporating Social Networks and Thread Structure
Alkhereyf, Sakhar / Rambow, Owen et al. | 2020
digital version
1346: Development and Validation of a Corpus for Machine Humor Comprehension
Tseng, Yuen-Hsien / Wu, Wun-Syuan / Chang, Chia-Yueh / Chen, Hsueh-Chih / Hsu, Wei-Lun et al. | 2020
digital version
1353: Alector: A Parallel Corpus of Simplified French Texts with Alignments of Misreadings by Poor and Dyslexic Readers
Gala, Núria / Tack, Anaïs / Javourey-Drevet, Ludivine / François, Thomas / Ziegler, Johannes C. et al. | 2020
digital version
1362: A Corpus for Detecting High-Context Medical Conditions in Intensive Care Patient Notes Focusing on Frequently Readmitted Patients
Moseley, Edward T. / Wu, Joy T. / Welt, Jonathan / Foote, John / Tyler, Patrick D. / Grant, David W. / Carlson, Eric T. / Gehrmann, Sebastian / Dernoncourt, Franck / Celi, Leo Anthony et al. | 2020
digital version
1368: Multilingual Stance Detection in Tweets: The Catalonia Independence Corpus
Zotova, Elena / Agerri, Rodrigo / Nuñez, Manuel / Rigau, German et al. | 2020
digital version
1376: An Evaluation of Progressive Neural Networksfor Transfer Learning in Natural Language Processing
Moeed, Abdul / Hagerer, Gerhard / Dugar, Sumit / Gupta, Sarthak / Ghosh, Mainak / Danner, Hannah / Mitevski, Oliver / Nawroth, Andreas / Groh, Georg et al. | 2020
digital version
1382: WAC: A Corpus of Wikipedia Conversations for Online Abuse Detection
Cécillon, Noé / Labatut, Vincent / Dufour, Richard / Linarès, Georges et al. | 2020
digital version
1391: FloDusTA: Saudi Tweets Dataset for Flood, Dust Storm, and Traffic Accident Events
Hamoui, Btool / Mars, Mourad / Almotairi, Khaled et al. | 2020
digital version
1397: An Annotated Corpus for Sexism Detection in French Tweets
Chiril, Patricia / Moriceau, Véronique / Benamara, Farah / Mari, Alda / Origgi, Gloria / Coulomb-Gully, Marlène et al. | 2020
digital version
1404: Measuring the Impact of Readability Features in Fake News Detection
Santos, Roney / Pedro, Gabriela / Leal, Sidney / Vale, Oto / Pardo, Thiago / Bontcheva, Kalina / Scarton, Carolina et al. | 2020
digital version
1414: When Shallow is Good Enough: Automatic Assessment of Conceptual Text Complexity using Shallow Semantic Features
Stajner, Sanja / Hulpuș, Ioana et al. | 2020
digital version
1423: DecOp: A Multilingual and Multi-domain Corpus For Detecting Deception In Typed Text
Capuozzo, Pasquale / Lauriola, Ivano / Strapparava, Carlo / Aiolli, Fabio / Sartori, Giuseppe et al. | 2020
digital version
1431: Age Recommendation for Texts
Blandin, Alexis / Lecorvé, Gwénolé / Battistelli, Delphine / Étienne, Aline et al. | 2020
digital version
1440: Multilingual Twitter Corpus and Baselines for Evaluating Demographic Bias in Hate Speech Recognition
Huang, Xiaolei / Xing, Linzi / Dernoncourt, Franck / Paul, Michael J. et al. | 2020
digital version
1449: VICTOR: a Dataset for Brazilian Legal Documents Classification
Luz de Araujo, Pedro Henrique / de Campos, Teófilo Emídio / Ataides Braz, Fabricio / Correia da Silva, Nilton et al. | 2020
digital version
1459: Dynamic Classification in Web Archiving Collections
Patel, Krutarth / Caragea, Cornelia / Phillips, Mark et al. | 2020
digital version
1469: Aspect Flow Representation and Audio Inspired Analysis for Texts
Vasconcelos, Larissa / Campelo, Claudio / Jeronimo, Caio et al. | 2020
digital version
1478: Annotating and Analyzing Biased Sentences in News Articles using Crowdsourcing
Lim, Sora / Jatowt, Adam / Färber, Michael / Yoshikawa, Masatoshi et al. | 2020
digital version
1485: Evaluation of Deep Gaussian Processes for Text Classification
Jayashree, P. / Srijith, P. K. et al. | 2020
digital version
1492: EmoEvent: A Multilingual Emotion Corpus based on different Events
Plaza del Arco, Flor Miriam / Strapparava, Carlo / Urena Lopez, L. Alfonso / Martin, Maite et al. | 2020
digital version
1499: MuSE: a Multimodal Dataset of Stressed Emotion
Jaiswal, Mimansa / Bara, Cristian-Paul / Luo, Yuanhang / Burzo, Mihai / Mihalcea, Rada / Provost, Emily Mower et al. | 2020
digital version
1511: Affect inTweets: A Transfer Learning Approach
Zhang, Linrui / Huang, Hsin-Lun / Yu, Yang / Moldovan, Dan et al. | 2020
digital version
1517: Annotation of Emotion Carriers in Personal Narratives
Tammewar, Aniruddha / Cervone, Alessandra / Messner, Eva-Maria / Riccardi, Giuseppe et al. | 2020
digital version
1526: Towards Interactive Annotation for Hesitation in Conversational Speech
Wottawa, Jane / Tahon, Marie / Marin, Apolline / Audibert, Nicolas et al. | 2020
digital version
1533: Abusive language in Spanish children and young teenager’s conversations: data preparation and short text classification with contextual word embeddings
Costa-jussà, Marta R. / González, Esther / Moreno, Asuncion / Cumalat, Eudald et al. | 2020
digital version
1538: IIIT-H TEMD Semi-Natural Emotional Speech Database from Professional Actors and Non-Actors
Rambabu, Banothu / Botsa, Kishore Kumar / Paidi, Gangamohan / Gangashetty, Suryakanth V et al. | 2020
digital version
1546: The POTUS Corpus, a Database of Weekly Addresses for the Study of Stance in Politics and Virtual Agents
Janssoone, Thomas / Bailly, Kévin / Richard, Gaël / Clavel, Chloé et al. | 2020
digital version
1554: GoodNewsEveryone: A Corpus of News Headlines Annotated with Emotions, Semantic Roles, and Reader Perception
Bostan, Laura Ana Maria / Kim, Evgeny / Klinger, Roman et al. | 2020
digital version
1567: SOLO: A Corpus of Tweets for Examining the State of Being Alone
Kiritchenko, Svetlana / Hipson, Will / Coplan, Robert / Mohammad, Saif M. et al. | 2020
digital version
1578: PoKi: A Large Dataset of Poems by Children
Hipson, Will / Mohammad, Saif M. et al. | 2020
digital version
1590: AlloSat: A New Call Center French Corpus for Satisfaction and Frustration Analysis
Macary, Manon / Tahon, Marie / Estève, Yannick / Rousseau, Anthony et al. | 2020
digital version
1598: Learning the Human Judgment for the Automatic Evaluation of Chatbot
Wu, Shih-Hung / Chien, Sheng-Lun et al. | 2020
digital version
1603: Korean-Specific Emotion Annotation Procedure Using N-Gram-Based Distant Supervision and Korean-Specific-Feature-Based Distant Supervision
Lee, Young-Jun / Lim, Chae-Gyun / Choi, Ho-Jin et al. | 2020
digital version
1611: Semi-Automatic Construction and Refinement of an Annotated Corpus for a Deep Learning Framework for Emotion Classification
Xu, Jiajun / Masuda, Kyosuke / Nishizaki, Hiromitsu / Fukumoto, Fumiyo / Suzuki, Yoshimi et al. | 2020
digital version
1618: CEASE, a Corpus of Emotion Annotated Suicide notes in English
Ghosh, Soumitra / Ekbal, Asif / Bhattacharyya, Pushpak et al. | 2020
digital version
1627: Training a Broad-Coverage German Sentiment Classification Model for Dialog Systems
Guhr, Oliver / Schumann, Anne-Kathrin / Bahrmann, Frank / Böhme, Hans Joachim et al. | 2020
digital version
1633: An Event-comment Social Media Corpus for Implicit Emotion Analysis
Lee, Sophia Yat Mei / Lau, Helena Yan Ping et al. | 2020
digital version
1643: An Emotional Mess! Deciding on a Framework for Building a Dutch Emotion-Annotated Corpus
De Bruyne, Luna / De Clercq, Orphee / Hoste, Veronique et al. | 2020
digital version
1652: PO-EMO: Conceptualization, Annotation, and Modeling of Aesthetic Emotions in German and English Poetry
Haider, Thomas / Eger, Steffen / Kim, Evgeny / Klinger, Roman / Menninghaus, Winfried et al. | 2020
digital version
1664: Learning Word Ratings for Empathy and Distress from Document-Level User Responses
Sedoc, João / Buechel, Sven / Nachmany, Yehonathan / Buffone, Anneke / Ungar, Lyle et al. | 2020
digital version
1674: Evaluation of Sentence Representations in Polish
Dadas, Slawomir / Perełkiewicz, Michał / Poświata, Rafał et al. | 2020
digital version
1681: Identification of Primary and Collateral Tracks in Stuttered Speech
Riad, Rachid / Bachoud-Lévi, Anne-Catherine / Rudzicz, Frank / Dupoux, Emmanuel et al. | 2020
digital version
1689: How to Compare Automatically Two Phonological Strings: Application to Intelligibility Measurement in the Case of Atypical Speech
Ghio, Alain / Lalain, Muriel / Giusti, Laurence / Fredouille, Corinne / Woisard, Virginie et al. | 2020
digital version
1695: Evaluating Text Coherence at Sentence and Paragraph Levels
Liu, Sennan / Zeng, Shuang / Li, Sujian et al. | 2020
digital version
1704: HardEval: Focusing on Challenging Tokens to Assess Robustness of NER
Bernier-Colborne, Gabriel / Langlais, Phillippe et al. | 2020
digital version
1712: An Evaluation Dataset for Identifying Communicative Functions of Sentences in English Scholarly Papers
Iwatsuki, Kenichi / Boudin, Florian / Aizawa, Akiko et al. | 2020
digital version
1721: An Automatic Tool For Language Evaluation
Fassetti, Fabio / Fassetti, Ilaria et al. | 2020
digital version
1727: Which Evaluations Uncover Sense Representations that Actually Make Sense?
Boyd-Graber, Jordan / Guo, Fenfei / Findlater, Leah / Iyyer, Mohit et al. | 2020
digital version
1739: Diversity, Density, and Homogeneity: Quantitative Characteristic Metrics for Text Collections
Lai, Yi-An / Zhu, Xuan / Zhang, Yi / Diab, Mona et al. | 2020
digital version
1747: Towards Few-Shot Event Mention Retrieval: An Evaluation Framework and A Siamese Network Approach
Min, Bonan / Chan, Yee Seng / Zhao, Lingjun et al. | 2020
digital version
1753: Linguistic Appropriateness and Pedagogic Usefulness of Reading Comprehension Questions
Horbach, Andrea / Aldabe, Itziar / Bexte, Marie / Lopez de Lacalle, Oier / Maritxalar, Montse et al. | 2020
digital version
1763: Dataset Reproducibility and IR Methods in Timeline Summarization
Born, Leo / Bacher, Maximilian / Markert, Katja et al. | 2020
digital version
1772: Database Search vs. Information Retrieval: A Novel Method for Studying Natural Language Querying of Semi-Structured Data
Nadig, Stefanie / Braschler, Martin / Stockinger, Kurt et al. | 2020
digital version
1780: Why Attention is Not Explanation: Surgical Intervention and Causal Reasoning about Neural Models
Grimsley, Christopher / Mayfield, Elijah / R.S. Bursten, Julia et al. | 2020
digital version
1791: Have a Cake and Eat it Too: Assessing Discriminating Performance of an Intelligibility Index Obtained from a Reduced Sample Size
Marczyk, Anna / Ghio, Alain / Lalain, Muriel / Rebourg, Marie / Fredouille, Corinne / Woisard, Virginie et al. | 2020
digital version
1796: Evaluation Metrics for Headline Generation Using Deep Pre-Trained Embeddings
Moeed, Abdul / An, Yang / Hagerer, Gerhard / Groh, Georg et al. | 2020
digital version
1803: LinCE: A Centralized Benchmark for Linguistic Code-switching Evaluation
Aguilar, Gustavo / Kar, Sudipta / Solorio, Thamar et al. | 2020
digital version
1814: Paraphrase Generation and Evaluation on Colloquial-Style Sentences
Sjöblom, Eetu / Creutz, Mathias / Scherrer, Yves et al. | 2020
digital version
1823: Analyzing Word Embedding Through Structural Equation Modeling
Han, Namgi / Hayashi, Katsuhiko / Miyao, Yusuke et al. | 2020
digital version
1833: Evaluation of Lifelong Learning Systems
Prokopalo, Yevhenii / Meignier, Sylvain / Galibert, Olivier / Barrault, Loic / Larcher, Anthony et al. | 2020
digital version
1842: Interannotator Agreement for Lexico-Semantic Annotation of a Corpus
Hajnicz, Elżbieta et al. | 2020
digital version
1849: An In-Depth Comparison of 14 Spelling Correction Tools on a Common Benchmark
Näther, Markus et al. | 2020
digital version
1858: Sentence Level Human Translation Quality Estimation with Attention-based Neural Networks
Yuan, Yu / Sharoff, Serge et al. | 2020
digital version
1866: Evaluating Language Tools for Fifteen EU-official Under-resourced Languages
Alves, Diego / Thakkar, Gaurish / Tadić, Marko et al. | 2020
digital version
1874: Word Embedding Evaluation for Sinhala
Lakmal, Dimuthu / Ranathunga, Surangika / Peramuna, Saman / Herath, Indu et al. | 2020
digital version
1882: Stress Test Evaluation of Transformer-based Models in Natural Language Understanding Tasks
Aspillaga, Carlos / Carvallo, Andrés / Araujo, Vladimir et al. | 2020
digital version
1895: Brand-Product Relation Extraction Using Heterogeneous Vector Space Representations
Janz, Arkadiusz / Kopociński, Łukasz / Piasecki, Maciej / Pluwak, Agnieszka et al. | 2020
digital version
1902: A Tale of Three Parsers: Towards Diagnostic Evaluation for Meaning Representation Parsing
Buljan, Maja / Nivre, Joakim / Oepen, Stephan / Øvrelid, Lilja et al. | 2020
digital version
1910: Headword-Oriented Entity Linking: A Special Entity Linking Task with Dataset and Baseline
Yang, Mu / Chen, Chi-Yen / Lee, Yi-Hui / Zeng, Qian-hui / Ma, Wei-Yun / Shih, Chen-Yang / Chen, Wei-Jhih et al. | 2020
digital version
1918: TableBank: Table Benchmark for Image-based Table Detection and Recognition
Li, Minghao / Cui, Lei / Huang, Shaohan / Wei, Furu / Zhou, Ming / Li, Zhoujun et al. | 2020
digital version
1926: WIKIR: A Python Toolkit for Building a Large-scale Wikipedia-based English Information Retrieval Dataset
Frej, Jibril / Schwab, Didier / Chevallet, Jean-Pierre et al. | 2020
digital version
1934: Constructing a Public Meeting Corpus
Tanaka, Koji / Chu, Chenhui / Ren, Haolin / Renoust, Benjamin / Nakashima, Yuta / Takemura, Noriko / Nagahara, Hajime / Fujikawa, Takao et al. | 2020
digital version
1941: Annotating and Extracting Synthesis Process of All-Solid-State Batteries from Scientific Literature
Kuniyoshi, Fusataka / Makino, Kohei / Ozawa, Jun / Miwa, Makoto et al. | 2020
digital version
1951: WEXEA: Wikipedia EXhaustive Entity Annotation
Strobl, Michael / Trabelsi, Amine / Zaiane, Osmar et al. | 2020
digital version
1959: Handling Entity Normalization with no Annotated Corpus: Weakly Supervised Methods Based on Distributional Representation and Ontological Information
Ferré, Arnaud / Bossy, Robert / Ba, Mouhamadou / Deléger, Louise / Lavergne, Thomas / Zweigenbaum, Pierre / Nédellec, Claire et al. | 2020
digital version
1967: HBCP Corpus: A New Resource for the Analysis of Behavioural Change Intervention Reports
Bonin, Francesca / Gleize, Martin / Finnerty, Ailbhe / Moore, Candice / Jochim, Charles / Norris, Emma / Hou, Yufang / Wright, Alison J. / Ganguly, Debasis / Hayes, Emily et al. | 2020
digital version
1976: Cross-lingual Structure Transfer for Zero-resource Event Extraction
Lu, Di / Subburathinam, Ananya / Ji, Heng / May, Jonathan / Chang, Shih-Fu / Sil, Avi / Voss, Clare et al. | 2020
digital version
1982: Cross-Domain Evaluation of Edge Detection for Biomedical Event Extraction
Ramponi, Alan / Plank, Barbara / Lombardo, Rosario et al. | 2020
digital version
1990: Semantic Annotation for Improved Safety in Construction Work
Thompson, Paul / Yates, Tim / Inan, Emrah / Ananiadou, Sophia et al. | 2020
digital version
2000: Social Web Observatory: A Platform and Method for Gathering Knowledge on Entities from Different Textual Sources
Tsekouras, Leonidas / Petasis, Georgios / Giannakopoulos, George / Kosmopoulos, Aris et al. | 2020
digital version
2009: Development of a Corpus Annotated with Medications and their Attributes in Psychiatric Health Records
Chaturvedi, Jaya / Viani, Natalia / Sanyal, Jyoti / Tytherleigh, Chloe / Hasan, Idil / Baird, Kate / Velupillai, Sumithra / Stewart, Robert / Roberts, Angus et al. | 2020
digital version
2017: Do not let the history haunt you: Mitigating Compounding Errors in Conversational Question Answering
Mandya, Angrosh / O' Neill, James / Bollegala, Danushka / Coenen, Frans et al. | 2020
digital version
2026: CLEEK: A Chinese Long-text Corpus for Entity Linking
Zeng, Weixin / Zhao, Xiang / Tang, Jiuyang / Tan, Zhen / Huang, Xuqian et al. | 2020
digital version
2036: The Medical Scribe: Corpus Development and Model Performance Analyses
Shafran, Izhak / Du, Nan / Tran, Linh / Perry, Amanda / Keyes, Lauren / Knichel, Mark / Domin, Ashley / Huang, Lei / Chen, Yu-hui / Li, Gang et al. | 2020
digital version
2045: A Contract Corpus for Recognizing Rights and Obligations
Funaki, Ruka / Nagata, Yusuke / Suenaga, Kohei / Mori, Shinsuke et al. | 2020
digital version
2054: Recognition of Implicit Geographic Movement in Text
Pezanowski, Scott / Mitra, Prasenjit et al. | 2020
digital version
2064: Extraction of the Argument Structure of Tokyo Metropolitan Assembly Minutes: Segmentation of Question-and-Answer Sets
Takamaru, Keiichi / Kimura, Yasutomo / Shibuki, Hideyuki / Ototake, Hokuto / Uchida, Yuzu / Sakamoto, Kotaro / Ishioroshi, Madoka / Mitamura, Teruko / Kando, Noriko et al. | 2020
digital version
2069: A Term Extraction Approach to Survey Analysis in Health Care
Robin, Cécile / Isazad Mashinchi, Mona / Ahmadi Zeleti, Fatemeh / Ojo, Adegboyega / Buitelaar, Paul et al. | 2020
digital version
2078: A Scientific Information Extraction Dataset for Nature Inspired Engineering
Kruiper, Ruben / Vincent, Julian F.V. / Chen-Burger, Jessica / Desmulliez, Marc P.Y. / Konstas, Ioannis et al. | 2020
digital version
2086: Automated Discovery of Mathematical Definitions in Text
Vanetik, Natalia / Litvak, Marina / Shevchuk, Sergey / Reznik, Lior et al. | 2020
digital version
2095: WN-Salience: A Corpus of News Articles with Entity Salience Annotations
Wu, Chuan / Kanoulas, Evangelos / de Rijke, Maarten / Lu, Wei et al. | 2020
digital version
2103: Event Extraction from Unstructured Amharic Text
Tadesse, Ephrem / Tsegaye, Rosa / Qaqqabaa, Kuulaa et al. | 2020
digital version
2110: Comparing Machine Learning and Deep Learning Approaches on NLP Tasks for the Italian Language
Magnini, Bernardo / Lavelli, Alberto / Magnolini, Simone et al. | 2020
digital version
2120: MyFixit: An Annotated Dataset, Annotation Tool, and Baseline Methods for Information Extraction from Repair Manuals
Nabizadeh, Nima / Kolossa, Dorothea / Heckmann, Martin et al. | 2020
digital version
2129: Towards Entity Spaces
van Erp, Marieke / Groth, Paul et al. | 2020
digital version
2138: Love Me, Love Me, Say (and Write!) that You Love Me: Enriching the WASABI Song Corpus with Lyrics Annotations
Fell, Michael / Cabrio, Elena / Korfed, Elmahdi / Buffa, Michel / Gandon, Fabien et al. | 2020
digital version
2148: Evaluating Information Loss in Temporal Dependency Trees
Ocal, Mustafa / Finlayson, Mark et al. | 2020
digital version
2157: Populating Legal Ontologies using Semantic Role Labeling
Humphreys, Llio / Boella, Guido / Di Caro, Luigi / Robaldo, Livio / van der Torre, Leon / Ghanavati, Sepideh / Muthuri, Robert et al. | 2020
digital version
2167: PST 2.0 - Corpus of Polish Spatial Texts
Marcińczuk, Michał / Oleksy, Marcin / Wieczorek, Jan et al. | 2020
digital version
2175: Natural Language Premise Selection: Finding Supporting Statements for Mathematical Text
Ferreira, Deborah / Freitas, André et al. | 2020
digital version
2183: Odinson: A Fast Rule-based Information Extraction Framework
Valenzuela-Escárcega, Marco A. / Hahn-Powell, Gus / Bell, Dane et al. | 2020
digital version
2192: The STEM-ECR Dataset: Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic Sources
D'Souza, Jennifer / Hoppe, Anett / Brack, Arthur / Jaradeh, Mohmad Yaser / Auer, Sören / Ewerth, Ralph et al. | 2020
digital version
2204: MathAlign: Linking Formula Identifiers to their Contextual Natural Language Descriptions
Alexeeva, Maria / Sharp, Rebecca / Valenzuela-Escárcega, Marco A. / Kadowaki, Jennifer / Pyarelal, Adarsh / Morrison, Clayton et al. | 2020
digital version
2213: Domain Adapted Distant Supervision for Pedagogically Motivated Relation Extraction
Sainz, Oscar / Lopez de Lacalle, Oier / Aldabe, Itziar / Maritxalar, Montse et al. | 2020
digital version
2223: Temporal Histories of Epidemic Events (THEE): A Case Study in Temporal Annotation for Public Health
Niu, Jingcheng / Ng, Victoria / Penn, Gerald / Rees, Erin E. et al. | 2020
digital version
2231: Exploiting Citation Knowledge in Personalised Recommendation of Recent Scientific Publications
Khadka, Anita / Cantador, Iván / Fernandez, Miriam et al. | 2020
digital version
2241: A Platform for Event Extraction in Hindi
Sahoo, Sovan Kumar / Saha, Saumajit / Ekbal, Asif / Bhattacharyya, Pushpak et al. | 2020
digital version
2251: Rad-SpatialNet: A Frame-based Resource for Fine-Grained Spatial Relations in Radiology Reports
Datta, Surabhi / Ulinski, Morgan / Godfrey-Stovall, Jordan / Khanpara, Shekhar / Riascos-Castaneda, Roy F. / Roberts, Kirk et al. | 2020
digital version
2261: NLP Analytics in Finance with DoRe: A French 250M Tokens Corpus of Corporate Annual Reports
Masson, Corentin / Paroubek, Patrick et al. | 2020
digital version
2268: The Language of Brain Signals: Natural Language Processing of Electroencephalography Reports
Maldonado, Ramon / Harabagiu, Sanda et al. | 2020
digital version
2276: Humans Keep It One Hundred: an Overview of AI Journey
Shavrina, Tatiana / Emelyanov, Anton / Fenogenova, Alena / Fomin, Vadim / Mikhailov, Vladislav / Evlampiev, Andrey / Malykh, Valentin / Larin, Vladimir / Natekin, Alex / Vatulin, Aleksandr et al. | 2020
digital version
2285: Towards Data-driven Ontologies: a Filtering Approach using Keywords and Natural Language Constructs
de Boer, Maaike / Verhoosel, Jack P. C. et al. | 2020
digital version
2293: A French Corpus and Annotation Schema for Named Entity Recognition and Relation Extraction of Financial News
Jabbari, Ali / Sauvage, Olivier / Zeine, Hamada / Chergui, Hamza et al. | 2020
digital version
2300: Inferences for Lexical Semantic Resource Building with Less Supervision
Bebeshina, Nadia / Lafourcade, Mathieu et al. | 2020
digital version
2306: Acquiring Social Knowledge about Personality and Driving-related Behavior
Iwai, Ritsuko / Kawahara, Daisuke / Kumada, Takatsune / Kurohashi, Sadao et al. | 2020
digital version
2316: Implicit knowledge in argumentative texts : an annotated corpus
Becker, Maria / Korfhage, Katharina / Frank, Anette et al. | 2020
digital version
2325: Multiple Knowledge GraphDB (MKGDB)
Faralli, Stefano / Velardi, Paola / Yusifli, Farid et al. | 2020
digital version
2332: Orchestrating NLP Services for the Legal Domain
Moreno-Schneider, Julian / Rehm, Georg / Montiel-Ponsoda, Elena / Rodriguez-Doncel, Víctor / Revenko, Artem / Karampatakis, Sotirios / Khvalchik, Maria / Sageder, Christian / Gracia, Jorge / Maganza, Filippo et al. | 2020
digital version
2341: Evaluation Dataset and Methodology for Extracting Application-Specific Taxonomies from the Wikipedia Knowledge Graph
Bordea, Georgeta / Faralli, Stefano / Mougin, Fleur / Buitelaar, Paul / Diallo, Gayo et al. | 2020
digital version
2348: Subjective Evaluation of Comprehensibility in Movie Interactions
Randria, Estelle / Fontan, Lionel / Le Coz, Maxime / Ferrané, Isabelle / Pinquier, Julien et al. | 2020
digital version
2358: Representing Multiword Term Variation in a Terminological Knowledge Base: a Corpus-Based Study
León-Araúz, Pilar / Reimerink, Arianne / Cabezas-García, Melania et al. | 2020
digital version
2368: Understanding Spatial Relations through Multiple Modalities
Dan, Soham / He, Hangfeng / Roth, Dan et al. | 2020
digital version
2373: A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages
Roy, Dwaipayan / Bhatia, Sumit / Jain, Prateek et al. | 2020
digital version
2381: Pártélet: A Hungarian Corpus of Propaganda Texts from the Hungarian Socialist Era
Kmetty, Zoltán / Vincze, Veronika / Demszky, Dorottya / Ring, Orsolya / Nagy, Balázs / Szabó, Martina Katalin et al. | 2020
digital version
2389: KORE 50^DYWC: An Evaluation Data Set for Entity Linking Based on DBpedia, YAGO, Wikidata, and Crunchbase
Noullet, Kristian / Mix, Rico / Färber, Michael et al. | 2020
digital version
2396: Eye4Ref: A Multimodal Eye Movement Dataset of Referentially Complex Situations
Alacam, Özge / Ruppert, Eugen / Salama, Amr Rekaby / Staron, Tobias / Menzel, Wolfgang et al. | 2020
digital version
2405: SiBert: Enhanced Chinese Pre-trained Language Model with Sentence Insertion
Chen, Jiahao / Cao, Chenjie / Jiang, Xiuyan et al. | 2020
digital version
2413: Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Roark, Brian / Wolf-Sonkin, Lawrence / Kirov, Christo / Mielke, Sabrina J. / Johny, Cibu / Demirsahin, Isin / Hall, Keith et al. | 2020
digital version
2424: GM-RKB WikiText Error Correction Task and Baselines
Melli, Gabor / Eldallal, Abdelrhman / Lazem, Bassim / Moreira, Olga et al. | 2020
digital version
2431: Embedding Space Correlation as a Measure of Domain Similarity
Beyer, Anne / Kauermann, Göran / Schütze, Hinrich et al. | 2020
digital version
2440: Wiki-40B: Multilingual Language Model Dataset
Guo, Mandy / Dai, Zihang / Vrandečić, Denny / Al-Rfou, Rami et al. | 2020
digital version
2453: Know thy Corpus! Robust Methods for Digital Curation of Web corpora
Sharoff, Serge et al. | 2020
digital version
2461: Evaluating Approaches to Personalizing Language Models
King, Milton / Cook, Paul et al. | 2020
digital version
2470: Class-based LSTM Russian Language Model with Linguistic Information
Kipyatkova, Irina / Karpov, Alexey et al. | 2020
digital version
2475: Adaptation of Deep Bidirectional Transformers for Afrikaans Language
Ralethe, Sello et al. | 2020
digital version
2479: FlauBERT: Unsupervised Language Model Pre-training for French
Le, Hang / Vial, Loïc / Frej, Jibril / Segonne, Vincent / Coavoux, Maximin / Lecouteux, Benjamin / Allauzen, Alexandre / Crabbé, Benoit / Besacier, Laurent / Schwab, Didier et al. | 2020
digital version
2491: Accelerated High-Quality Mutual-Information Based Word Clustering
Ciosici, Manuel R. / Assent, Ira / Derczynski, Leon et al. | 2020
digital version
2497: Rhythmic Proximity Between Natives And Learners Of French - Evaluation of a metric based on the CEFC corpus
Coulange, Sylvain / Rossato, Solange et al. | 2020
digital version
2503: From Linguistic Resources to Ontology-Aware Terminologies: Minding the Representation Gap
Speranza, Giulia / di Buono, Maria Pia / Monti, Johanna / Sangati, Federico et al. | 2020
digital version
2511: Modeling Factual Claims with Semantic Frames
Arslan, Fatma / Caraballo, Josue / Jimenez, Damian / Li, Chengkai et al. | 2020
digital version
2521: Automatic Transcription Challenges for Inuktitut, a Low-Resource Polysynthetic Language
Gupta, Vishwa / Boulianne, Gilles et al. | 2020
digital version
2528: Geographically-Balanced Gigaword Corpora for 50 Language Varieties
Dunn, Jonathan / Adams, Ben et al. | 2020
digital version
2537: Data Augmentation using Machine Translation for Fake News Detection in the Urdu Language
Amjad, Maaz / Sidorov, Grigori / Zhila, Alisa et al. | 2020
digital version
2543: Evaluation of Greek Word Embeddings
Outsios, Stamatis / Karatsalos, Christos / Skianis, Konstantinos / Vazirgiannis, Michalis et al. | 2020
digital version
2552: A Dataset of Mycenaean Linear B Sequences
Papavassiliou, Katerina / Owens, Gareth / Kosmopoulos, Dimitrios et al. | 2020
digital version
2562: The Nunavut Hansard Inuktitut-English Parallel Corpus 3.0 with Preliminary Machine Translation Results
Joanis, Eric / Knowles, Rebecca / Kuhn, Roland / Larkin, Samuel / Littell, Patrick / Lo, Chi-kiu / Stewart, Darlene / Micher, Jeffrey et al. | 2020
digital version
2573: Exploring Bilingual Word Embeddings for Hiligaynon, a Low-Resource Language
Michel, Leah / Hangya, Viktor / Fraser, Alexander et al. | 2020
digital version
2581: A Finite-State Morphological Analyser for Evenki
Zueva, Anna / Kuznetsova, Anastasia / Tyers, Francis et al. | 2020
digital version
2590: Morphology-rich Alphasyllabary Embeddings
Mersha, Amanuel / Wu, Stephen et al. | 2020
digital version
2596: Localization of Fake News Detection via Multitask Transfer Learning
Cruz, Jan Christian Blaise / Tan, Julianne Agatha / Cheng, Charibeth et al. | 2020
digital version
2605: Evaluating Sentence Segmentation in Different Datasets of Neuropsychological Language Tests in Brazilian Portuguese
Casanova, Edresson / Treviso, Marcos / Hübner, Lilian / Aluísio, Sandra et al. | 2020
digital version
2615: Jejueo Datasets for Machine Translation and Speech Synthesis
Park, Kyubyong / Choe, Yo Joong / Ham, Jiyeon et al. | 2020
digital version
2622: Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Matsuura, Kohei / Ueno, Sei / Mimura, Masato / Sakai, Shinsuke / Kawahara, Tatsuya et al. | 2020
digital version
2629: Development of a Guarani - Spanish Parallel Corpus
Chiruzzo, Luis / Amarilla, Pedro / Ríos, Adolfo / Giménez Lugo, Gustavo et al. | 2020
digital version
2634: AR-ASAG An ARabic Dataset for Automatic Short Answer Grading Evaluation
Ouahrani, Leila / Bennouar, Djamal et al. | 2020
digital version
2644: Processing Language Resources of Under-Resourced and Endangered Languages for the Generation of Augmentative Alternative Communication Boards
Ferger, Anne et al. | 2020
digital version
2649: The Nisvai Corpus of Oral Narrative Practices from Malekula (Vanuatu) and its Associated Language Resources
Aznar, Jocelyn / Gala, Núria et al. | 2020
digital version
2657: Building a Time-Aligned Cross-Linguistic Reference Corpus from Language Documentation Data (DoReCo)
Paschen, Ludger / Delafontaine, François / Draxler, Christoph / Fuchs, Susanne / Stave, Matthew / Seifart, Frank et al. | 2020
digital version
2667: Benchmarking Neural and Statistical Machine Translation on Low-Resource African Languages
Duh, Kevin / McNamee, Paul / Post, Matt / Thompson, Brian et al. | 2020
digital version
2676: Improved Finite-State Morphological Analysis for St. Lawrence Island Yupik Using Paradigm Function Morphology
Chen, Emily / Park, Hyunji Hayley / Schwartz, Lane et al. | 2020
digital version
2685: Towards a Spell Checker for Zamboanga Chavacano Orthography
Himoro, Marcelo Yuji / Pareja-Lora, Antonio et al. | 2020
digital version
2698: Identifying Sentiments in Algerian Code-switched User-generated Comments
Adouane, Wafia / Touileb, Samia / Bernardy, Jean-Philippe et al. | 2020
digital version
2706: Automatic Creation of Text Corpora for Low-Resource Languages from the Internet: The Case of Swiss German
Linder, Lucy / Jungo, Michael / Hennebert, Jean / Musat, Claudiu Cristian / Fischer, Andreas et al. | 2020
digital version
2712: Evaluating Sub-word Embeddings in Cross-lingual Models
Hakimi Parizi, Ali / Cook, Paul et al. | 2020
digital version
2720: A Swiss German Dictionary: Variation in Speech and Writing
Schmidt, Larissa / Linder, Lucy / Djambazovska, Sandra / Lazaridis, Alexandros / Samardžić, Tanja / Musat, Claudiu et al. | 2020
digital version
2726: Towards a Corsican Basic Language Resource Kit
Kevers, Laurent / Retali-Medori, Stella et al. | 2020
digital version
2736: Evaluating the Impact of Sub-word Information and Cross-lingual Word Embeddings on Mi'kmaq Language Modelling
Boudreau, Jeremie / Patra, Akankshya / Suvarna, Ashima / Cook, Paul et al. | 2020
digital version
2746: Exploring a Choctaw Language Corpus with Word Vectors and Minimum Distance Length
Brixey, Jacqueline / Sides, David / Vizthum, Timothy / Traum, David / Iskarous, Khalil et al. | 2020
digital version
2754: Massive vs. Curated Embeddings for Low-Resourced Languages: the Case of Yorùbá and Twi
Alabi, Jesujoba / Amponsah-Kaakyire, Kwabena / Adelani, David / España-Bonet, Cristina et al. | 2020
digital version
2763: TRopBank: Turkish PropBank V2.0
Kara, Neslihan / Aslan, Deniz Baran / Marşan, Büşra / Bakay, Özge / Ak, Koray / Yıldız, Olcay Taner et al. | 2020
digital version
2773: Collection and Annotation of the Romanian Legal Corpus
Tufiș, Dan / Mitrofan, Maria / Păiș, Vasile / Ion, Radu / Coman, Andrei et al. | 2020
digital version
2778: An Empirical Evaluation of Annotation Practices in Corpora from Language Documentation
von Prince, Kilu / Nordhoff, Sebastian et al. | 2020
digital version
2788: Annotated Corpus for Sentiment Analysis in Odia Language
Mohanty, Gaurav / Mishra, Pruthwik / Mamidi, Radhika et al. | 2020
digital version
2796: Building a Task-oriented Dialog System for Languages with no Training Data: the Case for Basque
López de Lacalle, Maddalen / Saralegi, Xabier / San Vicente, Iñaki et al. | 2020
digital version
2803: SENCORPUS: A French-Wolof Parallel Corpus
Nguer, Elhadji Mamadou / Lo, Alla / Dione, Cheikh M. Bamba / Ba, Sileye O. / Lo, Moussa et al. | 2020
digital version
2812: A Major Wordnet for a Minority Language: Scottish Gaelic
Bella, Gábor / McNeill, Fiona / Gorman, Rody / O Donnaile, Caoimhin / MacDonald, Kirsty / Chandrashekar, Yamini / Freihat, Abed Alhakim / Giunchiglia, Fausto et al. | 2020
digital version
2819: Crowdsourcing Speech Data for Low-Resource Languages from Low-Income Workers
Abraham, Basil / Goel, Danish / Siddarth, Divya / Bali, Kalika / Chopra, Manu / Choudhury, Monojit / Joshi, Pratik / Jyoti, Preethi / Sitaram, Sunayana / Seshadri, Vivek et al. | 2020
digital version
2827: A Resource for Studying Chatino Verbal Morphology
Cruz, Hilaria / Anastasopoulos, Antonios / Stump, Gregory et al. | 2020
digital version
2832: Learnings from Technological Interventions in a Low Resource Language: A Case-Study on Gondi
Mehta, Devansh / Santy, Sebastin / Mothilal, Ramaravind Kommiya / Srivastava, Brij Mohan Lal / Sharma, Alok / Shukla, Anurag / Prasad, Vishnu / U, Venkanna / Sharma, Amit / Bali, Kalika et al. | 2020
digital version
2839: Irony Detection in Persian Language: A Transfer Learning Approach Using Emoji Prediction
Golazizian, Preni / Sabeti, Behnam / Ashrafi Asli, Seyed Arad / Majdabadi, Zahra / Momenzadeh, Omid / Fahmi, Reza et al. | 2020
digital version
2846: Towards Computational Resource Grammars for Runyankore and Rukiga
Bamutura, David / Ljunglöf, Peter / Nebende, Peter et al. | 2020
digital version
2855: Optimizing Annotation Effort Using Active Learning Strategies: A Sentiment Analysis Case Study in Persian
Ashrafi Asli, Seyed Arad / Sabeti, Behnam / Majdabadi, Zahra / Golazizian, Preni / Fahmi, Reza / Momenzadeh, Omid et al. | 2020
digital version
2862: BanFakeNews: A Dataset for Detecting Fake News in Bangla
Hossain, Md Zobaer / Rahman, Md Ashraful / Islam, Md Saiful / Kar, Sudipta et al. | 2020
digital version
2872: A Resource for Computational Experiments on Mapudungun
Duan, Mingjun / Fasola, Carlos / Rallabandi, Sai Krishna / Vega, Rodolfo / Anastasopoulos, Antonios / Levin, Lori / Black, Alan W et al. | 2020
digital version
2878: Automated Parsing of Interlinear Glossed Text from Page Images of Grammatical Descriptions
Round, Erich / Ellison, Mark / Macklin-Cordes, Jayden / Beniamine, Sacha et al. | 2020
digital version
2884: The Johns Hopkins University Bible Corpus: 1600+ Tongues for Typological Exploration
McCarthy, Arya D. / Wicks, Rachel / Lewis, Dylan / Mueller, Aaron / Wu, Winston / Adams, Oliver / Nicolai, Garrett / Post, Matt / Yarowsky, David et al. | 2020
digital version
2893: Towards Building an Automatic Transcription System for Language Documentation: Experiences from Muyu
Zahrer, Alexander / Zgank, Andrej / Schuppler, Barbara et al. | 2020
digital version
2901: Towards Flexible Cross-Resource Exploitation of Heterogeneous Language Documentation Data
Jettka, Daniel / Lehmberg, Timm et al. | 2020
digital version
2906: CantoMap: a Hong Kong Cantonese MapTask Corpus
Winterstein, Grégoire / Tang, Carmen / Lai, Regine et al. | 2020
digital version
2914: No Data to Crawl? Monolingual Corpus Creation from PDF Files of Truly low-Resource Languages in Peru
Bustamante, Gina / Oncevay, Arturo / Zariquiey, Roberto et al. | 2020
digital version
2924: Creating a Parallel Icelandic Dependency Treebank from Raw Text to Universal Dependencies
Jónsdóttir, Hildur / Ingason, Anton Karl et al. | 2020
digital version
2932: Building a Universal Dependencies Treebank for Occitan
Miletic, Aleksandra / Bras, Myriam / Vergez-Couret, Marianne / Esher, Louise / Poujade, Clamença / Sibille, Jean et al. | 2020
digital version
2940: Building the Old Javanese Wordnet
Moeljadi, David / Aminullah, Zakariya Pamuji et al. | 2020
digital version
2947: CPLM, a Parallel Corpus for Mexican Languages: Development and Interface
Sierra Martínez, Gerardo / Montaño, Cynthia / Bel-Enguix, Gemma / Córdova, Diego / Mota Montoya, Margarita et al. | 2020
digital version
2953: SiNER: A Large Dataset for Sindhi Named Entity Recognition
Ali, Wazir / Lu, Junyu / Xu, Zenglin et al. | 2020
digital version
2962: Construct a Sense-Frame Aligned Predicate Lexicon for Chinese AMR Corpus
Song, Li / Dai, Yuling / Liu, Yihuan / Li, Bin / Qu, Weiguang et al. | 2020
digital version
2970: MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
Han, Lifeng / Jones, Gareth / Smeaton, Alan et al. | 2020
digital version
2980: A Myanmar (Burmese)-English Named Entity Transliteration Dictionary
Myat Mon, Aye / Ding, Chenchen / Kaing, Hour / Mar Soe, Khin / Utiyama, Masao / Sumita, Eiichiro et al. | 2020
digital version
2984: CA-EHN: Commonsense Analogy from E-HowNet
Li, Peng-Hsuan / Yang, Tsan-Yu / Ma, Wei-Yun et al. | 2020
digital version
2991: Building Semantic Grams of Human Knowledge
Leone, Valentina / Siragusa, Giovanni / Di Caro, Luigi / Navigli, Roberto et al. | 2020
digital version
3001: Automatically Building a Multilingual Lexicon of False Friends With No Supervision
Uban, Ana Sabina / Dinu, Liviu P. et al. | 2020
digital version
3008: A Parallel WordNet for English, Swedish and Bulgarian
Angelov, Krasimir et al. | 2020
digital version
3016: ENGLAWI: From Human- to Machine-Readable Wiktionary
Sajous, Franck / Calderone, Basilio / Hathout, Nabil et al. | 2020
digital version
3027: Opening the Romance Verbal Inflection Dataset 2.0: A CLDF lexicon
Beniamine, Sacha / Maiden, Martin / Round, Erich et al. | 2020
digital version
3036: word2word: A Collection of Bilingual Lexicons for 3,564 Language Pairs
Choe, Yo Joong / Park, Kyubyong / Kim, Dongwoo et al. | 2020
digital version
3046: Introducing Lexical Masks: a New Representation of Lexical Entries for Better Evaluation and Exchange of Lexicons
Cartoni, Bruno / Calvelo Aros, Daniel / Vrandecic, Denny / Lertpradit, Saran et al. | 2020
digital version
3053: A Large-Scale Leveled Readability Lexicon for Standard Arabic
Al Khalil, Muhamed / Habash, Nizar / Jiang, Zhengyang et al. | 2020
digital version
3063: Preserving Semantic Information from Old Dictionaries: Linking Senses of the 'Altfranzösisches Wörterbuch' to WordNet
Stein, Achim et al. | 2020
digital version
3069: Cifu: a Frequency Lexicon of Hong Kong Cantonese
Lai, Regine / Winterstein, Grégoire et al. | 2020
digital version
3078: Odi et Amo. Creating, Evaluating and Extending Sentiment Lexicons for Latin.
Sprugnoli, Rachele / Passarotti, Marco / Corbetta, Daniela / Peverelli, Andrea et al. | 2020
digital version
3087: WordWars: A Dataset to Examine the Natural Selection of Words
Mohammad, Saif M. et al. | 2020
digital version
3096: Challenge Dataset of Cognates and False Friend Pairs from Indian Languages
Kanojia, Diptesh / Kulkarni, Malhar / Bhattacharyya, Pushpak / Haffari, Gholamreza et al. | 2020
digital version
3103: Development of a Japanese Personality Dictionary based on Psychological Methods
Iwai, Ritsuko / Kawahara, Daisuke / Kumada, Takatsune / Kurohashi, Sadao et al. | 2020
digital version
3109: A Lexicon-Based Approach for Detecting Hedges in Informal Text
Islam, Jumayel / Xiao, Lu / Mercer, Robert E. et al. | 2020
digital version
3114: Word Complexity Estimation for Japanese Lexical Simplification
Nishihara, Daiki / Kajiwara, Tomoyuki et al. | 2020
digital version
3121: Inducing Universal Semantic Tag Vectors
Huo, Da / de Melo, Gerard et al. | 2020
digital version
3128: LexiDB: Patterns & Methods for Corpus Linguistic Database Management
Coole, Matthew / Rayson, Paul / Mariani, John et al. | 2020
digital version
3136: Towards a Semi-Automatic Detection of Reflexive and Reciprocal Constructions and Their Representation in a Valency Lexicon
Kettnerová, Václava / Lopatkova, Marketa / Vernerová, Anna / Barancikova, Petra et al. | 2020
digital version
3145: Languages Resources for Poorly Endowed Languages : The Case Study of Classical Armenian
Vidal-Gorène, Chahan / Decours-Perez, Aliénor et al. | 2020
digital version
3153: Constructing Web-Accessible Semantic Role Labels and Frames for Japanese as Additions to the NPCMJ Parsed Corpus
Takeuchi, Koichi / Butler, Alastair / Nagasaki, Iku / Okamura, Takuya / Pardeshi, Prashant et al. | 2020
digital version
3162: Large-scale Cross-lingual Language Resources for Referencing and Framing
Vossen, Piek / Ilievski, Filip / Postma, Marten / Fokkens, Antske / Minnema, Gosse / Remijnse, Levi et al. | 2020
digital version
3172: Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use Case
Khan, Fahad / Romary, Laurent / Salgado, Ana / Bowers, Jack / Khemakhem, Mohamed / Tasovac, Toma et al. | 2020
digital version
3181: Linking the TUFS Basic Vocabulary to the Open Multilingual Wordnet
Bond, Francis / Nomoto, Hiroki / Morgado da Costa, Luís / Bond, Arthur et al. | 2020
digital version
3189: Some Issues with Building a Multilingual Wordnet
Bond, Francis / Morgado da Costa, Luis / Goodman, Michael Wayne / McCrae, John Philip / Lohk, Ahti et al. | 2020
digital version
3198: Collocations in Russian Lexicography and Russian Collocations Database
Khokhlova, Maria et al. | 2020
digital version
3207: Methodological Aspects of Developing and Managing an Etymological Lexical Resource: Introducing EtymDB-2.0
Fourrier, Clémentine / Sagot, Benoît et al. | 2020
digital version
3217: OFrLex: A Computational Morphological and Syntactic Lexicon for Old French
Guibon, Gaël / Sagot, Benoît et al. | 2020
digital version
3226: Automatic Reconstruction of Missing Romanian Cognates and Unattested Latin Words
Ciobanu, Alina Maria / Dinu, Liviu P. / Zoicas, Laurentiu et al. | 2020
digital version
3232: A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment
Ahmadi, Sina / McCrae, John Philip / Nimb, Sanni / Khan, Fahad / Monachini, Monica / Pedersen, Bolette / Declerck, Thierry / Wissik, Tanja / Bellandi, Andrea / Pisani, Irene et al. | 2020
digital version
3243: A Broad-Coverage Deep Semantic Lexicon for Verbs
Allen, James / An, Hannah / Bose, Ritwik / de Beaumont, Will / Teng, Choh Man et al. | 2020
digital version
3252: Computational Etymology and Word Emergence
Wu, Winston / Yarowsky, David et al. | 2020
digital version
3260: A Dataset of Translational Equivalents Built on the Basis of plWordNet-Princeton WordNet Synset Mapping
Rudnicka, Ewa / Naskręt, Tomasz et al. | 2020
digital version
3265: TRANSLIT: A Large-scale Name Transliteration Resource
Benites, Fernando / Duivesteijn, Gilbert François / von Däniken, Pius / Cieliebak, Mark et al. | 2020
digital version
3272: Computing with Subjectivity Lexicons
L. M. Jeronimo, Caio / E. C. Campelo, Claudio / Balby Marinho, Leandro / Sales, Allan / Veloso, Adriano / Viola, Roberta et al. | 2020
digital version
3281: The ACoLi Dictionary Graph
Chiarcos, Christian / Fäth, Christian / Ionov, Maxim et al. | 2020
digital version
3291: Resources in Underrepresented Languages: Building a Representative Romanian Corpus
Midrigan - Ciochina, Ludmila / Boyd, Victoria / Sanchez-Ortega, Lucila / Malancea_Malac, Diana / Midrigan, Doina / Corina, David P. et al. | 2020
digital version
3297: World Class Language Technology - Developing a Language Technology Strategy for Danish
Kirchmeier, Sabine / Pedersen, Bolette / Nimb, Sanni / Diderichsen, Philip / Henrichsen, Peter Juel et al. | 2020
digital version
3302: A Corpus for Automatic Readability Assessment and Text Simplification of German
Battisti, Alessia / Pfütze, Dominik / Säuberli, Andreas / Kostrzewa, Marek / Ebling, Sarah et al. | 2020
digital version
3312: The CLARIN Knowledge Centre for Atypical Communication Expertise
van den Heuvel, Henk / Oostdijk, Nelleke / Rowland, Caroline / Trilsbeek, Paul et al. | 2020
digital version
3317: Corpora of Disordered Speech in the Light of the GDPR: Two Use Cases from the DELAD Initiative
van den Heuvel, Henk / Kelli, Aleksei / Klessa, Katarzyna / Salaasti, Satu et al. | 2020
digital version
3322: The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe
Rehm, Georg / Marheinecke, Katrin / Hegele, Stefanie / Piperidis, Stelios / Bontcheva, Kalina / Hajic, Jan / Choukri, Khalid / Vasiļjevs, Andrejs / Backfried, Gerhard / Prinz, Christoph et al. | 2020
digital version
3333: A Framework for Shared Agreement of Language Tags beyond ISO 639
Gillis-Webber, Frances / Tittel, Sabine et al. | 2020
digital version
3340: Gigafida 2.0: The Reference Corpus of Written Standard Slovene
Krek, Simon / Arhar Holdt, Špela / Erjavec, Tomaž / Čibej, Jaka / Repar, Andraz / Gantar, Polona / Ljubešić, Nikola / Kosem, Iztok / Dobrovoljc, Kaja et al. | 2020
digital version
3346: Corpus Query Lingua Franca part II: Ontology
Evert, Stefan / Harlamov, Oleg / Heinrich, Philipp / Banski, Piotr et al. | 2020
digital version
3353: A CLARIN Transcription Portal for Interview Data
Draxler, Christoph / van den Heuvel, Henk / van Hessen, Arjan / Calamai, Silvia / Corti, Louise et al. | 2020
digital version
3360: Ellogon Casual Annotation Infrastructure
Petasis, Georgios / Tsekouras, Leonidas et al. | 2020
digital version
3366: European Language Grid: An Overview
Rehm, Georg / Berger, Maria / Elsholz, Ela / Hegele, Stefanie / Kintzel, Florian / Marheinecke, Katrin / Piperidis, Stelios / Deligiannis, Miltos / Galanis, Dimitris / Gkirtzou, Katerina et al. | 2020
digital version
3381: The Competitiveness Analysis of the European Language Technology Market
Vasiļjevs, Andrejs / Skadina, Inguna / Samite, Indra / Kauliņš, Kaspars / Ajausks, Ēriks / Meļņika, Jūlija / Bērziņš, Aivars et al. | 2020
digital version
3390: Constructing a Bilingual Hadith Corpus Using a Segmentation Tool
Altammami, Shatha / Atwell, Eric / Alsalka, Ammar et al. | 2020
digital version
3399: Facilitating Corpus Usage: Making Icelandic Corpora More Accessible for Researchers and Language Users
Steingrímsson, Steinþór / Barkarson, Starkaður / Örnólfsson, Gunnar Thor et al. | 2020
digital version
3406: Interoperability in an Infrastructure Enabling Multidisciplinary Research: The case of CLARIN
de Jong, Franciska / Maegaard, Bente / Fišer, Darja / van Uytvanck, Dieter / Witt, Andreas et al. | 2020
digital version
3414: Language Technology Programme for Icelandic 2019-2023
Nikulásdóttir, Anna / Guðnason, Jón / Ingason, Anton Karl / Loftsson, Hrafn / Rögnvaldsson, Eiríkur / Sigurðsson, Einar Freyr / Steingrímsson, Steinþór et al. | 2020
digital version
3423: Privacy by Design and Language Resources
Kamocki, Pawel / Witt, Andreas et al. | 2020
digital version
3428: Making Metadata Fit for Next Generation Language Technology Platforms: The Metadata Schema of the European Language Grid
Labropoulou, Penny / Gkirtzou, Katerina / Gavriilidou, Maria / Deligiannis, Miltos / Galanis, Dimitris / Piperidis, Stelios / Rehm, Georg / Berger, Maria / Mapelli, Valérie / Rigault, Michael et al. | 2020
digital version
3438: Related Works in the Linguistic Data Consortium Catalog
Jaquette, Daniel / Cieri, Christopher / DiPersio, Denise et al. | 2020
digital version
3443: Language Data Sharing in European Public Services - Overcoming Obstacles and Creating Sustainable Data Sharing Infrastructures
Smal, Lilli / Lösch, Andrea / van Genabith, Josef / Giagkou, Maria / Declerck, Thierry / Busemann, Stephan et al. | 2020
digital version
3449: A Progress Report on Activities at the Linguistic Data Consortium Benefitting the LREC Community
Cieri, Christopher / Fiumara, James / Strassel, Stephanie / Wright, Jonathan / DiPersio, Denise / Liberman, Mark et al. | 2020
digital version
3457: Digital Language Infrastructures - Documenting Language Actors
Lyding, Verena / König, Alexander / Pretti, Monica et al. | 2020
digital version
3463: Samrómur: Crowd-sourcing Data Collection for Icelandic Speech Recognition
Mollberg, David Erik / Jónsson, Ólafur Helgi / Þorsteinsdóttir, Sunneva / Steingrímsson, Steinþór / Magnúsdóttir, Eydís Huld / Gudnason, Jon et al. | 2020
digital version
3468: Semi-supervised Development of ASR Systems for Multilingual Code-switched Speech in Under-resourced Languages
Biswas, Astik / Yilmaz, Emre / De Wet, Febe / Van der westhuizen, Ewald / Niesler, Thomas et al. | 2020
digital version
3475: CLFD: A Novel Vectorization Technique and Its Application in Fake News Detection
Mersinias, Michail / Afantenos, Stergos / Chalkiadakis, Georgios et al. | 2020
digital version
3484: SimplifyUR: Unsupervised Lexical Text Simplification for Urdu
Qasmi, Namoos Hayat / Zia, Haris Bin / Athar, Awais / Raza, Agha Ali et al. | 2020
digital version
3490: Jamo Pair Encoding: Subcharacter Representation-based Extreme Korean Vocabulary Compression for Efficient Subword Tokenization
Moon, Sangwhan / Okazaki, Naoaki et al. | 2020
digital version
3498: Offensive Language and Hate Speech Detection for Danish
Sigurbergsson, Gudbjartur Ingi / Derczynski, Leon et al. | 2020
digital version
3509: Semi-supervised Deep Embedded Clustering with Anomaly Detection for Semantic Frame Induction
Yong, Zheng Xin / Timponi Torrent, Tiago et al. | 2020
digital version
3520: Search Query Language Identification Using Weak Labeling
Tambi, Ritiz / Kale, Ajinkya / King, Tracy Holloway et al. | 2020
digital version
3528: Automated Phonological Transcription of Akkadian Cuneiform Text
Sahala, Aleksi / Silfverberg, Miikka / Arppe, Antti / Lindén, Krister et al. | 2020
digital version
3535: COSTRA 1.0: A Dataset of Complex Sentence Transformations
Barancikova, Petra / Bojar, Ondřej et al. | 2020
digital version
3542: Automatic In-the-wild Dataset Annotation with Deep Generalized Multiple Instance Learning
Correia, Joana / Trancoso, Isabel / Raj, Bhiksha et al. | 2020
digital version
3551: How Much Data Do You Need? About the Creation of a Ground Truth for Black Letter and the Effectiveness of Neural OCR
Ströbel, Phillip Benjamin / Clematide, Simon / Volk, Martin et al. | 2020
digital version
3560: Dirichlet-Smoothed Word Embeddings for Low-Resource Settings
Jungmaier, Jakob / Kassner, Nora / Roth, Benjamin et al. | 2020
digital version
3566: On The Performance of Time-Pooling Strategies for End-to-End Spoken Language Identification
Monteiro, Joao / Alam, Md Jahangir / Falk, Tiago et al. | 2020
digital version
3573: Neural Disambiguation of Lemma and Part of Speech in Morphologically Rich Languages
Hoya Quecedo, José María / Maximilian, Koppatz / Yangarber, Roman et al. | 2020
digital version
3583: Non-Linearity in Mapping Based Cross-Lingual Word Embeddings
Zhao, Jiawei / Gilman, Andrew et al. | 2020
digital version
3590: LibriVoxDeEn: A Corpus for German-to-English Speech Translation and German Speech Recognition
Beilharz, Benjamin / Sun, Xin / Karimova, Sariya / Riezler, Stefan et al. | 2020
digital version
3595: SEDAR: a Large Scale French-English Financial Domain Parallel Corpus
Ghaddar, Abbas / Langlais, Phillippe et al. | 2020
digital version
3603: JParaCrawl: A Large Scale Web-Based English-Japanese Parallel Corpus
Morishita, Makoto / Suzuki, Jun / Nagata, Masaaki et al. | 2020
digital version
3610: Neural Machine Translation for Low-Resourced Indian Languages
Choudhary, Himanshu / Rao, Shivansh / Rohilla, Rajesh et al. | 2020
digital version
3616: Content-Equivalent Translated Parallel News Corpus and Extension of Domain Adaptation for NMT
Mino, Hideya / Tanaka, Hideki / Ito, Hitoshi / Goto, Isao / Yamada, Ichiro / Tokunaga, Takenobu et al. | 2020
digital version
3623: NMT and PBSMT Error Analyses in English to Brazilian Portuguese Automatic Translations
Caseli, Helena / Inácio, Marcio et al. | 2020
digital version
3630: Evaluation Dataset for Zero Pronoun in Japanese to English Translation
Shimazu, Sho / Takase, Sho / Nakazawa, Toshiaki / Okazaki, Naoaki et al. | 2020
digital version
3635: Better Together: Modern Methods Plus Traditional Thinking in NP Alignment
Kovács, Ádám / Ács, Judit / Kornai, Andras / Recski, Gábor et al. | 2020
digital version
3640: Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
Song, Haiyue / Dabre, Raj / Fujita, Atsushi / Kurohashi, Sadao et al. | 2020
digital version
3650: Being Generous with Sub-Words towards Small NMT Children
Defauw, Arne / Vanallemeersch, Tom / Van Winckel, Koen / Szoc, Sara / Van den Bogaert, Joachim et al. | 2020
digital version
3657: Document Sub-structure in Neural Machine Translation
Dobreva, Radina / Zhou, Jie / Bawden, Rachel et al. | 2020
digital version
3668: An Evaluation Benchmark for Testing the Word Sense Disambiguation Capabilities of Machine Translation Systems
Raganato, Alessandro / Scherrer, Yves / Tiedemann, Jörg et al. | 2020
digital version
3676: MEDLINE as a Parallel Corpus: a Survey to Gain Insight on French-, Spanish- and Portuguese-speaking Authors’ Abstract Writing Practice
Névéol, Aurélie / Jimeno Yepes, Antonio / Neves, Mariana et al. | 2020
digital version
3683: JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation
Mao, Zhuoyuan / Cromieres, Fabien / Dabre, Raj / Song, Haiyue / Kurohashi, Sadao et al. | 2020
digital version
3692: A Post-Editing Dataset in the Legal Domain: Do we Underestimate Neural Machine Translation Quality?
Ive, Julia / Specia, Lucia / Szoc, Sara / Vanallemeersch, Tom / Van den Bogaert, Joachim / Farah, Eduardo / Maroti, Christine / Ventura, Artur / Khalilov, Maxim et al. | 2020
digital version
3698: Linguistically Informed Hindi-English Neural Machine Translation
Goyal, Vikrant / Mishra, Pruthwik / Sharma, Dipti Misra et al. | 2020
digital version
3704: A Test Set for Discourse Translation from Japanese to English
Nagata, Masaaki / Morishita, Makoto et al. | 2020
digital version
3710: An Analysis of Massively Multilingual Neural Machine Translation for Low-Resource Languages
Mueller, Aaron / Nicolai, Garrett / McCarthy, Arya D. / Lewis, Dylan / Wu, Winston / Yarowsky, David et al. | 2020
digital version
3719: TDDC: Timely Disclosure Documents Corpus
Doi, Nobushige / Oda, Yusuke / Nakazawa, Toshiaki et al. | 2020
digital version
3727: MuST-Cinema: a Speech-to-Subtitles corpus
Karakanta, Alina / Negri, Matteo / Turchi, Marco et al. | 2020
digital version
3735: On Context Span Needed for Machine Translation Evaluation
Castilho, Sheila / Popović, Maja / Way, Andy et al. | 2020
digital version
3743: A Multilingual Parallel Corpora Collection Effort for Indian Languages
Siripragrada, Shashank / Philip, Jerin / Namboodiri, Vinay P. / Jawahar, C V et al. | 2020
digital version
3752: To Case or not to case: Evaluating Casing Methods for Neural Machine Translation
Etchegoyhen, Thierry / Gete, Harritxu et al. | 2020
digital version
3761: The MARCELL Legislative Corpus
Váradi, Tamás / Koeva, Svetla / Yamalov, Martin / Tadić, Marko / Sass, Bálint / Nitoń, Bartłomiej / Ogrodniczuk, Maciej / Pęzik, Piotr / Barbu Mititelu, Verginica / Ion, Radu et al. | 2020
digital version
3769: ParaPat: The Multi-Million Sentences Parallel Corpus of Patents Abstracts
Soares, Felipe / Stevenson, Mark / Bartolome, Diego / Zaretskaya, Anna et al. | 2020
digital version
3775: Corpora for Document-Level Neural Machine Translation
Liu, Siyou / Zhang, Xiaojun et al. | 2020
digital version
3782: OpusTools and Parallel Corpus Diagnostics
Aulamo, Mikko / Sulubacak, Umut / Virpioja, Sami / Tiedemann, Jörg et al. | 2020
digital version
3790: Literary Machine Translation under the Magnifying Glass: Assessing the Quality of an NMT-Translated Detective Novel on Document Level
Fonteyne, Margot / Tezcan, Arda / Macken, Lieve et al. | 2020
digital version
3799: Handle with Care: A Case Study in Comparable Corpora Exploitation for Neural Machine Translation
Etchegoyhen, Thierry / Gete, Harritxu et al. | 2020
digital version
3808: The FISKMÖ Project: Resources and Tools for Finnish-Swedish Machine Translation and Cross-Linguistic Research
Tiedemann, Jörg / Nieminen, Tommi / Aulamo, Mikko / Kanerva, Jenna / Leino, Akseli / Ginter, Filip / Papula, Niko et al. | 2020
digital version
3816: Multiword Expression aware Neural Machine Translation
Zaninello, Andrea / Birch, Alexandra et al. | 2020
digital version
3826: An Enhanced Mapping Scheme of the Universal Part-Of-Speech for Korean
Kim, Myung Hee / Colineau, Nathalie et al. | 2020
digital version
3834: Finite State Machine Pattern-Root Arabic Morphological Generator, Analyzer and Diacritizer
Alkhairy, Maha / Jafri, Afshan / Smith, David et al. | 2020
digital version
3842: An Unsupervised Method for Weighting Finite-state Morphological Analyzers
Keleg, Amr / Tyers, Francis / Howell, Nick / Pirinen, Tommi et al. | 2020
digital version
3851: Language-Independent Tokenisation Rivals Language-Specific Tokenisation for Word Similarity Prediction
Bollegala, Danushka / Kiryo, Ryuichi / Tsujino, Kosuke / Yukawa, Haruki et al. | 2020
digital version
3861: A Supervised Part-Of-Speech Tagger for the Greek Language of the Social Web
Nikiforos, Maria Nefeli / Kermanidis, Katia Lida et al. | 2020
digital version
3868: Bag & Tag'em - A New Dutch Stemmer
Jonker, Anne / de Ruijt, Corné / de Gruijl, Jornt et al. | 2020
digital version
3877: Glawinette: a Linguistically Motivated Derivational Description of French Acquired from GLAWI
Hathout, Nabil / Sajous, Franck / Calderone, Basilio / Namer, Fiammetta et al. | 2020
digital version
3886: BabyFST - Towards a Finite-State Based Computational Model of Ancient Babylonian
Sahala, Aleksi / Silfverberg, Miikka / Arppe, Antti / Lindén, Krister et al. | 2020
digital version
3895: Morphological Analysis and Disambiguation for Gulf Arabic: The Interplay between Resources and Methods
Khalifa, Salam / Zalmout, Nasser / Habash, Nizar et al. | 2020
digital version
3905: Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus
Metheniti, Eleni / Neumann, Guenter et al. | 2020
digital version
3913: Introducing a Large-Scale Dataset for Vietnamese POS Tagging on Conversational Texts
Tran, Oanh / Pham, Tu / Dang, Vu / Nguyen, Bang et al. | 2020
digital version
3922: UniMorph 3.0: Universal Morphology
McCarthy, Arya D. / Kirov, Christo / Grella, Matteo / Nidhi, Amrit / Xia, Patrick / Gorman, Kyle / Vylomova, Ekaterina / Mielke, Sabrina J. / Nicolai, Garrett / Silfverberg, Miikka et al. | 2020
digital version
3932: Building the Spanish-Croatian Parallel Corpus
Mikelenić, Bojana / Tadić, Marko et al. | 2020
digital version
3937: DerivBase.Ru: a Derivational Morphology Resource for Russian
Vodolazsky, Daniil et al. | 2020
digital version
3944: Morfessor EM+Prune: Improved Subword Segmentation with Expectation Maximization and Pruning
Grönroos, Stig-Arne / Virpioja, Sami / Kurimo, Mikko et al. | 2020
digital version
3954: Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
Stankovic, Ranka / Šandrih, Branislava / Krstev, Cvetana / Utvić, Miloš / Skoric, Mihailo et al. | 2020
digital version
3963: Fine-grained Morphosyntactic Analysis and Generation Tools for More Than One Thousand Languages
Nicolai, Garrett / Lewis, Dylan / McCarthy, Arya D. / Mueller, Aaron / Wu, Winston / Yarowsky, David et al. | 2020
digital version
3973: Cairo Student Code-Switch (CSCS) Corpus: An Annotated Egyptian Arabic-English Corpus
Balabel, Mohamed / Hamed, Injy / Abdennadher, Slim / Vu, Ngoc Thang / Çetinoğlu, Özlem et al. | 2020
digital version
3978: Getting More Data for Low-resource Morphological Inflection: Language Models and Data Augmentation
Sorokin, Alexey et al. | 2020
digital version
3984: Visual Modeling of Turkish Morphology
Özenç, Berke / Solak, Ercan et al. | 2020
digital version
3991: Kvistur 2.0: a BiLSTM Compound Splitter for Icelandic
Daðason, Jón / Mollberg, David / Loftsson, Hrafn / Bjarnadóttir, Kristín et al. | 2020
digital version
3996: Morphological Segmentation for Low Resource Languages
Mott, Justin / Bies, Ann / Strassel, Stephanie / Kodner, Jordan / Richter, Caitlin / Xu, Hongzhi / Marcus, Mitchell et al. | 2020
digital version
4003: CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data
Wenzek, Guillaume / Lachaux, Marie-Anne / Conneau, Alexis / Chaudhary, Vishrav / Guzmán, Francisco / Joulin, Armand / Grave, Edouard et al. | 2020
digital version
4013: On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding Learning
Doval, Yerai / Camacho-Collados, Jose / Espinosa Anke, Luis / Schockaert, Steven et al. | 2020
digital version
4024: Building an English-Chinese Parallel Corpus Annotated with Sub-sentential Translation Techniques
Zhai, Yuming / Liu, Lufei / Zhong, Xinyi / Illouz, Gabriel / Vilnat, Anne et al. | 2020
digital version
4034: Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
Nivre, Joakim / de Marneffe, Marie-Catherine / Ginter, Filip / Hajic, Jan / Manning, Christopher D. / Pyysalo, Sampo / Schuster, Sebastian / Tyers, Francis / Zeman, Daniel et al. | 2020
digital version
4044: EMPAC: an English-Spanish Corpus of Institutional Subtitles
Serrat Roozen, Iris / Martínez Martínez, José Manuel et al. | 2020
digital version
4054: Cross-Lingual Word Embeddings for Turkic Languages
Kuriyozov, Elmurod / Doval, Yerai / Gómez-Rodríguez, Carlos et al. | 2020
digital version
4063: How Universal are Universal Dependencies? Exploiting Syntax for Multilingual Clause-level Sentiment Detection
Kanayama, Hiroshi / Iwamoto, Ran et al. | 2020
digital version

How to get this title?

Download

Quicklinks

Borrowing & Ordering

Quicklinks

Search & discover

Quicklinks

Learning & working

Quicklinks

Publishing & Archiving

Quicklinks

About the TIB

Quicklinks

Research & Development

A Major Wordnet for a Minority Language: Scottish Gaelic (English)

How to get this title?

Export, share and cite

More details on this result

Table of contents

Similar titles

How to get this title?

Export, share and cite