Lessons Learned from Creating a Balanced Corpus from Online Data (Englisch)
- Neue Suche nach: Darģis, Roberts
- Neue Suche nach: Levāne-Petrova, Kristīne
- Neue Suche nach: Poikāns, Ilmārs
- Neue Suche nach: Darģis, Roberts
- Neue Suche nach: Levāne-Petrova, Kristīne
- Neue Suche nach: Poikāns, Ilmārs
In:
Human language technologies - the Baltic perspective
; 127-134
;
2020
-
ISBN:
- Aufsatz (Konferenz) / Print
-
Titel:Lessons Learned from Creating a Balanced Corpus from Online Data
-
Beteiligte:
-
Kongress:International Conference Baltic HLT ; 9. ; 2020 ; Online
-
Erschienen in:
-
Verlag:
- Neue Suche nach: IOS Press
-
Erscheinungsort:Amsterdam
-
Erscheinungsdatum:2020
-
ISBN:
-
Medientyp:Aufsatz (Konferenz)
-
Format:Print
-
Sprache:Englisch
- Neue Suche nach: 54.72 / 54.75
- Weitere Informationen zu Basisklassifikation
-
Schlagwörter:
-
Klassifikation:
-
Datenquelle:
Die Inhaltsverzeichnisse werden automatisch erzeugt und basieren auf den im Index des TIB-Portals verfügbaren Einzelnachweisen der enthaltenen Beiträge. Die Anzeige der Inhaltsverzeichnisse kann daher unvollständig oder lückenhaft sein.
- 3
-
A Study in Estonian Pronominal Coreference ResolutionBarbu, Eduard / Muischnek, Kadri / Freienthal, Linda et al. | 2020
- 11
-
Structural Models of Lithuanian Plosive Consonants in Different Word Positions AuthorsDereškevičiūtė, Sigita / Kazlauskienė, Asta et al. | 2020
- 19
-
Evaluating Multilingual BERT for EstonianKittask, Claudia / Milintsevich, Kirill / Sirts, Kairit et al. | 2020
- 27
-
Similarities and Differences of Lithuanian Functional Styles: A Quantitative PerspectiveMandravickaitė, Justina / Krilavičius, Tomas et al. | 2020
- 32
-
Targeted Aspect-Based Sentiment Analysis for Lithuanian Social Media ReviewsPetkevičius, Mažvydas / Vitkutė-Adžgauskienė, Daiva / Amilevičius, Darius et al. | 2020
- 39
-
Automatic Extraction of Lithuanian Cybersecurity Terms Using Deep Learning ApproachesRokas, Aivaras / Rackevičienė, Sigita / Utka, Andrius et al. | 2020
- 47
-
Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model TrainingSalimbajevs, Askars et al. | 2020
- 55
-
Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian TweetsThakkar, Gaurish / Pinnis, Mārcis et al. | 2020
- 62
-
Large Language Models for Latvian Named Entity RecognitionVīksna, Rinalds / Skadiņa, Inguna et al. | 2020
- 73
-
Data Augmentation for Pipeline-Based Speech TranslationAlves, Diego / Salimbajevs, Askars / Pinnis, Mārcis et al. | 2020
- 80
-
Robust Neural Machine Translation: Modeling Orthographic and Interpunctual VariationBergmanis, Toms / Stafanovičs, Artūrs / Pinnis, Mārcis et al. | 2020
- 87
-
Interactive Learning of Dialog Scenarios from ExamplesDeksne, Daiga / Skadiņš, Raivis et al. | 2020
- 95
-
Intent Detection-Based Lithuanian Chatbot Created via Automatic DNN Hyper-Parameter OptimizationKapočiūtė-Dzikienė, Jurgita et al. | 2020
- 103
-
Towards Hybrid Model for Human-Computer Interaction in LatvianSkadiņa, Inguna / Goško, Didzis et al. | 2020
- 111
-
LVBERT: Transformer-Based Model for Latvian Language UnderstandingZnotiņš, Artūrs / Barzdiņš, Guntis et al. | 2020
- 119
-
An Online Linguistic Analyser for Scottish GaelicBoizou, Loïc / Lamb, William et al. | 2020
- 123
-
Corpus-Based Methods for Assessment of Traditional DictionariesDadurkevičius, Virginijus / Petrauskaitė, Rūta et al. | 2020
- 127
-
Lessons Learned from Creating a Balanced Corpus from Online DataDarģis, Roberts / Levāne-Petrova, Kristīne / Poikāns, Ilmārs et al. | 2020
- 135
-
Creation of Language Resources for the Development of a Medical Speech Recognition System for LatvianDarģis, Roberts / Grūzītis, Normunds / Auzin̦a, Ilze / Stepanovs, Kaspars et al. | 2020
- 142
-
Towards the Development of Language Analysis Tools for the Written Latgalian LanguageDeksne, Daiga / Vulāne, Anna et al. | 2020
- 150
-
Adding Compound Splitting and Analysis to a Semantic Tagger of Modern Standard Finnish – On the Way to FiSTCompKettunen, Kimmo et al. | 2020
- 158
-
Lexicon-Enhanced Neural Lemmatization for EstonianMilintsevich, Kirill / Sirts, Kairit et al. | 2020
- 166
-
Berri Corpus Manager: A Corpus Analysis Tool Using MongoDB TechnologySanjurjo-González, Hugo et al. | 2020
- 174
-
Evaluating Sentence Segmentation and Word Tokenization Systems on Estonian Web TextsSirts, Kairit / Peekman, Kairit et al. | 2020
- 182
-
Language Technology Platform for Public AdministrationSkadiņš, Raivis / Pinnis, Mārcis / Vasiļevskis, Artūrs / Vasiļjevs, Andrejs / Šics, Valters / Rozis, Roberts / Lagzdiņš, Andis et al. | 2020
- 191
-
What Can We Learn from Almost a Decade of Food TweetsSproģis, Uga / Rikters, Matīss et al. | 2020
- 199
-
OCR Challenges for a Latvian Pronunciation DictionaryStrankale, Laine / Paikens, Pēteris et al. | 2020
- 207
-
Morfio – A Corpus-Based Perspective on Latvian MorphologyŠkrabal, Michal / Vondřička, Pavel / Cvrček, Václav et al. | 2020
- 215
-
Development and Research in Lithuanian Language Technologies (2016-2020)Utka, Andrius / Vaičenonienė, Jurgita / Briedienė, Monika / Krilavičius, Tomas et al. | 2020
- 225
-
Quantitative Analysis of Language Competence vs. Performance in Russian- and Lithuanian-Speaking 6 Year-OldsBalčiūnienė, Ingrida / Kornev, Aleksandr N. et al. | 2020
- 233
-
Lithuanian Pedagogic Corpus: Correlations Between Linguistic Features and Text ComplexityBoizou, Loïc / Kovalevskaitė, Jolanta / Rimkutė, Erika et al. | 2020
- 241
-
Detailed Error Annotation for Morphologically Rich Languages: Latvian Use CaseDarģis, Roberts / Auzin̦a, Ilze / Levāne-Petrova, Kristīne / Kaija, Inga et al. | 2020
- 245
-
The First Corpus-Driven Lexical Database of Lithuanian as L2Kovalevskaitė, Jolanta / Boizou, Loïc / Bielinskienė, Agnė / Jancaitė, Laima / Rimkutė, Erika et al. | 2020
- 253
-
Error Tagging in the Lithuanian Learner CorpusRuzaitė, Jūratė / Dereškevičiūtė, Sigita / Kavaliauskaitė-Vilkinienė, Viktorija / Krivickaitė-Leišienė, Eglė et al. | 2020