👷 Readings in Digital Typography, Scientific Visualization, Information Retrieval and Machine Learning

[Michal Štefánik and Vítek Novotný]: Poster session for the ALPS NLP winter school 14. 1. 2021

Poster sessions at ALPS 2021 will take place in two two-hour sessions at the gather.town virtual platform.
At this final seminar of fall 2020, Vítek and Michal will wander around gather.town and present their ALPS 2021 posters.

Michal Štefánik: Abstract

Development of current state-of-the-art language models is heavily focused on Transformers architecture, that can be pre-trained on vast pre-training corpora and relatively quickly be fine-tuned to a specific task. On the pre-training stage, these models are usually trained on Masked Language Modeling, while on fine-tuning stage, models are trained to minimise cross-entropy on either token-level or sequence-level tasks.

While these fine-tuning objectives can be successfully followed, it is often for the price of the loss of generality of the network. Significantly, such generality loss is often misperceived during the training process, as both the loss and measured performance is tightly bound to the optimised objective. This in consequence causes the network to prone to what Kahneman in humans calls Availability of Heuristics - the system seeks for every shortcut heuristics, that would allow him for cutting down the loss any further.

Our experiments show this problem with neural translators, where model overfits specific length of parallel corpus pairs, that can not be observed on reported BLEU on own validation data set, or with summarization, where the model learns to write syntactically-coherent output but lacks to frame and propagate key points of the input. Again, being able to properly match the declension and other morphology also has beneficial, yet misleading impact on validation ROUGE.

This poster aims to familiarise the audience with this common flaw, analyse its reasons and to outline and promote the possible research directions, that could help ou language systems in a way towards higher levels of generality.

Michal Štefánik: On Eliminating Inductive Biases of Deep Language Models

Draw.io link

Vítek Novotný: Abstract

Since the seminal work of Mikolov and colleagues, word vectors of log-bilinear models have found their way into many NLP applications. Later, Mikolov and colleagues have equipped their log-bilinear model with positional weighting that allowed them to reach state-of-the-art performance on the word analogy task.

Although the positional model improves accuracy on the intrinsic word analogy task, prior work has neglected qualitative evaluation of its linguistic properties as well as quantitative evaluation on extrinsic end tasks.We open-source the positional model and we evaluate it using qualitative and quantitative tasks.

We show that the positional model captures information about parts of speech and self-information. We also show that the positional model consistently outperforms non-positional models on text classification and language modeling.

Vítek Novotný: When FastText Pays Attention

Efficient Estimation of Word Representations Using Positional Weighting

Google Slides link

Předchozí

Následující

👷 Readings in Digital Typography, Scientific Visualization, Information Retrieval and Machine Learning
- Nyní studovat
  
  [Michal Štefánik] Attention sparsification: Look into the future and the past (behind the context window) 8. 10. 2020
- Nyní studovat
  
  [Vítek Novotný] Word Embeddings: Towards Fast, Interpretable, and Accurate Information Retrieval Systems 15. 10. 2020
- Nyní studovat
  
  [Mikuláš Bankovič] Single Image Super Resolution: SRCNN and ESPCN 22. 10. 2020
- Nyní studovat
  
  [Michal Štefánik] Attention semantics: What attention heads actually know and why should we care 29. 10. 2020
- Nyní studovat
  
  [Vlastimil Martinek] Experiments with image augmentation for classification and segmentation 5. 11. 2020
- Nyní studovat
  
  [Vítek Novotný & Dominik Rehák] Five Years of Markdown in LaTeX: What, Why, How, and Whereto 12. 11. 2020
- Nyní studovat
  
  [Jakub Ryšavý] Feature Reduction: Selection or Extraction for Time Series (Financial) Data 19. 11. 2020
- Nyní studovat
  
  [Vlastimil Martinek] Experiments with Image Augmentation for Classification and Segmentation: Part 2 26. 11. 2020
- Nyní studovat
  
  [Eniafe Festus Ayetiran] Exploting Semantic Knowledge for Aspect Sentiment Classification: A Deep Learning Approach 3. 12. 2020
- Nyní studovat
  
  [Michal Štefánik] Unsupervised Data Augmentation: Thinking Outside the Single-Objective Box 10. 12. 2020
- Nyní studovat
  
  [All] Christmas party 17. 12.
- Nyní studovat
  
  [Petr Sojka et al.]: The Representations of Language which Allow Thinking, Fast and Slow 7. 1. 2021
- Nyní studovat
  
  [Michal Štefánik and Vítek Novotný]: Poster session for the ALPS NLP winter school 14. 1. 2021
- Nyní studovat
  
  [Filip Široký] Forecasting the Linac3 ion beam current challenge (a.k.a. co se dělá v CERN) 28. 1. 2021
- Nyní studovat
  
  [Vítek Novotný a Michal Štefánik] Advanced Language Processing Winter School 2021 11. 2. 2021
- Nyní studovat
  
  [Vítek Novotný] Math-Aware Search Engine in a Single Line of Code 18. 2. 2021

Operace

Prohlédnout vše

Interaktivní osnova

[Michal Štefánik and Vítek Novotný]: Poster session for the ALPS NLP winter school 14. 1. 2021

Michal Štefánik: Abstract

Vítek Novotný: Abstract

Operace