Benjamin Roth

News:

July, 2025: Happy to host ACL 2025 in Vienna as a local Co-Chair
June, 2025: The research group data mining and machine learning is looking for a PhD student (funded) in Natural Language Processing (Deadline: 24 June 2025)
April, 2024: The research group data mining and machine learning is looking for a Postdoctoral Researcher in Natural Language Processing (filled)
April 24, 2024: Invited Lecture on Open LLMs at the Digital Humanities lecture series, University of Vienna
April 15, 2024: Tutorial on Open LLMs at the Digital Innovation Lab, University of Vienna
September 29, 2023: Tutorial on Weak Supervision, Aalborg University (Copenhagen, several group members)
July 17, 2023: “How to evaluate language-based AI systems”, invited talk at the Center for Ad- vanced Studies Munich (Benjamin Roth)
March 29, 2023: Snorkel ML Research Talk: Weak Supervision at the University of Vienna (online)
November 5, 2022: AKBC 2022 Workshop on Weak, Indirect and Self Supervision for Knowledge Extraction
October 19, 2022: Invited talk at the Austrian Research Institute for Artificial Intelligence
September 13, 2022: Invited talk at SemAI 2022: First Workshop on Semantic AI
July 2, 2022, Invited talk at the CesIfo Venice Summer Institute for Microeconomic Research
November 10, 2021: Invited talk at the Third European Language Resource Coordination (ELRC) workshop in Austria, Vienna
August 12, 2021: Vienna Workshop on Weak Supervision and Natural Language Processing (VieWSNLP)
May 7, 2021: ICLR 2021 Workshop on Weakly Supervised Learning, WeaSuL
April, 2021: PhD position in machine learning for natural language processing (call for applications, deadline: May 5, 2021)
April, 2021: Check out Knodle, a framework for weakly supervised deep learning: knodle.cc (on github)
April 27, 2021: Invited talk on Weakly supervised machine learning for text analysis at the German Society for Computational Linguistics and Language Technology
January 19, 2021: Invited talk on Automating linguistic tests using pre-trained language models at the Wiener Sprachgesellschaft
November 2020: I am looking for a PhD student in deep learning for natural language processing at the University of Vienna. (filled)
June 2020: I am very happy to announce that I have accepted a professorship offer at the University of Vienna, starting this fall! Information about PhD and PostDoc positions will follow soon.
September 9, 2019: Invited talk at the Quebec Artificial Intelligence Institute (Mila) on Interpretable Question Answering
May 24, 2019: Invited talk at the Workshop on Representation Learning for Complex Data, Université Lumière Lyon 2
April 2019: PhD Position in NLP and Deep Learning (closed)
December 2018: NAACL Workshop on extracting structured knowledge from scientific publications accepted! Workshop papers due: Wednesday February 27, 2019
November 2018: DFG Project on “Representing Sets in Embeddings of Relational Information” with one PhD position got approved
October 11, 2018: Invited Talk on Relational information extraction at Munich NLP Meetup (slides)
October 8, 2018: Invited Talk on Relation extraction for non-standard agruments at Apple Siri, Cambridge/UK

Current PhD students

Loris Schönegger (2024-)
Lukas Thoma (2022-)
Pedro Henrique Luz de Araujo (2022-)
Yuxi Xia (2022-)
Vasiliki Kougia (2021-)
Andreas Stephan (2021-)
Anastasiia Sedova (2020-)
Luisa März (2019-2023)

Y Xia, PHL de Araujo, K Zaporojets, B Roth
Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles
Accepted to ACL 2025

B Roth, PHL de Araujo, Y Xia, S Kaltenbrunner, C Korab
Specification Overfitting in Artificial Intelligence
Artificial Intelligence Review, 2025

A Sedova, R Litschko, D Frassinelli, B Roth, B Plank
To Know or Not To Know? Analyzing Self-Consistency of Large Language Models under Ambiguity
EMNLP Findings 2024

PHL De Araujo, B Roth
Functionality learning through specification instructions
EMNLP Findings 2024

M Aßenmacher, A Stephan, L Weissweiler, E Çano, I Ziegler, M Härttrich, B Bischl, B Roth, C Heumann, H Schütze
Collaborative Development of Modular Open Source Educational Resources for Natural Language Processing
ACL 2024 Workshop on Teaching NLP

V Kougia, A Sedova, A Stephan, K Zaporojets, B Roth
Analysing zero-shot temporal relation extraction on clinical notes using temporal consistency
ACL 2024 Workshop on Biomedical NLP

P Dolog, Y Sadikaj, Y Velaj, A Stephan, B Roth, C Plant
The Impact of Cluster Centroid and Text Review Embeddings on Recommendation Methods
ACM Web Conference (WWW) 2024

L Zellinger, A Stephan, B Roth
Counterfactual Reasoning with Knowledge Graph Embeddings
EACL 2024

A Stephan, L Miklautz, K Sidak, JP Wahle, B Gipp, C Plant, B Roth
Text-Guided Image Clustering
EACL 2024

A Baumann, A Stephan, B Roth
Seeing through the mess: evolutionary dynamics of lexical polysemy
EMNLP 2023

A Sedova, B Roth
ULF: Unsupervised Labeling Function Correction using Cross-Validation for Weak Supervision
EMNLP 2023

PHL de Araujo, B Roth
Cross-functional Analysis of Generalisation in Behavioural Learning
TACL 2023

A Sedova, B Roth
ACTC: Active Threshold Calibration for Cold-Start Knowledge Graph Completion
ACL 2023

A Sedova, L Zellinger, B Roth
Learning with Noisy Labels by Adaptive Gradient-Based Outlier Removal
ECML PKDD 2023

V Kougia, S Fetzel, T Kirchmair, E Çano, S Moayed Baharlou, S Sharifzadeh, B Roth
MemeGraphs: Linking Memes to Knowledge Graphs
ICDAR 2023

L März, E Asgari, F Braune, F Zimmermann, B Roth
XPASC: Measuring Generalization in Weak Supervision by Explainability and Association
Preprint

A Stephan, V Kougia, B Roth
SepLL: Separating Latent Class Labels from Weak Supervision Noise
Findings of EMNLP 2022

PHL de Araujo, B Roth
Checking HateCheck: a cross-functional analysis of behaviour-aware learning for hate speech detection
ACL 2022 Workshop on Efficient Benchmarking in NLP (NLP Power!)

A Stephan, B Roth
WeaNF: Weak Supervision with Normalizing Flows
RepL4NLP 2022

B Roth, E Çano
Focused Contrastive Training for Test-based Constituency Analysis
NeurIPS 2021 Workshop on Self-Supervised Learning

L März, E Asgari, F Braune, F Zimmermann, B Roth
KnowMAN: Weakly Supervised Multinomial Adversarial Networks
EMNLP 2021

A Sedova, A Stephan, M Speranskaya, B Roth
Knodle: Modular Weakly Supervised Learning with PyTorch
RepL4NLP 2021

L März, S Schweter, N Poerner, B Roth, H Schütze
Data Centric Domain Adaptation for Historical Text with OCR Errors
ICDAR 2021

MA Hedderich, B Roth, K Kann, B Plank, A Ratner, D Klakow (editors)
Proceedings of the First Workshop on Weakly Supervised Learning (WeaSuL)
ICLR 2021 Workshop on Weakly Supervised Learning

B Roth, M Wiegand
Python for Linguists (book review)
Computational Linguistics 2021

M Speranskaya, M Schmitt, B Roth
Ranking vs. Classifying: Measuring Knowledge Base Completion Quality
AKBC 2020

J Jungmaier, N Kassner, B Roth
Dirichlet-Smoothed Word Embeddings for Low-Resource Settings
LREC 2020

E Asgari, F Braune, B Roth, C Ringlstetter, M Mofrad
UniSent: Universal Adaptable Sentiment Lexica for 1000+ Languages
LREC 2020

R Rojowiec, M Fink, B Roth
Intent Recognition in Doctor-Patient Interviews
LREC 2020

A Sydorova, N Poerner, B Roth
Interpretable Question Answering on Knowledge Bases and Text
ACL 2019

L März, D Trautmann, B Roth
Domain adaptation for part-of-speech tagging of noisy user-generated text
NAACL 2019

B Roth, C Conforti, N Poerner, S Karn and H Schütze.
Neural Architectures for Open-Type Relation Argument Extraction.
Natural Language Engineering, 2019 (preprint)

M Schmitt, S Steinheber, K Schreiber, B Roth.
Joint Aspect and Polarity Classification for Aspect-based Sentiment Analysis with End-to-End Neural Networks.
EMNLP 2018.

N Poerner, H Schütze, B Roth.
Evaluating neural network explanation methods using hybrid documents and morphological prediction.
ACL 2018.

P Gupta, B Roth, H Schütze.
Joint Bootstrapping Machines for High Confidence Relation Extraction.
NAACL 2018.

M Schulder, M Wiegand, J Ruppenhofer and B Roth.
Towards Bootstrapping a Polarity Shifter Lexicon using Linguistic Features.
IJCNLP 2017.

H Adel, B Roth and H Schütze.
Comparing Convolutional Neural Networks to Traditional Models for Slot Filling.
NAACL 2016.

P Verga, D Belanger, E Strubell, B Roth, and A McCallum.
Multilingual Relation Extraction using Compositional Universal Schema.
NAACL, 2016.

M Schuhmacher, B Roth, S Ponzetto and L Dietz.
Finding Relevant Relations in Relevant Documents.
ECIR 2016.

B Roth, N Monath, D Belanger, E Strubell, P Verga, A McCallum.
Building Knowledge Bases with Universal Schema: Cold Start and Slot-Filling Approaches.
NIST Text Analysis Conference 2015.

A Neelakantan, B Roth and A McCallum.
Compositional Vector Space Models for Knowledge Base Completion.
ACL 2015.

M Wiegand, B Roth and D Klakow.
Combining Pattern-based and Distributional Similarity for Graph-based Noun Categorization.
NLDB 2015.

B Roth, E Strubell, K Silverstein and A McCallum.
Minimally Supervised Event Argument Extraction using Universal Schema.
NIPS 2014 Workshop on Knowledge Extraction (AKBC).

A Neelakantan, B Roth and A McCallum.
Knowledge Base Completion using Compositional Vector Space Models.
NIPS 2014 Workshop on Knowledge Extraction (AKBC).

M Wiegand, B Roth and D Klakow.
Automatic Food Categorization from Large Unlabeled Corpora and Its Impact on Relation Extraction.
EACL 2014.

B Roth, T Barth, G Chrupala, M Gropp, D Klakow.
RelationFactory: A Fast, Modular and Effective System for Knowledge Base Population.
EACL 2014 (software demo).

J Illig, B Roth and D Klakow.
Unsupervised Parsing for Generating Surface-Based Relation Extraction Patterns.
EACL 2014.

B Roth, T Barth, M Wiegand, M Singh, D Klakow.
Effective Slot Filling Based on Shallow Distant Supervision Methods.
NIST Text Analysis Conference 2013.

B Roth, D Klakow.
Combining Generative and Discriminative Model Scores for Distant Supervision.
EMNLP 2013.

B Roth, D Klakow.
Feature-Based Models for Improving the Quality of Noisy Training Data for Relation Extraction.
CIKM 2013.

B Roth, T Barth, M Wiegand, D Klakow.
A Survey of Noise Reduction Methods for Distant Supervision.
CIKM 2013 Workshop on Knowledge Extraction (AKBC).

M Wiegand, B Roth, D Klakow.
Web-based Relation Extraction for the Food Domain.
NLDB 2012.

B Roth, G Chrupala, M Wiegand, M Singh, D Klakow.
Generalizing from Freebase and Patterns using Distant Supervision for Slot Filling.
NIST Text Analysis Conference 2012.

M Wiegand, B Roth, D Klakow.
Knowledge Acquisition with Natural Language Processing in the Food Domain: Potential and Challenges.
ECAI 2012 Workshop: Cooking with Computers.

M Wiegand, B Roth, D Klakow.
Data-driven Knowledge Extraction for the Food Domain.
KONVENS 2012.

M Wiegand, B Roth, E Lasarcyk, S Köser, D Klakow.
A Gold Standard for Relation Extraction in the Food Domain.
LREC 2012.

B Roth, A McCallum, M Dymetman and N Cancedda.
Machine Translation Using Overlapping Alignments and SampleRank.
AMTA 2010.

G Chrupala, G Dinu and B Roth.
Enriched Syntax-based Meaning Representation for Answer Extraction.
SIGIR 2010 Workshop: Query Representation and Understanding.

B Roth and D Klakow.
Cross-Language Retrieval Using Link-Based Language Models.
SIGIR 2010.

L Li, B Roth and C Sporleder.
Topic Models for Word Sense Disambiguation and Token-Based Idiom Detection.
ACL 2010.

M Wiegand, A Balahur, B Roth, D Klakow and A Montoyo.
A Survey on the Role of Negation in Sentiment Analysis.
2010 Workshop on Negation and Speculation in NLP.

B Roth and D Klakow.
Combining Wikipedia-Based Concept Models for Cross-Language Retrieval.
IRF Conference 2010.

Benjamin Roth

Publications