Dominic Petrak

Doctoral Researcher

About Me

Hi, my name is Dominic Petrak. I'm a PhD candidate at the UKP Lab, Technical University of Darmstadt, and my current research focus is on conversational AI. In particular, I am working on ideas and approaches for the early detection and prevention of generation errors in the open world to improve user interaction. I started my professional career as a software engineer at Sopra Steria and DXC Technology. I have four years of professional experience in NLP and academia, as well as seven years of professional experience as a software engineer for clients mainly in the public sector and homeland security.

I have a Master's degree in Computer Science.

Highlights

Machine Learning
Broad understanding of current research and state of the art approaches in Natural Language Processing across different tasks. Hands-On experience from data engineering to model evaluation using common frameworks like PyTorch, PyTorch Geometric, SpaCy, Scikit-Learn and others.

Enterprise Application Development
Over the years, I've been confronted with different environments, ecosystems, and tasks. Specialized on the Jakarta EE stack, formerly Java EE stack, I gained extensive experience from conception and architecture through the development and test automation to integration and continuous delivery of enterprise applications.

iSAQB Certified Professional for Software Architecture
The international Software Architecture Qualification Board is an organization that defines standards in the qualification of software architects. I am certified with the Foundation Level. This certificate attests knowledge of the tasks and responsibilities of software architects in software development projects, as well as of state of the art methods, techniques, patterns, and concepts for developing software architectures, for consulting representatives, for documenting and communicating software architectures.

Qualifications

Master of Science in Computer Science
6 years of experience in Machine Learning and Data Science in the context of natural language processing including experience in using frameworks like PyTorch, Spacy, and Scikit-Learn
6 years of experience in programming Python
7 years of professional experience in developing Enterprise Applications using the Java EE / Jakarta EE technology stack
Practical long-term experience on working with different software development methodologies and frameworks like Scrum, V Model, Waterfall
iSAQB Certified Professional for Software Architecture
Basic understanding of OS fundamentals and experience in programming C and C++
Strong analytical, problem-solving, and communication skills
Passion for technology and innovation
Ability and motivation to self-teach while entering new domains


Experience
07/2021	Doctoral Researcher Supervised by Nafise Sadat Moosavi (University of Sheffield) and Iryna Gurevych at UKP Lab, Technical University of Darmstadt, Darmstadt, Germany
01/2021 - 06/2021	Senior Consultant Sopra Steria, Frankfurt am Main, Germany
10/2017 - 12/2020	Consultant Sopra Steria, Frankfurt am Main, Germany
04/2017 - 09/2017	Application Developer DXC Technology, Wiesbaden, Germany
07/2014 - 03/2017	Junior Application Developer CSC, Wiesbaden, Germany
Education
10/2019 - 04/2021	Computer Science (M. Sc.) RheinMain University of Applied Science, Wiesbaden, Germany
10/2016 - 10/2019	Computer Science (B. Sc.) RheinMain University of Applied Science, Wiesbaden, Germany
08/2012 - 07/2014	Apprenticeship IT Specialist for Application Development, CSC, Wiesbaden, Germany
08/2010 - 07/2012	Specialized A-Levels Technical College Untertaunus, Taunusstein, Germany

Academic Work

Learning from Implicit User Feedback, Emotions and Demographic Information in Task-Oriented and Document-Grounded Dialogues
Dominic Petrak, Thy Thy Tran, Iryna Gurevych. 2024. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 4573--4603, Miami, Florida, USA. Association for Computational Linguistics.
Implicit user feedback, user emotions and demographic information have shown to be promising sources for improving the accuracy and user engagement of responses generated by dialogue systems. However, the influence of such information on task completion and factual consistency, which are important criteria for task-oriented and document-grounded dialogues, is not yet known. To address this, we introduce FEDI, the first English task-oriented and document-grounded dialogue dataset annotated with this information. Our experiments with Flan-T5, GPT-2 and Llama 2 show a particularly positive impact on task completion and factual consistency. Participants in our human evaluation reported that the responses generated by the feedback-trained models were more informative (Flan-T5 and GPT-2), relevant and factual consistent (Llama 2).

Systematic analysis of requirements for socially acceptable service robots
Andrea Ruo, Simone Arreghini, Luca Capra, Rosario De Chiara, Valeria Di Pasquale, Alessandro Giusti, Cristina Iani, Antonio Paolillo, Dominic Petrak, Alexander Plaum, Megha Quamara, Lorenzo Sabattini, Viktor Schmuck, Paolo Servillo, Francesco Zurolo, Valeria Villani. 2024. Preprint, arXiv:2409.08677
In modern society, service robots are increasingly recognized for their wide range of practical applications. In large and crowded social spaces, such as museums and hospitals, these robots are required to safely move in the environment while exhibiting user-friendly behavior. Ensuring the safe and socially acceptable operation of robots in such settings presents several challenges. To enhance the social acceptance in the design process of service robots, we present a systematic analysis of requirements, categorized into functional and non-functional. These requirements are further classified into different categories, with a single requirement potentially belonging to multiple categories. Finally, considering the specific case of a receptionist robotic agent, we discuss the requirements it should possess to ensure social acceptance.

Learning From Free-Text Human Feedback – Collect New Datasets Or Extend Existing Ones?
Dominic Petrak, Nafise Moosavi, Ye Tian, Nikolai Rozanov, Iryna Gurevych. 2023. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 16259–16279, Singapore. Association for Computational Linguistics.
Continuous learning from free-text human feedback, such as error corrections, new knowledge, or alternative responses, is essential for today’s chatbots and virtual assistants to stay up-to-date, engaging, and socially acceptable. However, for research on methods for learning from such data, annotated data is scarce. To address this, we examine the error and user response types of six popular dialogue datasets from various types, including MultiWoZ, PersonaChat, Wizards-of-Wikipedia, and others, to assess their extendibility with the needed annotations. For this corpus study, we manually annotate a subset of each dataset with error and user response types using an improved version of the Integrated Error Taxonomy and a newly proposed user response type taxonomy. We provide the resulting dataset (EURTAD) to the community. Our findings provide new insights into dataset composition, including error types, user response types, and the relations between them.

Lessons Learned from a Citizen Science Project for Natural Language Processing
Jan-Christoph Klie, Ji-Ung Lee, Kevin Stowe, Gözde Şahin, Nafise Sadat Moosavi, Luke Bates, Dominic Petrak, Richard Eckart De Castilho, Iryna Gurevych. 2023. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pages 3594–3608, Dubrovnik, Croatia. Association for Computational Linguistics.
Many Natural Language Processing (NLP) systems use annotated corpora for training and evaluation. However, labeled data is often costly to obtain and scaling annotation projects is difficult, which is why annotation tasks are often outsourced to paid crowdworkers. Citizen Science is an alternative to crowdsourcing that is relatively unexplored in the context of NLP. To investigate whether and how well Citizen Science can be applied in this setting, we conduct an exploratory study into engaging different groups of volunteers in Citizen Science for NLP by re-annotating parts of a pre-existing crowdsourced dataset. Our results show that this can yield high-quality annotations and at- tract motivated volunteers, but also requires considering factors such as scalability, participation over time, and legal and ethical issues. We summarize lessons learned in the form of guidelines and provide our code and data to aid future work on Citizen Science.

Arithmetic-Based Pretraining - Improving Numeracy of Pretrained Language Models
Dominic Petrak, Nafise Sadat Moosavi, Iryna Gurevych. 2023. In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023), pages 477–493, Toronto, Canada. Association for Computational Linguistics.
State-of-the-art pretrained language models tend to perform below their capabilities when applied out-of-the-box on tasks that require understanding and working with numbers (usually referred to as numeracy). Recent work suggests two main reasons for this: (1) popular tokenisation algorithms have limited expressiveness for numbers, and (2) common pretraining objectives do not target numeracy. Approaches that address these shortcomings usually require architectural changes or pretraining from scratch. In this paper, we propose a new extended pretraining approach called Arithmetic-Based Pretraining that jointly addresses both in one extended pretraining step without requiring architectural changes or pretraining from scratch. Arithmetic-Based Pretraining combines contrastive learning to improve the number representation, and a novel extended pretraining objective called Inferable Number Prediction Task to improve numeracy. Our experiments show the effectiveness of Arithmetic-Based Pretraining in three different tasks that require improved numeracy, i.e., reading comprehension in the DROP dataset, inference-on-tables in the InfoTabs dataset, and table-to-text generation in the WikiBio and SciGen datasets.

Relations Extraction using Indicators (Master Thesis)
Dominic Petrak; RheinMain University of Applied Science, Wiesbaden, Germany; 2021
Relations between entities are a key for understanding the semantic context in natural language, and therefore for popular tasks like question answering, information extraction or knowledge graph generation. State-of-the-art approaches for machine learning-based relations classiﬁcation encode the entire sentence using pre-trained models of transformers, without further consideration of syntactic indicators like certain phrases or words, or prepositions, which are more informative than other words and may be beneﬁcial for identifying semantic relations. In this thesis, the eﬀect of additionally using those indicators for relations extraction is investigated.

Semantic Code Search with Neural Bag-of-Words and Graph Convolutional Networks
Anna Abad Sieper, Omar Amarkhel, Savina Diez and Dominic Petrak; SKILL Student Conference @ INFORMATIK 2020 - awarded as best paper
Approach to semantic code search. We investigated two ideas for retrieving code that best matches a natural language query. The first idea was to expand a neural Bag-of-Words encoder with TF-IDF weighting. The second idea was to additionally utilize the call hierarchies by using a Graph Convolutional Network trained on corresponding caller graphs. The Java and Python datasets from GitHub's CodeSearchNet challenge were used as the data basis. Call hierarchies have been added to the Java datasets.

Bug Localization (Bachelor Thesis)
Dominic Petrak; RheinMain University of Applied Science, Wiesbaden, Germany; 2019
Localization of faulty source code in software based on human-written bug reports (Static Bug Localization) has long been the subject of research in the area of information retrieval and machine learning. In 2018, Bench4BL, a benchmark dataset, and evaluation framework for this task has been proposed. Using that, the authors compared state-of-the-art approaches to this task. In this thesis, those approaches have been studied and the most promising ideas have been combined. The resulting approach was trained and evaluated by using Bench4BL. Finally, the results were compared.