CV

Contact Information

Name Tobias Eder
Professional Title Doctoral Researcher in NLP & Trustworthy AI
Email tobias@eder.ai

Professional Summary

My research focuses on building and evaluating trustworthy NLP systems, particularly in high-stakes and multimodal settings. I investigate how data selection, model introspection, and evaluation design can improve reliability beyond standard metrics and across domains including legal text processing, social media analysis, misinformation detection, and content moderation. I am interested in where current systems fail and what that tells us about the gap between benchmark performance and real-world deployment.

Experience

  • 2020 - 2025

    Munich, Germany

    Doctoral Researcher
    Technical University of Munich (TUM), Social Computing Group
    Research, teaching, and project leadership at the intersection of NLP, multimodal learning, and trustworthy AI. Responsible for the full cycle from project acquisition and design through research execution to publication and student mentorship.
    • Led the COLEX research project as PI (Software Campus / BMBF, 2022–2024), managing a team of 5. Delivered a working prototype demonstrated at the Software Campus Summit and handed off to industry partner DATEV.
    • Conducted the technical track of the EMF project (2023–2025): collected and analyzed approximately 400k social media posts across platforms, producing a misinformation landscape analysis that informed the BfS public communication strategy and resulted in a recommendation action plan for misinformation handling.
    • Contributed to the HUILE project (Siemens collaboration, 2021–2023) on human-in-the-loop techniques for enterprise NLP, delivering an internal prototype handed off to Siemens.
    • Designed and taught six courses and lab courses across NLP, explainability, and AI ethics (2020–2025).
    • Supervised approximately 30 master’s theses and 6 bachelor’s theses spanning content moderation, legal NLP, misinformation detection, and explainability.
  • 2018 - 2019

    Munich, Germany

    Software Developer
    Ablacon Inc.
    Full-stack development at a medical AI startup building a mapping system for atrial fibrillation treatment. The company has since been acquired by Boston Scientific.
    • Built the web frontend and backend (Django, Flask) for Ablacon’s Electrographic Flow visualization platform as part of a 6-person technical team.
    • Prototype I contributed to entered the clinical trial pipeline for pre-clinical validation.
  • 2018 - 2018

    Munich, Germany

    NLP Intern
    Siemens Corporate Technology
    • NLP and semantic modeling for enterprise applications.

Education

  • 2021 - 2026

    Munich, Germany

    Doctoral Candidate
    Technical University of Munich (TUM)
    Computer Science
    • Topic: Trustworthy NLP — investigating data quality, model robustness, and evaluation design across high-stakes domains.
    • Supervisor: Prof. Georg Groh, Social Computing Group.
    • Expected graduation: 2026.
  • 2017 - 2020

    Munich, Germany

    M.Sc.
    Technical University of Munich (TUM)
    Data Engineering and Analytics
    • Thesis: Bias Detection in Hate Speech Datasets.
  • 2015 - 2017

    Munich, Germany

    B.Sc.
    Technical University of Munich (TUM)
    Computational Linguistics and Computer Science
  • 2010 - 2015

    Munich, Germany

    M.A.
    Ludwig-Maximilians-Universität München (LMU)
    German Literature, Political Science, and Law
    • ERASMUS exchange year at Queen Mary University of London (2013–2014).

Research Projects

  • 2022 - 2024
    COLEX: Automatic Analysis and Summarization of German Legal Correspondence
    Principal Investigator (Software Campus / BMBF / DATEV, 2022–2024). Developed a graph-based retrieval augmentation system leveraging German legal citation structure for improved context in legal document analysis. Created a multi-perspective summarization corpus for court rulings with distinct summary targets. Built an end-to-end prototype with web frontend integrating all components.
    • Funded by BMBF via Software Campus.
    • Role: Principal Investigator — responsible for project design, hiring, budget, and research direction.
  • 2023 - 2025
    Misinformation Analysis for EMF and 5G Communication
    Technical lead for the computational track (Bundesamt für Strahlenschutz, 2023–2025). Responsible for social media data collection, quantitative analysis of public discourse on electromagnetic fields, and exploratory work on misinformation detection methods.
    • Funded by the German Federal Office for Radiation Protection (BfS).
    • Role: Technical lead for data collection and quantitative analysis.
  • 2021 - 2023
    HUILE: Human-in-the-Loop Techniques for Enterprise Use Cases
    Research on human-model collaboration and feedback-driven learning for enterprise NLP systems (Siemens AG collaboration, 2021–2023).
    • Industry collaboration with Siemens AG.

Publications

  • 2026
    Safer Reasoning Traces: Measuring and Mitigating Chain-of-Thought Leakage in LLMs
    arXiv preprint

  • 2026
    Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation
    CySoc Workshop at ICWSM 2026

  • 2026
    More Than Debunking Misinformation: A Holistic Communication Model for Public Authorities — The Case of Electromagnetic Fields and 5G in Germany
    Research Handbook on Public Communication (Edward Elgar), forthcoming

  • 2023
    Retrieving Users’ Opinions on Social Media with Multimodal Aspect-Based Sentiment Analysis
    IEEE 17th International Conference on Semantic Computing (ICSC 2023)

    Best Paper Award.

  • 2022
    Comparative Analysis of Cross-Lingual Contextualized Word Embeddings
    2nd Workshop on Multi-lingual Representation Learning (MRL) at EMNLP 2022

  • 2022
    Long Input Dialogue Summarization with Sketch Supervision for Summarization of Primetime Television Transcripts
    Workshop on Automatic Summarization for Creative Writing at COLING 2022

  • 2022
    Explaining Neural NLP Models for the Joint Analysis of Open- and Closed-Ended Survey Answers
    2nd Workshop on Trustworthy NLP (TrustNLP) at ACL 2022

  • 2022
    Introducing an Abusive Language Classification Framework for Telegram to Investigate the German Hater Community
    Proceedings of the International AAAI Conference on Web and Social Media (ICWSM 2022)

  • 2022
    Bias and Comparison Framework for Abusive Language Datasets
    AI and Ethics, 2(1), 79–101

  • 2022
    Mediale Hasssprache und technologische Entscheidbarkeit: Zur ethischen Bedeutung subjektiv-perzeptiver Datenannotationen in der Hate Speech Detection
    Medien–Demokratie–Bildung (Springer), 295–310

  • 2021
    Anchor-Based Bilingual Word Embeddings for Low-Resource Languages
    Proceedings of ACL-IJCNLP 2021 (Volume 2: Short Papers), 227–232

  • 2021
    A Scenario-Based Approach to the Design and Use of Ethical AI Models in Managing a Health Pandemic
    Institute for Ethics in Artificial Intelligence, Munich

  • 2018
    Evaluating Bilingual Word Embeddings on the Long Tail
    Proceedings of NAACL-HLT 2018 (Volume 2: Short Papers), 188–193

Teaching

  • 2021 - 2025
    Lectures
    Technical University of Munich
    • Advanced NLP (lecture block on data, bias, and ethics, 2023–2025).
    • Natural Language Processing (tutorial sessions, WS 2021).
  • 2020 - 2025
    Lab Courses
    Technical University of Munich
    • NLP Lab Course (2021–2025): continuous project-based NLP course.
    • Ethical AI Lab Course (2022–2025, summer term): tackling ethically challenging problems in ML and AI.
    • XAI Lab Course (2020–2023, winter term): explainability methods for machine learning.
  • 2021 - 2025
    Seminars
    Technical University of Munich
    • Advanced NLP Seminar (2022–2025, winter term): recent developments in NLP.
    • Ethics for Nerds Seminar (2021–2024, bachelor level): general ethical issues in computer science.
    • Ethics in NLP Seminar (2022–2023, summer term): ethical challenges specific to NLP.
  • 2021 - 2025
    Thesis Supervision
    Technical University of Munich
    • Supervised approximately 30 master’s theses and 6 bachelor’s theses across NLP, content moderation, legal text processing, explainability, and misinformation detection.

Professional Service

  • 2023 - 2026
    ACL Rolling Review (since 2023), NeurIPS (2026), AISTATS, ICWSM, and various workshops
    Peer Review
  • 2021 - 2026
    Association for Computational Linguistics (ACL)
    Professional Membership

Awards

  • 2023

    IEEE 17th International Conference on Semantic Computing (ICSC 2023). For: Retrieving Users’ Opinions on Social Media with Multimodal Aspect-Based Sentiment Analysis.

Skills

Programming: Python (primary), C++/Cython, SQL, JavaScript
ML / NLP: PyTorch, scikit-learn, HuggingFace Transformers/Trainer/Accelerate/Evaluate, Sentence-Transformers, spaCy, gensim, LIME, SHAP
LLM & Fine-tuning: PEFT/LoRA, bitsandbytes, vLLM, Ollama, LangChain
Data & Visualization: pandas, NumPy, matplotlib, seaborn, Jupyter
Web & Prototyping: Django, Flask, React, Gradio, Streamlit
Infrastructure: Docker, Git, Linux, RunPod, PostgreSQL, MongoDB, Weights & Biases, TensorBoard, Hydra
Languages: German (native), English (fluent, academic working language)