dida conference

You are invited to a day full of machine learning

After two successful dida conferences in the last two years (see the video below and recordings of the talks 2023 and 2024) we are looking forward to a new edition this year. The conference will feature talks on machine learning (applied and technical), workshops, space for networking and, of course, good food.

You are invited to a day full of machine learning

This year we have a new event location with the frizzforum in Berlin. As the number of participants is limited, we recommend early registration.

Photos

Conference Schedule

Under this link you can download the schedule and find a map of the venue.

09:15 - 10:00

Doors are open

Arrive and connect with others.

10:00 - 10:10

Introduction

Welcome to the dida conference.

10:10 - 10:50

Andrey Vasnetsov
(qdrant)

miniCOIL: Sparse Neural Retrieval Done Right

In this talk, we present miniCOIL - our attempt to make a sparse neural retrieval model as it should be - combining the benefits of dense and lexical retrieval without propagating their drawbacks. We will share how to design and train a lightweight model that is performant on out-of-domain data and demonstrate its capabilities.

10:50 - 11:45

Dr. Sebastian Lapuschkin
(HHI)

Decoding AI: How Explainability Unlocks Actionable Insights

Contemporay Large Language Models (LLMs) constitute a powerful but often opaque enabler technology. This talk explores how tools from Explainable AI (XAI) and mechanistic interpretability can help us uncover the roles of learned structures — such as attention heads or knowledge neurons — enabling more reliable and actionable interactions with AI.
We demonstrate how XAI reveals when an attention head actively contributes to inference and where it draws information from, uncovering its broader function. Using these insights, we introduce computationally efficient methods to detect when an LLM "hallucinates" based on parametric and potentially outdated knowledge, or whether it is generating answers based on available contextual information. Our techniques operate with minimal computational overhead, making real-time, citation-worthy, and controllable LLM outputs feasible in cost-sensitive applications. These advancements push LLMs toward greater transparency, reliability, and practical integration in real-world systems.

11:45 - 12:15

Coffee break

12:15 - 13:00

Axel Besinger
(dida)
Dr. Augusto Stoffel
(dida)

From OCR to LLMs: The Journey to Reliable Data Extraction from Complex Retail Documents

AI-powered data extraction works - until it doesn’t. When handling structured tables in invoices, orders, or financial documents, we expect OCR, LLMs, and Vision AI to extract data reliably. However, complex documents - e.g. nested tables, irregular structures, and edge cases - pose real challenges for document data extraction AI models. With our solution smartextract, we tackled a real-world customer challenge: automating order entry from complex order documents and tables for a German shoe retailer: OCR and text-based LLMs struggled, Vision LLMs were inconsistent. Only extensive customization could solve the appearing problems - including segmentation, few-shot prompting, fine-tuning, and even the possibility of training a custom computer vision model. In this talk, we will show why standard AI models struggle with complex tables and demonstrate in which cases segmentation helps. Further, we will show benchmarks of commercial vs. open-source models and discuss the trade-offs between OCR, LLMs, and computer vision models.

13:00 - 14:00

Lunch break

14:00 - 14:45

Dr. Florian Wenzel
(Mirelo AI)

Generating Music and Sound with Generative Foundation Models

In the age of digital content, music and sound are crucial elements that enhance storytelling, engagement, and emotional impact. Yet, finding the perfect soundtrack for a video - one that aligns seamlessly with its mood and pacing - remains a challenge for many creators. Mirelo AI is changing this with cutting-edge generative foundation models for AI-driven music and sound design. In this talk, we will introduce our approach to video-aware music generation. Our technology enables users to upload a video, and our AI dynamically composes original music and synchronized sound effects tailored specifically to its content. This is a game-changer for content creators, advertisement agencies, game developers, and everyday users looking to elevate their projects with custom-generated audio.

14:45 - 15:45

Fabian Dechent
(dida)

Building a Real Time License Plate Matching Service

This talk covers strategies applied in bringing a real-time license plate matching service into production. Based on the project requirements, we discuss the chosen ML model architecture and training procedure, implemented software architecture components, as well as tools and infrastructure for the deployment. Particular focus will be on the deployed microservices architecture and model serving using NVIDIA Triton.

15:45 - 16:15

Coffee break

16:15 - 17:00

Thomas Strottner
(Edgeless Systems)

Confidential AI: Leveraging LLMs and Coding Assistants with End-To-End Encrypted User Data

With confidential computing, data can remain protected or encrypted even during processing. Since the introduction of the Nvidia H100, this technology can also be applied to AI. When used correctly, it enables the creation of AI-as-a-Service (such as ChatGPT) where users do not need to trust the provider regarding data security and privacy. This makes it possible to process sensitive with GenAI services. The presentation will explain the fundamentals of the technology and provide an overview of use cases and existing deployments.

17:00 - 18:00

Dr. Ma Li
(dida)

Detecting Subtle Events in Videos with YOLO-in-Time

In this talk, we present a general-purpose model architecture that is suitable for detecting and counting subtle events in videos. It combines a convolutional neural network (CNN), a recurrent neural network (RNN) and a YOLO-type head in the time domain. The model can be trained end-to-end with time-resolved labels only, without the need for customization or domain-specific knowledge. We discuss possible use cases and show experimental results both on synthetic datasets and on real world datasets.

18:00 - 23:00

Ambient DJ Set: Karolina

Check out Karolina on SoundCloud.

18:00 - 19:00
Poster Session
Cornelius Braun, Marc Toussaint (TU Berlin): “Stein Variational Evolution Strategies”
Moritz Besser (BalkonSonne.App): “BalkonSonne.App - Does solar pay off on your balcony?”
Jan Strich (Universität Hamburg): “T2-RAGBench: Text-and-Table Benchmark for Evaluating Retrieval-Augmented Generation”
Shpresim Sadiku (Zuse Institut Berlin): “S-CFE: Simple Counterfactual Explanations”
Mark Budgen (dida): “Defect detection and classification for unroasted coffee beans”
Maximilian Trescher (dida): “Artificial intelligence for detecting respiration in dairy cows”
Ekaterina Gikalo (TU München): “HERBS: Hallucination Evaluation using Retrieval, BERT, and Support Vector Regression”
Joana Reuss (TU München): “Pixel-wise Self-Supervised Crop Classification on EuroCropsML”
Jim Berend (Fraunhofer HHI): “Mechanistic understanding and validation of large AI models with SemanticLens”
Moritz Weckbecker, Galip Ümit (Fraunhofer HHI): “DualView: A Runtime-Efficient Framework for Sparse Training Data Attribution via Kernelized SVMs”
Lukas Garbas (HU Berlin): “TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks”
Sebastian Pohl (HU Berlin): “LM-Pub-Quiz: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models”
Coşku Horuz (Universität Lübeck): “Accelerating Spatiotemporal Learning with minConvRNNs”

19:00 - 23:00

Dinner & Networking

Food and drinks will be available for all guests.

Workshops: Room 1

10:15 - 11:30

Dat Win Edwin Fung
(Wenglor)
Dr. Tassilo Glander
(Wenglor)

Challenges in Model Quantization for an Embedded Platform

We present our experience of using AI models in the context of industrial computer vision. In manufacturing, AI often is used in a setting of restricted computational power, e.g., on an edge computer or a smart camera without a GPU. In this workshop we will present model quantization as a typical approach to address this challenge, including definition, implementation, pitfalls, and practical use cases. During the workshop, we would like to have a hands-on session so that participants can deploy their own models to an industrial smart camera. Also, we would like to get into an exchange about problems and solutions with the participants.

11:30 - 13:00

Dr. Marty Oelschläger
(dida)
Fabian Dechent
(dida)

From Development to Production: Deploy ML Models via Nvidia's Triton Inference Server

You have developed and trained a nice machine learning model, which works seemingly perfect in your development environment. But now, in the real world of production you encounter various challenges from scaling, over monitoring, to robustness. In this workshop we want to present a step by step tutorial how to use and deploy machine learning models provider agnostic via Nvidia's open-sourced Triton Inference Server as a remedy for the above mentioned challenges.

13:00 - 14:00

Lunch break

14:00 - 15:30

Dr. Julian Risch
(deepset)

Go Beyond Basic RAG with Agents and Haystack

Retrieval Augmented Generation (RAG) has transformed how we build AI systems with LLMs. However, traditional RAG workflows are static and often struggle to handle the dynamic and complex demands of real-world applications. Enter agentic behavior: a concept that enables LLMs to make decisions, integrate tools, and adapt workflows dynamically. In this workshop, we’ll explore how the open source AI framework Haystack helps building production-ready RAG pipelines with agentic behavior. Using hands-on examples implemented in Python, we’ll walk through practical use cases of “Agents” , including tool calling and multi-step problem-solving.

16:30 - 18:00

Dina Dürkop
(Birds on Mars)
Florian Dohmann
(Birds on Mars)
Dr. Liel Glaser
(dida)

Responsible AI

After a concise overview of the Responsible AI Framework by Birds on Mars, Dina Dürkop and Florian Dohmann, together with Dr. Liel Glaser from dida, will discuss its relevance for current business practice. Subsequently, the participants will work in moderated small groups: They will reflect on their own level of maturity, exchange experiences, and gather concrete next steps to anchor responsible AI in their organizations. The goal is to return to their daily business with clear impulses for action and a common understanding of responsible AI in everyday business.

Workshops: Room 2

11:30 - 13:00

Dr. Liel Glaser
(dida)
Lorenzo Melchior
(dida)

Comply to deploy: Unlocking AI act compliance and cybersecurity

Starting a new project is exciting, defining goals, aligning team expectations and getting the first insights into a new problem. To guide us in this, we have found it effective to hold a kick-off workshop. In light of the recent AI Act, and its added focus on AI and data security, we decided to develop an add-on for the workshop, specifically to classify the project and to cover project specific security requirements. To develop a workshop that helps us develop the best and safest system that we can, without boring our colleagues by repeating the same warning about phishing emails at every project start, we used interviews with past project leads to refine the content, and develop the workshop. In this talk + workshop we will present how we developed this process, and then try it with you using an example.

13:00 - 14:00

Lunch break

14:00 - 16:00

Dr. William Clemens
(dida)
Thanh Long Phan
(dida)

dida reading group

dida has a weekly internal reading group where our ML scientists discuss recent papers and how they can help with our projects.

In this workshop Dr. William Clemens will present Emergent social conventions and collective bias in LLM populations and Thanh Long Phan will present the paper Absolute Zero: Reinforced Self-play Reasoning with Zero Data.

Speakers

Andrey Vasnetsov

Andrey is the Co-Founder and CTO of qdrant, where he advances high-performant vector similarity search technology. He has a background in computer science.

Dr. Sebastian Lapuschkin

Sebastian is the head of the explainable artificial intelligence group at Fraunhofer Heinrich Hertz Institute (HHI) in Berlin. He has a PhD in computer science and actively publishes in the field.

Dr. Julian Risch

Julian leads the Open Source Engineering team at deepset, the company behind the production-ready AI framework Haystack.

Dr. Augusto Stoffel

Augusto holds a PhD in mathematics and did research in the field of algebraic topology and its application as a foundation of quantum field theory. At dida he is a technical lead in information extraction and NLP.

Dr. Florian Wenzel

Florian has a PhD in machine learning and worked as a researcher at Google and Amazon before he founded his own startup Mirelo AI.

Dr. Ma Li

Ma Li has a PhD in pure mathematics and works as a Machine Learning Scientist at dida, focusing both on computer vision and NLP.

Dr. Liel Glaser

Liel has a PhD in theoretical physics from the Niels Bohr Institute in Copenhagen and focuses on NLP as a Machine Learning Scientist at dida.

Thomas Strottner

Thomas is the Vice President of Technology Partnerships at Edgeless Systems.

Axel Besinger

Axel has a passion for business development in technology companies and recently dived into LLMs. He leads smartextract, dida's information extraction software.

Dr. Tassilo Glander

Tassilo has a PhD in computer science and is the CTO of Wenglor Deevio, where he works mostly on computer vision.

Dat Win Edwin Fung

Dat works on machine learning and computer vision problems at Wenglor.

Dr. William Clemens

Will holds a PhD in string theory and quantum chromodynamics at the University of Southampton. At dida, he works as a machine learning scientist.

Florian Dohmann

Florian is an AI and digital transformation expert, he co-founded Birds on Mars, and applies their Responsible AI Framework to help businesses ethically integrate AI and blend human creativity with machine intelligence.

Dina Dürkop

Dina is a digital strategist and AI expert who drives responsible AI adoption at Birds on Mars. She utilizes their framework to cultivate AI literacy and implement AI products while merging creativity, intelligence, and organizational learning.

Fabian Dechent

Fabian is a project lead and machine learning scientist at dida, focusing on both computer vision and NLP.

Dr. Marty Oelschläger

Marty holds a PhD in physics (HU Berlin) focussing on fluctuation-induced phenomena. At dida, they work as a project lead.

Lorenzo Melchior

With a background in computer science, Lorenzo is experienced in dev-ops and data security and works as a Machine Learning Engineer at dida.

Location

frizzforum, Friedrichstraße 23, 10969 Berlin

You are invited to a day full of machine learning

You are invited to a day full of machine learning

Photos

Conference Schedule

Doors are open

Introduction

miniCOIL: Sparse Neural Retrieval Done Right

Decoding AI: How Explainability Unlocks Actionable Insights

Coffee break

From OCR to LLMs: The Journey to Reliable Data Extraction from Complex Retail Documents

Lunch break

Generating Music and Sound with Generative Foundation Models

Building a Real Time License Plate Matching Service

Coffee break

Confidential AI: Leveraging LLMs and Coding Assistants with End-To-End Encrypted User Data

Detecting Subtle Events in Videos with YOLO-in-Time

Ambient DJ Set: Karolina

Poster Session

Dinner & Networking

Workshops: Room 1

Challenges in Model Quantization for an Embedded Platform

From Development to Production: Deploy ML Models via Nvidia's Triton Inference Server

Lunch break

Go Beyond Basic RAG with Agents and Haystack

Responsible AI

Workshops: Room 2

Comply to deploy: Unlocking AI act compliance and cybersecurity

Lunch break

dida reading group

Speakers

Location