Different methods from the field of NLP helped us to create a software that spots errors in legal contracts
By Emilius Richter • July 18th, 2022
For a software provider, the project proposal is the first step toward meeting the needs of the customer. In this article, I will describe the most important modules in machine learning project proposals.
By Emilius Richter • May 21st, 2021
We discuss what questions should be considered and answered up front to launch a successful machine learning software project.
By Angela Maennel • April 26th, 2021
Here I show how open-domain question answering systems work and how they can enhance search engines. We will have closer look at one specific type of system, DrQA.
By How to extract text from PDF files • August 17th, 2020
In the following I want to present the open-source Python PDF tools PyPDF2, pdfminer and PyMuPDF that can be used to extract text from PDF files. I will compare their features and point out some drawbacks.
By Fabian Gringel • March 30th, 2020
In this blog post I present the three best free text annotation tools for the manual labeling of documents in NLP projects. You will learn how to install, configure and use them and find out which one of them suits your purposes best. The tools I'm going to present are brat, doccano and INCEpTION.
By Fabian Gringel • January 20th, 2020
This blog post presents the most common OCR tools, shows how to use them and assesses their strengths and weaknesses. In the end you will be able to choose and apply an OCR tool suiting the needs of your project.
By Temporal convolutional networks for sequence modeling • January 6th, 2020
This blog post presents a simple but powerful convolutional approach for sequences which is called Temporal Convolutional Network (TCN), originally proposed in Bai 2018, and tells you where to find implementations for Pytorch, Keras and Tensorflow.