International Consortium of Investigative Journalists: Machine Learning Engineer
We are looking for a qualified Machine Learning (ML) Engineer to join ICIJ’s technology team, working closely with our ML engineer in coordination with full stack and system engineers as well as a UX/UI designer. This is a full-time position on a 12-month contract.
With the help of your teammates, you will be responsible for designing, building and shipping ML related tasks and pipelines for processing from a large amount of documents through our open source software Datashare. Datashare ML pipelines cover a wide range of ML topics spanning Natural Language Understanding (NLU), computer vision and speech-processing with privacy, quality and scalability in mind.
We use vision algorithms in Optical Character Recognition (OCR) to extract text from documents, to detect passports in scanned documents, or to understand document layouts and better extract their content. We plan to leverage Automatic Speech Recognition (ASR) to transcribe various audio and video files. We use NLU to extract entities from documents and plan to perform entity resolution between documents. We intend to improve document search and retrieval, potentially using leveraging graphs and vector-embeddings.
ICIJ’s tech team collaborates with academic partners on cutting-edge ML topics; as an ML engineer you will also be actively involved in R&D projects.