About the Project:

Join our team as a Machine Learning Engineer and play a key role in the development and maintenance of our AI-based systems. Our projects predominantly focus on text and speech data, emphasizing Natural Language Processing (NLP) and speech processing, including ASR, speech analysis, and denoising systems.

As an ML Engineer, your responsibilities encompass a broad spectrum of tasks:

- Conceiving and developing end-to-end AI systems, from data collection and processing to module development, testing, and post-deployment methods.
- Defining and implementing AI systems, creating data pipelines, and incorporating features based on infrastructure and performance requirements.
- Establishing a benchmarking system to compare and select system evolutions using tools like Hydra, Metaflow/MLFlow, Weight&Biases, etc.
- Defining data collection, annotation, and processing protocols, developing processing pipelines, and creating datasets for specific applications.
- Managing data collection and processing for training and testing machine learning-based systems.
- Analyzing data to optimize products, adapting models, reconsidering modalities, and creating new user behavior metrics.
- (Optional) Contributing to publications such as blogs, scientific articles, and interviews.

What You Will Use:

- Python
- Machine learning libraries like TensorFlow, PyTorch, ONNX and huggingface's transformers library
- Database technologies like MongoDB and SQL
- MLOps platforms
- Dependency management tools like Docker
- Version control tools: git/github

We Are Looking for Someone With:

Required Skills:

- Excellent proficiency in Python coding.
- Proven experience in implementing machine learning systems.
- Capability to work with popular machine learning frameworks and libraries such as TensorFlow, PyTorch, ONNX, huggingface's transformers library, and MLOps platforms. Proficiency with both TensorFlow and PyTorch is preferred.

Preferred Skills:

- Background in NLP or speech processing (ideally both).
- Strong theoretical foundation.
- Experience with NLP libraries and generative models.
- Familiarity with ASR, speaker recognition, speech synthesis, and speaker diarization.
- Proficiency in Docker for efficient deployment.
- Knowledge of MLOps methodology and implementation.
- Familiarity with database technologies (SQL or NoSQL) for effective data management.

At Diabolocom, diversity and inclusion are in our DNA. All qualified applicants will receive equal consideration for employment without regard to color, language, religion, sex, sexual orientation, gender identity, national or social origin, opinion disability, age.
