This job offer is unpublished.
EkimetricsPublished 12 months
Logo Ekimetrics

Stage 2025 - Data Science et Machine Learning/AI practitionner - Sujet : Causal Fairness Analysis: de la théorie à la pratique (H/F/N)

Juniors accepted
Internship
Data Scientist
Machine Learning
Deep learning
HTML5

Ekimetrics: Data Scientist Internship - Causal Fairness Analysis: From Theory to Practice 🤓

Ekimetrics, a leading data science and AI solutions provider, is seeking a passionate and talented intern to join their Causalité team in the Innovation department.

About Ekimetrics 📊

  • 400+ data science experts
  • 1000+ diverse projects for 350+ clients
  • 5 offices: Paris, Hong Kong, Shanghai, London, and New York
  • $1 billion in profits generated for clients since 2006

Our Mission 🌱

We aim to help our clients rethink their business models by aligning economic performance with sustainable goals through data science. To achieve this, we have a diverse team of experts, including Product Managers, Product Designers, Data Architects, Lead Tech, Data Engineers, DevOps Engineers, and Data Scientists.

Join the Data Science Team 🤝

Why are we recruiting?

Our Innovation department collaborates with industry and academic partners to conduct research in AI. We have several PhD experts in generative AI, deep learning, computer vision, time series, explainability, and causality. Two CIFRE theses are ongoing, and two more will start in 2025. Each expert leads a team responsible for testing state-of-the-art algorithms, adapting them to specific business problems, developing new methodologies or algorithms, and ensuring integration into Ekimetrics' industrial ecosystem.

Causalité Team

The Causalité team focuses on a wide range of causal tasks, including causal discovery, causal effect estimation, and counterfactual reasoning, through cutting-edge research in causal inference. We leverage the latest AI and statistical modeling techniques, such as machine learning, deep learning, and Bayesian modeling. By continuously testing new algorithms and auditing Ekimetrics' practices in causal reasoning, the Causalité team refines existing methodologies and develops new ones to address novel problems and expand Ekimetrics' expertise. This allows Ekimetrics to move beyond analyzing correlations and remain at the forefront of data-driven decision-making, empowering clients to utilize causal reasoning.

Internship Objectives: Causal Fairness Analysis: From Theory to Practice

Artificial intelligence (AI) is widely used in businesses and institutions as a decision-making tool in various applications, including sensitive sectors such as healthcare, politics, and finance. While AI systems can provide a competitive advantage, it is crucial to ensure that fundamental human rights are respected. As a result, AI systems are increasingly regulated, particularly in Europe with the AI Act, which requires all high-risk AI systems to assess their compliance and impact on fundamental rights, including the right to non-discrimination.

Fairness analysis in AI systems, commonly referred to as "fairness analysis," is based on two key concepts: "disparate treatment" and "disparate impact." Causal reasoning is a necessary perspective to consider, especially regarding disparate treatment and impact notions, although it is not the only way to derive a fairness analysis. Recently, Plečko and Bareinboim developed a framework for causal fairness analysis (CFA) based on these doctrines and translated them into mathematical language using counterfactual formulations. While the CFA framework is rich and theoretically sound, its practical implementation in real-use cases remains challenging (e.g., uncertainties related to the validity of assumptions and technical complexities in identifying and estimating counterfactuals in finite data regimes).

Therefore, this internship aims to implement and evaluate the CFA framework in real-use cases. The implementation will be done using Double Machine Learning (DML) approaches and integrating sensitivity analysis components. Regarding real-world applications, we will focus on the finance and utility sectors, starting with the COMPAS dataset.

This internship offers an opportunity to engage in cutting-edge AI research and potentially pursue a PhD program or a career as an AI Data Scientist with Ekimetrics.

Your Responsibilities

  • Develop knowledge in causal inference and fairness in AI
  • Conduct a comprehensive literature review of state-of-the-art models and techniques in causal AI fairness
  • Implement and experiment with double machine learning (DML) approaches enhanced by sensitivity analyses
  • Collaborate with your supervisor to design, implement, and evaluate prototype models on synthetic and real datasets
  • Share findings with internal teams and contribute to developing long-term strategies for responsible AI at Ekimetrics

Your Profile and Skills

  • Currently in the final year of a Master's degree or engineering degree, with a solid foundation in statistics, machine learning, and programming
  • Experience with Python and tabular data
  • Interest in multidisciplinary projects at the intersection of statistics, regulation, and economics
  • Strong analytical and problem-solving skills, with an interest in developing fair and responsible AI models
  • Curiosity and desire to invest in learning the fundamentals of causal inference
  • Fluent in English

Why Join Us?

  • Work in a dynamic start-up environment that encourages curiosity and excellence
  • Receive continuous training and development through a comprehensive learning experience that combines technical knowledge, soft skills, and practical experience
  • Be part of a welcoming and close-knit community that values fun and creativity
  • Imagine unexpected solutions and step out of your comfort zone to foster innovation
  • Enjoy access to a range of benefits, including a private gym, art exhibitions, video games, concerts, and CSR challenges

Our Recruitment Process

  • Technical test on HackerRank
  • HR interview with a Talent Acquisition specialist
  • Case study with a Consultant
  • Final interview with a Senior Consultant

We look forward to meeting you and learning more about your passion for data science. Apply now and let's explore the exciting world of AI together!

References:

[1] Section 2 - Article 8. Requirements for high-risk AI systems: Compliance with the requirements. Of Regulation (EU) 2024/1689 of the European Parliament and of the Council of 13 June 2024 laying down harmonised rules on artificial intelligence and amending Regulations (EC) No 300/2008, (EU) No 167/2013, (EU) No 168/2013, (EU) 2018/858, (EU) 2018/1139 and (EU) 2019/2144 and Directives 2014/90/EU, (EU) 2016/797 and (EU) 2020/1828 (Artificial Intelligence Act) https://eur-lex.europa.eu/eli/reg/2024/1689/oj
[2] Solon Barocas and Andrew D. Selbst. Big data’s disparate impact. California Law Review, 104-671. 2016

[3] Solon Barocas, Moritz Hardt, and Arvind Narayanan. Fairness and Machine Learning: Limitations and Opportunities. MIT Press. 2023

[4] Michael Moor. Causation in the Law. The Stanford Encyclopedia of Philosophy. 2019

[5] Joseph Y. Halpern. Actual Causality. MIT Press. 2016

[6] Drago Plečko and Elias Bareinboim. Causal Fairness Analysis. Foundations and Trends in Machine Learning: Vol. 17, No. 3, pp 1–238. 2024

[7] Victor Chernozhukov, Denis Chetverikov, Mert Demirer, Esther Duflo, Christian Hansen, Whitney Newey and James Robins. Double/Debiased Machine Learning for Treatment and Causal Parameters. ArXiv preprint. 2024. https://arxiv.org/abs/1608.00060
[8] Helmut Farbmacher, Martin Huber, Lukáš Lafférs, Henrika Langen and Martin Spindler. Causal mediation analysis with double machine learning. ArXiv preprint. 2021. https://arxiv.org/abs/2002.12710
[9] Philipp Bach, Victor Chernozhukov, Malte S. Kurz and Martin Spindler. DoubleML - An Object-Oriented Implementation of Double Machine Learning in Python, Journal of Machine Learning Research, 23(53): 1-6, 2022 https://www.jmlr.org/papers/v23/21-0862.html
[10] Maresa Schröder, Dennis Frauen, and Stefan Feuerriegel. Causal fairness under unobserved confounding: a neural sensitivity framework. 12th International Conference on Learning Representations, 2023.

[11] Julia Angwin, Jeff Larson, Surya Mattu, and Lauren Kirchner. Machine Bias: There’s software used across the country to predict future criminals. And it’s biased against blacks. ProPublica https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing. 2016

As an employer, Ekimetrics offers equal opportunities to all applicants regardless of gender, ethnicity, religion, sexual orientation, social status, disability, or age. Ekimetrics is committed to fostering an inclusive work environment that reflects diversity in its teams.

Reference :eki-data-lever+Ekimetrics-Stage-2025-Data-Science-et-Machine-LearningAI-practitionner-Sujet-Causal-Fairness-Analysis-de-la-theorie-a-la-pratique-HFN

Skills

Data
Machine Learning
Deep learning
D3
Backend
Python
Kong
Frontend
HTML5

Similar Jobs

brand cover
senior node.js fullstack and llm agent engineer
YeldaPermanent contract
YeldaPermanent contract
Paris, FR
& Remote
Hybrid remote
≥ 5 years experience
55k€ ➞ 70k€/year
Github
NodeJS
Machine Learning
1 day ago
brand cover
data scientist
VeepeePermanent contract
VeepeePermanent contract
100% Remote work
Juniors accepted
Machine Learning
Numpy
Pandas
3 hours ago
brand cover
senior data scientist
VeepeePermanent contract
VeepeePermanent contract
Paris, FR
& Remote
Hybrid remote
≥ 5 years experience
Machine Learning
Deep learning
Git
3 hours ago
brand cover
applied ai, technical lead, forward deployed ai engineer - emea
Mistral AIPermanent contract
Mistral AIPermanent contract
Amsterdam, NL& 5 others
& Remote
Hybrid remote
≥ 7 years experience
Azure
LangChain
Machine Learning
12 hours ago
brand cover
senior data engineer
NelsonPermanent contract
NelsonPermanent contract
Paris, FR
No remote work
Juniors accepted
FastAPI
Machine Learning
PostgreSQL
18 hours ago
brand cover
head of new business unit - cdi - paris
PapernestPermanent contract
PapernestPermanent contract
Paris, FR
& Remote
Hybrid remote
≥ 5 years experience
Make
Management
22 hours ago
brand cover
machine learning - end of studies internship - paris
StocklyInternship
StocklyInternship
Paris, FR
No remote work
Juniors accepted
Make
Pandas
Python
22 hours ago
brand cover
people scientist
MakiPeoplePermanent contract
MakiPeoplePermanent contract
Paris, FR
& Remote
Hybrid remote
Juniors accepted
Construct 3
Go
Machine Learning
22 hours ago
brand cover
ai engineer (genai & llm) – practice ia margo
MARGOPermanent contract
MARGOPermanent contract
Paris, FR
No remote work
Juniors accepted
50k€ ➞ 80k€/year
Docker
Intelligence Artificielle
Kubernetes
1 day ago