About Me

Mahdi Karabiben

Hi there 👋 I'm a data engineer based in Paris with more than six years of experience designing and building petabyte-scale data platforms. I'm very passionate about open-source projects and enjoy working with data and designing scalable solutions.

My Career

Zendesk

Working on Zendesk's core data platform.

Oct. 2022 - Present
Staff Data Engineer (promoted)

Zendesk

Building data products within the Enterprise Data & Analytics org. (Offering internal teams access to product data at scale, contributing to the design and implementation of data domains, enhancing the existing data stack)

Nov. 2021 - Oct. 2022
Senior Data Engineer

FactSet

Worked on multiple projects on the AWS cloud (desiged and implemented cloud-based data lakes, data pipelines, and ML Ops pipelines). Designed and implemented an Apache Superset environment used by multiple teams within the company.

Apr. 2020 - Nov. 2021
Data Engineer

Crédit Agricole CIB

Worked as a consultant with CA-CIB's data teams on implementing a Big Data architecture for the FRTB regulations. The project consisted of building a Hadoop data lake to store all of the bank's markets risk data.

Dec. 2018 - Apr. 2020
Big Data Consultant

Numberly

Worked at Numberly (1000mercis group) as a data engineer for my end-of-studies project. The project consisted of building data pipelines for a Big Data architecture using Apache Spark (PySpark), Apache Airflow, and Apache Zeppelin. The data was mainly stored on MSSQL and Apache Hive (on top of Apache Hadoop).

Jan. 2018 - Jul. 2018
Big Data Developer

Democracy International

Worked at Democracy International on a campaign to deliver useful information to Tunisian voters to motivate them to participate in the country's upcoming municipal elections. Turned the data provided by Tunisian ministries and associations into interactive visualizations that helped voters get a better understanding of their municipalities' data.

May. 2017 - Jan. 2018
Data Visualization Specialist

My Skills

My Projects

Author of End-to-End Batch Data Pipeline with Spark

I authored a four-project series for Manning Publications on building an end-to-end data pipeline. The series discusses technologies like Spark, Delta Lake, and Superset.

Celer, a traffic and navigation app

Conceived and implemented a three-layer data architecture (Lambda Architecture) for a traffic and navigation Android app.

An engine that generates notebook-based dynamic dashboards

Built a python-based engine that generates dynamic dashboards via a search page.

...   ...

Visualizing Electoral Data

Created a set of data visualizations for the 2018 tunisian municipal elections.

...   ...

An interactive dashboard to visualize the impact of natural disasters

Participated in designing and implementing an interactive dashboard that showcases the human and financial impacts of various types of disasters on the world. The dashboard was built in collaboration with the World Bank and is intended to be used by the World Bank's analysts.

...   ...

Monumental 3D Projection Mapping

Participated in the different phases of conceiving and creating a monumental 3D projection mapping during the 10th edition of the JAM SESSION, an event organized at INSAT by Ciné-Radio INSAT. Tools used are Blender, Resolume Arena 5, and Adobe After Effects.