Available for opportunities

Mujtaba Saqib

Data Engineer & Analytics Specialist

ETL Pipelines · Data Warehousing · AI Systems

01

About

I am a Data Engineer and Analytics Specialist with experience building scalable ETL pipelines, data warehouses, and analytical dashboards.

I specialize in transforming complex datasets into actionable insights using tools like Airflow, Snowflake, and dbt. Alongside data engineering, I have developed machine learning and AI-based solutions for real-world problems.

I focus on delivering efficient, production-ready data systems.

0+ Projects Built
0+ ETL Pipelines
0+ Technologies
02

Experience

Sep 2025 — Present

Data & AI Engineer Intern

UnitZero
  • Designing and implementing data pipelines for financial data ingestion
  • Developing AI agents for market analysis and information retrieval
  • Optimizing LLM workflows and evaluating performance on domain-specific datasets
Data Pipelines AI Agents LLMs
Aug 2025

Data Engineer & Analytics Intern

Data Pilot
  • Built ETL pipelines using Mage AI for structured data processing
  • Implemented data validation and quality checks using ClickHouse
  • Assisted in optimizing analytics workflows in a production-like environment
Mage AI ClickHouse ETL
03

Projects

US Earthquake ETL Pipeline

Built an automated ETL pipeline for earthquake data ingestion and transformation. Processed real-time and historical seismic data for analytics.

Python ETL APIs Data Pipelines

Berlin Bike Theft Data Engineering

Designed a complete ETL pipeline for theft data analysis. Built transformation layers and dashboards for insights.

Airflow dbt Snowflake Power BI

Uber Data Engineering & Analytics

Processed ride data and built analytical datasets. Generated insights on ride patterns and operational trends.

Python SQL Data Modeling

Olympics Dashboard Project

Developed data pipelines and dashboards for Olympic data analysis. Visualized performance trends and country-wise statistics.

Data Engineering BI Dashboards

Loan Portfolio & Borrower Risk Insights

Built ETL pipelines and risk segmentation models. Developed dashboards for credit risk and borrower analysis.

Airflow dbt Snowflake Power BI

Crypto Currency Time Series Analysis

Built forecasting models for cryptocurrency trends. Applied time-series analysis for predictive insights.

Python Time Series ML

Medicine Recommendation System

Developed a system to recommend medicines based on symptoms. Applied ML techniques for classification and prediction.

Machine Learning NLP

Gemini AI Projects

Built multiple projects leveraging Google Gemini APIs. Explored prompt engineering and AI integrations.

Generative AI LLMs

HEC Degree Requirement Verifier

Built a system to validate degree requirements automatically. Implemented logic-based validation workflows.

Python Rule-Based Systems

AI-Based Movie Recommendation

Developed a movie recommendation engine using AI algorithms. Implemented similarity-based and predictive models.

Machine Learning Algorithms
04

Skills

Languages

Python SQL PySpark Java C HTML CSS JavaScript

Data Engineering

ETL Pipelines Data Warehousing Data Modeling Batch & Stream Processing

Tools & Platforms

Airflow Mage AI dbt Snowflake PostgreSQL ClickHouse Docker AWS Apache Flink Dremio

Data Science & AI

Machine Learning Deep Learning Neural Networks NLP Computer Vision Time Series Analysis LLMs & Generative AI Agentic AI

BI & Visualization

Power BI Tableau

DevOps

Docker CI/CD Git Linux
05

Get in Touch

Interested in working together or have a question?
Feel free to reach out — I'm always open to discussing data engineering, AI, and new opportunities.

Say Hello