Muhammad Syafiq Farhan
Hello, I'm

Muhammad Syafiq Farhan

Analytics Engineer

Building production-grade data pipelines, ETL systems, and regulatory reporting infrastructure. Turning raw data into reliable, scalable foundations.

Scroll Down

About Me

Analytics Engineer with 1+ year of experience building production-grade data pipelines, ETL systems, and regulatory reporting infrastructure in the Malaysian financial services sector. Proficient in Python, PySpark, Databricks, Delta Lake, dbt, and Airflow. Experienced working in BNM-regulated environments with exposure to CCRIS reporting, enterprise Alteryx infrastructure, and data migration projects across financial institutions.

Open to Work
Kuala Lumpur, Malaysia
GMT+8

Experience

September 2025 – Present

Analytics Engineer

OR Technologies (ORTECH) • Kuala Lumpur, Malaysia

  • Migrated enterprise Alteryx Server infrastructure with zero workflow loss across 40+ production workflows, maintaining continuity of critical regulatory reporting pipelines.
  • Designed and optimised ETL pipelines supporting enterprise regulatory reporting for a BNM-regulated financial institution, improving processing reliability and auditability.
  • Resolved 20+ production data issues by root-cause analysis of pipeline logic, reducing recurring failures impacting downstream reporting cycles.
  • Rebuilt inefficient Alteryx workflows in collaboration with client stakeholders, reducing data processing time and improving system stability across reporting runs.
  • Built and maintained automated tax reporting pipelines, reducing manual workload by ~70% and enabling consistent, scheduled execution.
  • Improved pipeline documentation and logic consistency to support data governance and audit readiness under BNM compliance requirements.
October 2024 – April 2025

Artificial Intelligence Researcher

TM Research & Development (TMRND) • Selangor, Malaysia

  • Built end-to-end Python data pipelines ingesting from CSV and PostgreSQL sources, implementing structured schema validation to ensure data quality across pipeline stages.
  • Developed REST APIs using FastAPI to expose processed health metrics to downstream frontend and analytics applications.
  • Integrated AI-generated recommendations using LangChain and OpenAI API, delivering contextual, prompt-engineered outputs for staff health reporting.
  • Collaborated across frontend, backend, and database teams to deliver an integrated health analytics platform from ingestion to API-based consumption.

Education

Bachelor of Computer Science (Honours)

Data Science and Computational Intelligence

International Islamic University Malaysia (IIUM)

Graduated August 2025
🥇 1st Place - InnovaTex 2024

Final Year Project: "Deep Learning-based Evaluation of the Relationship between Mandibular Third Molar and Mandibular Canal on CBCT"

Projects

Skills

Languages

Python SQL

Data Engineering

PySpark dbt Apache Airflow ETL Development Medallion Architecture

Platforms & Storage

Databricks PostgreSQL Delta Lake Docker BigQuery

ML & Analytics

scikit-learn SHAP Plotly

APIs & Backend

FastAPI REST APIs LangChain OpenAI API

Visualisation

Databricks AI/BI (Lakeview) Streamlit Tableau

Tools

Git GitHub Alteryx BigQuery

Certifications

Let's Connect

Got a project in mind or just want to chat? Hit me up! 🚀

Send an Email