Sahil Gundu
@sahilgunduSenior Data Engineer, GCP, Azure, Dataflow, Dataproc, pyspark, Databricks, Python, SQL
Language Breakdown
Lines of code distribution across 9 owned repositories
I-Shaped Developer
I-shapedSpecialist — deep expertise in Jupyter Notebook
Collaboration Network
Global Impact visualization
Repos
9
PRs
0
Growth
+18%
Top Collaborators
No collaborator data yet.
Coding Streak
Contribution activity over the past year
₍ᐢ. .ᐢ₎ 𝓇𝑒𝒾
@s3phikura
Shreekanth Guttedar
@shreekanthashokg-lang
José Raposo
@0Darkn
dovas
@dovas-net
⠀
@destroy-boys
Top Repositories
Docs-only case study of a compliance & anomaly detection platform on Azure + Databricks (Streaming ETL + Batch ELT + ML).
Research on human-in-the-loop architectures for safe and explainable AI. Includes the Learning-Free Regulator (LFR) concept.
Docs-only case study – Compliance Reporting data platform on Azure for a Big-4 Audit & Consulting Firm (BFSI, healthcare-style datasets) using Streaming Pipeline (ETL) + Batch Pipeline (ELT) with Snowflake, Synapse, ADF, Power BI, ML risk scoring, DQ, governance, and lineage.
Sanitized ML-based risk scoring pipeline for a Tier-1 UK Retail Bank (GCP + BigQuery ML). Includes Batch/ETL ingestion, feature engineering, BQML training, scoring workflows, governance, lineage, and runbooks. No client code/data.
GCP-based Regulatory Reporting Lakehouse — Tier-1 Swiss Bank (Simulated Case Study):- Documentation-only repo illustrating a cloud-native data lakehouse architecture for regulatory reporting on Google Cloud Platform (GCS + BigQuery + Dataflow + Composer). Includes ADRs, runbooks, and compliance data contracts.
Sanitized case study — Tier-1 UK bank FX streaming on GCP (Pub/Sub → Dataflow → BigQuery, Composer, VPC-SC/CMEK). Patterns only; no client code/data.
python_de_learning_2025
Open Source Impact
Contributions to external projects
No external contributions found.