Identity Fraud Detection Pipeline (Real-time)
Designed a real-time fraud detection pipeline ingesting identity events via Kafka, executing PySpark operations, and writing outputs to Delta Lake.
Confidence S. is an accomplished data engineer with expertise in Python, SQL, and Spark, and a proven track record designing ELT and ETL pipelines across cloud platforms such as GCP (BigQuery, Cloud Storage, Dataflow), AWS (Redshift, S3, Glue, Lambda, EMR), and Azure (Synapse Analytics, Databricks, ADF, ADLS Gen2), as well as Microsoft Fabric. His technical proficiency extends to technologies like Apache Spark, Apache Kafka, RabbitMQ, Apache Airflow, DBT, Azure Data Factory, SSIS, Databricks Workflows, and Medallion…
Designed a real-time fraud detection pipeline ingesting identity events via Kafka, executing PySpark operations, and writing outputs to Delta Lake.
Delivered a scalable data ingestion platform capable of processing various file types stored in AWS S3 into BigQuery using Airflow and Docker.
Lead the migration to Google BigQuery, engineered partitioning and clustering of tables, and constructed Airflow DAGs for automated loads.

Confidence is available for hire
Schedule an interview14 days risk-free trial · No commitments · We handle contracts and payroll
Confidence S. is an accomplished data engineer with expertise in Python, SQL, and Spark, and a proven track record designing ELT and ETL pipelines across cloud platforms such as GCP (BigQuery, Cloud Storage, Dataflow), AWS (Redshift, S3, Glue, Lambda, EMR), and Azure (Synapse Analytics, Databricks, ADF, ADLS Gen2), as well as Microsoft Fabric. His technical proficiency extends to technologies like Apache Spark, Apache Kafka, RabbitMQ, Apache Airflow, DBT, Azure Data Factory, SSIS, Databricks Workflows, and Medallion…
Designed a real-time fraud detection pipeline ingesting identity events via Kafka, executing PySpark operations, and writing outputs to Delta Lake.
Delivered a scalable data ingestion platform capable of processing various file types stored in AWS S3 into BigQuery using Airflow and Docker.
Lead the migration to Google BigQuery, engineered partitioning and clustering of tables, and constructed Airflow DAGs for automated loads.
Other vetted developers with similar skills and experience