Data Engineer with 5+ years of professional experience helping businesses harness the power of their data to drive growth and success in industries like fintech and retail using Python, SQL, Databricks, Airflow, BigQuery, and Cloud Solutions. Strong understanding of the entire data pipeline and a sharp capability for translating business requirements into designing and implementing efficient and scalable solutions. Relevant projects include developing a Databricks workflow for a leading Brazilian payment solutions company using Spark and Python to automate Delta table cleanup that reduced GCS storage from 300TB to 123TB, and replacing a 2-hour Alteryx workflow with a 15-minute Airflow DAG using Python, SQL, and dbt to extract and incrementally transform Brazil Central Bank data, loading it into BigQuery.
Technologies: Databricks, BigQuery, GCS, Airflow, Docker, CI/CD, GitHub Actions, SQL, Python, PySpark, dbt, Datastream, Datafusion, Spark, Terraform, Cloud Functions, Amazon S3, Git
Technologies: SQL, Python, PySpark, Databricks, Azure Data Factory, Databricks, SQL Server, Power BI, Azure DevOps, CI/CD, Azure DevOps, Git, Azure Functions
Technologies: Redshift, SQL, Python, Amazon S3, Power BI, Looker
Technologies: SQL, Python, Airflow, BigQuery, GCS, Airbyte, Datastream, Git, CI/CD, Dataform, Datastream, Dataplex
IBM Data Science Professional Certificate