Administration of the Databricks platform across the company’s entire environment, managing access, optimizing cluster performance, and controlling costs for each project.
Maintenance and development of automation, bots and dashboards to monitor usage and costs into databricks environment (notebook, DBSQL, workflows, DLT).
Participation in projects focused on access permissions and auditing to ensure data security using Unity Catalog and AWS - IAM.
Development in SQL and Python, with a focus on PySpark and Pandas.
ETL development for tables and files using Databricks, Rancher, Airflow, and AWS, working with the medallion architecture, using delta and parquet.
Providing support to Databricks users.
CI/CD for bots using TensorFlow.
Code versioning and code review via GitHub.
Local Data Quality testing and debugging using Databricks, VS Code, Rancher, and Airflow.
Experience working on Kanban-based projects.
Maintenance and support of databases such as SAS and SingleStore.
Data Engineer
Grupo Boticario
05.2022 - 09.2023
Working with clients using Scrum methodology in two-week sprints.
Development in Python and SQL to create tables and automation according to business requirements, implementing medallion architecture via Databricks and orchestrating workflows with Azure Data Factory, including Data Quality tests before deployment.
Automated file delivery through the platform.
Version control management using Azure DevOps and GitHub.
Experience as a QA tester.
Data Engineer
Seara Alimentos LTDA
12.2021 - 03.2022
Collaboration on data science projects for ETL processing of tables.
Data manipulation using SQL and Python. Working with SQL databases, including Oracle.
Automation of SAS Business Objects database tables to improve usability and efficiency for end users, using GCP and Databricks.
BI Analyst
Seara Alimentos LTDA
01.2020 - 12.2021
Evaluated the success of implemented BI strategies through rigorous post-implementation analysis, ensuring alignment with business objectives.
Analysis of sales values across different business sectors, maintaining and developing new insights and dashboards for decision-making using Excel VBA and QlikView.
Annual monitoring of the company’s budget planning, creating presentations as requested by the leadership.
Coordination with the commercial team to rank, monitor, and support sales volume management.
Internship
Seara Alimentos LTDA
10.2019 - 12.2020
Supported staff members in their daily tasks, reducing workload burden and allowing for increased focus on higher-priority assignments.
Gained valuable experience working within the food industry, applying learned concepts directly into relevant work situations.
Sorted and organized files, spreadsheets, and reports.
Gained hands-on experience in various software programs, as Excel VBA, QlikView and SQL and Python development, increasing proficiency and expanding technical skill set.
User support to QlikView environment.
Education
Bachelor’s In Material Science Engineer - Engineering
Universidade Presbiteriana Mackenzie
Sao Paulo
12.2021
Skills
ETL Development
Data Warehousing & Data Lakehousing
Data Modeling
Data Pipeline Design
Python, PySpark, SQL
Databricks
CI/CD
Data Science
Cloud Experience: AWS & Azure
Certification
Databricks Academy Accreditation- Lakehouse Fundamentals
Databricks Academy Accreditation - Platform Administrator
Microsoft Certified: Azure AI Fundamentals AI-900
Microsoft Certified: Azure Fundamentals AZ-900
Microsoft Certified: Azure Data Fundamentals DP-900
Microsoft Certified: Azure Data Engineer Associate DP-203
Timeline
Data Engineer
PicPay
10.2023 - Current
Data Engineer
Grupo Boticario
05.2022 - 09.2023
Data Engineer
Seara Alimentos LTDA
12.2021 - 03.2022
BI Analyst
Seara Alimentos LTDA
01.2020 - 12.2021
Internship
Seara Alimentos LTDA
10.2019 - 12.2020
Bachelor’s In Material Science Engineer - Engineering