Enterprise ETL Pipeline Optimization
Architected medallion architecture data pipelines processing 50+ tables with Python/PySpark. Implemented schema mappings, filters, and join conditions for Parquet/Delta formats.
Transforming raw data into actionable insights through scalable ETL pipelines and advanced analytics solutions
📍 Bengaluru, Karnataka
Data Engineer with 1+ years of experience at Tredence Inc., specializing in Azure cloud ecosystem and large-scale ETL pipeline optimization. Successfully modified 50+ table schemas and achieved 60% runtime reduction in critical data workflows. Former Data Science Intern with expertise in CNN-based audio processing and computer vision solutions. Certified Databricks Data Engineer Associate with proven track record in medallion architecture implementation.
Architected medallion architecture data pipelines processing 50+ tables with Python/PySpark. Implemented schema mappings, filters, and join conditions for Parquet/Delta formats.
Developed CNN-based speech classifier with custom audio dataset. Extracted MFCC, delta, CQT spectrogram features for temporal sequence analysis. Achieved successful digit classification as foundation for multimodal system.
95% detection accuracy computer vision solution using CNN classifier and 68-point facial landmark localization. Outperformed existing solutions in asymmetric flash scenarios.
Processed compressed parquet datasets for neutrino detection insights. Created multi-dimensional 3D Earth visualization for particle location mapping.
Ready to tackle your next data challenge? I'm available for full-time opportunities, consulting projects, and technical collaborations.