Some Technologies
- Python, SQL, Java, Scala, C#
- Pandas, Numpy, PySpark
- SkLearn, Tensorflow, PyTorch
- AWS, Azure, GCP, IBM Cloud
- Python, R, Java, Scala, Julia
- Airflow, Jenkins, Luigi, Argonaut
- Google Analytics, Tag Manager
- Databricks, Snowflake, dbt
- NLTK, SpaCy, BERT, GPT
- SageMaker, Vertex AI, Azure ML
- MySQL, MongoDB, Cassandra
- Redshift, BigQuery
Highlights
Certified in Data Engineering
Certified in Data Engineering by Databricks. Masters in Data Engineering and other formal training.
Certified in Cloud Solutions
AWS Cloud Solutions Architect and holding diverse other certifications related to AWS, Azure and GCP specializations.
Certified in Big Data
Certified in Big Data with Databricks, Snowflake, AWS and Azure. Trained distributed computing (Map Reduce, Spark and Hadoop).
Data Pipelines
Engineering robust data pipelines that ensure the seamless movement, transformation, and storage of data from various sources to analytical platforms.
Cloud Solutions
Designing and implementing cloud solutions, enabling scalable, secure, and cost-effective data storage and computing across diverse platforms.
Big Data Technologies
Harnessing the power of big data technologies, like Hadoop and Spark, to process, analyze, and derive insights from large datasets efficiently and accurately.
Data Lake & Warehouse
Building and managing data lakes and warehouses, ensuring organized, secure, and accessible storage of structured and unstructured data for analytical purposes.
Real-Time Processing
Implementing real-time data processing and analytics solutions, enabling businesses to gain instant insights and respond to changing conditions dynamically and effectively.
Serverless Computing
Leveraging serverless computing to build and run applications without server management, ensuring efficient resource use, and simplifying development, deployment and scaling.
Selected Projects
- MySQL database development for micropaleontological papers of interest for hydrocarbon modeling (OERA/ Nova Scotia Government).
- Pipeline for ingesting, processing, cleaning, transforming and analyzing textual data (NLP model for Content Analysis) from clients reviews and other unstructured feedback (HMR).
- DAG development and management (Airflow) for scheduled data ingestion (tailored solutions for specific clients) (HMR).
- Under construction.