CV

A data scientist/machine learning engineer with a Ph.D. in Computer Science, specialized in leading and development of cutting-edge projects. Utilizing advanced MLops, statistical modeling, machine learning, and data analysis techniques to extract actionable insights and implement data-driven solutions in commercialized environments. Committed to data science and innovation in academia, actively publishing in prestigious conferences and journals like NeurIPS and Pattern Recognition.

Work experience

Senior Data Scientist, AGL Energy (October 2022 - April 2024)

Duties includes:

Achievements:

Data Scientist Researcher, RMIT University/ Ford Company (June 2021 - October 2022)

Duties included: Worked in a multi-disciplinary team of 5, including engineers from Ford company in the US and researchers from RMIT University, in implementing a versatile quality control technology applicable to various problems.
Achievements: Increased the company revenue via efficient detection of the defects in industrial parts using domain adaptation method utilizing ResNet50 as the deep neural network backbone, PyTorch, OpenCV, Torch-vision, Scikit-learn

Machine Learning Specialist, Keylead Health (January 2021 - June 2021)

Duties included: Collaborated in a 5-member team comprising data engineers and clinicians to analyze healthcare data, aiming to enhance the clinical trial and medical research processes.

Skills

Project Information

Selected Projects

ProjectDateLocationDescription
LLM-based AI Generated RBAC Roles03/2024Melbourne, VICAutomatically generated RBAC roles within Azure, employing a RAGbased Large Language Model (LLM), open-source vector databases (Faiss), GPT3.5, and Langchain in the process. Enhanced security by generating RBAC roles based on user activity, resulting in pruning the overestimated roles.
Document Understanding Using Multi-Modal Transformers01/2024Melbourne, VICI employed Donut, a multi-modal transformer, for OCR-free document understanding and precise information extraction. Additionally, I applied advanced prompt engineering techniques such as the cascade of thoughts and Iterative Refinement to enhance accuracy in this project. Increased accuracy in document knowledge extraction by 20% and overcame the OCR problems.
Auto Topic Generation11/2023Melbourne, VICDeveloped and deployed a topic generation and categorization model for customer conversations using GPT-3.5 and K-means clustering. Scaled the model in terms of cost and computation for 300k documents per month with a mean of 4000 tokens per conversation
Email Classification - Fine-tuning Foundational LLMs09/2023Melbourne, VICFine-tuned foundational models and transformers (Bert and Roberta) using methods such as transfer learning, on tasks such as email classification, sentiment analysis, and MTTR regression. Successful deployment of the email classification technique contributed to a 40% reduction in service desk time allocation.
Churn Model Using LSTM06/2023Melbourne, VICChurn model leveraging LSTM and LGBM models. Additionally, established a monitoring pipeline to track data and concept drift. Mitigated a 3% churn rate by deploying refactored churn models in production.
Parallel Programming for Model Improvement03/2023Melbourne, VICImproved a detection model by restructuring it to leverage parallel libraries, specifically Dask, for optimizing scalability. Reduced the runtime of the model by 94%.
Call volume prediction using Lstm and attention10/2022Melbourne, VICDesigned and developed a time series forecasting model featuring a multi-head attention layer and an LSTM head Successfully forecasted the call volume resulting in better budgeting for the call center.

Education

Publications

Recommandations
For more Info. check my LinkedIn account: (Link)