Passionate about building machine learning pipelines, exploring NLP and computer vision, and solving real-world problems with statistical rigor and deep learning.
I'm a data scientist and machine learning engineer with a strong foundation in statistics, probability, and experimental design. I believe that good ML starts with understanding the data — and I'm equally comfortable building regression models in R, training YOLO for computer vision, or experimenting with transformer architectures for NLP.
My interests span the full ML spectrum: from feature selection and hyperparameter tuning to cutting-edge retrieval-augmented generation (RAG) and vector databases. I'm always reading research papers and looking for ways to turn theory into practice.
Six core areas of my ML skill set — from statistics to deep learning.
Visual design inspirations that inform the neo-brutalist aesthetic — bold typography, hard borders, and editorial layouts.




Course projects and personal work spanning statistics, computer vision, and NLP.
Built a multi-linear regression model in R to predict population age distributions. Applied feature selection techniques and validated using statistical methods including ANOVA and residual analysis.
Developed a multi-modal system combining YOLO for object detection and CLIP for image-text alignment. Integrated audio and tabular data streams for comprehensive analysis.
End-to-end NLP pipeline with tokenization, embedding, transformer models, and RAG integration using vector databases.
Modular ML pipeline with automated feature selection, hyperparameter tuning, cross-validation, and model evaluation.
Implemented neural network fundamentals including backpropagation, activation functions, and loss optimization from the ground up.
Completed coursework in statistics, probability, and machine learning methods.
Core probability theory, distributions, hypothesis testing, confidence intervals, and Bayesian inference.
Advanced statistical modeling including regression, ANOVA, experimental design, and feature selection techniques.
Design of experiments, A/B testing methodology, confounding variables, and causal inference.
OOP principles in Python and Java — encapsulation, inheritance, polymorphism, and design patterns.
Relational database design, SQL, normalization, indexing, and basic query optimization.
Course project: predicted population age using MLR in R with feature selection and model diagnostics.
Building retrieval-augmented generation pipelines with vector embeddings, dense retrieval, and knowledge base integration.
Deep understanding of attention mechanisms, BERT/GPT architectures, fine-tuning strategies, and model interpretability.
Combining vision, language, and audio — inspired by CLIP and YOLO — for richer representation learning.
Have a project, research idea, or just want to chat about ML? Drop me a message.