I'm an AI Software Engineer with 2+ years of experience building production-ready Generative AI systems and resilient backend services on AWS, currently based in Colombo, Sri Lanka.
I specialize in designing RAG pipelines, orchestrating LLM workflows using LangChain & LangGraph, and deploying scalable REST APIs with Python (FastAPI). I'm passionate about bridging complex AI capabilities with real-world enterprise applications.
Currently pursuing a BSc in Information Systems at the University of Colombo School of Computing, with hands-on experience in AI safety evaluation, MLOps frameworks, and cloud-native infrastructure.
Automated document processing pipeline using Python, Tesseract OCR & OpenCV to digitize scanned National Identity Cards. Implements Regex & Named Entity Recognition to parse unstructured OCR output into structured JSON for identity verification.
AI prototype using Transformers (DistilBERT) for aspect-based sentiment analysis — extracting structured business insights from unstructured user text with multi-label classification and loss-function optimization.
End-to-end data pipeline orchestrated with Apache Airflow (Docker) for automated daily processing. Implements data extraction from AWS S3, transformation with Python/Pandas, quality checks, and structured loading to support downstream analytics and ML workloads.
Real-time logistics and scheduling automation for the Sri Lankan railway network. Features a concurrent seat reservation engine, automated warrant processing, and an FSM-inspired asset tracking module with automated conflict resolution logic.
A comprehensive breakdown of SLM architecture — covering Sparse Attention, Knowledge Distillation, LoRA fine-tuning, and real-world edge deployment scenarios. Featured in Towards AI with 54 claps.
Deep technical analysis of Alibaba's QwQ-32B reasoning model — benchmarked against DeepSeek-R1, OpenAI o1, and Claude 3.5 Sonnet. Covers RL training, efficiency gains, and real-world deployment considerations.
Technical breakdown of the January 2025 DeepSeek ClickHouse database breach — covering the misconfiguration, exposed data types, implications for AI infrastructure security, and actionable lessons for the industry.
A two-part beginner-to-intermediate guide demystifying deep learning — covering neural network architectures (CNNs, RNNs, LSTMs, Transformers, GANs), activation functions, loss functions, and optimization algorithms in plain English.
Designing and deploying LLM-powered applications and RAG pipelines using LangChain and vector embeddings. Building scalable AI-serving architectures on AWS (Lambda, EC2) with FastAPI. Contributing to AI safety evaluation — implementing hallucination detection metrics and MLOps practices in an Agile enterprise environment.
Built robust RESTful APIs using Python (FastAPI) integrating backend services with frontend platforms and external data ecosystems. Managed cloud infrastructure on AWS, implemented monitoring and alerting systems, and optimized async database logic for high-traffic applications.
Validated NLP datasets for low-resource languages (Tamil/English) to support emotion detection models. Built data preprocessing scripts to automate the cleaning and annotation of 1,000+ text entries and video datasets for computer vision training.
Interested in collaborating or have a project in mind? Feel free to reach out!