Hello
I’m Koliya,

 

A Data Scientist

With over 6 years of industry experience in applied data analysis and modelling. Possesses sound knowledge in machine learning, statistics, applied mathematics and advanced programming skills. A fluent communicator and a problem-solver known to produce innovative solutions to clients’ problems.
If You Want To Hire Me
Scroll Down
Work Experience

Sonder

Full-stack Data Scientist

December 2021 – Current Sydney, Australia

  • Implemented GPT-based models using Hugging Face and OpenAI frameworks to generate coherent and contextually relevant responses in natural language understanding tasks.
  • Designed and implemented data pipelining workflows using Databricks and Snowflake on AWS, ensuring seamless data integration, transformation, and analysis for large-scale chat transcript datasets, enabling efficient model training and inference processes.
  • Developed accurate and scalable machine learning and time series models for prediction and forecasting purposes, enabling data-driven decision-making and proactive planning across various domains.
  • Leveraged natural language processing techniques to classify chat transcripts into multiple categories, enabling effective organization and retrieval of information for data-driven insights.
  • Developed advanced topic extraction and semantic analysis models to unlock patterns and semantic structures within texts, extracting valuable insights for strategic decision-making and business optimization.
  • Implemented a machine learning-based search engine utilizing state-of-the-art techniques, allowing efficient retrieval of relevant chat transcripts based on user search queries, enhancing user experience and information accessibility.

Smart Infrastructure Facility

Data Science Researcher

August 2018 – November 2021
Wollongong, Australia

  • Transformed raw data into efficient database structures through meticulous analysis, ensuring optimal data organization and accessibility.
  • Executed a comprehensive range of analyses, encompassing exploratory, descriptive, predictive, and prescriptive segments, to derive meaningful insights and support informed decision-making.
  • Communicated data-driven insights and key findings effectively by creating compelling visual representations and artifacts.
  • Established standardized ETL/ELT processes and tools, enabling seamless data discovery, extraction, and facilitating robust model development and evaluation.
  • Developed advanced machine learning, deep learning, and time series models in Python, R, and Microsoft SQL Server environment to enable accurate prediction and forecasting, driving data-based decision-making.
  • Utilized a diverse array of scientific computing, data analytics, machine learning, deep learning libraries such as Numpy, Pandas, Matplotlib, Plotly, Scikit-Learn, Apache Spark, Keras, and Tensorflow, to effectively analyze, model, and extract insights from data.

nCinga Innovations

Data Scientist Combined
Business Analyst

November 2016 – July 2018
Colombo, Sri Lanka

  • Analyzed business requirements and translated them into practical solutions, focusing on the product perspective to address complex data engineering and data science challenges.
  • Contributed to the development of cutting-edge products aimed at building smart factories, leveraging an industrial IoT platform to empower manufacturers with real-time operational insights, predictive analytics, and smart automation capabilities.
  • Performed diverse responsibilities including requirements gathering, validation, process mapping, gap analysis, UML diagramming, wireframe modeling, and conducted data querying, analysis, and modeling. Produced comprehensive technical documentation such as software requirement specifications, client release notes, and user manuals.
  • Demonstrated proficiency in a wide range of technologies relevant to data engineering and data science, including Cassandra, Apache Kafka, Elasticsearch, MongoDB, RethinkDB, InfluxDB, Redis, SQLite, Java Spring Boot, Python, R, VMware, and AWS, ensuring efficient and scalable data processing, storage, and analysis.
  • Collaborated closely with product managers as a Scrum Master in an agile environment, effectively managing project workflows and ensuring smooth progress. Utilized JIRA for issue tracking, backlog management, and work logging, streamlining overall project management processes.
Personal Growth

Education

University of Wollongong
Doctor of Philosophy in Data Science
August 2018 – November 2021
Wollongong, Australia
University of Colombo
Bachelor of Science in Physical Science
February 2014 – November 2017
Colombo, Sri Lanka
Sri Lanka Institute of Information Technology
Bachelor of Science (Honors) in Information Technology
November 2012 – November 2016
Colombo, Sri Lanka

Technical Skills

Languages
Python, R, Java, C++, C#, C, MATLAB, HTML/CSS, JavaScript, AngularJS, React, AJAX, Bash
Databases
MSSQL, MySQL, Oracle, MongoDB, Cassandra,
Hadoop, Kafka, Spark, Elasticsearch, InfluxDB
Business Intelligence
SAS Viya, Microsoft Power BI, Tableau,
Microsoft Excel Analysis ToolPak
Cloud Services
Amazon, Azure, Google Cloud, VMWare

Leadership / Extracurricular

IEEE Student Branch
Vice President
November 2019 – November 2021
University of Wollongong
Sri Lankan Student Society
President
August 2019 – September 2020
University of Wollongong
Colombo Rugby
University National Colors Holder
February 2014 – November 2017
University of Colombo
Projects and Research

Uncovering Trends in Healthcare Data

Python, R, Microsoft SQL Server
2018 – 2021

  • Focused on healthcare cost forecasting, leading indicators, and evaluation of the effectiveness of chronicdisease management programs using ten years worth of health insurance claims data.
  • Used methods such as seasonal autoregressive integrated moving average model, association rule mining, sequential rule mining, heuristic rules, k means, hierarchical clustering, and propensity score matching.

Fog Assisted Industrial IoT Module for Apparel Industry

Java, RethinkDB, AngularJS
2017 – 2018

  • Created a cycle time capturing module for the apparel industry using fog computing and industrial IoT. It works offline to provide real-time analytics by tackling the most common issues in industrial IoT setups, such as high bandwidth consumption and low latency.
  • Designed mobile applications to gather data from operators, dashboards to display real-time analytics. Created a java application to process messages sent between applications and manage the fog node’s local database to operate backend services.

nFactory Tracer

Java Play, Python, Elasticsearch, MongoDB,
AngularJS, Node.js, SQLLite
2017 – 2018

  • Developed an industrial IoT and analytics platform to automate processes and provide real-time insights onthe go, increasing productivity, quality and reducing costs while retaining a motivated workforce.
  • Used MongoDB and Elasticsearch databases to store data collected from the devices on the production floor and developed a dashboard application to present real-time analytics to operators, an analytic application to provide real-time insights and predictive analytics to higher-level management.
Endorsements
Hire Me

Supercharge Your Business with Expert Data Science Insights!

Ready to harness the power of data for your business? As a highly skilled data scientist, I offer the expertise you need to turn complex data into actionable insights. Let’s collaborate and unlock your data-driven success. Hire me today!

Teaching

Unlock Your Learning Potential: Experienced Tutor at Your Service!

Ready to take your learning to the next level? As a highly experienced tutor, I offer personalized instruction and guidance to help you achieve academic excellence. Let’s collaborate and unleash your full learning potential. Book a tutoring session today!”