Kickstart 2021 with an AWS certification to validate your cloud expertise

Photo by JESHOOTS.COM on Unsplash


Introduction to AWS Certifications
Step 1: AWS E-learning Modules
Step 2: AWS Whitepapers and Website
Step 3: YouTube Practice Questions
Optional Resources
About the Examination

Introduction to AWS Certifications

Amazon Web Services (AWS) is one of the most popular cloud platforms out there, and they offer an extensive suite of fully featured services from data centers globally. In line with these services, AWS offers certifications to help professionals validate these in-demand cloud computing skills.

Reviewing the controversial implementation of Video Assistant Referees in English football using Python

Image by chiraphat phaungmala from Pixabay

Match Highlights

Kickoff: Background & Context
1st half : Data acquisition and preparation
2nd half: Analysis and Insights
Full-time whistle: Conclusion


  • Son Heung-Min was most frequently involved in VAR overturn incidents
  • VAR incidents tend to spike in the middle of each half and peak at the end of each half
  • No apparent bias of VAR decisions in favor of the Big 6 teams
  • Amongst teams present in both EPL seasons where VAR was implemented, Brighton had the highest proportion of overturn decisions in their favor (67.9%), while West Bromwich Albion had the lowest (25.0%)
  • Link to GitHub repo of this project here

Thoughts and Theory

Keep your neural network alive by understanding the downsides of ReLU

Photo by Solen Feyissa on Unsplash


(1) What is ReLU and what are its advantages?
2) What’s the Dying ReLU problem?
3) What causes the Dying ReLU problem?
4) How to solve the Dying ReLU problem?

Activation functions are mathematical equations that define how the weighted sum of the input of a neural node is transformed into an output, and they are key parts of an artificial neural network (ANN) architecture.

Activation functions add non-linearity to a neural network, thereby allowing the network to learn complex patterns in the data. …

Using Python and Flourish to visualize rank and revenue trends of the world’s largest companies

Designed by Vectorarte / Freepik

Companies rise and fall amidst the intense and ruthless global competition, thus it would be fascinating to visualize the progress of the top global firms over the past few decades.

The Fortune Global 500 is an annual ranking of the top 500 corporations worldwide as measured by revenue, and it serves as a good source of data for running visual analysis. I figured it would also be an enriching experience to generate bar chart race animations using code (Python) and no-code (Flourish) solutions. Let’s get started!

Table of Contents

(1) Data Acquisition
(2) Data Preparation
(3) Bar Chart Race with Python bar-chart-race package
(4) Bar Chart…

Obtain a Tableau Specialist certification to showcase your data visualization skills and product knowledge

Photo by William Iven on Unsplash


Introduction to Tableau Certifications
Tableau Desktop Setup
Step 1: Tableau Free Training Videos
Step 2: Tableau Whitepaper
Step 3: Online Practice
About the Examination

Introduction to Tableau Certifications

The Tableau software is one of the most popular visual analytics platform in the market. With its focus on business intelligence, Tableau makes it easy for users to explore and manage data, and to quickly discover and share insights. Given how commonly it is used across various industries, securing a Tableau certification will certainly help you to differentiate yourself from the crowd.

Front-end users will most likely be using Tableau…

Simple trick to create a dynamic table of contents to allow easy scroll navigation for your readers

Photo by LinkedIn Sales Navigator on Unsplash

Table of Contents

1) Introduction
2) Concept of HTML Elements (id tag and # symbol)
3) Steps to create this kind of dynamic table of contents


By now you should be aware that the Medium platform does not allow writers to automatically generate a dynamic hyperlinked table of contents. This is an issue because writers frequently use sections to organize their stories, and what we see, more often than not, are static tables of content that are just lists of text.

Having a dynamic table of contents significantly improves the user experience for readers, making it easier for them to scroll to sections…

Step-by-step sentiment analysis with NLP (Stanza, NLTK Vader and TextBlob) on COVID-19 vaccine tweets

Photo by CDC on Unsplash

The COVID-19 pandemic has presented itself as one of the gravest global threats, and is still very much an ongoing menace. In equal measure, we are in the midst of the biggest vaccination campaign in human history. According to Bloomberg, a staggering 68.1 million doses in 56 countries have been administered so far (as of 26 Jan 2020).

While the vaccine has offered renewed hope in the fight against COVID-19, it has also ignited aggressive anti-vaccine movements. It would thus be interesting to gauge the public’s perception towards the COVID-19 vaccine with sentiment analysis (in Python) on recent Twitter data.


Applying ensemble learning and multiple linear regression in Python for condominium rental price predictions

Photo by Burak Ceviz on Unsplash

This is Part 2 of a two-part series on the analysis of the Singapore condominium rental market. Do also check out Part 1 (Data Acquisition & EDA in Python).

Introduction and Motivation

As described in Part 1, I was keen to examine the condominium rental market in Singapore, which is relatively under-explored.

Having completed data acquisition and exploratory data analysis, it is time to run some predictions on the dataset. This article explores the use of Ensemble Learning models (Random Forest, XGBoost and LightGBM) and Multi-Linear Regression to predict condominium rental prices.

Data Pre-Processing before Prediction

As a recap, our dataset comprises 7,317 condominium listings extracted in…

Glean insights into the Singapore property market through data acquisition (with web scraping) and exploratory data analysis in Python

Photo by Mike Enerio on Unsplash

Introduction and Motivation

There has been plenty of analysis on public housing sales (especially the over-analyzed Boston housing dataset). Even for the Singapore context, there are numerous notebooks and datasets that involves HDB public housing.

Hence, I was keen to examine a segment of the property market that is less explored. Instead of public housing, I looked at private properties (specifically condominiums), and instead of sales, I delved into rentals.

In this article, I share about how I acquired data with web scraping, and performed exploratory data analysis to glean insights into the Singapore condo rental market. Let’s go!

Note: This is Part…

Hands-on Tutorials

Detect, recognize and extract car license plate numbers with the power of computer vision (A step by step tutorial)

Photo by Alexander Popov on Unsplash

Table of Contents

1) Motivation and Introduction
2) Getting Started
3) Car Plate Detection with OpenCV and Haar Cascade
4) Plate Number Recognition and Extraction with TesseractOCR

Click here to view the Jupyter Notebook, and here to visit the GitHub repo.

Motivation and Introduction

When we talk about AI, computer vision is definitely one of the top applications that comes to people’s minds. Hype aside, I have always been fascinated by it because since it pertains to the human sight, arguably the most important organ of human sense.

I have come across plenty of computer vision projects involving the human face and/or body. As such, I…

Kenneth Leung

Data Scientist | MSc Business Analytics | Web Developer | Pharmacist |

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store