Using Python and Flourish to visualize rank and revenue trends of the world’s largest companies

Image for post
Image for post
Designed by Vectorarte / Freepik

Companies rise and fall amidst the intense and ruthless global competition, thus it would be fascinating to visualize the progress of the top global firms over the past few decades.

The Fortune Global 500 is an annual ranking of the top 500 corporations worldwide as measured by revenue, and it serves as a good source of data for running visual analysis. I figured it would also be an enriching experience to generate bar chart race animations using code (Python) and no-code (Flourish) solutions. Let’s get started!

Table of Contents


Obtain a Tableau Specialist certification to showcase your data visualization skills and product knowledge

Image for post
Image for post
Photo by William Iven on Unsplash

Contents

Introduction to Tableau Certifications

The Tableau software is one of the most popular visual analytics platform in the market. With its focus on business intelligence, Tableau makes it easy for users to explore and manage data, and to quickly discover and share insights. Given how commonly it is used across various industries, securing a Tableau certification will certainly help you to differentiate yourself from the crowd.

Front-end users will most likely be using Tableau…


Simple trick to create dynamic table of contents for easier scroll navigation for your readers

Image for post
Image for post
Photo by LinkedIn Sales Navigator on Unsplash

Table of Contents

Introduction

By now you should be aware that the Medium platform does not allow writers to automatically generate a dynamic hyperlinked table of contents. This is an issue because writers frequently use sections to organize their stories, and what we see, more often than not, are static tables of content that are just lists of text.

Having a dynamic table of contents significantly improves the user experience for readers, making it easier for them to scroll to…


Step-by-step sentiment analysis with NLP (Stanza, NLTK Vader and TextBlob) on COVID-19 vaccine tweets

Image for post
Image for post
Photo by CDC on Unsplash

The COVID-19 pandemic has presented itself as one of the gravest global threats, and is still very much an ongoing menace. In equal measure, we are in the midst of the biggest vaccination campaign in human history. According to Bloomberg, a staggering 68.1 million doses in 56 countries have been administered so far (as of 26 Jan 2020).

While the vaccine has offered renewed hope in the fight against COVID-19, it has also ignited aggressive anti-vaccine movements. It would thus be interesting to gauge the public’s perception towards the COVID-19 vaccine with sentiment analysis (in Python) on recent Twitter data.

TL:DR


Applying ensemble learning and multiple linear regression in Python for condominium rental price predictions

Image for post
Image for post
Photo by Burak Ceviz on Unsplash

This is Part 2 of a two-part series on the analysis of the Singapore condominium rental market. Do also check out Part 1 (Data Acquisition & EDA in Python).

Introduction and Motivation

As described in Part 1, I was keen to examine the condominium rental market in Singapore, which is relatively under-explored.

Having completed data acquisition and exploratory data analysis, it is time to run some predictions on the dataset. This article explores the use of Ensemble Learning models (Random Forest, XGBoost and LightGBM) and Multi-Linear Regression to predict condominium rental prices.

Data Pre-Processing before Prediction

As a recap, our dataset comprises 7,317 condominium listings extracted in…


Glean insights into the Singapore property market through data acquisition (with web scraping) and exploratory data analysis in Python

Image for post
Image for post
Photo by Mike Enerio on Unsplash

Introduction and Motivation

There has been plenty of analysis on public housing sales (especially the over-analyzed Boston housing dataset). Even for the Singapore context, there are numerous notebooks and datasets that involves HDB public housing.

Hence, I was keen to examine a segment of the property market that is less explored. Instead of public housing, I looked at private properties (specifically condominiums), and instead of sales, I delved into rentals.

In this article, I share about how I acquired data with web scraping, and performed exploratory data analysis to glean insights into the Singapore condo rental market. Let’s go!

Note: This is Part…


Kickstart 2021 with an AWS certification to validate your cloud expertise

Image for post
Image for post
Photo by JESHOOTS.COM on Unsplash

Contents

Introduction to AWS Certifications

Amazon Web Services (AWS) is one of the most popular cloud platforms out there, and they offer an extensive suite of fully featured services from data centers globally. In line with these services, AWS offers certifications to help professionals validate these in-demand cloud computing skills.


Hands-on Tutorials

Detect, recognize and extract car license plate numbers with the power of computer vision (A step by step tutorial)

Image for post
Image for post
Photo by Alexander Popov on Unsplash

Table of Contents

Click here to view the Jupyter Notebook, and here to visit the GitHub repo.

Motivation and Introduction

When we talk about AI, computer vision is definitely one of the top applications that comes to people’s minds. Hype aside, I have always been fascinated by it because since it pertains to the human sight, arguably the most important organ of human sense.

I have come across plenty of computer vision projects involving the human face and/or body. As such, I…


Hands-on Tutorials

Using fast.ai v2 and Google Colab to serve an intoxicating mix of data and dram

Image for post
Image for post
Photo by Q.U.I on Unsplash

Motivation

Having starting immersing myself in the updated fast.ai v2 deep learning course, I felt it would be ideal to apply and document what I have learnt so far. In this article, I will be sharing about how I trained a deep learning (CNN) classifier to distinguish the different types of popular alcoholic beverages.

An important highlight of this walkthrough is that it details the latest instructions on how to utilize the updated Microsoft Azure Bing Search V7 API, since key changes were implemented as of 30 Oct 2020.

Links

Contents


Leveraging the power of automation to generate data

Image for post
Image for post
Photo by Christopher Gower on Unsplash

Motivation

Analytics is impossible without data, and web scraping is one of the many tools out there for us to curate data. The concept of web scraping has always fascinated me, and I felt it would be fun to practice my coding chops while exploring the public information of healthcare professionals in Singapore.

I also notice that most existing web scraping tutorials tend to be overly brief, therefore a more detailed walkthrough would certainly be beneficial.

The Ministry of Health (MOH) Professional Registration System site provides open access to the public information of healthcare professionals, and there are different sites for…

Kenneth Leung

Data Scientist | NUS MSc Business Analytics | Pharmacist | https://linkedin.com/in/kennethleungty

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store