Compiler, the blog
Data Pipeline for Customer Success Dashboards
A retrospective
Nov. 17, 2020 • The Data Beetle
Background Over the past couple of months, I worked with an e-commerce startup with a B2B offering to build Customer Success (CS) Dashboards for their clients that they will use to assess this...
When less is more but bigger is better
Data Sufficiency Challenges
Sept. 6, 2020 • The Data Beetle
I was recently asked to summarize a Machine Learning paper from Google Brain Team's latest research output: Big Self-Supervised Models are Strong Semi-Supervised Learners . I guess that...
Of problems of data and problematic data
Matching old solutions with new problems based on a better understanding of the requirements and data
June 12, 2020 • The Data Beetle
As a data management & architecture consultant, my goal is to help businesses create the bedrock for data science and analytics. That said, I seldom miss a chance to take a stab at the analyt...
Of companies and businesses
Data Beetle Technologies Limited
April 3, 2020 • The Data Beetle
The Process I've registered my company in Hong Kong to provide Data Management Consultancy (hopefully) with a global outreach. There are two parts to the registration process - "Compan...
Of Data, Big Data, and Massive Data
Greenplum MPP & Talend ELT
Feb. 20, 2020 • The Data Beetle
Request I was recently approached by a friend who runs a growing company to help with Data Management. Their business involves hardware and software installation on certain entities within a loc...
Of Questions & Answers
Data Warehouse Design Questionnaire
Jan. 29, 2020 • The Data Beetle
Background Recently a friend reached out to me saying that his company generates lots of data regularly but it’s all disparate and unwieldy. A common enough predicament. So while perusing this w...
Of Radars & Spiders
Data Governance Assessment
Jan. 9, 2020 • The Data Beetle
Data Governance An important aspect of Data Management (DM) is Data Governance (DG). A DG framework stipulates how data must be acquired, transmitted, stored, accessed, shared, and destroyed. An...
Of Stars & Snowflakes
Dimensional Modelling
Jan. 2, 2020 • The Data Beetle
Data Warehousing Data Warehousing (DW) is the method for storing large amounts of data and making it accessible via convenient channels for analysis. It differs from Online Transaction Processin...
Data Management as a Service
Enabling Advanced Analytics & Data Science
Dec. 30, 2019 • The Data Beetle
Background As highlighted in the previous post , in my 15-year long career, I've enjoyed enabling and supporting data scientists and analysts. For the first nine years or so, I was a busin...
Home of the Data Beetle
Introduction
Dec. 29, 2019 • The Data Beetle
Name & Logo My two undying passions are Data and the Beatles. Hence the name. I didn't go with the spelling with an 'A', though, to avoid any potential copyright issues. Moreo...