Open source data visualization tool by Airbnb: benefits and limitations.

Open source data visualization tool by Airbnb: benefits and limitations.

In economies where the role of big data is ever-increasing, companies turn to business intelligence (BI) and reporting tools to have their data analyzed and presented in a precise and readable format.

As a data scientist, I’m always working with companies looking for actionable insights. Both large enterprises and startups today are ready to invest in BI tools and the relevant engineering work to explore and visualize their data.

In this article, I review an open source data visualization tool Superset designed by Airbnb. We used Superset in one of our recent projects and it served us really well in most cases. In this article, I will tell why we chose Superset among other BI tools, what are the main benefits and drawbacks of the platform.

We used Superset in our project for a fitness mobile app with a huge fast-growing customer base. On the one hand, a BI tool was requested by business stakeholders, who needed a number of specific reports to monitor trend changes in application usage and better understand customer behavior. On the other hand, a BI tool could be used by our data science team to perform exploratory data analysis in relation to different user cohorts before building Machine Learning models.

We needed a tool that would satisfy the following requirements:

After searching for available solutions, we selected SupersetandPentahofor further evaluation.

Superset was seen as a more attractive tool for us for the following reasons:

Superset is a data exploration platform designed to be visual, intuitive and interactive. Superset’s main goal is to make it easy to slice, dice and visualize data. Its developer claims that Superset can perform analytics at the speed of thought. As we have already mentioned the open source data visualization tool is written in pythonic web framework Flask.

This project was originally named Panoramix, was renamed to Caravel in March 2016, and is currently named Superset as of November 2016.Source.

Superset allows integration of multiple data sources but in our project, we created a single BI data warehouse that was the only data source for Superset.

2. After you’ve added a data source you should add tables from that DB and specify fields’ properties. You can specify if a field is groupable, filterable, temporal and so on. Also, you can add custom metrics (in addition to default ones like COUNT, COUNT DISTINCT, SUM, etc.).

3. Then you create slices. Slice is a single plot based on your data. Note that you can build slices for just one table at a time. But it’s not a problem because you can always create a view by joining as many tables as needed. Your future BI report can consist of one or more slices.

Superset worked perfectly for us when we used it to plot aggregated metrics like distribution of users grouped by OS (iOS, Android), age groups, gender and so on. But the tool appeared to be helpless when we wanted to plot aggregations of a higher level.

Share it:
Share it:

[Social9_Share class=”s9-widget-wrapper”]

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

You Might Be Interested In

How Generative AI Could Lead to a 10x Increase in Coding Productivity

26 Feb, 2023

In the recent “Big Ideas 2023” report by Ark Invest, the investment management firm forecasted that AI could lead to …

Read more

Why you should devote as much time to dark data as big data

27 Oct, 2016

IDC and EMC estimate that by 2020 the world will have 40 zettabytes of data, and a 2016 Veritas Global …

Read more

Key Trends To Dominate Big Data Analytics

12 Jan, 2017

Big data analytics has been in the forefront of providing insights and predictive power to businesses in recent times. Once …

Read more

Recent Jobs

Cyber Security Engineer – P2

Hybrid (Aurora, CO, USA)

5 Mar, 2024

Read More

Sr. Manager – Data and Analytics Technical Lead

Hybrid (Dedham, MA, USA)

5 Mar, 2024

Read More

Manager, Business Data and Analytics

Hybrid (Troy, OH, USA)

5 Mar, 2024

Read More

Do You Want to Share Your Story?

Bring your insights on Data, Visualization, Innovation or Business Agility to our community. Let them learn from your experience.

Get the 3 STEPS

To Drive Analytics Adoption
And manage change

3-steps-to-drive-analytics-adoption

Get Access to Event Discounts

Switch your 7wData account from Subscriber to Event Discount Member by clicking the button below and get access to event discounts. Learn & Grow together with us in a more profitable way!

Get Access to Event Discounts

Create a 7wData account and get access to event discounts. Learn & Grow together with us in a more profitable way!

Don't miss Out!

Stay in touch and receive in depth articles, guides, news & commentary of all things data.