Visualizing your data is a great way to deliver your message to your audience. Until now, data processing scripts and low-code functionality have enabled BI analysts to transform the data and filter out only the most important data. At the same time, many BI analysts are pumping up their python (or R) knowledge to apply more advanced analytics to their data and visualize it in a dashboard or low-code app. However, current BI platforms don’t always provide tools for python/R code integration and are especially not suitable for hosting machine learning code. One solution could be to use one of the major cloud providers. However, this is often easier said than done as it requires specialized knowledge of a variety of services.
In this example, we wish to predict if a new Spotify song fits better with Wouter or Alexander, our two targets. To do so, we use their Spotify playlist data to uncover their music taste.
Alexander has developed a classification model that predicts if a song belongs to Wouter’s Spotify list or that of Alexander. The model is deployed on kubernetes with an API endpoint (in UbiOps) and visualized using Tableau.
Rockfeather makes use of Data Visualization, Data Science, and Low-Code technology to build smart solutions. Their team creates state-of-the-art solutions for their customers in various industries and functions. This is how Rockfeather embraces technology and empowers ambitious people.
UbiOps is a deployment tool built for data scientists, which allows them to deploy their code easily, with knowledge of python or R alone.
The first step in the process is about training a predictive model, which can find patterns in the data from the two targets. The Spotify datasets on the two target audiences include all characteristics of the songs they listen to. These characteristics are translated into numeric variables, so that we compare the songs to each other. The following features are used:
Before Alexander starts with the training process, he splits the dataset into training and testing sets. The training dataset is made available for multiple machine learning models to recognize patters in it. Based on this training dataset, the machine learning models look at the characteristics of the songs, in order to differentiate the music styles of Alexander and Wouter. Once the training process is done, the performance is tested on the testing dataset. This is data that the model has never seen. The algorithm-generated predictions, that is, a classification on whether a song belongs to Alex’s or Wouter’s playlist, are compared with the actual values. The predicted and actual values of the test set are visualized in the Tableau dashboard on top of this page.
After comparing different models’ performance with different parameter settings, Alexander is satisfied with the performance of the best performing model, which he saves as “rf_saved.pkl”. By doing so, he stores the model parameters and model weights, so that he can apply the model to any new data that he feeds it. The model and its predictions can now be used in practice.
Wouter explains how to do this in UbiOps’ technical blog. Would you like to know more about Machine learning or Data Science, please scroll down and sign-up for a free discovery session. In this discovery session we will take a look at your business’ most potential use cases.
This blog has been written by Alexander Mik (Rockfeather) and Wouter Hollander (UbiOps)
Are you curious about how data science can help your organization? During this free discovery session, we will take a look at your business' most potential use cases.
During this Masterclass, we will discuss the power of the Power Platform and how process mining further enhances it. We’ll talk you through three scenarios that can improve your operations: Order2cash, an OnboardingApp, and a Customer Order Portal. During the Masterclass, interaction is at the forefront. That way you’ll get all the answers you need.
Dit webinar verkent Databricks en waarom je deze tool zou moeten overwegen boven andere prominente data-infrastructuren. In ongeveer 15 minuten verkennen we verschillende aspecten van Databricks, hoe het zich verhoudt tot andere tools en of je moet overwegen om over te stappen naar Databricks.
Forrester Research predicts that companies with smart Data & Automation programs will not only overtake their competitors, but crush them. When you integrate Data & Automation well, you’ll have a big head start over the competition. During “The Future of Data & Automation: Your Roadmap for 2025” we give you the tips and tricks you need to let your solid ideas take flight.