We’re breaking the species barrier in our mission to bring data science to everyone by using the power of data science to improve our pets’ lives.
In supervised learning, model training uses data with known outcomes, while in unsupervised learning, the data doesn’t have a known outcome. So which is best for your use case? Read on to find out!
Let’s become better data scientists by avoiding common pitfalls. Follow these basic principles to make your machine learning projects more impactful.
Resilience is the new accuracy in data science projects. Here’s why your “best” model might not be the best at all…
With Jupyter Notebooks baked into RapidMiner 9.6, coders have a powerful new tool to share projects with coworkers. Read on to find out all the details!
With our latest release, we’re letting anyone shape the future for the better, regardless of their background or skillset. Check out the highlights in this blog post.
Did you miss Wisdom 2020? Or do you want to relive all the fun? This blog post is for you!
What’s coming down the pipe for AI and machine learning in 2020 and beyond?
Natural language processing is changing how companies understand their data. See what it can do for you.
Digital twins are poised to be the next big thing in manufacturing.
With the holiday season upon us, we wanted to update you about three new features available today in RapidMiner Studio.
Thinking about coming to Boston for our 2020 user conference Wisdom? Here are six of the top things you’ll have FOMO about if you don’t attend.
Detecting model drift is a key component of model impact and maintenance. These tips will help you evaluate drift correctly.
Learn about RapidMiner Managed Server, our services offering to install, configure, and maintain a RapidMiner environment for you.
Learn about two phenomena: change of concept and drift of concept which demonstrate why models can’t just be put into deployment forever.
Organizations are struggling to deliver the promised benefits of data science. We call this the ‘model impact epidemic’ and this post examines the macro trends that allow the epidemic to spread freely.
We are proud to announce 5 new operators added across the Operator Toolbox and Smile extensions. Here’s an overview of these extensions and what’s new.
RapidMiner Server and Studio can now use the SAML protocol to interact with any identity provider, and incorporate RapidMiner users to the general user management of the company.
If you’ve spent a good bit of time replacing connections while moving a process to production, struggled with collaboration within your team, or have simply found the current feature set too rigid, we have good news for you.
In this article, we cover common issues we encounter when deploying ML models and how the combination of Talend and RapidMiner help overcome them.
Machine learning is constantly making every stage of manufacturing more efficient and lucrative. Learn how to harness its power for your business.
Learn how to connect RapidMiner Auto Model with other applications through Zapier, which has connectors to nearly every application that exists.
Data Science: Concepts and Practice (Second Edition) by Vijau Kotu and Bala Deshpande is now available. Order your copy today.
Learn how predictive marketing analytics can help engage your audience in all the different stages of a customer journey and maximize lead conversion.
Learn how artificial intelligence (AI), machine learning (ML), and big data are changing the renewable energy sector by taking advantage of collected data.
Learn how to structure and analyze customer reviews by sentiment and topic with machine learning and natural language processing.
Here’s a recap of the presentations from the second day of Wisdom 2018 in New Orleans. Wisdom is RapidMiner’s conference for users.
Here’s a recap of the presentations from the first day of Wisdom 2018 in New Orleans. Wisdom is RapidMiner’s conference for users.
What makes data prep so difficult and tedious? Ingo shares his thoughts on this and how RapidMiner addresses this issue with a new data prep approach.
Machine learning and data science have become an intrinsic part of business. Learn how to avoid common data science mistakes that can ruin your business.
RapidMiner regularly releases new versions of RapidMiner Studio, Server and Radoop. Read the top 10 reasons to upgrade to RapidMiner 9.
Read through a demonstration of Turbo Prep and Auto Model by Ingo Mierswa to see how RapidMiner makes data prep and machine learning fun, fast, and simple.
Check out these data science case studies produced by undergraduate students using RapidMiner in an annual data science competition.
One of the most frequent questions I get asked is: “Ingo, I am from Industry X and my data looks like Y and my colleague recommended to use model Z – what is your opinion on what model to use?” In this blog post, I explain a well-proven framework for model selection.
RapidMiner’s Real-Time Scoring Agent extends Server with a lightweight execution engine designed for specific use cases where speed and volume are critical.
I’m thrilled to announce that RapidMiner is nominated for the Company of the Year in AI, Machine Learning and Blockchain Technology at the NEVY Awards.
Today RapidMiner announced that we’re giving everyone a 30-day trial of Studio Large. Everyone will automatically receive the 30-day trial license.
We are excited to announce our partnership with MapR. This opens up data science possibilities for those who rely on MapR for managing their big data.
RapidMiner Auto Model automates machine learning and accelerates Data Science, making the platform more accessible to new users and more powerful for expert Data Scientists.
In Part 4 of this series we discuss multi-objective feature selection, which can be used for unsupervised learning & to identify best spaces for clusters.
Multi-objective optimization is great for feature selection because we can find all potentially good solutions without defining a trade-off factor.
Evolutionary algorithm is a generic optimization technique mimicking the ideas of natural evolution with the concepts of crossover, mutation, and selection.
Want to gain a better understanding of the data science and machine learning platforms market? Check out these review platforms reviews about RapidMiner.
Feature selection can greatly improve your machine learning models. Learn about it’s importance in part 1 of this blog series.
Remove obstacles to developing useful machine learning outputs and how to gain insights with the integration of RapidMiner and Tableau.
RapidMiner 8.0 will bring new features that offer improved reliability and horizontal scalability, to which we will expand the platform and revamp Server.
We’ve compiled the top ten most useful tips and tricks from our data science team to help you master our RapidMiner Studio.
We look at three use cases to demonstrate how RapidMiner Studio and RapidMiner Server can compliment each other and advance your call center operations.
Naïve Bayes is a powerful machine learning technique. Learn more about this classifier below and make it part of your standard toolbox.
RapidMiner was again voted as the most popular general data science platform and this is all thanks to our community of users!
What exactly are we doing with AI? Learn about what artificial intelligence and machine learning can do – and what it can’t do.
Delano Lima from Brazil won first place for his political tweet analysis project using the Rosette API and Rapidminer Studio in a Data Scientist Challenge.
k-Nearest Neighbors is one of the simplest machine learning algorithms. As for many others, human reasoning was the inspiration for this one as well.
What is data science? Have you read about the relationship between AI, machine learning, and deep learning? How do they relate to data science ?
There is hardly a day where there is no news on artificial intelligence in the media, and people know shockingly little about it. Read to learn more.
Learn more about time series forecasting in RapidMiner Studio and with R. R integrates well within RapidMiner in order to handle time series forecasting.
RapidMiner is a leader in both the Gartner’s Magic Quadrant for Data Science and the Forrester Wave: Predictive Analytics and Machine Learning Solutions.
Customer service centers are dominated by voice interactions between customers and service center agents, who are the face of the company.
Learn how k-fold cross-validation is the go-to method whenever you want to validate the future accuracy of a predictive model.
People often borrow ideas and apply them their situations. Start borrowing some great processes and use them for your own use case.
Learn how k-fold cross-validation is the go-to method whenever you want to validate the future accuracy of a predictive model.
Training errors can be dangerously misleading. Discover which practices will provide you with better estimation techniques for your model.
Data Prep series part 5: Outlier Detection. How to detect outliers and determine whether they are important or erroneous data that needs correction.
Learn how to prevent mistakes in model validation and the necessary components of a correct validation in regards to the training and test error.
A wrong validation leads to over-optimistic expectations for the model’s performance. Learn how to validate models correctly with our new blog series.
Using data visualization can tell a thousand words about your models to stakeholders. Discover how RapidMiner integrates with tools, like Qlik and Tableau.
Happy Holidays from RapidMiner. We would like to thank you for loving RapidMiner and supporting us on every step of this journey.
Feature Generation and Selection is the next step on transforming your data and we have some handy operators to help you make this process fast and easy.
After upgrading RapidMiner Studio, you might be wondering where your processes went. No need to worry, we’ve got you covered!
As Data Scientists, Engineers and Analysts, you have to routinely transform data from one type to another. RapidMiner makes converting data types easy.
The first companies to implement predictive maintenance and convert their vast data into actionable insights will gain a huge competitive advantage.
Data quality refers to the right type of data being in the right place. Learn how to improve the quality of your data by replacing missing values.
RapidMiner Radoop is a powerful tool for simplifying Big Data Hadoop Analytics and configuring it just got EASIER! Learn Now.
RapidMiner offers the option to export processes as scalable images in the Scalable Vector Graphics (svg) or Portable Document Format (pdf) file formats.
We kicked-off a special-purpose project, named the Data Core Project, to revise the core data management and processing core of RapidMiner.
Now that we have ported the cross-validation operator to make use of parallel execution, you can ultimately produce better results, faster.
You must spend time on data exploration; you must think about the problem you’re trying to solve, bring the right data together and then inspect it.
Learn how to connect with a Remote Desktop and finish the installation of RapidMiner Server on AWS and start running your processes in the cloud.
Learn how to take advantage of the AWS cloud infrastructure environment to put RapidMiner Server in a place where it can run 24/7.
Easily access Federal Reserve Economic Data or FRED API data in RapidMiner Studio using only two operators: Open File and Read XML.
Take advantage of building blocks, pre-built processes encapsulated inside a Subprocess meant to help speed up your analytics.
Learn how to configure Studio settings and add proper path properties to execute R and Python scripts in RapidMiner Server.
Exploring a new discipline is always a difficult task – that’s why we are proud to provide you with a Data Science Map to help you with this journey.
Start a Data Science Project with RapidMiner’s Data Science Expert Marketplace; helping you close the data science skills gap.
If you’re familiar with the Groovy script language, then the Execute Script Operator will quickly become a favorite. Learn more here.
Learn how to use data from MongoDB in RapidMiner to help website owners measure the successes of their online business goals.
In this post we’re going to continue with this theme but focus on authorizing groups and showcasing some of the native Web App (Dashboarding) capabilities.
The question isn’t RapidMiner vs R, it’s how to use them together. Learn tips and tricks for using RapidMiner with Python and R.
How to join data in RapidMiner. 7 easy ways to mash up your data in SQL fashion without writing SQL and using RapidMiner instead.
Learn how Hadoop big data in-Hadoop & in-memory approaches have positives factors when doing data science.
I wanted to use RapidMiner to tackle Kaggle Competitions and see if I could get in the Top 10% of the ML challenge called “Shelter Animal Outcomes”. – pt.2
I wanted to use RapidMiner to tackle Kaggle Competitions and see if I could get in the Top 10% of the ML challenge called “Shelter Animal Outcomes”.
Data science teams are an evolution of the marketing operations function, who are responsible for marketing technology, processes, and analytics.
How do we detect if there’s a problem in our infrastructure? RapidMiner explores the use of customer feedback to predict and reduce IT disruption.
Learn how to use RapidMiner Server to operationalize fraud models, push results to your BI engine, and seamlessly integrate machine learning in your company
Use Machine Learning to understand complex customer behavior patterns. Use RapidMiner to extract those relationships and drive customer retention quickly.
Use data science to predict qualified leads. Use our simple drag and drop data science platform to revolutionize sales and marketing processes.
Focus on predictive analytics to create more effective marketing. Here are just 5 ways you can use predictive analytics marketing.
See why organizations are investing in Qlik machine learning to be able to easily implement predictive analytics models into their business.
Data scientist Martin Schmitz talks about using RapidMiner Studio to do your own cluster analysis of Hearthstone card decks.
Today I’m excited to announce RapidMiner 7.2 is now available, and that we’ve launched free versions of Server & Radoop and introduced new pricing.
Introducing new 7.2 features such as gradient boosted trees, deep learning, generalized linear models, and a brand-new logistic regression.
Similar to Amazon recommendations, RapidMiner shows you which operator other data scientists would use next if they were building your process.
See how text mining with RapidMiner can help you determine customer sentiment, predict trends in adoption and make more informed business decisions.
There’s a lot of ways to use analytics in applications, but you won’t be able to do it efficiently unless you operationalize predictive analytics.
We are going to make a start providing an area where you can describe your projects and invite feedback on our Community site.
One of the most fun events at Wisdom is our competition, “Who Wants to be a Data Miner?” Participants must design RapidMiner processes for a given goal.
At Wisdom New York in February I was struck by the strong community feel in the room. Academics and customers sharing their time and knowledge.
Data science is the new computer science.The practical applications are clearer than ever. I came across articles that illustrate the impact of data science
RapidMiner makes it easy for organizations to get started with predictive analytics and begin extracting game-changing value.
We are embracing an open core business model to strive a good balance between the underlying concepts of open source and letting our organization grow.
Interestingly, a pure open source model has seldom been a successful commercial business model. In fact, maybe the only successful example is RedHat.
You see, things move very fast at RapidMiner. In the past few months we’ve hired 30 new people. And they all have a perspective on “who is RapidMiner.”
RapidMiner Boosts Security, Collaboration & Extensibility in Big Data with Latest Platform Innovations
We’re excited to announce updates to the RapidMiner portfolio. The latest innovations empowers those aiming to get more actionable insight from Big Data.
Amnesty International uses a CRM system to extend the relationship lifecycle. But it wanted to improve performance using new data analytics methods.
As an increasing number of enterprises move towards production deployments of Hadoop, security continues to be an important topic.
Building on the same processes from the Political Sentiment model, we redeploy them but update the Twitter Search terms for #Boston2024.
There are more than 250,000 RapidMiner users worldwide, brilliant minds who have had tasks similar to yours – building some analytical process.
Ingo Mierswa, RapidMiner’s CEO, Co-Founder & Data Scientist in Residence, explains how the K-nearest-neighbors algorithm is used to formulate ideas.
Ingo Mierswa, tells us that he doesn’t care about the past. That’s the realm of BI & reporting. He cares about the future – predictive analytics!
RapidMiner Academia provides free or substantially discounted licenses to students, professors, researchers and other academics.
On this episode, Ingo welcomes in the New Year by talking through the opportunities and challenges facing big data analytics professionals around the world.
CROC, Russia’s leading systems integration service provider, recently joined RapidMiner’s Partner Program. It’s estimated Russian IT market at $30B annually
Explore RapidMiner cloud use cases to help you understand real world applications for running data science processes backed by AWS.
An in-depth with Davide Weisman, Ph.D. on using RapidMiner in real world projects for text and social media data mining, enterprise-level strategy and more.
Predictive analytics continues to evolve and the ongoing quest is to build computer systems capable of understanding concepts rather than just keywords.
Radoop, a leading big data analytics solution, makes Hadoop implementations more powerful and easy to use with RapidMiner’s advanced analytics suite.
When you work with Predictive Analytics, you work these odds. You don’t look for the single prospect most likely to become a customer in the future.
Predictive Analytics and Big Data have been adopted by many industries, but Human Resources departments seem to be lagging this trend.
While businesses, and pro sports teams and scouts, will continue to make some decisions based on observation, Predictive Analytics is going mainstream.
Text Analytics and Why the purchase of WhatsApp was a Good Deal for Facebook.
We believe that our placement in the Leaders quadrant reflects our status as one of the world’s most frequently downloaded predictive analytics tools.
Previously, companies have just analyzed past data to discover what happened, when it happened, and why it happened.
What’s the big deal with big data? Sure, it’s got those 3 Vs Gartner talked about – Volume, Velocity, and Variety.
There is no better way to understand and learn more about predictive analytics and RapidMiner than by reading! We’ve compiled a list of our favorites.
With the news of Rapid-I now being RapidMiner we thought it would be helpful to reiterate that the core of RapidMiner stays open source.
We are excited to announce the top ranking of RapidMiner in KDNuggets’ 14th annual poll of Predictive Analytics and Data Science Software software use.
Hack/reduce brands itself as Boston’s Big Data hacking space. Backed by a who’s who of Boston tech powerhouses, ranging from Harvard to Google.
The Rapid-I team keeps on mining and we excavated two great books for our users. The first one, Data Mining for the Masses by Matthew North.
This year’s RCOMM live data mining challenge, “Who wants to be a data miner?”, to (partially) solve a Sudoku puzzle with RapidMiner.
The team is proud to announce the birth of a brand new plot component presenting you a powerful and flexible visualization of your data & process results.
MythMiner’s goal is to suggest TV programs based on the programs recorded by the user so far. This is similar to Tivo but it is based on MythTV.
The new super computer named Watson was created and trained during the last 4 years by 25 IBM engineers in order to play (and win!) at Jeopardy.
Being not only a company but even an open source company allows us to share this great feeling even more often.
I just stumbled upon this great blog post about some uncommon uses of regular expressions. RapidMiner also makes a lot use of those beasts.
One of the next versions of RapidMiner (5.0.011 or the upcoming version 5.1) will provide a nice extension of the expression parser which is for example used for the operator “Generate Attributes”.
You can find RapidMiner together with the latest information about RapidMiner and data science in general on Facebook, Twitter, and YouTube.