We’re excited to announce significant updates to the RapidMiner Modern Analytics Platform, the most comprehensive advanced analytics offering on the market today.
In a world where data lakes are often used solely as a repository for information, underutilized due to the state of the market and limits of technology, our aggressive advances turn the tide for data scientists and business users alike to extract business value from big data.
As you know, most traditional analytics vendors extract data from Hadoop to build and score analytic models. Moving big data out of Hadoop reintroduces bottlenecks and increases complexity. Only a few analytic vendors push down analytics computation to big data in Hadoop. RapidMiner pushes the computation of more than 250 machine learning models directly to the data in the cluster, making it easy to deploy powerful predictive analytics into production inside Hadoop.
With our pushdown Hadoop processing in RapidMiner Radoop, combined with our recent announcement of RapidMiner Streams, it’s easy to see that we are quickly turning dormant data lakes into money-making machines where enterprises can maximize the business value from their data. And now, predictive analytics is no longer a nice-to-have competitive advantage. It’s an absolute business necessity. Nobody else offers what RapidMiner does, and our latest release establishes us as the de-facto modern analytics platform.
The fact is RapidMiner is the only code-free advanced analytics platform available commercially that can execute analytical processes in-memory, in-Hadoop, in-Cloud, in-Stream and in-database. And, furthermore, our new in-Hadoop model scoring delivers up to 20x in performance compared to legacy Hadoop model scoring.
RapidMiner Radoop, which automatically creates an optimized analytic execution plan based on the unique Hadoop cluster configuration, now integrates machine learning algorithms from MLlib, Apache Spark’s machine learning library. This RapidMiner Radoop release includes push down processing for logistic regression and decision tree algorithms that can be trained natively in Hadoop, making use of the full distributed computation power of Spark in a Hadoop cluster.
We also recognize that big data security is top of mind for enterprises worldwide. This crucial business requirement typically delays analysis, but not with RapidMiner. RapidMiner Radoop easily integrates with Kerberos authentication securing Hadoop clusters, making it easy to perform large-scale data exploration, model building and model scoring while complying with well-adopted security standards.
We’ve also taken self-service analytics to the next level with “Guided Analytics”:
Wisdom of Crowds
RapidMiner continues to differentiate itself from other advanced analytics providers by offering a guided approach to building predictive analytics based on the wisdom gleaned from the 250,000 member strong RapidMiner community. The analytic best practice, or wisdom from the crowd, is mined via RapidMiner machine learning to recommend how to best build a predictive model. As users are always experimenting and learning, the latest innovations happening in the community are offered up as recommendations. This unique feature, which leverages the power of the RapidMiner community to create recommendations, makes it easier to develop more accurate predictive models, no matter how sophisticated the end user may be.
This latest platform release now includes context-aware recommendations, resulting in more relevant and focused guidance. This context awareness gives RapidMiners a better understanding of how other users are solving similar problems, and by tapping into that wisdom, offers up better recommendations that accelerate their time-to-value.
This latest platform release now includes recommendations for parameter settings. Tuning parameters is critical when developing analytic work flows and is a notoriously tedious and difficult task, especially for beginners. The RapidMiner parameter recommender uses the knowledge and experience of the community to recommend and fine tune parameters which improve the model accuracy and results.
Our latest platform will be showcased in Booth #1421 at this week’s Strata + Hadoop World taking place through February 20 in San Jose, California. If you plan to attend, we’d like to encourage you to take the RapidMiner Challenge to build and deploy a machine learning model in-Hadoop in 10 minutes or less. We’ll have a lot more going on at the booth as well, so we hope you’ll plan to join us!
I look forward to your comments and feedback about this exciting announcement.