This article is the third in a series that explores a high-level view of how and why many companies are deploying Apache Spark as a solution for their big data technology requirements.
IBM (NYSE: IBM) today announced that its machine learning technology –SystemML –has been accepted as a project by the Apache Incubator open source project. Originally developed by IBM Research, and now used in IBM’s BigInsights data analytics platform, SystemML is a machine learning algorithm translator.
In this talk, Xiangrui Meng of Databricks shares his experience in developing MLlib. The talk covers both higher-level APIs, ML pipelines, that make MLlib easy to use, as well as lower-level optimizations that make MLlib scale to massive data sets.
In this special guest feature, Bobby Koritala, Chief Product Officer of Infogix, discusses data management best practices and why data quality without data integrity is no match for today’s business demands.
When one of my favorite independent tech book publishers, No Starch Press, notified me about their new title “Doing Math with Python,” I was energized to review what potentially could be a good new resource for budding data scientists.
OpsDataStore, the company delivering a solution to improve online service quality across heterogeneous and rapidly changing applications and IT infrastructures, announced the general availability of OpsDataStore 1.0 — the industry’s first big data back end for all IT management data.
For those unfamiliar with Redis, it is an open source, in-memory data structure server. Originally conceived to solve a problem that required speed and simplicity, it soon became clear that Redis had applications far beyond its original intent. Redis has since grown to include many data structures that resolve very complex programming problems with simple commands executed within the data store.
In this special guest feature, Dr. Venkat Srinivasan, Chairman and CEO of Rage Frameworks, Inc., outlines how big data can help active investment managers see success in financial markets in pursuit of Alpha.
Apixio Inc., the data science company for healthcare, announced that its HCC Profiler solution is improving care delivery and chronic disease management with rich and accurate patient profiles.
WANdisco, (LSE: WAND) a leading provider of continuous-availability software for global enterprises to meet the challenges of Big Data, announced major updates to its flagship WANdisco Fusion Platform.