Sign up for our newsletter and get the latest big data news and analysis.

STUDY: 69% of Consumers Will Own an In-Home IoT Device by 2019

IoT

Eighty percent of consumers have privacy concerns with wearable Internet of Things (IoT) connected technologies, according to the 2014 State of the Internet of Things Study. But half of those same consumers said they would be willing to share personal data collected by such devices with third-party retailers when presented with compensation such as a coupon or discount.

Designing a High Performance Lustre Storage System: A Case Study

case

Intel’s White Paper, “Architecting a High-Performance Storage System,” shows you the step-by-step process in the design of a Lustre file system. It is available for download at insideBIGDATA White Paper Library. “Although a good system is well-balanced, designing it is not straight forward. Fitting components together and making the adjustments needed for peak performance is challenging. The process begins with a requirement analysis followed by a design structure (a common structure was selected for the paper) and component choices.”

Gartner’s 2014 Hype Cycle for Emerging Technologies

Gartner_Hype_Cycle_2014

The Gartner Hype Cycle for Emerging Technologies just hit the streets! This guide to the industry’s pulse is a good way to balance the hype with reality. As you’ll note in the chart below, big data is entering the Trough of Disillusionment.

Leaving Data on the Table: Data Scientists Reveal Obstacles to Big Data

Paradigm4-data-scientist-survey-Infographic-FINAL

The huge volume of Big Data produced by sensors, genomic sequencers, electronic exchanges, and connected devices continues to generate headlines but it’s the diverse types of data, not the volume, that’s a bigger challenge to data scientists and is causing them to “leave data on the table.”

Revolution R Enterprise vs. SAS Performance Benchmark

RRE vs. SAS Benchmark Results

The debate over which statistical platform sits premiere over the others for data science applications rages on. The discussion often turns to the popular R and SAS environments. But to focus the dialog on performance only, a new benchmark study was just completed by commercial R provider Revolution Analytics.

Big Data Survey Finds 75% of Businesses Yet to Reach Production

Seventy-five percent of businesses have yet to successfully deploy big data analytics solutions to gain business-impacting insights, despite 65 percent increasing their investment in analytic services and technologies in 2014. These findings are part of “Analytics 2014,” Lavastorm’s second annual survey on analytic usage, trends, and future initiatives.

Interview: Survey from Dell Discovers Need for Big Data in Midmarket Companies

04ec7c4225e3d05d0ed8eddc74be3481

Big Data has mostly been considered the realm of big enterprise and not the midmarket segment. Dell launched a survey to study this notion and discovered that midmarket companies not only need Big Data to engender better, more competitive business practices, but many are already using data analysis. We caught up with Darin Bartik, Executive Director and GM of Database Management at Dell, to learn more about the survey and its findings.

insideBIGDATA Guide to Machine Learning

inisde big data guide to machine learning

As the primary facilitator of data science and big data, machine learning has garnered much interest by a broad range of industries as a way to increase value of enterprise data assets. In this article series we’ll examine the principles underlying machine learning based on the R statistical environment.

Machine Learning Identifies Fake Research Papers

Fake_paper_clusters

Unsupervised machine learning techniques have proven useful in identifying fake research papers submitted to the arXiv preprint server. Approximately 500 preprints are receiving daily by the automated repository arXiv, but are not pre-screened by humans. As a result, many nonsense papers generated by software such as SCIgen and Mathgen have been found in the most popular repository used by scientists to share research results.

Big Workflow – Beyond Intelligent Workflow Management

Big-Data-funding

Big data applications represent a fast-growing category of high-value applications that are increasingly employed by business and technical computing users. However, they have exposed an inconvenient dichotomy in the way resources are utilized in data centers. A new white paper that focuses on these issues is available here on insideBIGDATA.