Sign up for our newsletter and get the latest big data news and analysis.

Cloudera Enhances Hadoop Usability and Accessibility for Data Scientists

Cloudera_logo_7212015

Cloudera, a leader in enterprise analytic data management powered by Apache Hadoop™, announced a number of new initiatives to enable data scientists to take advantage of big data and Hadoop for data analysis with more complex workflows.

Huawei Opens Astro to Boost Spark

Huawei-logo

Huawei announced that the Spark SQL on HBase package is now open source and available. Dubbed Astro, the end-to-end package combines the capabilities of Spark, Spark SQL and HBase, helps drive Spark adoption in broad a NoSQL customer base and provides powerful online query and analytics capabilities for large scale data processing in vertical enterprises.

Talend Releases Update to Integration Cloud

talend logo_NEW

Talend, a global leader in big data integration software, today announced the availability of Talend Integration Cloud – Summer ‘15, a secure and hosted platform for connecting all cloud and on-premises data and applications.

Spark 101: Spark Streaming and GraphX at Netflix

Spark_logo_feature

The Bay Area Spark Meetup recently was hosted at Netflix to feature talks by Netflix engineers about their use of Spark Streaming and GraphX, as well as a Q&A session with the Netflix folks plus the lead engineer of Spark Streaming. The presentation is provided here with the abstracts of the two talks below.

Talend Joins Salesforce Analytics Cloud Partner Ecosystem

talend logo_NEW

Talend, a global leader in big data integration software, announced the company has joined the Salesforce Analytics Cloud Partner Ecosystem and will provide companies with an easy, flexible and cost-effective way to cleanse, transform and move big data.

Big Data in Biosciences and Health Care is Focus of New UCLA Research Center

BigData_science

A new research institute at UCLA may eventually provide doctors with tools to more accurately tailor medicines for individual patients, which could both improve quality of care and minimize the side effects associated with today’s medicine.

Spark 101: Running Spark and MapReduce together in Production

Spark_logo_feature

Clusters must be tuned properly to run memory-intensive systems like Spark, H2O, and Impala alongside traditional MapReduce jobs. This Hadoop Summit 2015 talk describes Altiscale’s experience running the new memory-intensive systems in production for its customers.

Strategic Big Data Pivot Leads Webtrends to Success

Peter_Crossley_Webtrends

The interview that follows is with Peter Crossley, Director of Product Architecture at Webtrends to discuss his company’s data platform centered around Hadoop and Spark.

Xplenty Rolls Out Analytics-as-a-Service for Amazon S3

xplenty-logo

Xplenty, the data integration platform that makes it easy to process more data more quickly, announced its newest offering, Analytics-as-a-Service or “A3S.” This latest addition to Xplenty’s extract, transform and load (ETL) software enables users to query and analyze data directly on Amazon Simple Storage Service (Amazon S3™).

Teradata Launches Configurable Appliance for Big Data with Choice of Hadoop Distributions

teradata_logo_mi

Teradata (NYSE: TDC), the big data analytics and marketing applications company, launched the next-generation Teradata Appliance for Hadoop®, version 5, which is configurable, ready-to-run and offers a choice of the latest version of Hadoop from Hortonworks® (HDP™ 2.3), and for the first time, Cloudera (Cloudera Enterprise 5.4).