Sign up for our newsletter and get the latest big data news and analysis.

Data Science 101: Expressing Yourself in R

Brought to you by our friends over at the Stanford Center for Professional Development is this compelling data science education resource: “Expressing yourself in R” – by Hadley Wickham, Rice University.

What’s Hot & What’s Not in Data Science 2015

Big-Data-Trends-2015

In addition to analyzing data, our friends over at CrowdFlower — a people-powered data enrichment platform — has an affinity for lists. This year, they asked their team of data scientists to come up with what they think will be “hot” or “not” in their world for 2015.

Qubole Secures $13 Million in New Financing to Expand Big Data-as-a-Service Platform

Qubole_logo

Qubole™, a leading provider of Big Data in the cloud, announced today it has secured $13 million in new financing led by Norwest Venture Partners. The funding builds on recent milestones from the company, including a partnership with Microsoft Azure to integrate Qubole Data Service (QDS) to Azure customers, and will be used to expand Qubole’s mission to help every company to use the cloud to turn data into business growth.

Ask a Data Scientist: Confounding Variables

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” This week’s question is from a reader who asks for an explanation of confounding variables and why they’re important in data science projects.

Ask a Data Scientist: Storytelling With Data

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” This week’s question is from a reader who asks about the importance of “storytelling” in data science.

CrowdFlower’s Data Enrichment Platform Supports Eight New Languages

crowdflower_logo

CrowdFlower, a leading data enrichment platform to help data scientists collect, clean and label data to make it useful, recently announced support for eight new languages: Hindi, Arabic, Indonesian, Turkish, Italian, Russian, Vietnamese, and Chinese as well as enhanced support for French, German, Portuguese and Spanish. Businesses can now tap into these new Language Crowds to enrich data that requires language proficiency.

Data Science 101: Cassandra Tutorial for Beginners

Provided by our friends over at Edureka, Module 1 of their Apache Cassandra course below discusses the fundamental concepts of using a highly-scalable, column-oriented database to implement appropriate use cases.

The Major Roadblocks Facing the Smart City

Cristian_Borcea

In this special guest feature, Cristian Borcear of NJIT reflects on the evolution of technology and public policy in support of so-called “smart cities. ” Cristian Borcea is an Associate Professor and the Associate Chair of the Department of Computer Science at New Jersey Institute of Technology.

Data Science 101: Support Vector Machines

Support Vector Machines (SVM) is an important and widely used machine learning algorithm. In order to fully understand SVMs, you need to have a fundamental understanding of how the statistical learning method functions. Here is a useful lecture on SVM coming from MIT OpenCourseware.

Glassbeam Unveils Machine Learning and Real Time Analytics Capabilities

Glassbeam, Inc., the machine data analytics company, today announced a new version of Glassbeam SCALAR tightly integrated with Apache Spark™, enhancing its cutting-edge Internet of Things (IoT) analytics platform with new capabilities around advanced machine learning and real-time analytics.