At the recent Hadoop Summit 2015 in San Jose, I had the opportunity to sit down with Russell Foltz-Smith, VP of Data Platform with TrueCar, Inc. to discuss his company’s use of data in general and Hadoop specifically.
Qubole, the big data-as-a-service company, announced that Station X, a leading developer of technologies that make large-scale human genome management and analysis easier, is using Presto on Qubole’s cloud-based big data platform to power GenePool™, a powerful software-as-a-service solution for real-time analytics of genomic and medical information.
Cyber security startup Niara emerged from stealth to unveil its Security Intelligence solution, the first of its kind to combine advanced security analytics and forensics to help security teams quickly surface sophisticated cyber threats within their organization. Built on a big data architecture, the Niara Security Intelligence solution analyzes security data from disparate sources.
Dickey’s Barbecue Pit Gains Operational Insight across 500 Stores with Advanced Big Data Analytics in the Cloud
Business intelligence (BI) and data warehouse solutions provider iOLAP Inc. has combined Syncsort’s DMX data integration software with Yellowfin’s BI platform and Amazon Redshift for data warehousing to provide one of America’s most successful food chains, Dickey’s Barbecue Pit, with deep operational insight across its 500 U.S. restaurants.
Data lakes are enterprise-wide data management platforms designed for storing and analyzing vast amounts of information from disparate data sources in their native format. The idea is to place data into a data lake in their native structure instead of a repository built for a specific purpose such as a data warehouse or data mart.
Today platforms including Open Source solutions for Big Data (like Hadoop) and GIS Mapping solutions (like ESRI), SAP HANA, and BigTable, a highly scalable distributed storage system that is used in Google Maps – but none of these solutions scale to the requirements of data with space and time attributes or are designed to enable real-time decision-making using this data.