The video presentation below, “Big Data: A Data Driven Society?” by Roberto Zicari of Goethe University, is provided courtesy of the Stanford University Center for Professional Development – Colloquium on Computer Systems Seminar Series (EE380).
Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” This week’s question is from a reader who asks about ensemble methods and how you use them.
Hortonworks®, a leading contributor to and provider of enterprise Apache™ Hadoop®, announced an expansion of the Hortonworks Certified Technology Program. There has been tremendous momentum with 100’s of vendors in the certification program. Additionally, since its introduction just six months ago we’ve seen more than 70 technologies become YARN Ready.
This week I attended an event sponsored by my favorite Meetup group: LA Machine Learning. The topic was “Machine learning with Apache Mahout” presented by Ted Dunning, Chief Application Architect for MapR Technologies. The event was booked to capacity with 100 in attendance. [Read More...]
Most everyone has heard of “big data” – the popular term for data so massive it’s difficult to manage. Today, the volume of search engine queries, online retail sales and Twitter messages regularly exceeds the capabilities of traditional databases. There’s a complement to big data that we call “big schema”. Modern data can not only have vast quantities and fast rates, but can also have diverse structure. Big schema can arise with enterprise data models, large data warehouses and scientific data. [Read More...]