Sign up for our newsletter and get the latest big data news and analysis.

TOP 10 insideBIGDATA Articles for July 2020

In this continuing regular feature, we give all our valued readers a monthly heads-up for the top 10 most viewed articles appearing on insideBIGDATA. Over the past several months, we’ve heard from many of our followers that this feature will enable them to catch up with important news and features flowing across our many channels.

Book Review: Deep Learning with TensorFlow 2 and Keras

If you’re a data scientist who has been wanting to break into the deep learning realm, here is a great learning resource that can guide you through this journey. It’s pretty much an all-inclusive resource that includes all the popular methodologies upon which deep learning depends: CNNs, RNNs, RL, GANs, and much more. The glue that makes it all work is represented by the two most popular frameworks for deep learning pratcitioners, TensorFlow and Keras.

“Above the Trend Line” – Your Industry Rumor Central for 7/29/2020

Above the Trend Line: your industry rumor central is a recurring feature of insideBIGDATA. In this column, we present a variety of short time-critical news items grouped by category such as M&A activity, people movements, funding news, financial results, industry alignments, customer wins, rumors and general scuttlebutt floating around the big data, data science and machine learning industries including behind-the-scenes anecdotes and curious buzz.

Book Review: Bayesian Statistics the Fun Way by Will Kurt

“Bayesian Statistics the Fun Way: Understanding Statistics and Probability with Star Wars, Lego, and Rubber Ducks,” by Will Kurt (2019 No Starch Press) is an excellent introduction to subjects critical to all data scientists. Will Kurt, in fact, is a data scientist! I always advise my data science classes at UCLA to engage these important subjects in order to obtain a well-rounded exposure to disciplines upon which data science is based. I’ve already added this title to my official bibliography of learning resources given to my students.

“Above the Trend Line” – Your Industry Rumor Central for 7/17/2020

Above the Trend Line: your industry rumor central is a recurring feature of insideBIGDATA. In this column, we present a variety of short time-critical news items grouped by category such as M&A activity, people movements, funding news, financial results, industry alignments, customer wins, rumors and general scuttlebutt floating around the big data, data science and machine learning industries including behind-the-scenes anecdotes and curious buzz.

Best of arXiv.org for AI, Machine Learning, and Deep Learning – June 2020

In this recurring monthly feature, we will filter all the recent research papers appearing in the arXiv.org preprint server for subjects relating to AI, machine learning and deep learning – from disciplines including statistics, mathematics and computer science – and provide you with a useful “best of” list for the month.

insideBIGDATA Latest News – 7/14/2020

In this regular column, we’ll bring you all the latest industry news centered around our main topics of focus: big data, data science, machine learning, AI, and deep learning. Our industry is constantly accelerating with new products and services being announced everyday. Fortunately, we’re in close touch with vendors from this vast ecosystem, so we’re in a unique position to inform you about all that’s new and exciting. Our massive industry database is growing all the time so stay tuned for the latest news items describing technology that may make you and your organization more competitive.

The insideBIGDATA IMPACT 50 List for Q3 2020

The team here at insideBIGDATA is deeply entrenched in following the big data ecosystem of companies from around the globe. We’re in close contact with most of the firms making waves in the technology areas of big data, data science, machine learning, AI and deep learning. Our in-box is filled each day with new announcements, commentaries, and insights about what’s driving the success of our industry so we’re in a unique position to publish our quarterly IMPACT 50 List of the most important movers and shakers in our industry. These companies have proven their relevance by the way they’re impacting the enterprise through leading edge products and services. We’re happy to publish this evolving list of the industry’s most impactful companies!

Book Review: The Art of Statistics – How to Learn from Data by David Spiegelhalter

This recent title, “The Art of Statistics – How to Learn from Data,” by University of Cambridge statistician David Spiegalhalter, is an important book on a number of fronts. I particularly appreciated the topics covered in the book that touch on important parts of the Data Science Process: data visualization, linear regression, logarithmic scales, Pierson correlation coefficient, data distributions, logistic regression, ROC curves, classification trees, over-fitting, bootstrap, probability theory, probability distributions, Bayes theory, and much more. I think new data scientists should engage a gentle introduction of these topics before diving into mathematical theory and code.

Video Highlights: Maching Learning in R – Three Reasons to Use tidymodels

The following video presentation comes from my favorite Meetup group “LA R users group”, a 2,200+ member group that puts on some amazing virtual presentations. This talk centers around that fact that modeling and machine learning in R involve a bewildering array of heterogeneous packages, and establishing good statistical practice is challenging in any language. The tidymodels collection of packages offers a consistent, flexible framework for your modeling and machine learning work to address these problems.