Sign up for our newsletter and get the latest big data news and analysis.

Video Highlights: How to Set Up a Remote Data Science Team

The talk below was part of a joint webinar with Appsilon and RStudio on July 28, 2020. In the presentation, Appsilon Senior Data Scientist Olga Mierzwa-Sulima explains best practices for data science teams – whether they are working in the office together or fully remote.

Video Highlights: BigQuery + Notebooks: Building an Analytics Pipeline on Kaggle

Your architecture choices impact how efficiently you’re able to use your data. In this “Snapshots” video produced by Kaggle, Data Scientist Wendy Kan demonstrates how she incorporates BigQuery and Kaggle Notebooks into her workflow. Watch her create an interactive network analysis graph that explores the most commonly installed Python packages!

Video Highlights: The Soul of a New AI

Joseph Blue is a Customer-Facing Data Scientist at DataRobot and has 20 years of hands-on experience including concentrations in financial services & healthcare. Prior to DataRobot, he built solutions for United Healthcare, HNC Software, ID Analytics and MapR. Currently helping to democratize data science with Automated Machine Learning (AutoML) – one organization, indivisible, with insights and value for all. Joseph Blue was speaking at ODSC Europe 2019 in the presentation below.

Research Highlights: ExBERT

In the insideBIGDATA Research Highlights column we take a look at new and upcoming results from the research community for data science, machine learning, AI and deep learning. Our readers need to get a glimpse for technology coming down the pipeline that will make their efforts more strategic and competitive. In this installment we review a new paper: EXBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models by researchers from the MIT-IBM Watson AI Lab and Harvard.

Video Highlights: Maching Learning in R – Three Reasons to Use tidymodels

The following video presentation comes from my favorite Meetup group “LA R users group”, a 2,200+ member group that puts on some amazing virtual presentations. This talk centers around that fact that modeling and machine learning in R involve a bewildering array of heterogeneous packages, and establishing good statistical practice is challenging in any language. The tidymodels collection of packages offers a consistent, flexible framework for your modeling and machine learning work to address these problems.

Video Highlights: Delivering the Enterprise Data Cloud

In the video presentation below from the O’Reilly Strata Data Conference, Arun Murthy, co-founder of Hortonworks and current CPO of Cloudera, discusses how enterprises can extract and act on big data.

UCLA DataFest Winners Announced, Presentations Posted

For the annual UCLA DataFest, student worked hard with data pertaining to the monumental challenge we are all facing: COVID-19. This year’s virtual version of ASA DataFest at UCLA brought forth unforeseen challenges and wonderful opportunities. This beloved tradition is generally a competition wherein groups of three to five students have just 48 hours to make sense of a huge data set and present their findings in five minutes, using just two slides.

Video Highlights: Machine Learning and Causal Inference

For our current edition of “Video Highlights” I’d like to offer this talk that will review a series of recent papers that develop new methods based on machine learning methods to approach problems of causal inference, including estimation of conditional average treatment effects and personalized treatment assignment policies. Approaches for randomized experiments, environments with unconfoundedness, instrumental variables, and panel data will be considered.

Video Highlights: Pi and The Art of Computer Programming

For our current edition of “Video Highlights” I’d like to offer Donald Knuth’s 25th Annual Christmas Lecture (2019): “Pi and The Art of Computer Programming.” Professor Knuth is the Professor Emeritus of the Art of Computer Programming at Stanford University. Dr. Knuth’s classic programming texts include his seminal work The Art of Computer Programming, Volumes 1-3, widely considered to be among the best scientific writings of the century.

AI Under the Hood: DecorMatters

In this installment of “AI Under the Hood” I introduce Silicon Valley-based DecorMatters, a compelling creativity-sharing ecosystem that brings together interior designers and furniture shoppers to make any home renovation project easier and more affordable. Founded in 2016, DecorMatters is powered by augmented reality and AI technology, and is redefining the interior design and furniture shopping experience. The design platform is a place to create and share, as well as socialize and shop.