Book Review: Hands-On Exploratory Data Analysis with Python
The new data science title “Hands-On Exploratory Data Analysis with Python,” by Suresh Kumar Mukhiya and Usman Ahmed from Packt Publshing is a welcome addition to the growing list of books directed to help newbie data scientists improve their skills. I’m always on the lookout for texts that can help my students find their way along the challenging path toward becoming a data scientist. I think this book fills a void for Exploratory Data Analysis (EDA) learning resources.
Featured Stories
How AI and Machine Learning Will Shape Software Testing
In this special guest feature, Erik Fogg, Chief Operating Officer at ProdPerfect, covers some of the main benefits of adding AI to the software testing process, and why you should consider adding it to yours if you haven’t already. ProdPerfect is an autonomous E2E regression testing solution which leverages data from live user behavior data.
Open Source Innovations to Be Unveiled at Subsurface LIVE Winter 2021 Cloud Data Lake Conference
Dremio, the innovation leader in data lake transformation, announces the speaker lineup and full agenda for Subsurface LIVE Winter 2021, a two-day, live conference about the future of the cloud data lake industry. The virtual event takes place January 27-28, 2021, and features keynotes from senior executives of AWS, Tableau and Dremio, as well as 30+ technical sessions on the open source innovations, trends and strategies driving cloud data lake transformation and architectures.
insideBIGDATA Latest News – 1/26/2021
In this regular column, we’ll bring you all the latest industry news centered around our main topics of focus: big data, data science, machine learning, AI, and deep learning. Our industry is constantly accelerating with new products and services being announced everyday. Fortunately, we’re in close touch with vendors from this vast ecosystem, so we’re in a unique position to inform you about all that’s new and exciting. Our massive industry database is growing all the time so stay tuned for the latest news items describing technology that may make you and your organization more competitive.
Featured Resource

Real-Time Analytics from Your Data Lake Teaching the Elephant to Dance
This whitepaper from Imply Data Inc. introduces Apache Druid and explains why delivering real-time analytics on a data lake is so hard, approaches companies have taken to accelerate their data lakes, and how they leveraged the same technology to create end-to-end real-time analytics architectures.
All Recent News
- How AI and Machine Learning Will Shape Software Testing
- Book Review: Hands-On Exploratory Data Analysis with Python
- Open Source Innovations to Be Unveiled at Subsurface LIVE Winter 2021 Cloud Data Lake Conference
- insideBIGDATA Latest News – 1/26/2021
- How Enterprises Can Extract Meaningful Insights from Unstructured Data
- Data Science Salon for Healthcare, Finance & Technology
- Video Highlights: Content Driven Advertising using First Party Data
- Genesis of a Model Intelligence Platform – Truera
- XAI: Are We Looking Before We Leap?
- Best Practices for Data Search, Aggregation and Security in a Global Pandemic
- Panel Discussion: Needed Data Skills for 2021
- NoSQL vs SQL: Key Differences
- 5 Tips for Making Data Work for Your Business
- 2021 Trends in Blockchain: Mainstream Adoption at Last
- Using AI for Contract Management
- Molecula Secures $17.6 Million in Series A Funding to Democratize Machine-Scale Analytics and AI
- GridGain 8.8 Advances Its Multi-Tier Database Engine to Scale Beyond Available Memory Capacity and Meet Growing Customer Demand
- When Big Data Collides with Intellectual Property Law
- How AI Will Shape the Future of Customer Communications
- Driving with Data: How AI is Personalizing the Auto Insurance Industry and Saving Lives
- AI-driven Platform Identifies and Remediates Biases in Data
- Interview: Kathy Baxter, Architect of Ethical AI Practice at Salesforce
- “Above the Trend Line” – Your Industry Rumor Central for 1/11/2021
- Feature Stores are Critical for Scaling ML Initiatives and Accelerating both Top-line and Bottom-line Impact
- We All Know about AI in Medicine By Now. Here’s Why It Really Matters.
- Why 2021 is The Year of Low-Code
- Kyligence CEO Identifies Top Big Data, Cloud, and Data Analytics Predictions for 2021
- 2020’s Biggest Stories in AI
- MLOps Brings Best Practices to Developing Machine Learning
- Best of arXiv.org for AI, Machine Learning, and Deep Learning – December 2020
Industry Perspectives
Best Practices for Data Search, Aggregation and Security in a Global Pandemic
In this special guest feature, Kelly Griswold, Chief Operating Officer at Onna, takes a look at what would happen if we were able to access data and use it to create meaningful conclusions about teams and other workflows throughout the organization.
Using AI for Contract Management
In this special guest feature, Sunu Engineer, Principal Architect at Icertis, discusses using AI for Contract Lifecycle Management. Done right, AI for contract management has the potential to empower organizations to stay out front by turning repositories of contracts into indispensable strategic advantages.
Featured from insideHPC
- HPE to Build $35M+ NCAR Supercomputer for Extreme Weather ResearchHewlett Packard Enterprise (HPE) this morning said it has won a $35+ million contract to build a supercomputer for the National Center for Atmospheric Research (NCAR), a federal geoscience R&D center for meteorology, climate change and solar activity. HPE said the CPU/GPU-powered system, funded by the National Science Foundation, is expected to deliver 3.5x the […]
News from insideHPC
- VNET Lends Joins I4DI
- UF’s HiPerGator AI: Nvidia Supercomputer Offered to Students and Researchers across State University System
- Classiq Secures $14.5M Series A Funding for Development of Quantum Computing Software
- Verne Global and Sensa Join to Offer Certified Nvidia DGX System-Based Data Center Solutions
- HPE to Build $35M+ NCAR Supercomputer for Extreme Weather Research
- Western Digital and Qumulo Partner on Capacity and Scale for IHME COVID-19 Health Analytics and Vaccine Roll Out
- Exascale Computing Project: Researchers Accelerate I/O with Novel Processing Method
Editor’s Choice
The insideBIGDATA IMPACT 50 List for Q1 2021
The team here at insideBIGDATA is deeply entrenched in following the big data ecosystem of companies from around the globe. We’re in close contact with most of the firms making waves in the technology areas of big data, data science, machine learning, AI and deep learning. Our in-box is filled each day with new announcements, commentaries, and insights about what’s driving the success of our industry so we’re in a unique position to publish our quarterly IMPACT 50 List of the most important movers and shakers in our industry. These companies have proven their relevance by the way they’re impacting [READ MORE…]
Big Data Industry Predictions for 2021
2020 has been year for the ages, with so many domestic and global challenges. But the big data industry has significant inertia moving into 2021. In order to give our valued readers a pulse on important new trends leading into next year, we here at insideBIGDATA heard from all our friends across the vendor ecosystem to get their insights, reflections and predictions for what may be coming.
Hats Over Hearts
It is with great sadness that we announce the death of Rich Brueckner. His passing is an unexpected and enormous blow to both his family and the HPC Community. Rich was an institution in the HPC community. You couldn’t go to an event without seeing his red hat bobbing in the crowd, usually trailed by a fast-moving video crew. He’d be darting into booths, conducting interviews, and then speeding away to his next appointment.
What is the Difference Between Business Intelligence, Data Warehousing and Data Analytics
In this contributed article, Christopher Rafter, President and COO at Inzata,, writes that in the age of Big Data, you’ll hear a lot of terms tossed around. Three of the most commonly used are “business intelligence,” “data warehousing” and “data analytics.” You may wonder, however, what distinguishes these three concepts from each other so let’s take a look.
The Difference Between Data Science and Data Analytics
In this contributed article, tech writer Rick Delgado, examines the differences between the terms: data science and data analytics, where people working in the tech field or other related industries probably hear these terms all the time, often interchangeably. Although they may sound similar, the terms are often quite different and have differing implications for business.