The latest big data news and articles

Why Accelerating Data Engineering Across Public Clouds and Private Data Centers is a Game Changer

In this contributed article, Rob Gibbon, Product Manager at Canonical, suggests that data engineers typically know what they need to get done. The problem is that their environment doesn’t always make it easy. If you’re working on premise, it can be hard to get data-intensive solutions off the ground quickly. However, cloud solutions come with lock-in and unpredictable pricing. The game-changer in this scenario is a hybrid solution that will allow you to accelerate data engineering.

Video Highlights: Deep Reinforcement Learning for Maximizing Profits — with Prof. Barrett Thomas

In this video presentation, our good friend Jon Krohn, Co-Founder and Chief Data Scientist at the machine learning company Nebula, is joined by Dr. Barrett Thomas, an esteemed Research Professor in at the University of Iowa’s College of Business, to delve deep into Markov decision processes and how they relate to Deep Reinforcement Learning.

The Power of Data Visualization: Techniques and Best Practices

In this contributed article, freelance writer Ainsley Lawrence discusses how data visualization is a powerful tool that can help viewers quickly analyze and assess the status or results of an analysis. Good visualization can make even the largest and most complex datasets relatively straightforward to interpret.

Data Insights are Illuminating the Future of the Power Sector

In this contributed article, David Thomason, Industry Principal – Power Generation at AVEVA, believes that the power sector has more data than ever on nearly every process in its value chain. Now, new technologies are helping make sense of all those details to provide competitive advantages – and it’s not a moment too soon.

Heard on the Street – 4/11/2024

Welcome to insideBIGDATA’s “Heard on the Street” round-up column! In this regular feature, we highlight thought-leadership commentaries from members of the big data ecosystem. Each edition covers the trends of the day with compelling perspectives that can provide important insights to give you a competitive advantage in the marketplace.

How Can Companies Protect their Data from Misuse by LLMs? 

In this contributed article, Jan Chorowski, CTO at AI-firm Pathway, highlights why LLM safety begins at the model build and input stage, rather than the output stage – and what this means in practice; how LLM models can be engineered with safety at the forefront, and the role that a structured LLM Ops model plays; and the role of data chosen to train models, and how businesses can appropriately select the right data to feed into LLMs

Conversational Internet is Digitizing the Other Half of the World

In this contributed article, Beerud Sheth, co-founder and CEO of Gupshup, discusses the future of Generative AI beyond ChatGPT. In this new era of Conversational Internet, the chatbot is the new website, and the messaging app the new browser

Bringing DAG and IGA Together for Improved Security and Compliance

In this contributed article, Ronald Zierikzee, senior solutions consultant for Benelux, Omada, examines crucial tools that work in tandem to ensure that remote workers, employees and contractors are able to access the information they need – and only that information – in a secure and successful manner. Identity governance and administration (IGA) helps manage user identities and access across an enterprise, helping improve visibility into access privileges and helping to implement the necessary controls to prevent inappropriate or risky access. Data access governance (DAG) is the process of managing and controlling access to an organization’s data resources.

Opaque Systems Extends Confidential Computing to Augmented Language Model Implementations 

In this contributed article, editorial consultant Jelani Harper discusses how Opaque Systems recently unveiled Opaque Gateway, a software offering that broadens the utility of confidential computing to include augmented prompt applications of language models. One of the chief use cases of the gateway technology is to protect the data privacy, data sovereignty, and data security of organizations’ data that frequently augments language model prompts with enterprise data sources.

Video Highlights: Gradient Boosting: XGBoost, LightGBM and CatBoost — with Kirill Eremenko

In this video presentation, our good friend Jon Krohn, Co-Founder and Chief Data Scientist at the machine learning company Nebula, is joined by Kirill Eremenko to walk listeners through why decision trees and random forests are fruitful for businesses, and he offers hands-on walkthroughs for the three leading gradient-boosting algorithms today: XGBoost, LightGBM, and CatBoost.