Sign up for our newsletter and get the latest big data news and analysis.

insideBIGDATA Latest News – 5/4/2020

In this regular column, we’ll bring you all the latest industry news centered around our main topics of focus: big data, data science, machine learning, AI, and deep learning. Our industry is constantly accelerating with new products and services being announced everyday. Fortunately, we’re in close touch with vendors from this vast ecosystem, so we’re in a unique position to inform you about all that’s new and exciting. Our massive industry database is growing all the time so stay tuned for the latest news items describing technology that may make you and your organization more competitive.

dotData Launches AI-FastStart Program to Make it Easy for BI Teams to Adopt AI/ML through AutoML 2.0

dotData, focused on delivering full-cycle data science automation and operationalization for the enterprise, announced dotData AI-FastStart™, a new all-inclusive bundle of technology and services that includes a one year license to a fully-hosted version of dotData’s autoML 2.0 platform, plus training and support.

“We are seeing a huge demand for AI and ML capabilities in the market, but finding that many companies either do not have the internal resources to launch a data science program, or don’t know how to get one started,” said Ryohei Fujimaki, founder and CEO of dotData. “The AI-FastStart™ program was created as an all-inclusive bundle to help enterprises fast-track AI/ML deployments, and immediately realize value from their data.”

Pure Storage Delivers the Native Unified, Fast File and Object Platform

Pure Storage (NYSE: PSTG), the IT pioneer that delivers storage as a service, announced the availability of FlashBlade Purity 3.0, the next generation of the advanced storage software for unstructured data. FlashBlade delivers a truly unified, scale-out fast file and object storage platform purpose-built to support modern application development, modern analytics and next-generation data protection. 

FlashBlade, Pure’s scale-out file and object storage solution, is unique in its ability to accelerate applications and enable data to be shared across applications and workloads. The latest updates to FlashBlade enable real-time performance that allow application workflows to take center stage over infrastructure – resulting in a Modern Data Experience that replaces complexities and antiquated systems with fast, flexible and agile solutions to fit modern needs.

“FlashBlade is so unique because it is designed from the ground-up for tomorrow’s data pipelines and real-time analytics applications, with simplicity, ease of use and performance in mind —  something that legacy storage vendors struggle with,” said Matt Burr, Vice President and General Manager of FlashBlade. “It is exceptional as a storage platform, and provides an unparalleled customer experience in the scale-out, unified file and object storage market.”

Hewlett Packard Enterprise Delivers the Cloud Experience Everywhere with General Availability of HPE GreenLake Central

Hewlett Packard Enterprise (HPE) announced that HPE GreenLake Central, unveiled in December 2019, is now generally available for HPE GreenLake customers. This advanced software platform provides customers with a consistent cloud experience for all their applications and data, through an online portal and operations console that runs, manages and optimizes their entire hybrid cloud estate. In addition, HPE is introducing enhancements to its as-a-Service portfolio with support for data management and file storage, and an enhanced co-location offering through partnerships with Cohesity, Qumulo, and CyrusOne, respectively.

Today, applications and data live everywhere – in the cloud, in the datacenter, and at the edge. Now more than ever, organizations need flexibility and choice to run, manage and optimize their applications and data where it best fits their needs. HPE GreenLake Central gives customers the agility of the cloud, visibility and control for all applications and data everywhere, and the financial flexibility to pay for only what they use so they can focus on innovation. This way businesses can use their resources effectively in order to drive efficiencies and create new experiences, while mitigating risk, compliance, and cost issues.

With the introduction of HPE GreenLake Central, HPE is bringing the self-service experience to the on-premises environment and provides powerful capabilities and benefits that allow each role in an organization to achieve better business outcomes. Customers can use the platform to monitor and take action on a range of KPIs including security, capacity, compliance, and resource utilization. They can also access their own consumption metrics and analytics in order to optimize the placement of workloads based on the factors most important to them – such as cost, governance, or dependence on other systems.

“HPE GreenLake Central is an important leap forward in the hybrid cloud customer experience market”, said Keith White, Senior Vice President and General Manager of HPE GreenLake. “HPE uniquely enables customers to gain spending, performance, governance and security visibility with a true one-pane-of-glass management for their entire environment – from the edge to the core to multiple clouds – and enables developers and line of business to create and innovate at the pace required to compete in today’s market.”

On the Right Track: AIMS API Finds Similar Songs via AI Developed by Production Music Experts

Somewhere among those 100,000 tracks in your production music catalog is the right one for that TV, film, video, or game. Trouble is, you or your client have to find it. “It’s common for people to be surprised at what they have in their catalog,” says longtime production music professional Martin Nedvěd. “People can know a few thousand tracks at most, but never the entire catalog.”

The composers and production music experts at AIMS API set out to find tracks efficiently. They harnessed the power of machine learning to find the right music fast, using a reference track as a guide. And they did it from a perspective grounded in how the production music economy works.

AIMS API was built to eliminate these challenges. It recommends and discovers great music options for these professionals in ways that surpass manual search by tags. It can detect similarity quickly at a massive scale and captures the high level of nuance required to serve music supervisors and other sync teams. Its algorithms were trained specifically for production music, focusing the company’s efforts on surfacing results that fit briefs.

“Production music has increased in quality in recent years to match and often surpass commercially released tracks,” explains Nedvěd. “Now, the biggest problem for music-for-picture professionals is how to find the right track in an endless supply of music when time is always of the essence. We think we have a solution.”

Diveplane Announces Iron Bow Technologies as GEMINAI™ Distribution Partner

Diveplane, the company keeping the humanity in artificial intelligence (AI), announced a new strategic partnership with IT solution provider Iron Bow Technologies for the distribution of the company’s proprietary GEMINAI solution.

Diveplane’s GEMINAI, which produces fully realistic and applicable synthetic data sets, will allow Iron Bow to achieve greater research depth and increased performance within its suite of technology capabilities for healthcare clients. Implementing this technology gives Iron Bow the enhancement and support it needs to continue providing customers with the most tailored and highly-efficient IT solutions.

Going far beyond basic data masking and differential privacy, Diveplane’s GEMINAI creates completely new and statistically accurate data for analysis and modeling purposes. It uses patented techniques that produce a verifiable synthetic ‘twin’ data set, which contains the same statistical properties of the original data, but without the danger of exposing any confidential or sensitive patient information.

“We are extremely proud to partner with Iron Bow Technologies, as they are true pioneers of healthcare technology,” said Mike Capps, CEO and co-founder of Diveplane. “By combining Iron Bow’s proven success in delivering best-in-class healthcare solutions with Diveplane’s ability to allow even highly-regulated industries to safely share data, this partnership opens the floodgates to massive innovations that will benefit healthcare providers and patients alike.”

VAST Data Brings The Enterprise Into The Universal Storage Era

VAST Data, a storage company breaking decades-old trade-offs, launched Version 3 of its Universal Storage architecture, which has more than 20 new features – including support for Windows and MacOS applications, cloud data replication and native encryption. These innovations, which quickly follow the company’s $100 million Series C funding round at a $1.2 billion valuation in mid-April, introduce new capabilities for mission-critical enterprise and media customers who are eager to evolve beyond the hard drive and realize the full potential of VAST’s Disaggregated, Shared-Everything (DASE) approach.

These latest Universal Storage updates allow enterprises to now marry all-flash performance with archive economics and scale to enable mission-critical and data-intensive enterprise production environments to consolidate their workflows and bring the power of flash storage and fast access to all of their data.

“VAST Data’s Universal Storage has the promise to revolutionize customer workflows and dramatically simplify the cost, capacity, scale and resilience problems that have plagued scale-out storage for over a decade,” said Aaron Cardenas, Founder and CEO of P1 Technologies – a leading media solutions reseller. “We’re delighted to partner with VAST Data after witnessing the excitement and anticipation across their customer community. Version 3 is the beginning of helping our media customers see the benefits of consolidating workflows onto Universal Storage and accelerating and simplifying the creative process.”

Nines Receives FDA Approval Enabling Democratized Access To AI-Enabled Radiology In The Telehealth Era

Nines announced the NinesAI™ medical device making world-class artificial intelligence available to the rapidly-growing telehealth industry. NinesAI is FDA-cleared and supports the automated radiological review of CT Head images for the possible presence of two time-critical, life-threatening indications – intracranial hemorrhage and mass effect – to aid radiologists in triaging cases. Nines is the first company to receive U.S. Food and Drug Administration clearance for artificial intelligence technology that triages mass effect conditions, and to its knowledge is the first company to receive simultaneous FDA clearance on multiple indications. NinesAI will be deployed to radiologists in Nines’ teleradiology practice and will also be available to Nines’ customers for in-house use at no added cost.

Radiologists suffer from high rates of burnout, according to MedScape’s Radiologist Lifestyle 2020 report, due to ever-increasing workload and suboptimal working conditions. Nines’ team includes world-class radiologists and top Silicon Valley engineers devoted to relieving those burdens via more efficient workflows that positively impact patient care. Together, they have developed the Nines Navigator™ worklist and the Nines Reading Assistant, which are administrative, non-medical device programs to improve radiologist focus. NinesAI assists radiologists by alerting them to the possible presence of intracranial hemorrhage and mass effect on head CT scans, life-threatening conditions that can be prioritized for review and consultation with hospital physicians treating patients.

“At Nines, we believe the application of advanced technology can address complex and pressing challenges in healthcare, and in particular for radiology, solve for a higher rate of burnout among radiologists,” said Nines co-founder and CEO, David Stavens. “With clearance from FDA, we’re proud to offer transformative AI innovation supporting the prioritization and triage of emergent conditions to complement radiologists’ work and ultimately improve the quality of patient care. We’re excited to partner with customers who seek cutting edge tools to deal with the conditions that matter most for patients.”

Moogsoft Expands AIOps Platform to Deliver Customers the “Virtual NOC”

Moogsoft, a leading provider of artificial intelligence for IT Operations (AIOps), announced the release of Moogsoft Enterprise 8.0, a complete AIOps platform that enables IT Ops and DevOps teams to build a virtual Network Operations Center (NOC) and work more effectively from anywhere.

Moogsoft Enterprise consolidates visibility and control of monitoring tools to help entire IT Ops and DevOps teams reduce noise, prioritize incidents, reduce escalations and ensure uptime. Working from anywhere, the IT operator can now easily analyze alerts, logs, metrics and traces to find and resolve the root cause of incidents before they become outages.

With version 8.0, customers can create a Virtual NOC using the Moogsoft Situation Room to collaborate throughout the incident management process, and diagnose and resolve problems quickly, regardless of team members’ physical location. The platform also provides IT Ops teams a single-pane-of-glass to replace multiple screens each dedicated to different monitoring tools.

“In today’s business environment, AIOps is critical to continually assuring uptime of the digital services and experiences that connect companies with their customers,” said Moogsoft Founder and CEO Phil Tee. “Moogsoft Enterprise 8.0 allows IT Ops and DevOps teams working from anywhere to regain visibility and control over the scale and complexity of those services.”

GigaSpaces Announces Version 15.2, Doubling Performance, Simplifying Operations and Improving Security 

GigaSpaces, the provider of InsightEdge, the fast in-memory real-time analytics processing platform, announced the release of GigaSpaces 15.2  that doubles performance, and introduces the GigaOps Stack for unprecedented DataOps functionality that simplifies and streamlines modern application development and deployment.  Version 15.2 increases the velocity, reliability and security of data processing and analytics in alignment with business goals, while freeing enterprises to develop and deploy modern time-sensitive and data-driven applications that support their digital transformation initiatives.

“Now, more than ever, the ability to effectively unlock business insights through the enterprises’ data that is dispersed among different platforms is imperative to create and maintain a competitive edge,” said Yoav Einav, Vice President Product at GigaSpaces. “With Version 15.2 we continue to deliver on our goal to provide extreme performance and simplify the complexities of data lifecycle and management with a modern and cloud-native DataOps approach.”

Franz AllegroGraph 7 Powers First Distributed Semantic Knowledge Graph Solution with Federated-Sharding

Franz Inc., an early innovator in Artificial Intelligence (AI) and leading supplier of Semantic Graph Database technology for Knowledge Graph Solutions, announced AllegroGraph 7, a breakthrough solution that allows infinite data integration through a patented approach unifying all data and siloed knowledge into an Entity-Event Knowledge Graph solution that can support massive big data analytics. AllegroGraph 7 utilizes unique federated sharding capabilities that drive 360-degree insights and enable complex reasoning across a distributed Knowledge Graph. Hidden connections in data are revealed to AllegroGraph 7 users through a new browser-based version of Gruff, an advanced visualization and graphical query builder.

“Large enterprises have Knowledge Graphs that are so big that no amount of vertical scaling will work,” said Jans Aasman, CEO of Franz Inc. “When these organizations want to conduct new big data analytics, it requires a new effort by the IT department to gather semi-usable data for the data scientists, which can cost millions of dollars, waste valuable time and still not provide a holistic data architecture for querying across all data. ETL, Data Lakes and Property Graphs only exacerbate the problem by creating new data silos. AllegroGraph 7 takes a holistic approach to mixed data, unifying all enterprise data with domain knowledge, including taxonomies, ontologies and industry knowledge – making queries across all data possible, while simplifying and accelerating feature extraction for machine learning.”

Teradata Announces General Availability of Vantage Customer Experience

Teradata (NYSE: TDC), the cloud data and analytics company, announced that Vantage Customer Experience (CX) is now generally commercially available. Vantage CX helps brands deliver relevant, personalized experiences in real time, across all customer interactions and touch points. By partnering with Celebrus and integrating real-time interaction data into Vantage CX, Teradata customers can instantly access the highest-quality customer behavior data from across digital channels to create personalized, optimized and orchestrated customer experiences at scale.

“Delivering optimized experiences at the speed customers expect is a top priority for brands across every industry. But with the growing complexity of a fractured digital landscape, this has become increasingly challenging,” said Martyn Etherington, CMO at Teradata. “With Vantage CX — underpinned by Celebrus’ customer interaction data — we are able to address this challenge by giving enterprises the best possible data, in real time, from across all lines of business and customer touchpoints. This allows marketers to seamlessly operationalize insights across all their customer channels, delivering more value to their customers, and ultimately, increasing their bottom line.”

ChaosSearch 2.0, Industry’s First Data Lake Engine for Scalable Log Analysis

ChaosSearch announced its next-generation platform, ChaosSearch 2.0, the industry’s first data lake engine for scalable log analysis. ChaosSearch 2.0 instantly turns a company’s own cloud data lake into a hot, robust, streamlined analytics engine that speeds time-to-insights and cuts log analysis costs by up to 80 percent. It uniquely enables companies to analyze petabytes of data without adding compute or performing complex, labor-intensive processes, and without limiting data retention.

Being able to analyze hundreds of terabytes to petabytes of log data is critical to keeping applications, IT systems, and cloud environments running. But conventional log and data analytics solutions—such as those built on open source toolsets like Elasticsearch–have fixed architectures that make it impossible to cost-effectively handle very large data volume and retain it for long periods. Legacy “big data” solutions like Hadoop are equally expensive and complex.

“ChaosSearch 2.0 takes a completely different, entirely new approach. Built from the ground up to achieve the true promise of cloud data lakes, ChaosSearch makes it as easy for customers to get insights out of their lake as it is to dump data into it,” said Thomas Hazel, Founder and CTO of ChaosSearch. “While other solutions require DBAs and data engineers to set up new workloads, extract data from storage, manually transform it, and then load it into a vendor’s analytic database, ChaosSearch 2.0 customers simply stream any amount of data into their own Amazon S3 data lake, where our solution automatically transforms and analyzes it. Our distributed architecture and proprietary indexing and compression technologies enable businesses to gain new and better insights, quickly and at a fraction of the cost.”

IBM announces Elyra AI Toolkit, a set of AI-centric extensions to Jupyter Notebooks

Jupyter Notebooks are now the open standard for data science and artificial intelligence (AI) model development. In keeping with our commitment to open source and the Jupyter community, in particular, IBM is proud to announce Elyra, a set of open source AI-centric extensions to Jupyter Notebooks, and, more specifically, the new JupyterLab user interface.

“I’m excited to see IBM engaging with the Jupyter and scientific open source communities with their ongoing contributions in this area,” said Fernando Pérez, Project Jupyter co-Founder and co-Director. “As much as AI tools rely on vast amounts of data and computational resources, the human in the loop remains the critical element for both asking the right questions and making decisions responsibly. That’s where Jupyter plays a role, and I’m delighted to see new tools like Elyra that will help AI workflows within the Jupyter ecosystem.”

Igneous launches new capabilities for DataDiscover and DataFlow

Igneous, the SaaS data management company for file-intensive environments, announced enhancements to DataDiscover and DataFlow.
Igneous DataFlow now enables data-heavy organizations to move data in native format from where it lives to where it needs to be, on-premises or cloud. Igneous DataDiscover catalogs all data where it lives. Now organizations can enable end users to see their data their way and find what’s important to them.

“From the beginning, Igneous was working to manage unstructured data at scale through a suite of services, DataProtect, DataDiscover, and DataFlow” said Kiran Bhageshpur, CEO, Igneous. “With these new enhancements, organizations can take control of their data footprint to reduce complexity, increase productivity through search, move data fast to where it’s needed, and leverage the cloud to reduce operational costs.”

Couchbase Allows Organizations to Minimize Operational Costs, Accelerate Time to Market, and Embrace Hybrid Cloud Strategy with Autonomous Operator for Kubernetes 2.0 

Couchbase, the creator of the enterprise-class, multicloud to edge NoSQL database, announced Version 2.0 of the Couchbase Autonomous Operator for Kubernetes (“Autonomous Operator”). As enterprises embrace the potential of the cloud, DevOps, and microservices, Autonomous Operator Version 2.0:

  • Minimizes the operational cost of developing in Kubernetes by automating the deployment and management of hundreds of clusters across multiple setups, regions, private and public clouds
  • Reduces risk by giving users greater visibility of and control over the Autonomous Operator
  • Accelerates services’ time to market by eliminating silos in microservices infrastructure
  • Makes adopting a hybrid or multi-cloud strategy easier than ever by standardizing performance across any cloud – preventing vendor lock-in

Alluxio Launches Enhanced Hybrid Cloud Solution Based on Intel Optane Persistent Memory

Alluxio, the developer of open source cloud data orchestration software, announced a go-to-market solution in collaboration with Intel to offer an in-memory acceleration layer with 2nd Gen Intel Xeon Scalable processors and Intel Optane persistent memory (PMem). The solution eliminates performance degradation of analytics clusters that are increasingly built on disaggregated compute and storage architecture.

“Today’s disaggregated cloud storage lacks efficient file system semantics support like ‘rename’. Additionally, disaggregated cloud storage typically can’t leverage compute side storage media such as DRAM and SSD for use as buffers and page caches,” said Haoyuan Li, founder and CEO, Alluxio. “Adding Alluxio Data Orchestration System and Intel’s Optane persistent memory solves both issues, enabling maximum benefit for cloud storage and achieving competitive and even better performance than traditional on-premises configurations. This is particularly helpful for hybrid cloud environments when data is remote.”

Sigma Computing Adds New Features to Maximize Data’s Value Across the Enterprise and Empower More Teams

Sigma Computing, an innovator in cloud-native analytics and business intelligence (A&BI), has extended the power of Sigma to be used throughout the cloud data analytics stack, solidifying it as the single source of truth for data across entire organizations. With this feature, Sigma is the first to provide non-technical users with the ability to create a data set and write it back to the cloud data warehouse (CDW) for use across the organization without needing to write code. The company is also jump-starting analysis and expediting time to insights with the launch of its first template for customers on Snowflake.

“The proliferation of SaaS tools has not only resulted in mountains of data but also a number of applications that you need to be able to access all that data in,” said Rob Woollen, CEO and co-founder, Sigma Computing. “With Dataset Warehouse Views, organizations can now rely on Sigma for datasets and analyses wherever they need them. IT and data teams will also no longer have to make the false choice between a portfolio of best-in-class data tools and settling for less performance in a single vendor solution to aid data management because Sigma can easily sit at the center of an organization’s cloud data ecosystem, connecting all the dots and maximizing data’s value.

AWS Announces General Availability of Amazon Keyspaces (for Apache Cassandra)

Amazon Web Services, Inc. (AWS), an Amazon.com company (NASDAQ: AMZN), announced the general availability of Amazon Keyspaces (for Apache Cassandra), a scalable, highly available, and fully managed database service for Cassandra workloads. Amazon Keyspaces supports the same application code, Apache 2.0 licensed drivers, and developer tools that customers running Cassandra workloads use today. Customers can easily migrate on-premises Cassandra workloads to the cloud, without the worry of managing underlying infrastructure, while realizing superior scalability, availability, and manageability. With Amazon Keyspaces, there are no servers to manage, no need to provision, configure, and operate large Cassandra clusters, no need to manually add or remove nodes, and no need to rebalance partitions as traffic scales up or down. There are no up-front investments required to use Amazon Keyspaces, and customers only pay for the capacity they use.

“Many customers have self-managed Cassandra on Amazon EC2 or on-premises for some time, and these customers tell us that managing large Cassandra clusters is difficult because it requires specialized expertise to set up, configure, and maintain the underlying infrastructure, and necessitates a deep understanding of the entire application stack, including the Apache Cassandra open source software,” said Shawn Bice, Vice President, Databases, AWS. “Amazon Keyspaces gives customers the ability to run Cassandra without having to worry about managing the underlying hardware, and because it’s also serverless, customers can stand up Cassandra clusters in minutes and scale their database up and down with ease based on the needs of their application.”

Fairwinds Insights Now Generally Available: Software Helps Enterprises Run Kubernetes Securely and Efficiently at Scale

Fairwinds, the Kubernetes enablement company, introduced the general availability (GA) of Fairwinds Insights, a configuration validation platform that enables engineering and DevOps teams to run applications on Kubernetes securely, efficiently, and reliably. Now GA, the platform has been tried and tested during the five month public beta helping dozens of organizations prioritize and remediate configuration security, efficiency and reliability risks.

Fairwinds Insights combines trusted open source tools, toolchain integrations, and Fairwinds’ Site Reliability Engineer (SRE) expertise based on hundreds of successful Kubernetes deployments. While traditional infrastructure monitoring tools only react to system problems, Fairwinds Insights provides continuous visibility, prioritized recommendations, and collaboration tools. It enables teams to apply Kubernetes configuration best practices while moving applications from development to production.

“Kubernetes is maturing and while the difficult issues are no longer just standing up a cluster, companies still want to be sure they’re adopting it securely and efficiently,” said Kendall Miller, President, Fairwinds. “This is especially problematic as engineering organizations expand Kubernetes to multiple teams and rely on Kubernetes infrastructure for mission-critical applications. It’s not uncommon for DevOps teams to be under-staffed and lack the bandwidth to manually inspect every change introduced by a variety of engineering teams. That’s why we’ve built Fairwinds Insights to proactively monitor Kubernetes and container configurations and suggest improvements, helping DevOps teams avoid problems before they arise.”

Sourcegraph Releases Free Digital Children’s Book Titled “Our ABCs: Always Be Coding”

Sourcegraph, a leader in Universal Code Search, announced the release of a digital children’s book titled Our ABCs: Always Be Coding. With most people working from home during national Take Our Daughters and Sons to Work Day, the book provides a unique opportunity for software developers to teach their children about their work. Developers with young children know the struggle of keeping kids busy and learning during this time, but could be running out of ways to do so. Our ABCs: Always Be Coding is a fun way to keep kids engaged while teaching them the core concepts of software development.

“Dinosaurs, llamas, and dump trucks are cool—but so is coding,” said Quinn Slack, CEO and co-founder of Sourcegraph. “We love software development and wrote this book as a fun way to share our passion for coding with our kids, so they know what we’re doing all day working on the computer.”

New PostgreSQL Acceleration Software from Swarm64 Boosts Open Source Database Performance

Swarm64, a leader in database acceleration solutions for the PostgreSQL open source database, announced the availability of Swarm64 DA 4.0, database acceleration software that extends PostgreSQL with the ability to analyze data orders of magnitude faster than usual, even as data streams into the database. This enables more people to choose free open source PostgreSQL over costly proprietary databases for new analytic and data warehouse modernization projects.

“PostgreSQL is immensely popular and battle-hardened as a primary database, yet, for performance reasons it’s historically less popular in data warehousing and analytics. We’re on a mission to change that,” said Thomas Richter, CEO and co-founder of Swarm64. “Now, when extended with the Swam64 DA accelerator, PostgreSQL runs orders of magnitude faster because we’ve tripled the acceleration effects in release 4.0. This enables people to cut database costs by 75% or more by using free, open source PostgreSQL in a greater number of projects and meet performance requirements that until now, only costly, proprietary data warehouse and analytic databases could satisfy.”

Sign up for the free insideBIGDATA newsletter.

Leave a Comment

*

Resource Links: