The insideBIGDATA IMPACT 50 List for Q2 2021

Print Friendly, PDF & Email

The team here at insideBIGDATA is deeply entrenched in following the big data ecosystem of companies from around the globe. We’re in close contact with most of the firms making waves in the technology areas of big data, data science, machine learning, AI and deep learning. Our in-box is filled each day with new announcements, commentaries, and insights about what’s driving the success of our industry so we’re in a unique position to publish our quarterly IMPACT 50 List of the most important movers and shakers in our industry. These companies have proven their relevance by the way they’re impacting the enterprise through leading edge products and services. We’re happy to publish this evolving list of the industry’s most impactful companies!

The selected companies come from our massive data set of vendors and industry metrics. Yes, we use machine learning to analyze the industry in a detailed manner to determine a ranking for this list. We’re using a custom RankBoost algorithm adapted specifically for the big data community along with a plethora of proprietary data sources. The rankings include an indicator for upward movement in the list and also for companies newly added.

If you’re part of a company that you feel is “impactful” in some critical way please contact us immediately to be added to our database in order to be considered for this list. Companies on the list exhibit technology leadership, strength of offering, proven innovation, positivity of message, quality perception in the enterprise, intensity and frequency of social media buzz, high profile of members of the C-suite, and in the case of public companies: positive financial indicators and stock price, and so much more!

IMPACT 50 LIST for Q2 2021 (in order of the most impactful)

#1 Google AI [NASDAQ: GOOGL]

#2 NVIDIA – Inventor of the GPU for AI workloads [NASD: NVDA]

#3 Amazon Web Services – Cloud based machine learning, database, container, and storage [+3]

#4 Intel AI – Harnessing silicon designed specifically for AI [NASDAQ: INTC]

#5 Microsoft AI [NASDAQ: MSFT] [+2]

#6 Dell EMC [NYSE: DELL] [ +3]

#7 – Open Source Data Science and Machine Learning Platform

#8 Snowflake – Cloud Enterprise Data Warehouse

#9 DataRobot – Automated Machine Learning

#10 HPE [NYSE: HPE] [+1]

#11 Domino Data Lab – Data Science Platform

#12 Teradata [NYSE: TDC] [+1]

#13 TigerGraph – Graph database and analytics platform

#14 Qlik – Data Analytics and Data Integration [+1]

#15 Databricks – Unified analytics platform [+1]

#16 Kinetica – GPU Database [+1]

#17 SAS – Analytics, BI, and data management

#18 DataDirect Networks – AI and Deep Learning Storage

#19 Anaconda – Python Data Science Platform

#20 OmniSci– Massively accelerated analytics and data science

#21 Pure Storage [NYSE: PSTG] [+1]

#22 Neo4j – Graph database [+1]

#23 Salesforce Einstein AI – Smart CRM assistant [+1]

#24 Cloudera– Enterprise scale analytics platform [NYSE: CLDR] [+1]

#25 TIBCO – Integration, analytics and event-processing software [+4]

#26 Dremio – Data-as-a-Service platform

#27 StreamSets – Where DevOps meets data integration

#28 Guavus – Real-time big data analytics [+2]

#29 – AutoML

#30 MongoDB – Cross-platform document-oriented NoSQL database [NASDAQ: MDB] [+4]

#31 SQream – SQL GPU data warehouse [+2]

#32 – Operating system for ML and AI

#33 Fiddler Labs – Explainable AI

#34 Kaskada – End-to-end platform for feature engineering and feature serving [+1]

#35 Rulex – Explainable AI platform [+1]

#36 Cazena – Instant AWS data lake [+6]

#37 Diveplane – AI explainable, auditable, editable [+1]

#38 Neural Magic – No hardware AI, GPU speeds without GPUs

#39 Striim – Real-time data integration [+1]

#40 – Explainable AI for data scientists [+1]

#41 Kyndi – Explainable AI platform

#42 Couchbase – NoSQL cloud database service NEW

#43 Verta – AL and ML deployment & operations for data science teams [+3]

#44 Coursera – Massive open online course (MOOC) provider NEW

#45 Lightmatter – Photonic computing

#46 causaLens – A machine that predicts the global economy in real-time

#47 Run:AI – Platform for AI virtualization and orchestration [+2]

#48 Trifacta – Data wrangling tools [+3]

#49 Tecton – Data platform for machine learning [+1]

#50 WANdisco – Distributed computing specialists NEW

HONORABLE MENTION (in alphabetical order):

Actable AI – No-code data analytics with deep learning

Ahana – Ahana cloud for Presto NEW

Alegion – Data labeling platform

Allegro AI – DL/ML open source platform

Alluxio – Open source data orchestration for the cloud

AnotherBrain – Pioneers of organic AI

Arize AI – Real-time observability for AI [+6]

AtScale – Cloud OLAP for enterprise analytics

Beyond Limits – Explainable AI

Brytlyt – GPU accelerated analytics platform

Chatterbox Labs – AI model insights for trustworthy & fair AI

Comet– Self-hosted and cloud-based meta machine learning platform

Confluent – Stream processing with Apache Kafka

Cubonacci – Machine learning lifecycle management

DataKitchen – Enterprise DataOps platform NEW

DarwinAI – “AI Building AI” technology

Deci – Deep learning platform

DeepCube – Deep learning deployment on any device

Deepgram – Speech recognition – AI solutions for business

Determined AI – Deep Learning training platform

Digitate – AIOps

Exasol– In-memory analytic database

Faktion – Operationalize AI

Gigaspaces – In-memory computing platform

GridGain – In-memory computing platform

Hailo – Specialized deep learning processor for edge devices

Iguazio– Data science platform

InAccel – Application acceleration FPGA orchestrator

Intuition Machines – Deep learning and visual domain machine learning at scale

Lenses – DataOps platform for Apache Kafka and Kubernetes

Logical Clocks AB – Hopsworks, an enterprise platform for AI

Loop AI Labs – Cognitive computing

Megagon Labs – Experimental analytics and data management

Mipsology – FPGA machine learning NEW

ModelOp – Scale and govern enterprise AI

Molecula – Feature store NEW

Monte Carlo – Data reliability

Natural Intelligence – AI toolkit NEW

NetApp – Accelerated AI data pipelines

NtechLab – Augmenting intelligence

Obviously AI – Data science without code

OctoML – Deploy machine learning models

OpenAI – Ensures AGI benefits all of humanity

Ople – Predictive analytics for business users

Peltarion – Deep learning cloud platform

Perceive – Ergo edge inference processor NEW

Pinecone – Vector database for ML applications NEW

PredictHQ – Demand intelligence

Prophecy – Low code data engineering NEW

Qeexo – AutoML at the Edge

Spell – ML and DL platform

Streamlit – Open-source app framework

Superwise – AI assurance to achieve ML model monitoring

Toucan Taco – Data storytelling platform NEW

Truera – Model intelligence platform

Yellowbrick – Data warehouse for hybrid and multi-cloud environments

Contributed by Daniel D. Gutierrez, Editor-in-Chief and Resident Data Scientist of insideBIGDATA. In addition to being a tech journalist, Daniel also is a practicing data scientist, author, educator and sits on a number of advisory boards for various start-up companies. 

Sign up for the free insideBIGDATA newsletter.

Join us on Twitter: @InsideBigData1 –

Speak Your Mind