The insideBIGDATA IMPACT 50 List for Q4 2021

Print Friendly, PDF & Email

The team here at insideBIGDATA is deeply entrenched in following the big data ecosystem of companies from around the globe. We’re in close contact with most of the firms making waves in the technology areas of big data, data science, machine learning, AI and deep learning. Our in-box is filled each day with new announcements, commentaries, and insights about what’s driving the success of our industry so we’re in a unique position to publish our quarterly IMPACT 50 List of the most important movers and shakers in our industry. These companies have proven their relevance by the way they’re impacting the enterprise through leading edge products and services. We’re happy to publish this evolving list of the industry’s most impactful companies!

The selected companies come from our massive data set of vendors and industry metrics. Yes, we use machine learning to analyze the industry in a detailed manner to determine a ranking for this list. We’re using a custom RankBoost algorithm adapted specifically for the big data community along with a plethora of proprietary data sources. The rankings include an indicator for upward movement in the list and also for companies newly added.

If you’re part of a company that you feel is “impactful” in some critical way please contact us immediately to be added to our database in order to be considered for this list. Companies on the list exhibit technology leadership, strength of offering, proven innovation, positivity of message, quality perception in the enterprise, intensity and frequency of social media buzz, high profile of members of the C-suite, and in the case of public companies: positive financial indicators and stock price, and so much more!

IMPACT 50 LIST for Q4 2021 (in order of the most impactful)

#1 Amazon Web Services – Cloud based machine learning, database, container, and storage [NASDAQ: AMZN] [+2]

#2 NVIDIA – Inventor of the GPU for AI workloads [NASDAQ: NVDA]

#3 Google AI [NASDAQ: GOOGL]

#4 Microsoft AI [NASDAQ: MSFT]

#5 HPE [NYSE: HPE] [+1]

#6 DataRobot – Automated Machine Learning [+1]

#7 Dell Technologies [NYSE: DELL] [+1]

#8 Intel AI – Harnessing silicon designed specifically for AI [NASDAQ: INTC]

#9 Domino Data Lab – Data Science Platform

#10 Databricks – Unified analytics platform [+1]

#11 Teradata [NYSE: TDC] [+1]

#12 – Open Source Data Science and Machine Learning Platform

#13 TigerGraph – Graph database and analytics platform [+1]

#14 OpenAI – AI research laboratory [+6]

#15 Snowflake – Cloud Enterprise Data Warehouse [NYSE: SNOW]

#16 Kinetica – GPU Database

#17 SAS – Analytics, BI, and data management

#18 Neo4j – Graph database [+3]

#19 Qlik – Data Analytics and Data Integration

#20 Anaconda – Python Data Science Platform

#21 Salesforce Einstein AI – Smart CRM assistant [NYSE: CRM]

#22 OmniSci– Massively accelerated analytics and data science

#23 Cloudera– Enterprise scale analytics platform [+1]

#24 Dremio – Data-as-a-Service platform [+1]

#25 TIBCO – Integration, analytics and event-processing software [+1]

#26 MongoDB – Cross-platform document-oriented NoSQL database [NASDAQ: MDB] [+1]

#27 Diveplane – AI explainable, auditable, editable [+4]

#28 StreamSets – Where DevOps meets data integration

#29 SQream – SQL GPU data warehouse [+1]

#30 Neural Magic – No hardware AI, GPU speeds without GPUs [+4]

#31 DataDirect Networks – AI and Deep Learning Storage [+4]

#32 Kaskada – End-to-end platform for feature engineering and feature serving [+4]

#33 – Explainable AI for data scientists [+4]

#34 Fiddler Labs – Explainable AI

#35 – Operating system for ML and AI [+4]

#36 Esri – GIS mapping software NEW

#37 Truera – Model intelligence platform NEW

#38 Verta – AL and ML deployment & operations for data science teams [+3]

#39 Couchbase – NoSQL cloud database service [+1]

#40 Cazena – Instant AWS data lake [+4]

#41 – AutoML [+1]

#42 Trifacta – Data wrangling tools [+3]

#43 Tecton – Data platform for machine learning [+3]

#44 Pinecone – Vector database for ML applications NEW

#45 ModelOp – Scale and govern enterprise AI [+3]

#46 WANdisco – Distributed computing specialists [+1]

#47 Matillion – ETL software for cloud data warehouses [+3]

#48 Plotly – The front-end for ML and data science models NEW

#49 Comet– Self-hosted and cloud-based meta machine learning platform

#50 MariaDB – Enterprise open source database NEW

HONORABLE MENTION – 65 companies (in alphabetical order):

Actable AI – No-code data analytics with deep learning

Ahana – Ahana cloud for Presto

Alegion – Data labeling platform

Allegro AI – DL/ML open source platform

Alluxio – Open source data orchestration for the cloud

AnotherBrain – Pioneers of organic AI

Arize AI – Real-time observability for AI

AtScale – Cloud OLAP for enterprise analytics

Beyond Limits – Explainable AI

Brytlyt – GPU accelerated analytics platform

causaLens – A machine that predicts the global economy in real-time

Chatterbox Labs – AI model insights for trustworthy & fair AI

Cockroach Labs – Builders of CockroachDB

Confluent [NASDAQ: CFLT] – Stream processing with Apache Kafka – Natural language understanding

Coursera – Massive open online course (MOOC) provider

Cubonacci – Machine learning lifecycle management

DataKitchen – Enterprise DataOps platform

Dataloop – Image and video annotation

DarwinAI – “AI Building AI” technology

Deci – Deep learning platform

Deepgram – Speech recognition

Deeplite – AI-driven DNN optimizer – AI solutions for business

Digitate – AIOps

Exasol– In-memory analytic database

Faktion – Operationalize AI

Fero Labs – Using machine learning to optimize factory production NEW

Gigaspaces – In-memory computing platform

Graviti – Data platform to accelerate AI and ML NEW

GridGain – In-memory computing platform

Guavus – Real-time big data analytics

Gurobi Optimization – Mathematical optimization solver NEW

Hailo – Specialized deep learning processor for edge devices

Iguazio– Data science platform

InAccel – Application acceleration FPGA orchestrator

Intuition Machines – Deep learning and visual domain machine learning at scale

Kyligence – Intelligent data cloud NEW

Kyndi – Explainable AI platform

Lightmatter – Photonic computing

Logical Clocks AB – Hopsworks, an enterprise platform for AI

Loop AI Labs – Cognitive computing

Mipsology – FPGA machine learning

Molecula – Feature store

Monte Carlo – Data reliability

Natural Intelligence – AI toolkit

NetApp – Accelerated AI data pipelines

Neuton – AutoML – no-code AI NEW

Obviously AI – Data science without code

OctoML – Deploy machine learning models

Peltarion – Deep learning cloud platform

Perceive – Ergo edge inference processor

Prophecy – Low code data engineering

Pure Storage [NYSE: PSTG] Data storage solutions

Qeexo – AutoML at the Edge

Qumulo – Scale-out storage NEW

Rulex – Explainable AI platform

Run:AI – Platform for AI virtualization and orchestration

Sigmoid – Real-time data analytics

Spell – ML and DL platform

Starburst – Analytics engine for data mesh

Striim – Real-time data integration

Toucan Taco – Data storytelling platform

Tredence – Big data analytics

Yellowbrick – Data warehouse for hybrid and multi-cloud environments

Contributed by Daniel D. Gutierrez, Editor-in-Chief and Resident Data Scientist of insideBIGDATA. In addition to being a tech journalist, Daniel also is a practicing data scientist, author, educator and sits on a number of advisory boards for various start-up companies. 

Sign up for the free insideBIGDATA newsletter.

Join us on Twitter: @InsideBigData1 –

Speak Your Mind



  1. It is not only great information about the industry but list of companies in the relevant sector.