Infrastructure
Storage
Cohesity
United StatesCohesity is a late-stage technology firm that develops an intelligent data platform that converges secondary storage workflows to transform disparate silos of data into business insight. Cohesity provides secondary storage services for data protection, development, and analytics by applying a distributed, web-scale architecture to break down these data silos. The firm serves a wide variety of industries including healthcare, telecom, technology, and IT. The firm was founded in 2013 and is based in San Jose, California.
InfrastructureStorageQumulo
United StatesQumulo, headquartered in Seattle, has developed data-aware scale-out NAS, which enables enterprises to manage and store enormous numbers of digital assets by building real-time analytics directly into the file system itself. Qumulo Core is a software-only solution designed to leverage the price/performance of commodity hardware coupled with the modern technologies of flash, virtualization, and cloud.
InfrastructureStorageNetApp
United StatesNetApp (NASDAQ: NTAP) provides data storage and data management solutions. It offers a variety of cloud data services, including cloud storage, cloud backup, cloud disaster recovery, and more. It serves industries such as healthcare, financial services, manufacturing, and more. It was founded in 1992 and is based in San Jose, California.
Founded1992InfrastructureStorageHPE Nimble Storage
United StatesHewlett Packard Enterprise (NYSE: HPE) provides information technology solutions. The Company offers enterprise security, information technology infrastructure, analytics and data management, applications development and testing, data center care, cloud consulting, and business process services. Hewlett Packard Enterprise serves customers worldwide.
Founded2015InfrastructureStorageMinIO
United StatesMinio provides open source object storage for cloud-native and containerized applications.
InfrastructureStorage- New
Tigris
United StatesTigris is a provider of S3-compatible object storage services within the cloud computing industry. The company offers a platform for developers to store and access any amount of data with low latency, catering to a variety of use cases such as cloud-native and mobile applications. It primarily serves sectors that require real-time data access and storage, such as cloud-native applications and mobile app development. It was founded in 2021 and is based in Sunnyvale, California.
InfrastructureStorage Azure Storage
United StatesMicrosoft Azure is a cloud computing service created by Microsoft for building, testing, deploying, and managing applications and services through Microsoft-managed data centers.
Founded1975InfrastructureStorageClumio
United StatesClumio is a secure, backup as a service that consolidates the protection of an enterprise data center and any remote sites with no hardware or software to size, configure, manage - or even buy at all.
InfrastructureStorage- New
Egnyte
United StatesEgnyte provides software for enterprise file synchronization and sharing. The technology offers to store files in a company's existing data repository, as well as cloud computing storage. It was founded in 2008 and is based in Mountain View, California.
InfrastructureStorage Google Cloud Storage
United StatesGoogle Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products.
Founded2008InfrastructureStorageWasabi
United StatesWasabi provides a cloud storage platform that supports hot data, active archive cool data, and inactive archive cool data, with integrations for gateways, apps, and third-party platforms. It enables organizations to store and instantly access an unlimited amount of data with no tiers or unpredictable egress fees. It was formerly known as BlueArchive. The firm was founded in 2015 and is based in Boston, Massachusetts.
InfrastructureStorageIBM Storage
United StatesIBM (NYSE: IBM) manufactures and sells computer hardware and software, and offers infrastructure services, hosting services, and consulting services in areas ranging from mainframe computers to nanotechnology. The company was founded in 1911 and is based in Armonk, Newyork.
Founded1911InfrastructureStorageBackblaze
United StatesBackblaze (NASDAQ: BLZE) is a data storage provider. It offers two products: B2 Cloud Storage - An object storage service similar to Amazon's S3. Computer Backup - An online backup tool that allows Windows and macOS users to back up their data to offsite data centers.
InfrastructureStorageAlluxio
United StatesAlluxio is a memory-centric distributed storage system enabling reliable data sharing at memory-speed across cluster frameworks. Alluxio leverages lineage information and using memory aggressively. Tachyon caches working set files in memory, thereby avoiding disk to load datasets that are frequently read. This enables different jobs/queries and frameworks to access cached files at memory speed.
InfrastructureStorageCloudflare
United StatesCloudflare offers services to help companies reduce latencies, including distributed denial of service (DDoS) attack mitigation and a content delivery network (CDN). The company uses technology to solve problems over the internet. Cloudflare was founded in 2009 and is based in San Franciso, California.
InfrastructureStorageWeka
United StatesWeka offers a shared parallel file system, WekaFS, which leapfrogs legacy storage infrastructure. It helps tackle demanding storage performance challenges in data-intensive technical computing environments, so customers can solve problems. Its WekaFS accelerates time-to-insight from data and helps customers get high-powered IT investments. It was formerly known as WekaIO. The company was founded in 2013 and is based in Campbell, California.
InfrastructureStorageVast Data
United StatesVAST Data is an IT infrastructure company. Its exabyte-scale Universal Storage system is built entirely from high-performance flash media and has several features that result in a total cost of acquisition that is equivalent to hard drive-based archive systems.
InfrastructureStorageDigital Ocean Spaces
United StatesDigitalOcean (NYSE: DOCN) offers a software-as-a-service (SaaS) based platform. It offers a range of services such as website hosting, cloud virtual private network (VPN), big data computing, gaming development, and blockchain development. It provides solutions for digital marketing agencies, e-commerce businesses, and more. The company was founded in 2012 and is based in New York, New York.
InfrastructureStoragePure Storage
United StatesPure Storage, the all-flash enterprise storage company, enables the broad deployment of flash in the data center. When compared to traditional disk-centric arrays, Pure Storage all-flash enterprise arrays are 10x faster and 10x more space and power efficient at a price point that is less than performance disk per gigabyte stored. The Pure Storage FlashArray is ideal for high performance workloads, including server virtualization, desktop virtualization (VDI), database (OLTP, real-time analytics) and cloud computing. The company was founded in 2009 and is based in Mountain View, California.
InfrastructureStorageAmazon S3
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureStoragePanasas
United StatesPanasas, a provider in high performance parallel storage for technical computing applications and big data workloads, enables customers to rapidly solve complex computing problems, speed innovation and accelerate new product introduction. All Panasas storage products leverage the PanFS storage operating system to deliver superior performance, data protection, scalability and manageability. Panasas systems are optimized for demanding storage environments in the bioscience, energy, finance, government, manufacturing, and university markets.
InfrastructureStorage
MPP DBs
Actian
United StatesActian is a computer software company that enables organizations to transform big data into business value with data management solutions to transact, connect, analyze and act on data. Actian helps customers worldwide take action on their big data with Vectorwise analytics database, RushAnalytics Hadoop accelerator, DataCloud for cloud and on-premises data integration, Action Apps, as well as Ingres, Versant and PSQL transactional mission-critical databases.
InfrastructureMPP DBsExasol
GermanyExasol develops an in-memory database for analytics and data warehousing, and offers expertise in data insight and analytics. The in-memory analytic database combines in-memory, columnar compression and massively parallel processing. The company was founded in 2000 and is based in Nuremberg, Germany.
Founded2000InfrastructureMPP DBsTeradata
United StatesTeradata provides analytic data platforms, marketing and analytic applications, and consulting services. Teradata helps organizations collect, integrate, and analyze all of their data so they can know more about their customers and business.
Founded1979InfrastructureMPP DBsGreenplum Database
United StatesGreenplum Database is a company focused on providing a massively parallel data platform (MPP) for analytics, machine learning, and artificial intelligence (AI) within the database software industry. The company offers an open-source database that scales analytics to large datasets in the petabytes, supports complex applications, and integrates with various data sources and storage options. Greenplum's platform is designed to serve sectors that require large-scale data workloads and advanced analytical capabilities. It was founded in 2003 and is based in San Mateo, California.
InfrastructureMPP DBsIBM Data Warehouse Solutions
United StatesIBM (NYSE: IBM) manufactures and sells computer hardware and software, and offers infrastructure services, hosting services, and consulting services in areas ranging from mainframe computers to nanotechnology. The company was founded in 1911 and is based in Armonk, Newyork.
Founded1911InfrastructureMPP DBsVertica, An OpenText Company
United StatesVertica Systems is developing an analytics database management software company. Vertica makes SQL database management software designed to derive business intelligence from complex and large sets of data, in real-time.
InfrastructureMPP DBs
Data Lakes/Lakehouses
ChaosSearch
United StatesChaosSearch is a cloud-based log management and analytics service that extends the power of ELK directly onto AWS S3, providing access to long term data.
InfrastructureData Lakes/LakehousesOneHouse
United StatesOnehouse provides a cloud-native managed lakehouse service that helps to build data lakes, process data, and own data in open-source formats. The company provides data, through a cloud-managed lakehouse service built. Onehouse was founded in 2021 and is based in Menlo Park, California.
InfrastructureData Lakes/LakehousesHPE Ezmeral Data Fabric
United StatesHewlett Packard Enterprise (NYSE: HPE) provides information technology solutions. The Company offers enterprise security, information technology infrastructure, analytics and data management, applications development and testing, data center care, cloud consulting, and business process services. Hewlett Packard Enterprise serves customers worldwide.
Founded2015InfrastructureData Lakes/LakehousesCloudera
United StatesCloudera provides enterprise data management solutions that offer a unified platform for data, an enterprise data hub built on Apache Hadoop. It provides enterprises one place to store, process, and analyze all their data, empowering them to extend the value of existing investments while enabling fundamental new ways to derive value from their data.
InfrastructureData Lakes/Lakehouses- New
Qubole
United StatesQubole operates a cloud-native data platform for analytics and machine learning. Qubole's intelligent automation and self-service improve productivity, while workload-aware auto-scaling and real-time spot buying drive down compute costs. On October 14, 2020 Qubole was acquired by Idera. The terms of the transaction were not disclosed.
InfrastructureData Lakes/Lakehouses Google Cloud Dataproc
United StatesGoogle Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products.
Founded2008InfrastructureData Lakes/LakehousesStarburst
United StatesStarburst is an analytics anywhere company. It provides a distributed SQL query engine. With the ability to connect to a wide variety of data sources, companies use Presto to power large-scale, interactive analytics without the need to move their data. Starburst was previously known as Project Flex. The company was founded in 2017 and is based in Boston, Massachusetts.
InfrastructureData Lakes/LakehousesDremio
United StatesDremio provides a data analytics platform that allows business users to curate precisely the data they need, from any data source, then accelerate analytical processing for BI tools, machine learning, data science, and SQL clients. The company was founded in 2015 and is based in Santa Clara, California.
InfrastructureData Lakes/LakehousesDatabricks
United StatesDatabricks provides a data platform that aims to simplify data integration and offers data analytic services. Databricks Lakehouse Platform serves corporations in various industries worldwide. It was founded in 2013 and is based in San Francisco, California.
InfrastructureData Lakes/Lakehouses- New
Cloudian
United StatesCloudian is a cloud storage systems software. It offers solutions for data protection, data lakehouse, ransomware protection, data storage security, public cloud storage, data lifecycle management, and file services. It serves the federal government, financial services, manufacturing, media, education, healthcare, and life sciences. Cloudian was formerly known as Gemini Mobile Technologies. The company was founded in 2001 and is based in San Mateo, California.
InfrastructureData Lakes/Lakehouses IBM Data Lake Solutions
United StatesIBM (NYSE: IBM) manufactures and sells computer hardware and software, and offers infrastructure services, hosting services, and consulting services in areas ranging from mainframe computers to nanotechnology. The company was founded in 1911 and is based in Armonk, Newyork.
Founded1911InfrastructureData Lakes/LakehousesGoogle Cloud BigLake
United StatesGoogle Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products.
Founded2008InfrastructureData Lakes/LakehousesAzure Data Lake Storage
United StatesMicrosoft Azure is a cloud computing service created by Microsoft for building, testing, deploying, and managing applications and services through Microsoft-managed data centers.
Founded1975InfrastructureData Lakes/LakehousesSnowflake
United StatesSnowflake (NYSE: SNOW) provides a cloud data warehouse enabling enterprises to access structured and semi-structured data.
InfrastructureData Lakes/LakehousesAzure HD Insight
United StatesMicrosoft Azure is a cloud computing service created by Microsoft for building, testing, deploying, and managing applications and services through Microsoft-managed data centers.
Founded1975InfrastructureData Lakes/LakehousesAWS Amazon EMR
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureData Lakes/LakehousesAWS Lake Formation
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureData Lakes/Lakehouses- New
Iomete
United StatesIOMETE is a data management company specializing in modern data lakehouse solutions. Their offerings include a platform that integrates various data processing tools and services, such as Apache Iceberg and Apache Spark, to provide scalable, secure, and efficient data management for AI and analytics. IOMETE's solutions cater to a diverse range of sectors including financial institutions, healthcare, public sector, and data resellers. It was founded in 2020 and is based in Mountain View, California
InfrastructureData Lakes/Lakehouses
Data Warehouses
Yellowbrick
United StatesYellowbrick Data Warehouse is a modern, elastic data warehouse with separate storage and compute that runs in the cloud and on-premises. Yellowbrick enables large-scale enterprises to eliminate complexity, reduce risk, and predict and control costs by running all their data anywhere, across multi-cloud instances. Yellowbrick allows enterprises to run complex queries on live data at a petabyte scale in their own cloud account while supporting high concurrency with fast, interactive query responses to customers' most challenging business questions. Yellowbrick Data was founded in 2014 and is headquartered in Mountain View, California.
InfrastructureData WarehousesSnowflake
United StatesSnowflake (NYSE: SNOW) provides a cloud data warehouse enabling enterprises to access structured and semi-structured data.
InfrastructureData WarehousesAWS Amazon RedShift
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureData WarehousesFirebolt
IsraelFirebolt aims to redesign the concept of a data warehouse to work more efficiently and at a lower cost.
InfrastructureData WarehousesOcient
United StatesOcient enables organizations to explore and interact with hyperscale data sets to deliver meaningful insights and drive customer innovation. The company offers a hyperscale enterprise data warehouse platform that enables rapid transformation and analysis of massive (petabyte)-scale data. The company was founded in 2016 and is based in Chicago, Illinois.
InfrastructureData WarehousesOracle Exadata Cloud Service
United StatesOracle operates as an enterprise software company. It offers a variety of software products, including enterprise resource planning (ERP), customer relationship management (CRM), human capital management (HCM), and more. It serves industries such as automotive, consumer goods, and energy among others. It was founded in 1977 and is based in Austin, Texas.
Founded1977InfrastructureData Warehouses- New
SelectDB
ChinaSelectDB focuses on modern data warehousing for real-time analysis, operating in the data management and analytics industry. The company offers fully managed cloud-native services and self-managed private deployment services for real-time data warehousing, providing rapid query analysis on large-scale real-time data. It primarily serves sectors such as the financial industry, internet companies, manufacturing, enterprise services, and telecommunications. It was founded in 2021 and is based in Haidian, China.
Founded2021InfrastructureData Warehouses Kyligence
United StatesKyligence is an enterprise-level data warehouse product based on a big data platform. The company empowers sub-second query latency on big data and simplifies data analytics for business users, analysts and engineers.
InfrastructureData Warehouses- New
Hydra
United StatesHydra is an open source Snowflake alternative that transforms Postgres into a fast, scalable data warehouse. With Hydra, engineers can standardize both transactional and analytics workloads around the familiar Postgres syntax, tools, and ecosystem without app code changes. Hydra offers a fully managed cloud instance with columnar storage with high throughput writes. Hydra was founded in 2021 and is based in San Francisco, California.
InfrastructureData Warehouses Tabular
United StatesTabular is a cloud-native data platform designed to automate maintenance and optimization of data. It was founded in 2021 and is based in San Jose, California.
InfrastructureData WarehousesIBM Db2 Warehouse
United StatesIBM (NYSE: IBM) manufactures and sells computer hardware and software, and offers infrastructure services, hosting services, and consulting services in areas ranging from mainframe computers to nanotechnology. The company was founded in 1911 and is based in Armonk, Newyork.
Founded1911InfrastructureData Warehouses- New
Panoply by SQream
IsraelPanoply develops a cloud data platform enabling users to synchronize, store, and access data. It facilitates solution-unlocking analysis without data engineering. Its platform offers business data synchronization, managed extract, transform, and load connectors, cloud data storage, and built-in analytics features. It was founded in 2015 and is based in Tel Aviv, Israel. In December 2021, Panoply was acquired by SQream Technologies.
InfrastructureData Warehouses VMWare Greenplum
United StatesVMware (NYSE: VMW) is a cloud computing and virtualization technology company that enables organizations to build, run, manage and secure their apps across clouds. It serves the banking, healthcare, government, retail, telecommunications, manufacturing, and transportation industries. The company was founded in 1998 and is based in Palo Alto, California.
InfrastructureData WarehousesStarRocks
United StatesStarRocks offers real-time SQL engines for enterprise-scale analytics. It was founded in 2020 and is based in San Francisco, California.
Founded2020InfrastructureData WarehousesGoogle Cloud BigQuery
United StatesGoogle Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products.
Founded2008InfrastructureData WarehousesAzure Synapse Analytics
United StatesMicrosoft Azure is a cloud computing service created by Microsoft for building, testing, deploying, and managing applications and services through Microsoft-managed data centers.
Founded1975InfrastructureData Warehouses
Streaming/In-Memory
MotherDuck
United StatesMother Duck delivers a serverless data analytics platform for data. The platform offers a serverless analytics service built for lightweight use cases. It is built on the open-source platform DuckDB, an in-process online analytical processing database. The company was founded in 2022 and is based in San Fransico, California.
InfrastructureStreaming/In-MemorySAP HANA Cloud
GermanySAP (FWB: SAP) (NYSE: SAP) is a multinational software corporation that makes enterprise software to manage business operations and customer relations.
Founded1972InfrastructureStreaming/In-MemoryGigaspaces
United StatesGigaSpaces Technologies provides software middleware for deployment, management and scaling of mission-critical applications on cloud environments through two main product lines, XAP In-Memory Computing and Cloudify. Hundreds of Tier-1 organizations worldwide are leveraging GigaSpaces' technology to enhance IT efficiency and performance, from top financial firms, e-commerce companies, online gaming providers, healthcare organizations and telecom carriers.
InfrastructureStreaming/In-MemoryGridGain
United StatesGridGain is focused on real-time data access and processing by offering the enterprise-grade In-Memory Data Fabric built on Apache Ignite. The solution is used by global enterprises in financial, tech, retail, healthcare and other major sectors. GridGain solutions connect traditional and emerging data stores (SQL, NoSQL, and Hadoop) with cloud-scale applications and enable massive data throughput and ultra-low latencies across a scalable cluster of commodity servers. A converged data platform, GridGain In-Memory Data Fabric offers a comprehensive, enterprise-grade in-memory computing solution for high-volume transactions, real-time analytics and hybrid data processing.
InfrastructureStreaming/In-Memory- New
InfinyOn
United StatesInfinyOn is a company focused on data streaming, operating in the technology and data analytics industry. The company offers a platform that enables engineers to build robust event-driven data pipelines for various functions such as log processing, data enrichment, IoT edge processing, clickstream analytics, and real-time microservices. The platform is primarily used in sectors such as ecommerce, real estate tech, and cloud computing. It was founded in 2019 and is based in Saratoga, California
InfrastructureStreaming/In-Memory Hazelcast
United StatesHazelcast develops, distributes, and supports open source In-Memory Data Grid. Hazelcast's computing platform is comprised of two core products: Hazelcast IMDG, an in-memory data grid, and Hazelcast Jet, an application embeddable, stream and batch processing engine capable of supporting real-time streaming data.
InfrastructureStreaming/In-MemoryBytewax
United StatesBytewax, founded in 2020, develops a serverless machine learning platform o let data scientists and machine learning engineers build and scale ML-powered features. The company's platform allows users to build custom pipelines to transform data and make predictions in real-time, run the software locally to speed up development, and build complex workflows. It is based in Santa Cruz, California.
InfrastructureStreaming/In-MemoryDatabricks
United StatesDatabricks provides a data platform that aims to simplify data integration and offers data analytic services. Databricks Lakehouse Platform serves corporations in various industries worldwide. It was founded in 2013 and is based in San Francisco, California.
InfrastructureStreaming/In-MemoryUpsolver
IsraelUpsolver develops a cloud architecture for NoSQL databases. Upsolver NoSQL DB stores data in the cloud instead of local servers, and its platform works on data streams and offers unique algorithms and aggregation functions.
InfrastructureStreaming/In-MemoryGoogle Cloud Dataflow
United StatesGoogle Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products.
Founded2008InfrastructureStreaming/In-MemoryStreamNative
United StatesStreamNative develops an event streaming platform that aims to enable companies to leverage enterprise data as real-time event streams to develop and launch new products and services.
InfrastructureStreaming/In-MemoryAbly
United KingdomAbly is a realtime data delivery platform that allows users to add realtime messaging and streaming data to applications.
InfrastructureStreaming/In-MemoryDeltaStream
United StatesDeltaStream provides a serverless database to manage, secure and process streams. The firm's product allows users to build real-time streaming applications and pipelines with SQL in minutes. The firm was founded in 2020 and is based in San Mateo, California.
InfrastructureStreaming/In-MemoryEstuary
United StatesEstuary is a database software company that specializes in linking to both internal and external systems, integrating real-time batch operations, and allowing clients to stream data, including internal services, apps, or external SaaS. It is based in New York, New York.
InfrastructureStreaming/In-MemoryQuix
United KingdomQuix is a company that focuses on data engineering platforms, specifically in the event streaming domain. The company offers a platform that enables developers to build, test, and deploy real-time models and services directly on Kafka, simplifying the process of working with live data. Quix primarily serves sectors such as video game, media, manufacturing, automotive, and telco industries. It was founded in 2020 and is based in London, England.
InfrastructureStreaming/In-MemoryVoltron Data
United StatesVoltron Data is a remote company that connects data, hardware, and developers, as well as focuses on improving the Apache Arrow Ecosystem. The company is based in Mountain View, California.
InfrastructureStreaming/In-MemoryOracle Coherence
United StatesOracle operates as an enterprise software company. It offers a variety of software products, including enterprise resource planning (ERP), customer relationship management (CRM), human capital management (HCM), and more. It serves industries such as automotive, consumer goods, and energy among others. It was founded in 1977 and is based in Austin, Texas.
Founded1977InfrastructureStreaming/In-MemoryTinybird
SpainTinybird helps data teams build real-time data products at scale through SQL-based API endpoints. It ingests millions of rows per second and serves low latency, high concurrency analytical queries over any amount of data. It was founded in 2019 and is based in Madrid, Spain.
InfrastructureStreaming/In-MemoryRedPanda
United StatesRedpanda, fka Vectorized, delivers enterprises with a modern data-streaming platform for mission-critical applications. The company is building a family of products designed to reliably transform data streams into data products by unifying historical and real-time data, enabling inline Lambda transformations, all exposed under a drop-in Kafka-API replacement.
InfrastructureStreaming/In-MemoryConfluent
United StatesConfluent (NASDAQ: CFLT) provides an Apache Kafka-based streaming platform for enterprises in industries such as retail, logistics, manufacturing, financial services, technology, and media to maximize the value of their data. Confluent Platform lets enterprises move data from isolated systems into a real-time data pipeline where they can act on it immediately.
InfrastructureStreaming/In-MemoryRisingWave Labs
United StatesRisingWave Labs provides a cloud-native streaming database that maintains storage and allows users to access data efficiently. It uses SQL as the interface, consumes streaming data, and performs incremental computations. The company was founded in 2021 and is based in San Francisco, California.
InfrastructureStreaming/In-MemoryDecodable
United StatesDecodable, founded in 2021, specializes in the development of a data engineering platform. The platform enables applications and data engineers to build data pipelines to process and deliver data to offline and online systems. The company is based in San Francisco.
InfrastructureStreaming/In-MemoryMeroxa
United StatesMeroxa makes real-time data infrastructure accessible to any company regardless of expertise or available resource.
InfrastructureStreaming/In-MemoryVerverica
GermanyVerverica is an enterprise stream processing platform that provides multi-tenancy, authentication, role-based access control, and auto-scaling for Apache Flink. The company offers solutions such as event-driven applications, fraud detection, machine learning, and more. The company was founded in 2014 and is based in Berlin, Germany.
InfrastructureStreaming/In-Memory- New
Artie
United StatesArtie specializes in real-time data replication, focusing on the synchronization of databases to data warehouses. The company offers a platform that enables high-volume data transfer with low latency, using change data capture (CDC) and stream processing to replicate data. Its solutions cater to data and engineering teams, providing tools for real-time analytics. The company was founded in 2023 and is based in San Francisco, California.
InfrastructureStreaming/In-Memory Arcion
United StatesArcion offers a cloud-native, real-time data mobility platform. It enables data teams to build zero-maintenance data pipelines in minutes.
InfrastructureStreaming/In-MemoryStriim
United StatesStriim is an end-to-end streaming integration and intelligence platform. Striim specializes in multi-stream data integration and real-time Change Data Capture (CDC) across a wide variety of data sources including transaction/change data, events, log files, application and IoT sensor data. With data pipelines in-place, the Striim platform makes streaming analytics easy. Enterprises can detect anomalies, identify and visualize events of interest, and trigger alerts and workflows all in-time and in-context.
InfrastructureStreaming/In-MemoryAWS Amazon Kinesis
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureStreaming/In-MemoryConduktor
United StatesConduktor is a computer software company. It helps companies operate their data using Apache Kafka. It is based in Jackson, Wyoming.
InfrastructureStreaming/In-MemoryAiven
FinlandAiven offers a managed cloud service that hosts software infrastructure services. It provides managed open-source data technologies, like PostgreSQL, Kafka, and OpenSearch, on all major clouds, allowing developers to create applications. The company was founded in 2016 and is based in Helsinki, Finland.
InfrastructureStreaming/In-Memory
RDBMs
Oracle
United StatesOracle operates as an enterprise software company. It offers a variety of software products, including enterprise resource planning (ERP), customer relationship management (CRM), human capital management (HCM), and more. It serves industries such as automotive, consumer goods, and energy among others. It was founded in 1977 and is based in Austin, Texas.
Founded1977InfrastructureRDBMs- New
Altibase
United StatesAltibase is an enterprise-grade, high-performance relational database company. It offers a hybrid database system that combines in-memory and on-disk data storage in a unified engine, catering to the need for high throughput and economical storage. Altibase primarily serves sectors such as telecommunications, finance, manufacturing, and public services. It was founded in 1999 and is based in New York, New York.
Founded1999InfrastructureRDBMs Microsoft SQL Server
United StatesMicrosoft (NASDAQ: MSFT) provides, develops, and licenses consumer and enterprise software. The company develops, manufactures, and sells computer software, consumer electronics, and personal computers and services. It is known for its Windows operating systems, Office productivity suite, cloud-based Azure platforms, and many more. The company was founded in 1975 and is based in Redmond, Washington.
Founded1975InfrastructureRDBMsAWS Amazon RDS
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureRDBMsSAP ASE
GermanySAP (FWB: SAP) (NYSE: SAP) is a multinational software corporation that makes enterprise software to manage business operations and customer relations.
Founded1972InfrastructureRDBMsIBM DB2
United StatesIBM (NYSE: IBM) manufactures and sells computer hardware and software, and offers infrastructure services, hosting services, and consulting services in areas ranging from mainframe computers to nanotechnology. The company was founded in 1911 and is based in Armonk, Newyork.
Founded1911InfrastructureRDBMsMicrosoft Access
United StatesMicrosoft (NASDAQ: MSFT) provides, develops, and licenses consumer and enterprise software. The company develops, manufactures, and sells computer software, consumer electronics, and personal computers and services. It is known for its Windows operating systems, Office productivity suite, cloud-based Azure platforms, and many more. The company was founded in 1975 and is based in Redmond, Washington.
Founded1975InfrastructureRDBMs
NoSQL Databases
Speedb
IsraelSpeedb provides a drop-in replacement for RocksDB embedded storage engine for hyperscale data operations. Speedb Data Engine removes the capacity, scale, and performance limitations of using existing KVS solutions. It redesigned the RocksDB internal data structure to fit the performance and scalability requirements of modern, hyperscale data operations. By redesigning parts of RocksDB, it was able to develop an embedded KVS. Speedb supports petabyte scaling of datasets with billions of objects.
InfrastructureNoSQL DatabasesCouchbase
United StatesCouchbase (NASDAQ:BASE) provides a scalable NoSQL database. The solution includes a shared nothing architecture, a single node-type, a built in caching layer, true auto-sharding and a NoSQL mobile offering: Couchbase Mobile, a NoSQL mobile solution comprised of Couchbase Server, Couchbase Sync Gateway and Couchbase Lite. Couchbase Server and all Couchbase Mobile products are open source projects.
InfrastructureNoSQL DatabasesProgress MarkLogic
United StatesProgress is a company that focuses on the development, deployment, and management of high-impact business applications in the technology sector. The company offers a range of products that facilitate the creation of applications, automate processes for configuration, deployment, and scaling of apps, and enhance the accessibility and security of critical data. Progress primarily serves the technology industry, with a particular emphasis on software companies and developers. Progress was formerly known as Data Language Corporation. It was founded in 1981 and is based in Bedford, Massachusetts.
Founded2001InfrastructureNoSQL DatabasesAzure CosmosDB
United StatesMicrosoft Azure is a cloud computing service created by Microsoft for building, testing, deploying, and managing applications and services through Microsoft-managed data centers.
Founded1975InfrastructureNoSQL DatabasesAerospike
United StatesAerospike specializes in real-time NoSQL data solutions for any scale. Aerospike enterprises overcome seemingly impossible data bottlenecks to compete and win with a fraction of the infrastructure complexity and cost of legacy NoSQL databases. Aerospike empowers customers to instantly fight fraud; dramatically increase shopping cart size; deploy global digital payment networks; and deliver instant, one-to-one personalization for millions of customers.
InfrastructureNoSQL Databases- InfrastructureNoSQL Databases
MongoDB
United StatesMongoDB is a general purpose, document-based, distributed database built for modern application developers and for the cloud era.
InfrastructureNoSQL DatabasesAmazon DocumentDB
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureNoSQL DatabasesAWS Amazon DynamoDB
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureNoSQL Databases- New
Crunchydata
United StatesCrunchy Data specializes in enterprise PostgreStructured Query Language (SQL) solutions and support. The company offers fully managed cloud Postgres services, automated high-availability PostgreSQL solutions, and secure, hardened PostgreSQL for advanced security needs. Crunchy Data primarily serves sectors such as government, healthcare, finance, software as a service (SaaS), automotive, and blockchain. It was founded in 2012 and is based in Daniel Island, South Carolina.
Founded2012InfrastructureNoSQL Databases Oracle NoSQL
United StatesOracle operates as an enterprise software company. It offers a variety of software products, including enterprise resource planning (ERP), customer relationship management (CRM), human capital management (HCM), and more. It serves industries such as automotive, consumer goods, and energy among others. It was founded in 1977 and is based in Austin, Texas.
Founded1977InfrastructureNoSQL DatabasesScyllaDB
United StatesScyllaDB provides a real-time big data database compatible with Apache Cassandra and used for a variety of business-critical purposes, including as a key value store, time-series database, large blob store, and graph database backend.
InfrastructureNoSQL Databases- New
Nile
United StatesNile specializes in serverless Postgres database solutions for the modern Software as a Service (SaaS) industry. The company offers a multi-tenant database platform that provides built-in tenant and user management, global data placement, and instant customer dashboards. Its services cater to SaaS applications requiring scalable, secure, and efficient data management solutions. It is based in San Francisco, California.
InfrastructureNoSQL Databases - New
pgEdge
United StatespgEdge specializes in distributed PostgreSQL solutions optimized for the network edge within the database and cloud computing industries. The company offers products designed to reduce data latency and provide ultra-high availability for databases, leveraging a multi-master, multi-region, and multi-cloud architecture. pgEdge's services cater to various sectors, including financial services, SaaS media analytics, and companies requiring data residency compliance. It was founded in 2022 and is based in Alexandria, Virginia.
InfrastructureNoSQL Databases - New
RavenDB
IsraelRavenDB is a NoSQL Document Database company that operates in the database industry. The company offers a fully transactional database system designed for applications operating on the cloud or on-premise, providing high performance and data integrity. It is primarily used by organizations ranging from startups to large enterprises. It is based in Haifa, Israel.
InfrastructureNoSQL Databases VMware Tanzu GemFire
United StatesVMware operates as a company focused on providing multi-cloud services for all applications. The company's main offerings include services that enable digital innovation with enterprise control, such as solutions for modern applications, multi-cloud, digital workspace, security, and networking. It was founded in 1998 and is based in Palo Alto, California. In November 2023, VMware was acquired by Broadcom at a valuation of $61B.
InfrastructureNoSQL DatabasesCrate.io
United StatesCrate.io is a scalable real-time database for the machine data era, such as information from the Internet of Things that stores massive amounts of data over the cloud and provides it via SQL connections.
InfrastructureNoSQL DatabasesRiak
Riak is a decentralized datastore from Basho Technologies.
InfrastructureNoSQL DatabasesDatastax
United StatesDataStax is a real-time data company allowing enterprises to mobilize real-time data and quickly build the smart, highly scalable applications required to be a data-driven business. With Astra DB and Astra Streaming, DataStax delivers the power of Apache Cassandra, a scalable database, with the Apache Pulsar streaming technology in an open data stack that is available on any cloud. Astra DB is a multi-cloud DBaaS built on Apache Cassandra, while Astra Streaming is a multi-cloud Streaming-as-a-Service built on Apache Pulsar. The company also offers Luna for Apache Cassandra, a Subscription-to-Success product with open-source Apache Cassandra; Luna Streaming, a Subscription-to-Success product with open-source Apache Pulsar; and DataStax Enterprise, a scale-out, cloud-native NoSQL built on Ap
InfrastructureNoSQL DatabasesGoogle Cloud Bigtable
United StatesGoogle Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products.
Founded2008InfrastructureNoSQL Databases
NewSQL Databases
Supabase
SingaporeSupabase develops an open-source alternative to Google's Firebase. It helps developers by providing a Postgres database with a self-documenting application programming interface, edge functions, real-time subscriptions, storage, and vector embeddings. The company was founded in 2020 and is based in Singapore.
InfrastructureNewSQL DatabasesTimescale
United StatesTimescale is an open-source time-series database specifically designed for ease of use and complex queries that is fully compatible with Postgres. The company focuses on developers and businesses working with machine data and IoT. The company was founded in 2015 and is based in New York, New York.
InfrastructureNewSQL DatabasesGoogle Cloud Spanner
United StatesGoogle Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products.
Founded2008InfrastructureNewSQL DatabasesPlanetScale
United StatesPlanetScale offers Vitess, an open source sharding middleware system for MySQL that lets applications view a sharded MySQL cluster as a giant monolithic database.
InfrastructureNewSQL DatabasesYugabyteDB
United StatesYugaByte is the company behind YugaByte DB, the cloud-native, transactional, high-performance database for planet-scale cloud applications. YugaByte DB converges the operational database needs of mission-critical applications into an easy-to-manage, unified platform, allowing enterprises to focus on growing their business rather than managing complex infrastructure. The company was founded in 2016 and is based in Sunnyvale, California.
InfrastructureNewSQL DatabasesVolt Active Data
United StatesVolt Active Data develops a data platform designed to augment existing big data investments. It replaces the various layers typically required to make contextual decisions on streaming data with a single, unified layer that can handle ingest to action in milliseconds. The platform features include real-time business decisions; zero-lag streaming aggregation; and monetized streaming data. Its customers include Openet, Huawei, Connexity, Nokia, Mitsubishi Electric, and others. Volt Active Data, formerly VoltDB, was founded in 2010 and is based in Bedford, Massachusetts.
InfrastructureNewSQL DatabasesInfluxData
United StatesInfluxData is a time series, metrics, and analytics database. It is targeted at use cases for DevOps, metrics, sensor data, and analytics. It offers an open-source server agent named Telegraf that helps customers collect metrics from your stacks, sensors, and systems. The company was founded in 2012 and is based in San Francisco, California.
InfrastructureNewSQL Databases- New
Neon
United StatesNeon develops a multi-cloud Serverless Postgres provider which is purpose-built to address modern workloads oriented around edge computing and artificial intelligence (AI). It builds open-source cloud-native PostgreSQL specializing in separation of storage and compute, branching, and serverless architecture. It offers auto-scaling, branching, and bottomless storage solutions allowing for stateless and serverless Postgres. It was founded in 2021 and is based in San Francisco, California.
InfrastructureNewSQL Databases Google Cloud AlloyDB
United StatesGoogle Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products.
Founded2008InfrastructureNewSQL Databases- New
Altibase
United StatesAltibase is an enterprise-grade, high-performance relational database company. It offers a hybrid database system that combines in-memory and on-disk data storage in a unified engine, catering to the need for high throughput and economical storage. Altibase primarily serves sectors such as telecommunications, finance, manufacturing, and public services. It was founded in 1999 and is based in New York, New York.
Founded1999InfrastructureNewSQL Databases CockroachDB
United StatesCockroach Labs enables developers to build scalable applications that can survive datacenter-scale outages. With strong consistency and transactions, Cockroach Labs frees developers to focus on what matters, instead of engineering solutions to database shortcomings. The company was founded in 2015 and is based in New York, New York.
InfrastructureNewSQL DatabasesPingCAP
ChinaPingCAP is a NewSQL database provider that develops TiDB, a popular open-source distributed Hybrid Transactional/Analytical Processing (HTAP) database.
InfrastructureNewSQL DatabasesDassault Systemes NuoDB
FranceDassault Systemes (Euronext: DSY) (OTC Pink: DASTY) develops 3D design, 3D digital mock-up, and product lifecycle management (PLM) software. It provides solutions for a wide range of industries including aerospace and defense, architecture, engineering, construction, consumer goods, retail, and more. It was founded in 1983 and is based in Velizy-Villacoublay, France.
Founded1983InfrastructureNewSQL DatabasesMariaDB Xpand
United StatesMariaDB (NYSE: MRDB) provides open-source database solutions for SaaS, cloud, and on-premises applications that require availability, scalability, and performance. Its software uses pluggable, purpose-built storage engines to support workloads that previously required a variety of specialized databases delivering operational agility for businesses to focus on operations and reduce costs, constraints, and complexity related to proprietary databases. MariaDB was formerly known as SkySQI. The company was founded in 2009 and is based in Redwood City, California.
InfrastructureNewSQL DatabasesAWS Amazon Aurora
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureNewSQL Databases
Real Time Databases
ReadySet
United StatesReadySet provides a drop-in solution to the database performance problems that often arise when a company is in a phase of rapid growth, such as when dealing with large datasets, complicated queries, or high request volumes. To mitigate this problem today, developers must build in-house solutions based on custom query caching systems and read replica hierarchies. The resulting systems are complex, expensive, and prone to failures and outages. ReadySet was founded in 2020 and is based in Boston Massachusetts.
InfrastructureReal Time DatabasesMaterialize
United StatesMaterialize is a SQL streaming database for building internal tools, interactive dashboards, and customer-facing experiences. The company was founded in 2019 and is based in New York, New York.
InfrastructureReal Time DatabasesConvex
United StatesConvex offers a global state management platform for web developers. Its platform allows users to create their own projects, store and sync the app's shared state, and watch the app update with automatic subscriptions. It was founded in 2021 and is based in San Francisco, California.
InfrastructureReal Time Databases- New
RethinkDB
United StatesRethinkDB explores how databases can be intelligently designed to benefit from solid-state disks and flash storage. RethinkDB is built to store JSON documents, and scale to multiple machines with very little effort. It has a pleasant query language that supports really useful queries like table joins and group by, and is easy to setup and learn.
InfrastructureReal Time Databases FeatureBase
United StatesFeatureBase offers a distributed, highly-scalable, real-time database available via open source or cloud that is designed to execute analytical queries with low latency regardless of throughput or query volumes. It is designed for workloads that require real-time analytics on data that is continuously updated through inserts, updates, or deletes. FeatureBase was formerly known as Molecula. The company was founded in 2017 and is based in Austin, Texas.
InfrastructureReal Time DatabasesRedis
United StatesRedis provides a real-time data platform. It offers solutions for enterprise caching, session management, leaderboards, fraud detection, deduplication, and more. It serves retail, gaming, healthcare, and other sectors. It was founded in 2011 and is based in Mountain View, California.
Founded2011InfrastructureReal Time DatabasesAltinity
United KingdomAltinity is the commercial company behind the open-source ClickHouse data warehouse is an open-source SQL data warehouse to match the performance, maturity, and scalability of proprietary databases like Sybase IQ, Vertica, and Snowflake.
InfrastructureReal Time DatabasesStartree
United StatesStarTree operates a real-time analytics platform to power both enterprise and user-facing analytics applications. Its platform helps its users manage their real-time analytics and allows them to onboard more data and applications across additional use cases. It was formerly known as CortexData. The firm was founded in 2019 and is based in Mountain View, California.
InfrastructureReal Time Databases- InfrastructureReal Time Databases
- New
QuestDB
United KingdomQuestDB develops a time-series database for high throughput ingestion and fast structured query language (SQL) queries with operational simplicity. It supports schema-agnostic ingestion using the influx database line protocol (Influx DB), wire protocol (PostgreSQL), and a representational state transfer application programming interface (REST API) for imports and exports. The company was founded in 2019 and is based in London, United Kingdom.
InfrastructureReal Time Databases KX
United KingdomKx Systems provides software solutions that offer tools for the processing of real-time and historical data for trading, research, and smart meter data analysis. Kx offers kdb+, a column-store database with a built-in expressive query and programming language, q.
Founded1993InfrastructureReal Time DatabasesMacrometa
United StatesMacrometa provides a managed service that helps developers create geo-distributed applications and application programming interface (API), with a distributed server-less, database management system. It computes runtime for event driven applications across several edge data centers. Marcometa caters to the IoT, retail, e-commerce, SaaS, telecommunications, gaming and more sectors. The company was founded in 2017 and is based in San Mateo, California.
InfrastructureReal Time DatabasesClickhouse
United StatesClickHouse provides an open-source, column-oriented Online analytical processing (OLAP) database management system that allows users to generate analytical reports using SQL queries. The company was founded in 2021 and is based in Redwood City, California.
InfrastructureReal Time DatabasesRockset
United StatesRockset is a data platform designed to simplify much of the processing to get to querying and application building faster.
InfrastructureReal Time DatabasesImply
United StatesImply is a real-time analytics solution to store, query, and visualize event-driven data. It is built around Apache Druid, a widely-adopted open-source real-time analytics database architected to support streaming ingest and sub-second ad-hoc queries at scale. The company is based in Burlingame, California, and was founded in 2015.
InfrastructureReal Time DatabasesSingleStore
United StatesSingleStore offers a scalable in-memory database that delivers speed and throughput advantages while retaining SQL and ACID compliance. By offering a familiar relational interface to an in-memory data tier, SingleStore empowers developers with the technology web-scale companies use to cope with massive traffic and growth. SingleStore offers orders of magnitude improvements in write and read performance and simplifies application development and maintenance. It was formerly known as MemSQL. The company was founded in 2011 and is based in San Francisco, California.
InfrastructureReal Time Databases
Graph DBs
TigerGraph
United StatesTigerGraph is a provider of a native complete, distributed and parallel graph database platform for enterprise applications, which powers real-time deep link analytics for enterprises with complex and colossal amounts of data, a cloud service and GraphStudio, a visual software development kit (SDK) designed for technical and non-technical users to create, explore and query graphs visually. The company was founded in 2012 and is based in Redwood City, California.
InfrastructureGraph DBsArangoDB
United StatesArangoDB is an open-source database with a flexible data model for documents, graphs, and key-values. The company allows users to build high performance applications using a convenient SQL-like query language or JavaScript/Ruby extensions.
InfrastructureGraph DBsAWS Amazon Neptune
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureGraph DBsObjectivity
United StatesObjectivity is a provider of distributed, real-time, SOA-enabled, service and embedded database management solutions for mission-critical applications. The company's flagship product, Objectivity/DB, is used by government, security, complex manufacturing, commercial services, science, and engineering organizations to increase speed, precision and productivity.
Raised$15.2MInfrastructureGraph DBsIBM Graph
United StatesIBM (NYSE: IBM) manufactures and sells computer hardware and software, and offers infrastructure services, hosting services, and consulting services in areas ranging from mainframe computers to nanotechnology. The company was founded in 1911 and is based in Armonk, Newyork.
Founded1911InfrastructureGraph DBsDgraph
United StatesDGRaph Labs offers an open source, native, and distributed graph database, developed for real-time, low-latency, and high throughput query flow. Its data distribution is designed to minimize the number of network calls, keeping them directly proportional to the complexity of a query, not the number of results. DGraph, which allows scaling a database from a single laptop to serving terabytes of structured data via commodity hardware, has been built to survive machine failures and partial data center collapses. It also provides joins, the most common operation for a graph database. The company was founded in 2016 and is based in Palo Alto, California.
InfrastructureGraph DBsneo4j
United StatesNeo4j provides websites, telcos, and bioinformatics research organizations a graph database to model and query connected data. The company, which also has a large ecosystem of partners and developers, operates offices in Germany, UK, France, Belgium, Sweden, and Malaysia.
InfrastructureGraph DBs- New
Grafbase
United StatesGrafbase focuses on providing application programming interface (API) solutions for developers. The company offers a service allowing developers to connect their APIs, databases, and microservices, and deploy a high-performance, scalable GraphQL API in a matter of minutes. Grafbase primarily caters to the technology and software development industry. It was founded in 2021 and is based in San Francisco, California.
InfrastructureGraph DBs TerminusDB
IrelandTerminusDB is an open source model driven graph database for knowledge graph representation designed specifically for the web-age.
Raised$4.29MInfrastructureGraph DBsHasura
United StatesHasura is an API gateway with auth middleware. It features auto-renewing SSL, auth APIs and UI kit, and extensible to add providers. It allows users to create their own applications. The company was founded in 2017 and is based in San Francisco, California.
InfrastructureGraph DBs- New
NebulaGraph
United StatesNebulaGraph focuses on providing enterprise-level open-source distributed graph database solutions. The company's main service includes offering a graph database solution capable of handling large datasets with billions of nodes and trillions of edges while maintaining millisecond-level query latency. The company primarily sells to sectors such as financial risk control, real-time recommendation, knowledge graph, and other business scenarios. It was founded in 2018 and is based in Cupertino, California.
InfrastructureGraph DBs Stardog
United StatesStardog uses smart graph technology to unify heterogeneous, disparate data across the enterprise. With Stardog, companies can unify data into a coherent graph-based model that allows business logic and data sources to evolve independently.
InfrastructureGraph DBsApollo
United StatesApollo Graph operates an open-source platform. It provides an application programming interface (APIs), microservices, and databases into a graph that customers can query with GraphQL. The company was founded in 2016 and is based in San Francisco, California.
InfrastructureGraph DBsMemgraph
United KingdomMemgraph is a scalable and enterprise-ready graph database. The memory optimized storage engine provides lock-free data structure and native query compiling. Bundled with import, data processing and visualization tools, it provides a complete ecosystem to tackle the future challenges of big data.
InfrastructureGraph DBsOracle Graph
United StatesOracle operates as an enterprise software company. It offers a variety of software products, including enterprise resource planning (ERP), customer relationship management (CRM), human capital management (HCM), and more. It serves industries such as automotive, consumer goods, and energy among others. It was founded in 1977 and is based in Austin, Texas.
Founded1977InfrastructureGraph DBs
GPU Databases
Heavy.ai
United StatesHEAVY.AI develops GPU-powered data analytics and visualization software platform that enables data analysts to interactively explore large datasets at high speed. The company's database is using Graphics Processing Units (GPUs) to allow SQL queries to be executed in parallel, yielding massive speedups over in-memory databases.
InfrastructureGPU DatabasesSQream
United StatesSQream Technologies provides organizations with a big data analytics SQL database. The GPU-powered platform enables enterprises to rapidly ingest and analyze their growing data.
InfrastructureGPU DatabasesHeteroDB
United StatesHetero DB specializes in database acceleration through a Postgre structured query language (SQL) extension module. It focuses on enhancing database systems with graphics processing unit (GPU) and nonvolatile memory express-solid-state drives (NVME-SSD) technologies. The company offers solutions for processing large-scale data, leveraging parallel computing to optimize searches, aggregations, and transformations. Hetero DB's products aim to maximize hardware capabilities for data handling and real-time analytics in various sectors. It was founded in 2017 and is based in Shinagawa-Ku, Japan.
InfrastructureGPU DatabasesBrytlyt
United KingdomBrytlyt provides large telcos, retailers, and financial institutions with the SpotLyt tool to make sense of their data through analysis and visualization technology. Its GPU database works with patent-pending software built on PostgreSQL for high performance, high speed, high-quality visual analytics. The company can handle complex queries on billions of rows of data, delivering results in milliseconds in the form of user-friendly graphics, maps, and charts. With clear visual analysis, customers can interpret their end-users behavior in essential areas such as fraud prevention, attracting and retaining customers, network performance optimization and risk management, supporting decision-making, and improved outcomes.
InfrastructureGPU Databases- New
IBM Spark GPU
United StatesIBM (NYSE: IBM) is a multinational information technology company. It manufactures and sells computer hardware and software products. It also offers a range of solutions such as cloud cost management, business automation, data management, data warehouse, and more. It serves industries such as automotive, insurance, retail, and more. The company was founded in 1911 and is based in Armonk, New York.
Founded1911InfrastructureGPU Databases Kinetica
United StatesKinetica, formerly GPUdb, provides a distributed, in-memory database accelerated by GPUs, which delivers real-time actionable intelligence on large, complex and streaming data sets from IoT, transactions and other sources and allows organizations to ingest, explore, analyze and visualize streaming data within milliseconds.
InfrastructureGPU Databases
Multi-Model Databases & Abstraction
Xata
United StatesXata is a software company that offers a serverless database service for Jamstack applications. Its service is a combination of a document database with relations, an analytics engine, and a free-text search engine. It is based in Delaware, United States.
InfrastructureMulti-Model Databases & AbstractionSurrealDB
United KingdomSurrealDB is a powerful multi-model database, built for the cloud, and designed to improve the development process of traditional and modern applications. It offers an SQL-style query language, real-time queries with highly-efficient related data retrieval, advanced security permissions for multi-tenant access, and support for performant analytical workloads. The company was founded in 2021 and is based in London, United Kingdom.
InfrastructureMulti-Model Databases & AbstractionEdgeDB
CanadaEdgeDB operates as an open-source relational database built on top of Postgre Structured Query Language (SQL) that aims to enable users to build software with less effort. The platform features strict, strongly typed schema, powerful and expressive query language, a rich standard library, built-in support for schema migrations, and native GraphQL support. The company was founded in 2019 and is based in Toronto, Canada.
InfrastructureMulti-Model Databases & AbstractionTileDB
United StatesTileDB focuses on data management and analytics, operating within the technology and data science sectors. It offers a database that structures complex data for optimized cloud computing and analytics, and a product called TileDB Cloud that provides performance at any degree of dimensionality and scale. The company primarily serves industries such as financial services, healthcare, telecommunications, and oil and gas. It was founded in 2017 and is based in Cambridge, Massachusetts.
InfrastructureMulti-Model Databases & AbstractionPrisma
GermanyPrisma is an open-source ORM for Node.js and TypeScript that helps developers build faster and make fewer errors. It supports JavaScript and TypeScript and has supported databases including PostgreSQL, MySQL, MariaDB, SQL Server, SQLite, and more recently, MongoDB. The company was founded in 2016 and is based in Berlin, Germany.
InfrastructureMulti-Model Databases & AbstractionFauna
United StatesFauna is a database for social and mobile applications. FaunaDB is a modern, adaptive operational database built from the ground up to help digital business scale, without compromising productivity or agility.
InfrastructureMulti-Model Databases & Abstraction
Vector Databases
Pinecone
United StatesPinecone is a fully managed vector database that provides performance without compromises. It lets engineers quickly build accurate, real-time ML applications and then launch or scale them with less overhead, dramatically cutting the time, cost, risk and effort.
InfrastructureVector Databases- InfrastructureVector Databases
Chroma
United StatesChroma is the AI native open-source embeddings database. Using embeddings, Chroma lets developers add state and memory to their AI-enabled applications.
InfrastructureVector DatabasesZilliz
United StatesZilliz provides an enterprise-ready vector database for AI applications. The company builds database technologies to help organizations rapidly create AI/ML applications, and unlock the potential of unstructured data. The company was founded in 2017 and is based in San Francisco, California.
InfrastructureVector DatabasesQdrant
GermanyQdrant is an open-source vector similarity search engine. The company deploys it as an API service, providing a search for the nearest high-dimensional vectors. With Qdrant, embeddings or neural network encoders can be turned into full-fledged applications for matching, searching, recommending, and more. It was founded in 2021 and is based in Berlin, Germany.
Founded2021InfrastructureVector Databases- New
Marqo
AustraliaMarqo is an end-to-end vector search engine. It allows users to store and query unstructured data such as text, images, and code through an easy-to-use API. The company was founded in 2022 and is based in Melbourne, Australia.
InfrastructureVector Databases - New
LanceDB
United StatesLanceDB provides serverless vector database for artificial intelligence(AI) applications. The company build applications for generative artificial intelligence(AI), recsys, search engines, content moderation, and more. It was founded in 2022 and is based in San Francisco, California.
InfrastructureVector Databases Weaviate
NetherlandsWeaviate develops and manages a cloud-native search engine that allows users to bring machine learning models to scale. It offers an open-source vector search engine that stores both objects and vectors and allows combining vector search with structured filtering with the fault-tolerance and scalability of a cloud-native database, all accessible through GraphQL, REST, and various language clients. It was formerly known as SeMi Technologies and changed its name to Weaviate in January 2023. The company was founded in 2019 and is based in Amsterdam, Netherlands.
InfrastructureVector Databases- New
Vespa
NorwayVespa is a company that focuses on big data and artificial intelligence in the technology sector. The company offers a data-serving engine that allows users to apply AI to their data at any scale, with features such as search, recommendation and personalization, conversational AI, and semi-structured navigation. It was founded in 2023 and is based in Trondheim, Norway.
Founded2023InfrastructureVector Databases
ETL/ELT/Data Transformation
Hevo Data
United StatesHevo Data is a fully automated, no-code data pipeline platform that helps organizations leverage data effortlessly. Hevo's end-to-end data pipeline platform enables users to pull data from all sources to the warehouse and run transformations for analytics to generate real-time data-driven business insights. The platform supports ready-to-use integrations across databases, SaaS applications, cloud storage, SDKs, and streaming services. The company was founded in 2017 and is based in San Francisco, California.
InfrastructureETL/ELT/Data TransformationDataddo
United StatesDataddo is a data integration platform for marketers, business owners, data analysts, sales representatives, finance officers, data engineers, and data scientists. It specializes in data integration, extraction, and analytics, data fabric, data management, cloud computing services and transforms data into actionable insights. The company was founded in 2015 and is based in Mountain View, California.
InfrastructureETL/ELT/Data TransformationProphecy
United StatesProphecy.io is a low-code data engineering platform. Prophecy.io democratizes the development and deployment of high-quality data pipelines, combining visual development with agile software engineering best practices.
InfrastructureETL/ELT/Data TransformationPortable
United StatesPortable offers an ELT platform to build connectors on-demand for data teams. The platform enables data teams and companies to access fully-managed connectors and custom development in a no-code solution. The company was founded in 2020 and is based New York, New York.
InfrastructureETL/ELT/Data TransformationMeltano
MexicoMeltano is ELT (Extract, Load, Transform) for the DataOps era: open-source, self-hosted, CLI-first, debuggable, and extensible. Pipelines are code, ready to be version controlled, containerized and deployed continuously. Meltano lets users develop and test locally, then deploy in production along with the built-in airflow integration. Meltano was founded in 2021 and is based in Mexico.
InfrastructureETL/ELT/Data TransformationCoalesce
United StatesCoalesce Automation specializes in data transformation solutions within the technology sector. The company offers a platform that enables the visual development of data pipelines, standardizes data transformations, and integrates specifically with Snowflake. Coalesce Automation primarily serves industries such as manufacturing, retail, financial services, advertising, healthcare, and technology. It was founded in 2020 and is based in San Francisco, California.
InfrastructureETL/ELT/Data TransformationCdata
United StatesCData Software is a company that focuses on data access and connectivity solutions within the technology sector. The company offers a comprehensive connectivity platform that provides real-time data access across enterprise applications and infrastructure, enabling the elimination of data silos and facilitating effective collaboration. It also offers data virtualization for the cloud, universal ETL/ELT for integrating cloud, on-premise, and hybrid data, and hundreds of real-time data connectors for various applications and data sources. It was founded in 2016 and is based in Chapel Hill, North Carolina
InfrastructureETL/ELT/Data Transformationtamr
United StatesTamr connects and enriches the vast reserves of underutilized internal and external data, so enterprises can use all their data for analytics. Tamr combines machine learning algorithms with human insight to identify sources, understand relationships and curate the massive variety of siloed data. The company was formerly known as DataTamer and changed its name to Tamr. The company was founded in 2013 and is based in Cambridge, Massachusetts.
InfrastructureETL/ELT/Data TransformationDatorios
IsraelDatorios is a unified data operations platform that aims to let users rapidly and dynamically build, adjust, and deploy their business and operational pipelines, using all data sources, reducing the dependency on data engineering, DevOps, and outside contractors leveraging the analyst and data privacy. Using no-code visual dev tools, the Datorios product seeks to assist businesses in building and deploying dataflow infrastructures. Customers can build their own data pipelines to handle a variety of transformations, combining on-premises and cloud solutions with current processes and databases.
InfrastructureETL/ELT/Data TransformationGoogle Cloud Data Fusion
United StatesGoogle Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products.
Founded2008InfrastructureETL/ELT/Data Transformation- New
Supermetrics
FinlandSupermetrics specializes in data management and marketing intelligence in the technology sector. The company's main services include connecting, transforming, analyzing, and making predictions from data, with a focus on marketing data. Supermetrics primarily sells to sectors such as e-commerce, agencies, and business-to-business (B2B) software-as-a-service (SaaS). It was founded in 2013 and is based in Helsinki, Finland.
InfrastructureETL/ELT/Data Transformation Integrate.io
United StatesIntegrate.io is a data warehouse integration platform that provides a complete integration layer for turning a data warehouse into a data platform. It offers data analytics that requires no coding or deployment. It provides data security, data ingestion, reverse ETL, business intelligence, and more. It serves the e-commerce, media, travel, and healthcare industries. It was founded in 2012 and is based in San Francisco, California.
InfrastructureETL/ELT/Data TransformationAlteryx
United StatesAlteryx (NYSE: AYX) serves the self-service data analytics movement with a platform that can discover, prep, and analyze data, then deploy and share analytics at scale for deeper insights. The company was founded in 2010 and is based in Irvine, California.
InfrastructureETL/ELT/Data TransformationTalend
United StatesTalend provides integration that truly scales. From small projects to enterprise-wide implementations, Talend's highly-scalable data, application, and business process integration platform maximizes the value of an organization's information assets and optimizes return on investment through a usage-based subscription model.
InfrastructureETL/ELT/Data TransformationAirbyte
United StatesAirbyte is an open-source data integration platform that syncs data from applications, APIs, and databases to data warehouses, lakes, and other destinations. The company was founded in 2020 and is based in San Francisco, California.
InfrastructureETL/ELT/Data Transformation- New
Osmos
United StatesOsmos focuses on data ingestion. The company offers services such as automating the cleaning and importing of data into operational systems, providing smart data uploaders for a self-serve experience, and facilitating data migration and onboarding without the need for coding. Osmos primarily serves implementation and operations teams, helping them to streamline data ingestion processes. It was founded in 2019 and is based in Seattle, Washington.
Founded2019InfrastructureETL/ELT/Data Transformation Estuary
United StatesEstuary is a database software company that specializes in linking to both internal and external systems, integrating real-time batch operations, and allowing clients to stream data, including internal services, apps, or external SaaS. It is based in New York, New York.
InfrastructureETL/ELT/Data TransformationAzure Data Factory
United StatesMicrosoft Azure is a cloud computing service created by Microsoft for building, testing, deploying, and managing applications and services through Microsoft-managed data centers.
Founded1975InfrastructureETL/ELT/Data Transformationdbt Labs
United Statesdbt Labs helps data teams work directly within the warehouse to produce trusted datasets for reporting, ML modeling, and operational workflows.
InfrastructureETL/ELT/Data TransformationStitch
United StatesStitch is an ETL service built for developers. It connects to tools like Salesforce and Facebook Ads, and consolidates that data into a central location where it's ready for analysis.
Founded2016InfrastructureETL/ELT/Data TransformationAWS Glue
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureETL/ELT/Data TransformationKleene
United KingdomKleene's user interface allows data transformations and modelling to be created and orchestrated with no data engineering required.
InfrastructureETL/ELT/Data Transformation- New
Unstructured.io
United StatesUnstructured specializes in data extraction and transformation, focusing on the technology sector. The company provides services that capture unstructured data from various documents and convert it into AI-friendly formats, such as JSON, facilitating the integration with large language models (LLMs). It was founded in 2022 and is based in Rocklin, California.
InfrastructureETL/ELT/Data Transformation - New
DataChannel
IndiaDataChannel is a cloud-based data integration and reverse ETL platform to automate the data collection, preparation, and management processes. It offers digital marketing and analytics, data engineers and analysts, data integrity, data activation, and more. It was founded in 2020 and is based in Gurgaon, India.
Founded2020InfrastructureETL/ELT/Data Transformation Hitachi Vantara Pentaho
United StatesHitachi Vantara, a wholly owned subsidiary of Hitachi, combines technology, intellectual property and industry knowledge to deliver data-managing solutions that help enterprises improve their customers' experiences, develop new revenue streams, and lower the costs of business. The firm's products include enterprise storage, industrial data & IoT management, Lumada data integration, object storage, virtual storage platforms, and EverFlex. It was founded in 2017 and is based in Santa Clara, California.
Founded2017InfrastructureETL/ELT/Data TransformationRivery
United StatesRivery offers a data ETL pipeline and integration platform service that allows businesses to aggregate, transform and automate their data systems in the cloud. Rivery provides a single end-to-end ELT (extract, load, transform) solution which covers key processes to create the optimal data stack: Ingestion, Transformation, Orchestration, and Reverse ETL. Teams can choose from the different modules to build their ideal data infrastructure. Rivery was founded in 2017 and is based in New York, New York.
InfrastructureETL/ELT/Data TransformationFivetran
United StatesFivetran fully automated connectors sync data from cloud applications, databases, event logs and more into the data warehouse.
InfrastructureETL/ELT/Data TransformationMatillion
United KingdomMatillion provides data integration and transformation solution for cloud and cloud data warehouses. Its solutions include data transformation, data analytics, data integration, data lakes, data preparation, and data governance. The company serves enterprises and small and midsize businesses (SMBs). Matillion was founded in 2011 and is based in Altrincham, U.K.
InfrastructureETL/ELT/Data Transformation
Reverse ETL
Hightouch
United StatesHightouch is a data platform that helps businesses sync their customer data from their data warehouse to their CRM, marketing, and support tools. Per the company, it aims to help clients get their data out of a data warehouse and into applications via a process it dubs reverse ETL. The company was founded in 2018 and is based in San Francisco, California.
InfrastructureReverse ETLRudderstack
United StatesRudderStack is a customer data infrastructure for routing and processing event data from apps and websites to data warehouses and cloud apps.
InfrastructureReverse ETLCensus
United StatesCensus is a data automation platform that syncs data warehouses with relevant apps.
InfrastructureReverse ETLMessageGears
United StatesMessageGears offers cloud-based email delivery and tracking services to businesses. Its technology runs partly on-premises, connects an enterprise company's database, and gives customers secure access to marketing data. It offers solutions such as segmentation, messaging, personalization, and more services. It caters its services to retail, technology, finance, and more sectors. The company was founded in 2007 and is based in Atlanta, Georgia.
InfrastructureReverse ETLPolytomic
United StatesPolytomic helps to sync internal data to your business systems.
InfrastructureReverse ETLOctolis
FranceOctolis develops a customer data management platform. Its platform offers software to unify, score, and synchronize customer data that allows members to understand the customer base, normalize data, manage the data life cycle, and create a calculated field for better targeting. The company was founded in 2020 and is based in Saint-Mande, France.
Founded2020InfrastructureReverse ETL
Data Integration
SnapLogic
United StatesSnapLogic specializes in self-service integration. The company's cloud-based platform makes it fast and easy to connect data, applications, and devices, eliminating business silos and technology bottlenecks to accelerate digital business.
InfrastructureData Integration- New
Jitterbit
United StatesJitterbit offers an application programming interface (API) integration platform. It enables companies to rapidly connect software-as-a-service (SaaS), on-premise, and cloud applications and instantly infuse artificial intelligence into any business process. The company was founded in 2004 and is based in Alameda, California.
InfrastructureData Integration - New
Cinchy
CanadaCinchy provides data management services. The company offers solutions for application-specific databases. It was founded in 2017 and is based in Toronto, Canada.
InfrastructureData Integration - New
Make
Czech RepublicMake is a company that focuses on automation software, operating within the technology and software industry. The company offers a visual platform that allows users to design, build, and automate tasks, workflows, apps, and systems without the need for coding. Its services are primarily utilized by individuals, teams, and enterprises across various sectors. It was founded in 2021 and is based in Prague, Czech Republic.
Founded2021InfrastructureData Integration - New
Parabola
United StatesParabola is a data analytics tool that makes complex ad-hoc analysis easy to create and consume. The company's dashboard pulls in data from third-party sources and custom data sources in a way that decision-makers can understand, and play with. The company was founded in 2015 and is based in San Francisco, California.
InfrastructureData Integration StreamSets
United StatesStreamSets provides data ingest technology for big data applications. Its enterprise-grade infrastructure accelerates data analysis and decision-making by bringing unprecedented transparency and event processing to data in motion. On February 28th, 2022, StreamSets was acquired by Software AG. The terms of the transaction were not disclosed.
InfrastructureData Integration- New
Whalesync
United StatesWhalesync offers a no-code data tool and builds multiple search engine optimization (SEO) pages, internal tools, and full-blown applications. It develops no-code data syncing and automation tools. The company was founded in 2021 and is based in Redmond, Washington.
InfrastructureData Integration Tealium
United StatesTealium develops a customer data platform, comprised of an enterprise tag management solution, omnichannel customer segmentation and action engine, and suite of rich data services, creating a vendor-neutral data foundation that spans web, mobile, offline, and IoT. The firm primarily serves financial services, healthcare, retail, sports & entertainment, and hospitality industries. It was founded in 2008 and is based in San Diego, California.
InfrastructureData IntegrationBoomi
United StatesBoomi AtomSphere is an integration service that is fully on-demand and connects any combination of Software-as-a-Service (SaaS), cloud, and on-premise applications without the burden of installing and maintaining software packages or appliances.
InfrastructureData IntegrationFlatfile
United StatesFlatfile plugs into any web app and provides data importing solutions. It uses AI to map and resolve schema with files such as spreadsheets and CSVs. When the algorithms encounter an anomaly or a data type they cant process automatically, they prompt customers to make a decision and then add that scenario to a database for future reference. It develops a software development kit that allows developers to build on top of Flatfiles components to access import, match, merge and export functions. The company was founded in 2018 and is based in Denver, Colorado.
InfrastructureData IntegrationSnowplow
United KingdomSnowplow has created a fully managed platform for creating behavioral data thats used by companies to power artificial intelligence, machine learning and advanced analytics applications. With the Snowplow Behavioral Data Platform, data teams can access behavioral data thats created, modeled and customized for the specific application theyre building. Snowplow was founded in 2012 and is based in London, England.
InfrastructureData IntegrationBobsled
United StatesBobsled is a developer of a data-sharing platform intended for a no-code experience connecting any cloud, platform, or region. The platform manages all aspects such as distribution, updates, versioning, entitlements, telemetry, and more, from any cloud to any destination, enabling clients to choose the right data, update it at the right time, and deliver it to the right place, all from an intuitive web interface. It was founded in 2021 and is based in Los Angeles, California.
InfrastructureData Integrationimport.io
United StatesImport.io is a cloud-based big data platform that specializes in the conversion of the web into a database. It allows companies, developers, and coders to extract, connect and fuse disparate sources of data from the web on sectors such as consumer goods and retail, travel, and hospitality, events and tickets, and marketplaces. It caters its solutions to analytics providers, brands, and retailers. The company was founded in 2012 and is based in Campbell, California.
InfrastructureData Integrationtray.io
United StatesTray.io offers a low-code automation platform that can turn unique business processes into repeatable and scalable workflows that evolve whenever business needs change. The self-service platform enables users to build integrations using any API and connect enterprise applications at scale without incremental costs. The platform has applications in areas such as eCommerce, IT, marketing, marketing ops, product, sales, and sales ops. The company was founded in 2012 and is based in San Francisco, California.
InfrastructureData IntegrationCrux Data
United StatesCrux Informatics offers data engineering and information supply chain operation services that aim to help companies reduce the effort and money spent acquiring, exploring, and processing data. It implements and operates the data processing pipelines they create for its customers' requirements. It delivers solutions such as cloud migration, supply chain, and data integration solutions. The company was founded in 2017 and is based in San Francisco, California.
InfrastructureData IntegrationTwilio Segment
United StatesSegment is a platform for collecting customer data wherever it's generated - website, mobile app, servers, and more - and sending it to third-party tools, internal systems, or SQL databases with the flip of a switch. By consolidating data tracking to a single API, Segment saves engineers' time integrating new tools, eliminates data discrepancies, and democratizes data access across organizations. On October 12th, 2020, Segment was acquired by Twilio for a valuation of $3.2B.
InfrastructureData IntegrationQlik
United StatesQlik specializes in data discovery and user-driven business intelligence, delivering solutions ranging from reporting and self-service visual analysis to guided, embedded, and custom analytics, regardless of where data is located.
InfrastructureData IntegrationSAP Data Services
GermanySAP (FWB: SAP) (NYSE: SAP) is a multinational software corporation that makes enterprise software to manage business operations and customer relations.
Founded1972InfrastructureData IntegrationInformatica
United StatesInformatica Corporation (NYSE: INFA) is a provider of enterprise data integration software and services. With Informatica, companies can gain greater business value by integrating all their information assets from across the enterprise. More than 2,900 companies worldwide rely on Informatica to reduce the cost and expedite the time to address data integration needs of any complexity and scale.
InfrastructureData IntegrationInfoWorks
United StatesInfoWorkshas developed a Dynamic Data Warehousing (DDW) platform on Hadoop. The Infoworks DDW platform provides a high-performance and shared data repository that supports all enterprise analytics on Hadoop through software automation and intelligent data organization.
InfrastructureData IntegrationDenodo
United StatesDenodo is the leader in data virtualization providing agile, high performance data integration and data abstraction across the broadest range of enterprise, cloud, big data, unstructured data sources and real-time data services at half the cost of traditional approaches. Denodo's customers across every major industry have gained significant business agility and ROI by enabling faster and easier access to unified business information for agile BI, big data analytics, Web and cloud integration, single-view applications, and enterprise data services.
InfrastructureData Integration- New
Immersa
United StatesImmersa is a company that focuses on data intelligence in the business sector. The company's main service involves connecting to any data source and structuring the data to answer business questions without the need for the user to worry about schemas or pipelines. Immersa primarily sells to the SaaS industry. It is based in Palo Alto, California.
Founded2021InfrastructureData Integration - New
YepCode
SpainYepCode is an all-in-one SaaS platform focused on enabling development teams to create and manage integrations and automations using source code. The platform provides a serverless environment where users can write, execute, and monitor code, primarily using JavaScript and Python, to connect services and APIs for complex enterprise solutions. YepCode primarily serves sectors that require advanced automation and integration capabilities beyond what no-code tools can offer. It was founded in 2021 and is based in A Coruna, Spain.
Founded2021InfrastructureData Integration MuleSoft
United StatesMuleSoft provides an integration platform for connecting SaaS and enterprise applications in the cloud and on-premise. MuleSoft's Anypoint technology eliminates costly, time-intensive point-to-point integration, enabling business agility. Delivered as a packaged integration experience, Mule iON and Mule ESB are built on open source technology for fast, reliable integration without vendor lock-in. In March 2018, MuleSoft was acquired by Salesforce for $6.86B.
InfrastructureData Integration- New
Sequin
United StatesSequin is a data platform that specializes in API integration and synchronization for developers. The company offers a service that syncs APIs to various tools in real-time, allowing for seamless data integration and management. Sequin primarily caters to the software development industry, providing solutions that replace the need for custom API and glue code development. It was founded in 2020 and is based in Venice, California.
Founded2020InfrastructureData Integration Celigo
United StatesCeligo offers iPaaS integration that can be used to easily and quickly create customized integrations using intuitive wizards to guide business and IT users. Celigo's Integration Apps (formerly SmartConnectors) are prebuilt, full integrations between multiple applications, such as NetSuite, Salesforce, Shopify, Amazon, Jira, Zendesk, and many other popular applications.
InfrastructureData Integration- New
n8n
Germanyn8n operates as a workflow automation tool. It helps users to automate tasks, integrate data, create custom workflows, and more. The company was founded in 2019 and is based in Berlin, Germany.
InfrastructureData Integration Freshpaint
United StatesFreshpaint is a data platform that connects and standardizes customer data from sites or apps to marketing and analytics tools. Its auto-track system collects all pageviews and clicks across a user's site to allow them to push data into analytical tools to eliminate inefficiencies in manually tracking metrics. Its solutions include auto track, precision track, Application Programming Interface (API) identity resolution, time machine, and replay solutions. The company was founded in 2019 and is based in San Francisco.
InfrastructureData IntegrationNextData
United StatesNextdata provides various data-related services such as data product containers, analytic data product APIs, embedded computational policies, and data product discovery. The company was founded in 2022 and is based in California.
InfrastructureData Integration
Data Governance & Catalog
Modern Data Company
United StatesThe Modern Data Company develops a data operating system to remove complexity the data ecosystem. It simplifies how organizations manage, access, and interact with data. The company provides data operations for retail and consumer packaged goods, distribution, environmental, social, governance, technology companies, and more. It was founded in 2018 and is based in Palo Alto, California.
Founded2018InfrastructureData Governance & CatalogSailPoint
United StatesSailPoint (SV7:FRA) provides an identity security cloud platform that discovers, manages, and secures identities and access. Through its AI-driven intelligence, SaaS-based security, cloud access management, file access management, password management, and real-time access risk analysis it enables the enterprises to gain complete access visibility to all their systems. The company was founded in 2005 and is based in Austin, Texas. In August 2022, SailPoint was acquired by Thoma Bravo at a valuation of $6.9B.
InfrastructureData Governance & CatalogALTR
United StatesALTR provides a data-security platform that unleashes the cybersecurity benefits of blockchain. Built on ALTRchain, a high-performance, enterprise-grade blockchain technology for ultra-secure data access and storage, the ALTR platform allows organizations to monitor, access and store critical information.
InfrastructureData Governance & Catalog- New
OvalEdge
United StatesOvalEdge centralizes all of a company's data into a single repository, or catalog, enabling data users to quickly find data files, tables, social media content, and all other company data, no matter where it is stored.
InfrastructureData Governance & Catalog ObservePoint
United StatesObservePoint provides an enterprise data quality assurance platform that maximizes return on marketing technology by identifying, analyzing, and testing Javascript-based technologies deployed on websites. Organizations leverage ObservePoint technology to improve data quality, provide transparency into data collection methods, and monitor and systematically protect vital business metrics.
InfrastructureData Governance & CatalogRaito
BelgiumRaito is a software development company. It develops a data protection platform to manage data access across all databases and dashboards. It simplifies access management without sacrificing performance with the automated policy manager and open-source connectors, enabling users to share their data without having to worry about unauthorized access. It was founded in 2021 and is based in Woluwe-Saint-Pierre, Belgium.
InfrastructureData Governance & CatalogImmuta
United StatesImmuta helps organizations unlock value from their cloud data by providing an integrated Data Security Platform for sensitive data discovery and classification, security and access control, and activity monitoring. The company was founded in 2015 and is based in Boston, Massachusetts
InfrastructureData Governance & CatalogStemma
United StatesStemma is a fully managed data catalog powered by Amundsen, an open-source data catalog.
InfrastructureData Governance & CatalogOrion
United StatesOrion Governance offers an automated data governance platform. The company's software and metadata harvester automatically merge structured, unstructured, and legacy data into a single data center and use it to compile an end-to-end data lineage map on a real-time basis. The company also provides consulting and implementation support and services in the industries of financial services, retail, software, healthcare, airlines, and more. It was founded in 2017 and is based in San Mateo, California.
Founded2017InfrastructureData Governance & CatalogSecuriti
United StatesSecuriti offers a data privacy, security, governance, and compliance platform that creates a layer of unified data intelligence and controls across all major public clouds, data clouds, SaaS, and private clouds. The company serves organizations globally. The company was founded in 2019 and is based in San Jose, California.
InfrastructureData Governance & CatalogAlation
United StatesAlation is a company that specializes in enterprise data intelligence solutions, including data search & discovery, data governance, data stewardship, analytics, and digital transformation. With its Behavioral Analysis Engine, inbuilt collaboration capabilities, and open interfaces, Alation combines machine learning with human insight to tackle demanding challenges in data and metadata management. Alation serves enterprises across a wide range of industries. It was founded in 2012 and is based in Redwood City, California.
InfrastructureData Governance & CatalogInformatica
United StatesInformatica Corporation (NYSE: INFA) is a provider of enterprise data integration software and services. With Informatica, companies can gain greater business value by integrating all their information assets from across the enterprise. More than 2,900 companies worldwide rely on Informatica to reduce the cost and expedite the time to address data integration needs of any complexity and scale.
InfrastructureData Governance & Catalog- New
Alvin
EstoniaAlvin is a company that focuses on operationalizing data lineage in the data management industry. The company offers a platform that automatically builds and maintains a highly accurate dataset representing the connection between various data elements, such as columns, tables, dashboards, jobs, and ML models. This platform addresses key issues such as impact analysis, data discovery, problem tracing, and usage analytics. It was founded in 2018 and is based in Tallinn, Estonia
InfrastructureData Governance & Catalog Collibra
BelgiumCollibra is a data governance and intelligence platform that helps businesses unlock insights from disparate data sources. The company offers various products that allow both technical and business users to collaborate and combine data silos to find hidden meaning in their wealth of information.
InfrastructureData Governance & CatalogSelect Star
United StatesSelect Star builds an automated, and intelligent data discovery platform. It offers a full-text search that allows users to find data and identify which columns of a dataset are most used by applications within a company and are referenced in queries. The company was founded in 2020 and is based in San Franciso, California.
InfrastructureData Governance & Catalog- New
Octopai
IsraelOctopai focuses on data intelligence, operating within the data management and analytics domain. The company offers a metadata management solution that provides services such as data lineage, data discovery, and an automated data catalog, enabling businesses to map, track, and understand their data assets across complex, multi-vendor data ecosystems. It primarily serves sectors such as finance, healthcare, and insurance. The company was founded in 2015 and is based in Kfar Saba, Israel.
InfrastructureData Governance & Catalog Acryl Data
United StatesAcryl Datas vision is to empower data teams with extreme productivity, using trusted, compliant data through a metadata platform.
InfrastructureData Governance & CatalogPrecisely
United StatesPrecisely specializes in data integrity, providing accuracy and consistency in data for customers in more than 100 countries. Preciselys data integration, data quality, location intelligence, and data enrichment products power better business decisions to create better outcomes. The company was founded in 1992 and is based in Burlington, Massachusetts.
InfrastructureData Governance & CatalogHitachi Vantara
United StatesHitachi Vantara, a wholly owned subsidiary of Hitachi, combines technology, intellectual property and industry knowledge to deliver data-managing solutions that help enterprises improve their customers' experiences, develop new revenue streams, and lower the costs of business. The firm's products include enterprise storage, industrial data & IoT management, Lumada data integration, object storage, virtual storage platforms, and EverFlex. It was founded in 2017 and is based in Santa Clara, California.
Founded2017InfrastructureData Governance & CatalogData.world
United StatesData.World provides a platform that allows people to solve complex, academic, commercial, and societal problems. Users can find relevant data from a wide range of sources, manage numerous file formats, understand the data's meaning in ways that can be enhanced and shared, and contribute to and discuss data to trigger collaboration. Data.World was founded in 2016 and is based in Austin, Texas.
InfrastructureData Governance & CatalogStratio
SpainStratio offers data intelligence solutions for the banking, energy, healthcare, insurance, and retail sectors. It offers cloud adoption-assisted mapping, ontologies and knowledge graphs, data marketplace, augmented analytics AI and ML suite, and an on-demand application. The company was founded in 2014 and is based in Madrid, Spain.
InfrastructureData Governance & CatalogOkera
United StatesOkera is a software provider that enables the management of data access and governance at scale for the modern heterogeneous data environment. Okera's Active Data Access Platform allows agility and governance to co-exist and gives data producers, consumers, and stewards the confidence to unlock the power of their data for growth. This unique, enterprise-wide platform facilitates the provisioning, accessing, governing, and auditing of data in the multi-cloud, multi-data format, and multi-tool world.
InfrastructureData Governance & CatalogAtaccama
CanadaAtaccama is a unified data management platform provider. Combining data governance, data catalog, data quality, and master data management into a single, AI-powered fabric across hybrid and cloud environments, Ataccama gives businesses and data teams the ability to scale and accelerate business outcomes while maintaining the trust, security, and governance of data. Financial, commercial, and government organizations use Ataccama's solutions to execute and deliver business benefits. The company was founded in 2007 and is based in Toronto, Ontario.
InfrastructureData Governance & CatalogMetaphor
United StatesMetaphor provides a system of record for users' data ecosystems unifying data assets into a single searchable catalog.
InfrastructureData Governance & CatalogIBM
United StatesIBM (NYSE: IBM) manufactures and sells computer hardware and software, and offers infrastructure services, hosting services, and consulting services in areas ranging from mainframe computers to nanotechnology. The company was founded in 1911 and is based in Armonk, Newyork.
Founded1911InfrastructureData Governance & CatalogAtlan
United StatesAtlan is a modern data collaboration workspace similar to Github for engineering or Figma for design. By acting as a virtual hub for data assets ranging from tables and dashboards to models & code, Atlan enables teams to create a single source of truth for all their data assets and collaborate across the modern data stack through deep integrations with tools like Slack, BI tools, data science tools, and more.
InfrastructureData Governance & CatalogSecoda
CanadaSecoda offers digital collaborative knowledge management tools specializing in metadata, queries, charts and documentation sharing.
InfrastructureData Governance & CatalogBigID
United StatesBigID operates as a security company that offers risk management, privacy of customer data and data intelligence. The solutions allow organizations to find, classify and catalog sensitive data from the cloud to data centers. BigID serves enterprises across multi industries. The company was founded in 2015 and is based in New York, New York.
InfrastructureData Governance & CatalogCastor
FranceCastor operates a data catalog platform. It provides enterprise users with tools to collect, discover, understand and use datasets according to business needs. The solution automatically documents datasets and provides an overview of the entire data environment, giving employees the ability to search through all available data assets in the company. It was founded in 2020 and is based in Paris, France.
InfrastructureData Governance & CatalogSolidatus
United KingdomSolidatus is a data lineage and visualization tool application that allows organizations to rapidly discover, visualize, and understand how data flows through their systems. The company was founded in 2017 and is based in London, England.
InfrastructureData Governance & CatalogSkyhigh Security
United StatesSkyhigh Security is a name for the McAfee Enterprise security service edge business. McAfee is a security technology company, headquartered in Santa Clara, California, that delivers proactive and proven solutions and services that secure systems and networks.
Founded2022InfrastructureData Governance & CatalogDemyst
United StatesDemystData helps financial institutions optimize customer interactions through improved access to information. The company brings together online, social, and internal company data to create more comprehensive profiles and refined customer predictions.
InfrastructureData Governance & Catalog
Infrastructure - Orchestration
Elementl
United StatesElementl is a software solution company. The company offers a system for building modern data applications. It is based in San Francisco, California.
InfrastructureInfrastructure - OrchestrationAWS Step Functions
United StatesAmazon Web Services (AWS) is a business unit within Amazon.com that provides an infrastructure platform for businesses in the form of cloud computing.
Founded2006InfrastructureInfrastructure - OrchestrationMicrosoft Azure Data Factory
United StatesMicrosoft Azure is a cloud computing service created by Microsoft for building, testing, deploying, and managing applications and services through Microsoft-managed data centers.
Founded1975InfrastructureInfrastructure - OrchestrationGoogle Cloud Workflows
United StatesGoogle Cloud Platform, offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products.
Founded2008InfrastructureInfrastructure - OrchestrationPrefect
United StatesPrefect Technologies is a workflow management system designed for modern infrastructure and powered by the open-source Prefect Core workflow engine.
InfrastructureInfrastructure - OrchestrationMage
United StatesMage offers digital AI solutions to build, improve, and integrate AI models into apps.
InfrastructureInfrastructure - OrchestrationSeqera
SpainSeqera Labs provides open-source workflow orchestration software for data pipeline processing, cloud infrastructure, and secure collaboration. The company's products include Nextflow, Wave Containers, and Tower Enterprise, Tower Cloud. It was founded in 2018 and is based in Barcelona, Spain.
InfrastructureInfrastructure - OrchestrationAkuity
United StatesAkuity provides solutions that empower DevOps engineers to deliver applications more simply, more safely, and faster. It is powered by Argo, the open-source suite of Kubernetes-native application delivery software. The company was founded in 2021 and is based in Sunnyvale, California.
InfrastructureInfrastructure - OrchestrationTrocco
InfrastructureInfrastructure - OrchestrationUnion.ai
United StatesUnion.ai is the developer of Flyte, a platform for programming and processing concurrent AI and data analytics workflows. It was founded in 2021 and is based in Seattle, Washington.
InfrastructureInfrastructure - OrchestrationAstronomer
United StatesAstronomer develops data orchestration solutions based on Apache Airflow. The company's suite of products and services include Astronomer Cloud, a multi-tenant, multi-cloud Airflow as a Service and Astronomer Enterprise, a Kubernetes-native platform to easily deploy, manage, and scale distributed Airflow services.
InfrastructureInfrastructure - OrchestrationNeurelo
United StatesNeurelo provides a data access platform. The company offers a platform that abstracts database complexities, providing auto-generated application program interfaces (APIs) and custom query APIs with AI assistance, enabling developers to build and run applications more efficiently with PostgreSQL, MongoDB, and MySQL. It primarily serves the software development industry. It was founded in 2022 and is based in Los Altos, California.
InfrastructureInfrastructure - OrchestrationOuterbounds
United StatesProvider of full service application hosting and professional services specializing in complete end-to-end solutions. The company offers complete IT outsourcing, ISV hosting solutions, PC-lifecycle management, and electronic software distribution. The company develops and implements enterprise-wide solutions for server-based application deployment, thin-client, network security, wireless connectivity, and other networking solutions. The professional services encompass consulting, system design and integration, certified training, and support.
InfrastructureInfrastructure - Orchestration
Data Quality & Observability
Validio
SwedenValidio is a SaaS company that offers a cloud-based solution for enterprise customers for real-time validation, monitoring and cleaning of big data. The platform aims to help data teams eliminate bad data through monitoring, validation and cleaning of data in real time streams and batches. It was founded in 2019 and is based in Stockholm, Sweden.
InfrastructureData Quality & ObservabilityTalend
United StatesTalend provides integration that truly scales. From small projects to enterprise-wide implementations, Talend's highly-scalable data, application, and business process integration platform maximizes the value of an organization's information assets and optimizes return on investment through a usage-based subscription model.
InfrastructureData Quality & ObservabilityManta
United StatesManta is a tool for the automatic analysis of information flows in business intelligence, enabling companies to automate data flow mapping in complex environments. The tool can optimize systems, reduce development costs, perform impact analyses, and comply with regulatory requirements such as GDPR, CCAR, or HIPAA. It caters to companies across all industries, such as healthcare, banking, and more. Manta was founded in 2015 and is based in New York, New York.
InfrastructureData Quality & ObservabilityAcceldata
United StatesAcceldata provides a big data APM (Application performance management) solution assisting enterprises to optimize data lake operations, predicting failures, and troubleshooting. It provides insights into application performance, detects anomalies and alerts which can be used by data engineers, architects, and cluster operators. Acceldata analyzes performance bottlenecks and generates recommendations for remediation. The company was founded in 2018 and is based in San Jose, California.
InfrastructureData Quality & ObservabilityGreat Expectations
United StatesGreat Expectations focuses on data quality and collaboration in the technology sector. The company offers a platform that provides data testing, documentation, and profiling services, enabling users to maintain data integrity and accelerate data discovery. It primarily serves data teams across various sectors. It was founded in 2017 and is based in Midvale, Utah.
InfrastructureData Quality & ObservabilitySynq
United KingdomSynq is a platform helping data practitioners build reliability throughout the entire development life-cycle, from planning to production.
InfrastructureData Quality & ObservabilitySifflet
FranceSifflet is a cloud-native solution aimed at helping organizations make the most of their data. The company's platform handles the complex process of connecting to different data sources and adapts instantly to the data volumes, enabling every type and size of business to make data management more accessible. It was founded in 2021 and is based in Paris, France.
InfrastructureData Quality & ObservabilityDataband
IsraelDataband develops a software platform for agile machine learning development. The company is developing a framework that aims to offer cross-team standardization and complete visibility for project stakeholders. The company was founded in 2019 and is based in Tel Aviv, Israel. As of July 6th, 2022, Databand was acquired by IBM. The terms of the transaction were not disclosed.
InfrastructureData Quality & Observability- New
Telmai
United StatesTelmai develops a no-code data quality analysis and monitoring platform. Its product automatically detects data quality issues as data is being ingested and enables data teams to detect and investigate data quality issues. The company was founded in 2020 and is based in San Francisco, California.
InfrastructureData Quality & Observability