Apache Flink lets you process and analyze data streams in real time, offering scalable, stateful computations for data-driven applications.
Process data streams at any scale in real time
Apache Flink is an open-source framework designed for real-time data stream processing. Whether you’re dealing with unbounded or bounded data, Flink helps you perform complex computations at high speed and across distributed clusters, making it a go-to solution for data-driven applications.
With Flink, you can build scalable, stateful applications that analyze and react to data as it arrives. The platform supports integration with popular tools and environments, and it’s built to handle workloads of any size, from small deployments to massive enterprise clusters.
If you’re interested in learning more, the site provides clear documentation, a supportive community, and practical guides to help you get started. Whether you’re a developer, data engineer, or researcher, Flink empowers you to harness the power of real-time data processing.
Discover websites similar to Flink.apache.org based on shared categories, topics, and features.
Dask is an open-source Python library that helps you run data analysis and machine learning tasks faster by scaling your existing Python tools.
Apache Hive is a distributed data warehouse system for scalable analytics, letting you read, write, and manage big data using SQL on various storage systems.
Apache Kafka is an open-source platform for building distributed streaming and messaging applications, trusted by major companies worldwide.
Apache Mesos lets you manage datacenter resources as a single pool, making it easy to build and run scalable, fault-tolerant distributed systems.
MonetDB is a high-performance database system designed for fast analytics and data management using standard SQL. Open source and easy to use.
Access cross-national microdata for research and analysis with remote tools from the LIS Data Center in Luxembourg. Ideal for social science studies.
Access, visualize, and analyze economic data from hundreds of sources with easy-to-use tools and charts. Ideal for exploring trends and making comparisons.
Global Forest Watch provides real-time data and tools to monitor forests worldwide, helping you track deforestation and land use trends for better protection.
DBpedia lets you explore and use structured data from Wikipedia, offering tools, datasets, and knowledge graphs for research, analysis, and development.
Jupyter offers a web-based tool for writing and running code, sharing data, and creating interactive notebooks for science, data analysis, and education.
Matplotlib is a Python library for creating static, animated, and interactive data visualizations, with extensive guides, examples, and documentation.
Seaborn is a Python library for creating beautiful, informative statistical data visualizations. Explore guides, tutorials, and API docs for easy plotting.
Browse and share humanitarian crisis data from around the world to support relief efforts, with thousands of datasets from trusted organizations.
Explore and visualize open data from multiple sources on topics like health, economy, and environment. Find trends, charts, and insights in one place.
Apache Hadoop offers open-source tools for scalable, distributed computing and data analysis, letting you process big data efficiently and reliably.
NSF NEON offers open ecological data and resources to help you explore, analyze, and understand ecosystems across the United States.
Access global trade, export, import, and tariff data with WITS. Analyze trade competitiveness and download detailed statistics for any country.
CodaLab hosts data science and machine learning competitions where you can join, track progress, and collaborate with others worldwide.
Explore, compare, and download U.S. census data with easy-to-use tables, maps, and charts for places across America. Visualize and embed census info.
Genboree offers tools and databases for biomedical researchers to manage, analyze, and share genomics and biological data in a collaborative environment.
Hazelcast is a unified real-time data platform that lets you process streaming data instantly, combining stream processing and fast data storage in the cloud.
Cloudera offers a secure hybrid data platform for managing, analyzing, and moving data across clouds and on-premises, with built-in AI and analytics tools.
WEKA offers a high-performance data platform for storing, processing, and managing data across cloud and on-premises, powering AI and machine learning workloads.
CARTO lets you analyze, visualize, and build apps with spatial data on the cloud, making advanced location analytics easy for businesses and developers.
ClickHouse is a fast, open-source database for real-time analytics and reporting using SQL, ideal for business intelligence, ML, and big data tasks.
ScyllaDB offers a fast, scalable NoSQL database for data-intensive apps, delivering high performance and low latency for businesses and developers.
Protect and manage your data across hybrid and multi-cloud environments with Veeam’s self-managed backup and recovery solutions.
ArcGIS Hub helps you organize people, data, and tools in one cloud platform to support initiatives, share insights, and achieve community goals.
Qdrant is an open-source vector database and search engine that helps you build fast, scalable AI-powered search and recommendation systems.
3DEXPERIENCE is a cloud platform for businesses to collaborate, manage projects, and streamline workflows in one secure, unified space.
Dassault Systèmes offers a collaborative platform using virtual twin technology to help businesses design, innovate, and create sustainable solutions.
Jakarta EE is an open source platform for building cloud-native, enterprise Java applications, offering guides, specs, and community support for developers.
Azul delivers high-performance, secure Java platforms and tools for modern cloud enterprises, helping you optimize Java applications and runtime environments.
Printix lets you manage and secure all your company’s printing from the cloud, so you can print anywhere without print servers or complicated setup.
Explore tailored Microsoft Cloud solutions for industries like healthcare, finance, and government to help your organization streamline and innovate.
Project Atomic offered tools and resources for deploying and managing containers on next-gen operating systems, now guiding users to Fedora CoreOS.
Reown Cloud is an all-in-one platform for creators to build, manage, and grow their online presence with modern tools and seamless account access.
Prometheus is an open-source tool for monitoring systems and analyzing time series data with powerful metrics, alerts, and flexible querying.
Mintel offers up-to-date market research, consumer insights, and industry data to help businesses make informed decisions and spot new opportunities.
Securcube provides digital forensic tools for analyzing phone records and cell site data, helping professionals uncover critical evidence efficiently.
Virtuoso lets you connect, manage, and analyze data from multiple sources using open standards, with flexible AI-powered tools for individuals and businesses.
Datomic lets you build flexible, distributed systems that store and query all your data history, on your own infrastructure or in the cloud.
Knative offers tools for building, deploying, and managing serverless workloads on Kubernetes, helping developers create scalable cloud-native apps.
Gnuplot is a free, cross-platform graphing tool for creating plots and charts from data or mathematical functions, supporting both interactive and scripted use.
Unlock business insights with AI-powered data analysis tools and solutions, available in Korean. Make smarter decisions with innovative data technology.
Generate knowledge graphs easily with RML.io tools for Windows, Mac, and Linux. Use simple rules to turn your data into structured, connected insights.
Workspot unifies virtual desktops, enterprise browsing, and analytics to help businesses securely manage and scale remote work on any device.
GlobalDots helps businesses optimize cloud costs, boost efficiency, and enhance security with innovative cloud and web solutions tailored to your needs.
Microsoft Azure is a cloud platform offering tools to build, deploy, and manage apps and services securely from anywhere, for businesses of all sizes.
Istio is an open platform that helps you manage, secure, and monitor microservices across your cloud-native apps. Try it to simplify service management.
CockroachDB offers a cloud-native, distributed SQL database for building always-on, scalable applications with control over data and zero downtime.
BellSoft offers secure Java runtimes, tools, and cloud-native platforms to help you run, optimize, and manage modern Java applications efficiently.
Veeva offers cloud-based software solutions for the life sciences industry, helping pharmaceutical and biotech companies manage data, processes, and compliance.
Gaia-X is a European initiative building a secure, federated cloud and data infrastructure to enable trusted, decentralized digital ecosystems.
Explore and access genomics data, resources, and tools at the National Genomics Data Center—supporting research in life and health sciences worldwide.
Ontotext helps enterprises use AI and knowledge graphs to unify data, boost analytics, and improve collaboration across platforms like Microsoft 365 Copilot.
QUODD delivers flexible, on-demand market data solutions for financial institutions and startups, offering timely insights and customizable data products.
Explore and analyze large-scale networks with SNAP, Stanford's platform for efficient graph mining, available in C++ and Python for research and development.
Explore research, software, and resources from the UW Interactive Data Lab, focused on data visualization and interactive analysis tools.
Polygraph creates engaging data visualizations that turn complex information into clear, interactive stories for readers and learners.