Dask is an open-source Python library that helps you run data analysis and machine learning tasks faster by scaling your existing Python tools.
Scale your Python data workflows easily
Dask is an open-source platform designed to help you scale your Python data analysis and machine learning workflows. Whether you’re working with large datasets or need to speed up your computations, Dask lets you use familiar Python tools like pandas with the power of parallel computing.
You don’t need to change much in your existing code—Dask works seamlessly with popular libraries, making it a great fit for data scientists and engineers looking to boost performance. The site offers clear documentation, community support, and helpful resources to get started quickly.
If you want to process big data or accelerate your analytics in Python, Dask provides the flexibility and performance you need, all while staying open-source and community-driven.
Discover websites similar to Dask.org based on shared categories, topics, and features.
Apache Hive is a distributed data warehouse system for scalable analytics, letting you read, write, and manage big data using SQL on various storage systems.
Apache Spark is an open-source engine for large-scale data analytics, supporting data engineering, science, and machine learning in multiple languages.
Explore and visualize high-dimensional data or machine learning embeddings interactively in your browser with TensorFlow’s easy-to-use projector tool.
Apache Flink lets you process and analyze data streams in real time, offering scalable, stateful computations for data-driven applications.
Kubeflow helps you build, deploy, and manage machine learning workflows easily on Kubernetes, making AI projects simple and scalable.
OpenSearch is an open source search and analytics suite for finding, visualizing, and analyzing data, with AI and machine learning tools included.
OpenRefine lets you clean, transform, and organize messy data for free. Easily format, enrich, and prepare datasets using this open source tool.
Tidyverse offers a collection of R packages for data science, making data analysis, visualization, and manipulation in R simpler and more consistent.
DVC is an open-source tool for version control in data science and machine learning, helping you track data, models, and experiments like with Git.
Apache Pinot is an open source platform for real-time data analytics, letting you quickly analyze and visualize large datasets for instant insights.
Analyze life science data online with a collaborative platform designed for research and community-driven workflows in bioinformatics and genomics.
Apache Pig lets you analyze large data sets using a simple high-level language, making it easier to process and manage big data efficiently.
Apache Arrow offers a universal columnar data format and tools for fast, multi-language data analytics and seamless data interchange between systems.
Apache Zeppelin is a web-based notebook for interactive data analytics, letting you create collaborative documents using SQL, Scala, Python, R, and more.
Explore pandas, the open source Python library for fast, flexible data analysis and manipulation. Get started with guides, docs, and a helpful community.
Open-source tool for analyzing and visualizing data across sciences and engineering, supporting everything from large-scale simulations to desktop use.
Galaxy Europe is an open-source platform for accessible, FAIR data analysis with tools, resources, and a strong community for scientific collaboration.
Apache Druid is a high-performance analytics database for fast, real-time querying of streaming and batch data at any scale.
Manage and analyze massive multidimensional data cubes for science and research with flexible, scalable tools supporting open standards.
Query Wikipedia and related databases using SQL right in your browser. Explore, analyze, and share data easily—no software installation needed.
WEKA offers a high-performance data platform for storing, processing, and managing data across cloud and on-premises, powering AI and machine learning workloads.
Join a global data science and machine learning community, access datasets, enter competitions, and use collaborative tools to grow your skills.
Cloudera offers a secure hybrid data platform for managing, analyzing, and moving data across clouds and on-premises, with built-in AI and analytics tools.
Find benchmark datasets, data loaders, and evaluators for graph machine learning research, all designed to work with PyTorch models and tools.
Domino Data Lab is an enterprise AI platform that helps data science teams accelerate research, deploy models, and collaborate using trusted tools.
CARTO lets you analyze, visualize, and build apps with spatial data on the cloud, making advanced location analytics easy for businesses and developers.
ClickHouse is a fast, open-source database for real-time analytics and reporting using SQL, ideal for business intelligence, ML, and big data tasks.
Element 84 delivers geospatial data processing and software solutions to help organizations analyze, visualize, and use earth data for positive impact.
Dataiku is a platform to build, deploy, and manage AI and analytics projects, helping teams turn data into business insights and smarter decisions.
ScyllaDB offers a fast, scalable NoSQL database for data-intensive apps, delivering high performance and low latency for businesses and developers.
Protect and manage your data across hybrid and multi-cloud environments with Veeam’s self-managed backup and recovery solutions.
A community Q&A site for statistics, data analysis, and machine learning where you can ask questions, share knowledge, and discuss data topics.
ArcGIS Hub helps you organize people, data, and tools in one cloud platform to support initiatives, share insights, and achieve community goals.
Qdrant is an open-source vector database and search engine that helps you build fast, scalable AI-powered search and recommendation systems.
LAION is a nonprofit sharing open machine learning datasets, tools, and models to support research, education, and accessible AI development for everyone.
Weights & Biases helps AI developers track experiments, manage models, and streamline machine learning workflows from training to production.
StarRocks is an open-source database for fast, real-time analytics using SQL, designed to help businesses handle large-scale data easily and efficiently.
Polars offers a modern DataFrame platform for fast, scalable data analysis, letting you write queries and handle big data without managing servers.
Galaxy offers web-based tools for life science research, letting you analyze data, collaborate, and share results—no programming required.
Juice Analytics helps you turn complex data into clear, actionable insights with easy-to-use tools designed for businesses and technology teams.
Virtuoso lets you connect, manage, and analyze data from multiple sources using open standards, with flexible AI-powered tools for individuals and businesses.
Datomic lets you build flexible, distributed systems that store and query all your data history, on your own infrastructure or in the cloud.
JMP offers powerful tools for data analysis, visualization, and sharing, making it easy for scientists, engineers, and anyone to explore and understand data.
MAXQDA is a software platform for qualitative and mixed methods data analysis, helping you code, analyze, and present research data with AI-powered tools.
Finaeon offers in-depth financial data and analytics to help investment professionals and researchers make informed decisions using historical market insights.
Spotfire is a visual data science platform for businesses, offering easy data analysis, AI-driven insights, and interactive dashboards for smarter decisions.
HEAVY.AI offers fast, GPU-accelerated analytics for businesses and government to visualize and analyze massive geospatial and time-based data in real time.
Lizeo helps businesses make better decisions with data-driven insights, offering tools for price intelligence, product analysis, and market trends.
Create interactive dashboards and reports to visualize your data, helping you make smarter business decisions. Free and easy to use for everyone.
Power BI lets you visualize data, create interactive dashboards, and analyze information to gain insights and make better business decisions.
Collaborate on data analysis and create interactive charts and dashboards together in real time with Observable's online data visualization platform.
Graph Commons lets you map, analyze, and share complex data networks easily, helping you find insights and collaborate with others online.
data.world helps you organize, find, and use business data easily with a searchable catalog and tools for analytics, collaboration, and data governance.
Cuebiq offers location intelligence tools for brands, agencies, and researchers to analyze real-world movement, measure foot traffic, and target audiences.
KNIME is a free, open-source platform for building visual workflows to analyze data, automate tasks, and deploy AI solutions in your organization.
LSEG Data & Analytics offers global financial data, analytics, and AI-powered tools to help professionals gain insights and make informed decisions.
Dremio is a cloud-based data lakehouse platform offering fast SQL analytics, self-service data exploration, and AI-ready data management for businesses.
VSNi provides data analysis software and consultancy for plant, animal, aquaculture, and forestry breeding, supporting research in agri-science.
Trino is a fast, distributed SQL query engine that lets you analyze big data from multiple sources, helping you explore and understand your data easily.
Alteryx is a platform for automating analytics and preparing AI-ready data, letting you analyze, visualize, and make smarter decisions without coding.