OpenML lets you share datasets, algorithms, and experiments to collaborate and advance machine learning research and analysis together.
Share and explore machine learning datasets
OpenML is a collaborative platform where you can discover, share, and analyze datasets, algorithms, and experiments in the field of machine learning. Whether you’re a researcher, student, or data enthusiast, you’ll find a rich collection of resources to support your projects and deepen your understanding of machine learning.
You can upload your own datasets and code, explore others’ contributions, and even compare experimental results. OpenML makes it easy to collaborate with peers, track progress, and learn from shared experiments. It’s designed to help you work together with the global scientific community to advance machine learning research in an open, transparent way.
Discover websites similar to Openml.org based on shared categories, topics, and features.
Movebank lets you explore, manage, and share animal tracking data for research and collaboration in wildlife movement and ecology studies worldwide.
Sage Bionetworks helps researchers share, analyze, and reuse biomedical data, accelerating scientific discovery with AI-powered tools and a collaborative platform.
Analyze life science data online with a collaborative platform designed for research and community-driven workflows in bioinformatics and genomics.
The HDF Group offers tools, libraries, and support for managing, sharing, and preserving scientific and engineering data across platforms and environments.
OPeNDAP offers free, open-source tools to help researchers and data providers access, share, and manage distributed scientific datasets easily.
CAS offers scientific research platforms and data solutions to help researchers accelerate discoveries, manage information, and drive innovation across fields.
Apache Spark is an open-source engine for large-scale data analytics, supporting data engineering, science, and machine learning in multiple languages.
Galaxy is a community-driven data analysis platform offering tools, workflows, and free tutorials for researchers, scientists, and learners worldwide.
CodaLab Worksheets lets you run, share, and reproduce data experiments and research code online, making collaboration and transparency simple.
AnVIL Portal helps you migrate and analyze genomic research data in the cloud, offering collaborative tools and access to large scientific datasets.
MDAnalysis offers open-source tools for analyzing molecular simulation data, helping researchers explore molecular structures and dynamics easily.
Explore biological networks with tools, resources, and training for researchers to analyze genes, proteins, and interactions in biomedical science.
Dask is an open-source Python library that helps you run data analysis and machine learning tasks faster by scaling your existing Python tools.
Explore and visualize high-dimensional data or machine learning embeddings interactively in your browser with TensorFlow’s easy-to-use projector tool.
Stan is an open-source platform for Bayesian data analysis and statistical modeling, offering tools, documentation, and a supportive user community.
Access and share genomic data on viruses like influenza and COVID-19. GISAID supports global research and public health collaboration.
PhysioNet offers free access to complex physiologic signal data and tools, supporting research and collaboration in biomedical and health science fields.
Explore global bird conservation data, tools, and insights to support biodiversity protection, scientific research, and informed environmental decisions.
OBIS is a global, open-access database for marine biodiversity, offering data and resources to support ocean science, conservation, and sustainability.
DataONE connects you to a vast network of Earth and environmental data, offering tools and training to help researchers access, share, and manage data.
Access, manage, and analyze seismological and earth science data with NSF SAGE Data Services, supporting the global geoscience and research community.
Explore NIH's data science resources and initiatives supporting biomedical research, including tools, training, and strategic plans for scientific advancement.
Find benchmark datasets, data loaders, and evaluators for graph machine learning research, all designed to work with PyTorch models and tools.
Benchling is a cloud platform for biotech R&D, helping scientists plan, record, and share experiments for better collaboration and scientific insights.
Galaxy offers web-based tools for life science research, letting you analyze data, collaborate, and share results—no programming required.
OpenText Analytics Database offers fast data analysis, machine learning, and AI-powered insights for businesses, with flexible deployment options.
Join a global data science and machine learning community, access datasets, enter competitions, and use collaborative tools to grow your skills.
Dataiku is a platform to build, deploy, and manage AI and analytics projects, helping teams turn data into business insights and smarter decisions.
DataChain offers tools for data management, preprocessing, experiment tracking, and ML model versioning to streamline large-scale AI data workflows.
Explore and access genomics data, resources, and tools at the National Genomics Data Center—supporting research in life and health sciences worldwide.
Explore research, software, and resources from the UW Interactive Data Lab, focused on data visualization and interactive analysis tools.
Explore biomedical informatics research, health data standards, and tools for analyzing large health databases at Lister Hill National Center (NLM/NIH).
Explore cancer genomics data, tools, and resources to support cancer research and discovery, provided by the NCI Genomic Data Commons.
Explore Tableau Research for insights on data visualization and analysis, featuring publications and innovations from Tableau’s industrial research team.
Manage, analyze, and visualize scientific data with AI-powered tools for smarter workflows and collaboration. Ideal for researchers and scientists.
Explore innovative data analysis, simulation, and research tools at RPI's IDEA, supporting scientists and engineers with advanced computational resources.
Mobilize Center offers open-source tools and resources to help researchers easily integrate advanced methods into biomedical research projects.
OpenMS is an open-source platform for mass spectrometry data analysis and visualization, offering tools and workflows for researchers and developers.
Domino Data Lab is an enterprise AI platform that helps data science teams accelerate research, deploy models, and collaborate using trusted tools.
NERSC provides powerful computing resources and data storage for scientific research, supporting thousands of scientists with advanced tools and expertise.
Access seismic data, tools, and resources for earth science research and education through this collaborative seismology and geoscience platform.
Daylight offers cheminformatics tools for chemical data analysis, structure searching, and knowledge management for scientific discovery and research.
Explore space weather models, run simulations, and access tools for ionosphere-thermosphere research at NASA's CCMC platform for the science community.
Find and explore rat genomic, genetic, and disease data, plus analysis tools and resources for researchers in genetics and biomedical science.
Explore data science resources, events, and community programs from the University of Washington's eScience Institute. Learn, connect, and advance discovery.
Access standardized greenhouse gas data and resources from monitoring stations across Europe, supporting climate research and environmental understanding.
Access NASA's geospatial datasets and climate data through a unified interface. Explore, analyze, and download scientific data for research and education.
HDR UK connects health data across the UK, supporting research and discoveries to improve health and wellbeing through collaborative data science.
Discover chemistry software for R&D, knowledge management, and data analysis to streamline research, improve collaboration, and accelerate innovation.
Explore human gene expression and regulation across tissues with open-access data, visualizations, and resources from the Genotype-Tissue Expression (GTEx) project.
Explore research, data, and resources on how human activity impacts the environment and climate at the Center for Sustainability and the Global Environment.
Tecton helps teams build, manage, and serve machine learning data features, making it easier to get AI models into production quickly and reliably.
Open Source Brain shares open data, models, and code for neuroscience research, letting you explore and analyze brain science projects interactively online.
Scilab is an open-source platform for numerical analysis, data visualization, and algorithm development, with tools for modeling and simulation.
RAPIDS offers open source GPU-accelerated data science libraries, helping you analyze and process data faster with familiar Python APIs.
Element 84 delivers geospatial data processing and software solutions to help organizations analyze, visualize, and use earth data for positive impact.
Access free, open biodiversity data from around the world. Explore species, occurrences, and datasets to support research and conservation efforts.
Explore Australia's earth sciences with maps, data, news, and education resources from Geoscience Australia. Access earthquake info and scientific insights.
Research group at LMU Munich focused on computer vision and machine learning, exploring image and video understanding, generative models, and AI applications.
Jupyter offers a web-based tool for writing and running code, sharing data, and creating interactive notebooks for science, data analysis, and education.