Arvados is an open source platform for managing, analyzing, and sharing large-scale genomic and biomedical data for research and collaboration.
Process and share big scientific data with ease
Arvados is an open source platform built for researchers, scientists, and organizations who work with massive genomic and biomedical datasets. With Arvados, you can efficiently manage, process, and share your scientific data, making collaboration and discovery easier than ever.
The platform provides robust tools for handling next-generation sequencing, biomedical imaging, and other large data sets, supporting everything from drug discovery to clinical testing. It’s designed to help you stay organized, maintain regulatory compliance, and streamline your research workflows.
Whether you're a developer, system administrator, or researcher, Arvados offers comprehensive documentation and a supportive community to help you get started. If you need a powerful, open source solution for scientific data analysis and collaboration, Arvados has you covered.
Discover websites similar to Arvados.org based on shared categories, topics, and features.
OPeNDAP offers free, open-source tools to help researchers and data providers access, share, and manage distributed scientific datasets easily.
AnVIL Portal helps you migrate and analyze genomic research data in the cloud, offering collaborative tools and access to large scientific datasets.
Galaxy is a community-driven data analysis platform offering tools, workflows, and free tutorials for researchers, scientists, and learners worldwide.
Explore and visualize multi-dimensional data with interactive scatter plots, histograms, and images using glue's linked-data analysis tools.
Analyze life science data online with a collaborative platform designed for research and community-driven workflows in bioinformatics and genomics.
Open-source tool for analyzing and visualizing data across sciences and engineering, supporting everything from large-scale simulations to desktop use.
Manage and analyze massive multidimensional data cubes for science and research with flexible, scalable tools supporting open standards.
Apache Spark is an open-source engine for large-scale data analytics, supporting data engineering, science, and machine learning in multiple languages.
The HDF Group offers tools, libraries, and support for managing, sharing, and preserving scientific and engineering data across platforms and environments.
CAS offers scientific research platforms and data solutions to help researchers accelerate discoveries, manage information, and drive innovation across fields.
Movebank lets you explore, manage, and share animal tracking data for research and collaboration in wildlife movement and ecology studies worldwide.
CodaLab Worksheets lets you run, share, and reproduce data experiments and research code online, making collaboration and transparency simple.
OpenML lets you share datasets, algorithms, and experiments to collaborate and advance machine learning research and analysis together.
Sage Bionetworks helps researchers share, analyze, and reuse biomedical data, accelerating scientific discovery with AI-powered tools and a collaborative platform.
SemanticKITTI offers a large, annotated LiDAR dataset for research in 3D scene understanding, supporting computer vision and autonomous driving projects.
GÉANT connects European research and education networks, offering advanced infrastructure and services to support collaboration and innovation worldwide.
Apache Hadoop offers open-source tools for scalable, distributed computing and data analysis, letting you process big data efficiently and reliably.
Development Seed creates geospatial tools and data solutions to help you better understand our planet and make smarter decisions about a changing world.
CyVerse is an open science workspace for secure data storage, sharing, and collaborative analysis, supporting research and AI projects in a cloud environment.
Genboree offers tools and databases for biomedical researchers to manage, analyze, and share genomics and biological data in a collaborative environment.
Code Ocean offers a collaborative cloud platform for reproducible computational research, letting you manage code, data, and results in one secure place.
Access high-performance supercomputing resources for open scientific research and collaboration at the Argonne Leadership Computing Facility.
Access and analyze large-scale scientific climate data worldwide with this open-source platform designed for researchers and the scientific community.
NERSC provides powerful computing resources and data storage for scientific research, supporting thousands of scientists with advanced tools and expertise.
Firebolt is a cloud data warehouse built for fast analytics and AI apps, letting you analyze large datasets quickly and scale with ease.
Dremio is a cloud-based data lakehouse platform offering fast SQL analytics, self-service data exploration, and AI-ready data management for businesses.
MotherDuck is a cloud-based data warehouse built on DuckDB, letting you analyze big data quickly and easily with instant SQL and seamless collaboration.
Open-source data analysis framework for scientific research, designed to handle and analyze large datasets in high energy physics and related fields.
Polars offers a modern DataFrame platform for fast, scalable data analysis, letting you write queries and handle big data without managing servers.
Manage, analyze, and visualize scientific data with AI-powered tools for smarter workflows and collaboration. Ideal for researchers and scientists.
AFNI offers open-source tools for analyzing and visualizing MRI data, supporting researchers with software for functional and anatomical brain imaging studies.
Trino is a fast, distributed SQL query engine that lets you analyze big data from multiple sources, helping you explore and understand your data easily.
StarTree offers a managed real-time analytics platform for fast, large-scale OLAP, helping businesses gain continuous insights from their data.
Galaxy offers web-based tools for life science research, letting you analyze data, collaborate, and share results—no programming required.
Aviz is a research team focused on advancing data analysis and visualization methods, blending analytics with interactive visual tools. (French/English)
Explore supercomputing resources, events, and support at Oak Ridge Leadership Computing Facility for scientific research and high-performance computing projects.
Explore NASA's Advanced Supercomputing Division, where powerful computers support scientific research, simulations, and mission-critical analysis.
Energy Data eXchange (EDX) is a U.S. Department of Energy platform for sharing, analyzing, and collaborating on energy-related scientific data and resources.
Quantinuum offers advanced quantum computers and software, giving users cloud access to powerful tools for solving complex scientific and industry problems.
Access, manage, and analyze seismological and earth science data with NSF SAGE Data Services, supporting the global geoscience and research community.
IT4Innovations offers access to advanced supercomputing, data analysis, and AI resources for research teams in the Czech Republic and abroad.
Analyze genetic sequences, build phylogenetic trees, and explore evolutionary biology with MEGA’s integrated research and AI-powered tools.
Petrolink offers cloud-based data analysis tools for the energy sector, helping companies manage, visualize, and optimize their drilling operations.
Access seismic data, tools, and resources for earth science research and education through this collaborative seismology and geoscience platform.
Explore Tableau Research for insights on data visualization and analysis, featuring publications and innovations from Tableau’s industrial research team.
Explore innovative data analysis, simulation, and research tools at RPI's IDEA, supporting scientists and engineers with advanced computational resources.
Explore NIH's data science resources and initiatives supporting biomedical research, including tools, training, and strategic plans for scientific advancement.
Teradata offers a cloud-based analytics and data platform that helps businesses scale trusted AI, analyze data, and drive innovation for better results.
ESnet is a high-speed science network connecting researchers and labs, supporting data sharing and collaboration for the U.S. Department of Energy.
UC Berkeley's Econometrics Lab provides advanced computing resources, user support, and specialized software for statistical and econometric research.
Explore and access genomics data, resources, and tools at the National Genomics Data Center—supporting research in life and health sciences worldwide.
Explore cancer genomics data, tools, and resources to support cancer research and discovery, provided by the NCI Genomic Data Commons.
Benchling is a cloud platform for biotech R&D, helping scientists plan, record, and share experiments for better collaboration and scientific insights.
FAIRplus helps scientists manage and share life science data using FAIR principles, offering tools and guidelines to make research data more accessible.
Track satellites, space debris, and space weather in real time. Get detailed data, visualizations, and tools for exploring everything in Earth orbit.
ODISSEI connects social science researchers in the Netherlands with secure data, advanced analysis tools, and collaborative research support.
Duke Rhodes iiD connects students and experts across fields to explore big data through hands-on projects, research, and educational programs.
Pittsburgh Supercomputing Center offers advanced computing resources, research support, and educational programs for science, AI, and big data projects.
San Diego Supercomputer Center provides advanced computing, cloud, and data services for research, science, and innovation in high-performance environments.
The Econometrics Laboratory at UC Berkeley offers powerful computing resources and support for statistical, mathematical, and econometric research.