SNAP offers powerful network analysis tools and graph mining libraries for large-scale data, available in C++ and Python, ideal for research and development.
Analyze massive networks with ease
SNAP, the Stanford Network Analysis Project, is a platform designed for anyone interested in exploring and analyzing large-scale networks and graphs. Whether you’re a researcher, developer, or student, you can use SNAP’s powerful tools to study complex networks with millions of nodes and billions of edges. The library is available in both C++ and Python, making it accessible for various programming backgrounds.
With SNAP, you can efficiently manipulate big graphs, calculate important network properties, and generate different kinds of graphs for your projects. The platform also provides tutorials, datasets, and a helpful community, making it a great resource for learning and advancing network analysis. If you’re working with network data or interested in graph mining, SNAP gives you the tools and support you need to get started and go deeper.
Discover websites similar to Mmds.org. Optimized for ultra-fast loading.
Apache Arrow offers a universal columnar data format and tools for fast, multi-language data analytics and seamless data interchange between systems.
Explore pandas, the open source Python library for fast, flexible data analysis and manipulation. Get started with guides, docs, and a helpful community.
Apache Pig lets you analyze large data sets using a simple high-level language, making it easier to process and manage big data efficiently.
Open-source tool for analyzing and visualizing data across sciences and engineering, supporting everything from large-scale simulations to desktop use.
Manage and analyze massive multidimensional data cubes for science and research with flexible, scalable tools supporting open standards.
Apache Calcite is an open-source framework for building high-performance databases and data management systems with dynamic query processing.
Arvados is an open source platform for managing, analyzing, and sharing large-scale genomic and biomedical data for research and collaboration.
Explore and visualize multi-dimensional data with interactive scatter plots, histograms, and images using glue's linked-data analysis tools.
Analyze life science data online with a collaborative platform designed for research and community-driven workflows in bioinformatics and genomics.
Galaxy is a community-driven data analysis platform offering tools, workflows, and free tutorials for researchers, scientists, and learners worldwide.
Scrapy is an open-source Python framework that helps you efficiently scrape and extract data from websites for research, analysis, or automation projects.
Dask provides Python tools for parallel and distributed computing, helping you work with large data and accelerate analytics using familiar workflows.
NSF NEON offers open ecological data and resources to help you explore, analyze, and understand ecosystems across the United States.
Explore geoscience data, interactive tools, and educational resources focused on measuring Earth changes, GNSS, and geophysical research. English language.
DesignSafe-CI is an online hub for natural hazards engineering research, offering data tools, community resources, and learning for scientists and engineers.
Expasy offers bioinformatics resources and AI tools to help you explore, analyze, and retrieve complex biological data for research and learning.
CAIDA offers network research, curated datasets, and tools for scientists and academics studying internet infrastructure and data analysis.
OpenRefine lets you clean, transform, and organize messy data for free. Easily format, enrich, and prepare datasets using this open source tool.
Tidyverse offers a collection of R packages for data science, making data analysis, visualization, and manipulation in R simpler and more consistent.
Apache Pinot is an open source platform for real-time data analytics, letting you quickly analyze and visualize large datasets for instant insights.
dplyr offers tools and clear documentation for fast, consistent data manipulation in R, making it easy to work with data frames in memory or remotely.
Apache Zeppelin is a web-based notebook for interactive data analytics, letting you create collaborative documents using SQL, Scala, Python, R, and more.
Query and analyze data from Hadoop, NoSQL, and cloud storage using familiar SQL—no schema setup or data loading required.
Apache Spark is an open-source engine for large-scale data analytics, supporting data engineering, science, and machine learning in multiple languages.
Apache Hive is a distributed data warehouse system for scalable analytics, letting you read, write, and manage big data using SQL on various storage systems.
Apache Druid is a high-performance analytics database for fast, real-time querying of streaming and batch data at any scale.
GDELT monitors global news in 100+ languages, analyzing events, people, and trends worldwide. Access open data and insights on how our world unfolds.
Apache Superset lets you easily explore, analyze, and visualize your data with interactive dashboards and no-code or SQL-based tools.
Apache Kylin is an open-source platform for fast, scalable data analytics with high concurrency, offering intelligent OLAP solutions for big data.
RQDA is a free, open-source R package for qualitative data analysis, helping you code, organize, and examine textual data on Windows, Linux, or Mac.
TileDB helps you organize, structure, and analyze large-scale data in a secure research environment, making collaboration and discovery easier.
Open-source data analysis framework for scientific research, designed to handle and analyze large datasets in high energy physics and related fields.
Manage, analyze, and visualize scientific data with AI-powered tools for smarter workflows and collaboration. Ideal for researchers and scientists.
AFNI offers open-source tools for analyzing and visualizing MRI data, supporting researchers with software for functional and anatomical brain imaging studies.
Geneious offers bioinformatics software for scientists to analyze molecular sequence data, manage workflows, and streamline antibody discovery in the cloud.
Galaxy offers web-based tools for life science research, letting you analyze data, collaborate, and share results—no programming required.
Explore advanced research in computing sciences, data analysis, and mathematical modeling at Berkeley Lab, supporting breakthroughs in science and technology.
SIB Swiss Institute of Bioinformatics connects life science experts, resources, and training to advance biomedical data science and open research in biology.
JMP offers powerful tools for data analysis, visualization, and sharing, making it easy for scientists, engineers, and anyone to explore and understand data.
StarRocks is an open-source database for fast, real-time analytics using SQL, designed to help businesses handle large-scale data easily and efficiently.