lakeFS is an open-source tool that brings Git-like version control to your data, helping you manage and track changes in cloud object storage easily.
Git-style version control for your data
lakeFS is designed to make managing large datasets as easy as managing code repositories. With its open-source platform, you can create, track, and roll back data versions in your cloud storage, just like you would with Git for your code.
Whether you're working on data engineering, analytics, or machine learning projects, lakeFS helps you experiment and collaborate safely by letting you branch and merge datasets. The platform is highly scalable, so it’s a great fit for teams dealing with massive or rapidly changing data.
If you’ve ever wanted better control, reproducibility, or collaboration for your data, lakeFS brings familiar versioning workflows to the world of big data. You’ll find resources, documentation, and a welcoming community to help you get started.
Discover websites similar to Lakefs.io based on shared categories, topics, and features.
Prometheus is an open-source tool for monitoring systems and analyzing time series data with powerful metrics, alerts, and flexible querying.
Generate knowledge graphs easily with RML.io tools for Windows, Mac, and Linux. Use simple rules to turn your data into structured, connected insights.
Find detailed, accurate IP address data with IPinfo.io. Access geolocation, privacy, and company info by API or database for secure, reliable results.
Clarify lets industrial businesses connect, analyze, and automate their operational data for better insights and smarter decision-making.
Explore and visualize U.S. public data with interactive charts, maps, and reports. Find insights on industries, locations, education, and more.
Vega lets you create, edit, and share interactive data visualizations using a simple JSON format, perfect for exploring and presenting your data visually.
Holistics is a self-service analytics platform that lets teams explore, visualize, and share data insights using modern BI and DevOps best practices.
Frictionless Data offers open-source tools and standards to simplify working with complex data, making integration and management easier for teams and individuals.
OpenActive provides open data tools and resources to help people access sport and physical activity opportunities, making it easier to get active.
BIDS is a community-driven platform that provides a standard for organizing and sharing neuroimaging and behavioral data to simplify research collaboration.
Get real-time price insights for retail and industry in Brazil, powered by crowdsourced data from both physical stores and e-commerce, even in remote regions.
Snowplow helps organizations collect, manage, and use customer behavioral data to power AI, analytics, marketing, and digital experiences.
Explore interactive data visualizations and visual explanations that make complex topics easy to understand for learners and curious minds.
Explore and create interactive data visualizations in Python with Vega-Altair's easy-to-use, declarative charting library and helpful documentation.
GoAccess lets you analyze and visualize web server logs in real time, offering clear insights into your website traffic right from your terminal or browser.
Offers real-time modelling and forecasts of infectious disease trends, helping users understand and track the spread and impact of outbreaks worldwide.
Gogs lets you easily run your own Git service, offering a simple way to host and manage code repositories on your own server.
Discover, publish, and share quality datasets with DataHub. Access thousands of free and premium data resources, updated and ready for your projects.
Webz.io provides structured data and insights from the open, deep, and dark web to help you monitor risks, track trends, and make informed decisions.
Oxylabs offers high-quality proxy services and web scraping tools, letting you collect public web data easily using a large, ethical proxy network.
DVC is an open-source tool for version control in data science and machine learning, helping you track data, models, and experiments like with Git.
Zarr is an open community project for storing, sharing, and working with large multi-dimensional arrays, supporting efficient cloud and parallel computing.
Globus lets you securely transfer, share, and manage large research data across systems, making it easier for researchers to focus on their work.
InfluxDB is a platform for managing and analyzing time series data, offering fast, flexible database solutions for cloud, on-premises, or edge environments.
The HDF Group offers tools, libraries, and support for managing, sharing, and preserving scientific and engineering data across platforms and environments.
ThingSpeak lets you collect, analyze, and visualize IoT device data in the cloud using MATLAB tools. Easily manage and act on your connected devices' data.
EUDAT offers advanced tools to store, share, and preserve research data across disciplines and countries, supporting collaboration and data management.
Quantum offers end-to-end data management, storage, and analysis solutions for unstructured data, supporting AI workloads and secure, flexible data workflows.
Explore, store, and share geospatial data in the cloud with easy tools for mapping, managing, and publishing—no IT setup needed.
Resilio moves and syncs files fast between devices, teams, and locations—helping you access, share, and protect data anywhere, anytime.
Skyvia helps you easily integrate, back up, manage, and access cloud data without coding, offering secure data connections and automation in one platform.
TortoiseSVN is a free Windows tool for managing code versions with Subversion, offering an easy way to track and control project changes.
Mercurial is a free, distributed version control tool that helps you manage code and track changes for projects of any size with a simple interface.
Monotone is a free, distributed version control system that helps you manage code changes, collaborate, and sync projects across platforms.
Collaborate on Sketch designs in real time with version control, letting your team work together on the same file without losing track of changes.
Git is a free, open-source version control system for tracking code changes and collaboration, with downloads, documentation, and community resources.
Plastic SCM offers distributed version control and DevOps tools designed for large projects, helping teams manage code, automate builds, and collaborate easily.
Tower is a Git client for Mac and Windows that helps developers and designers manage code changes, collaborate, and streamline version control tasks easily.
VisualSVN offers easy-to-use Subversion version control for Windows, providing enterprise-ready tools for code management and secure collaboration.
Free public Git hosting for open source projects, offering easy repository management and collaboration tools for developers worldwide.
Mendeley Data is a free, secure online repository for sharing, storing, and citing research data, helping you easily access and collaborate worldwide.
Apache Subversion is an open-source version control system for managing and tracking changes to code, documents, and files in collaborative projects.
Darcs is a free, open source version control system for managing code changes, offering a simple interface and offline support across platforms.
Fossil is a distributed software configuration management tool with version control, bug tracking, wiki, and a built-in web interface for developers.
Browse and explore the OpenBSD source code online, view file histories, download revisions, and compare changes using this web-based CVS repository.
Cloudian offers scalable, S3-compatible object storage solutions for large data workloads across on-prem, hybrid, and multi-cloud environments.
Explore AI-powered analytics with Tableau's augmented analytics tools, making data insights easy to discover and understand for smarter business decisions.
Mintel offers up-to-date market research, consumer insights, and industry data to help businesses make informed decisions and spot new opportunities.
Gnuplot is a free, cross-platform graphing tool for creating plots and charts from data or mathematical functions, supporting both interactive and scripted use.
Securcube provides digital forensic tools for analyzing phone records and cell site data, helping professionals uncover critical evidence efficiently.
MonetDB is a high-performance database system designed for fast analytics and data management using standard SQL. Open source and easy to use.
Unlock business insights with AI-powered data analysis tools and solutions, available in Korean. Make smarter decisions with innovative data technology.
Access cross-national microdata for research and analysis with remote tools from the LIS Data Center in Luxembourg. Ideal for social science studies.
Bissantz offers business intelligence tools for easy analysis, planning, and reporting, helping companies make informed decisions with integrated data insights.
Apache Atlas helps you manage and govern your data in Hadoop, offering tools for metadata management, data lineage, and compliance tracking.
VersionPress is a free WordPress plugin that lets you use Git to track changes to both your site files and database, making website management easier.
Gitea is a lightweight, self-hosted Git service that lets you manage code repositories easily across platforms. Simple setup with open source flexibility.
DataCore delivers advanced cloud storage solutions for IT, helping you boost data performance, efficiency, and always-on access for your business needs.
Semantic Web Company offers AI-powered tools for data analysis, text mining, and recommendations, helping businesses make informed decisions.
Explore and access genomics data, resources, and tools at the National Genomics Data Center—supporting research in life and health sciences worldwide.