OpenRefine lets you clean, transform, and organize messy data for free. Easily format, enrich, and prepare datasets using this open source tool.
Clean and transform messy data with ease
OpenRefine is a free, open source platform designed to help you manage, clean, and transform messy datasets. Whether you’re dealing with spreadsheets, databases, or data from the web, OpenRefine gives you the tools to organize, reformat, and enrich your information quickly and efficiently.
You can use OpenRefine to spot inconsistencies, convert data between formats, and even connect with external web services to pull in more information. It’s especially handy for anyone working with large or complicated datasets who needs to tidy things up before analysis or sharing. The site offers downloads, documentation, and a friendly community to support your data projects.
From researchers and data scientists to journalists and librarians, OpenRefine is built for anyone who wants to make sense of their data without complex coding. Its user-friendly interface and active support channels make it easy to get started and stay productive.
Discover websites similar to Openrefine.org based on shared categories, topics, and features.
Access, analyze, and visualize global development data with interactive charts, tables, and maps from the World Bank's extensive databases.
Tidyverse offers a collection of R packages for data science, making data analysis, visualization, and manipulation in R simpler and more consistent.
Apache Pinot is an open source platform for real-time data analytics, letting you quickly analyze and visualize large datasets for instant insights.
Analyze life science data online with a collaborative platform designed for research and community-driven workflows in bioinformatics and genomics.
Apache Pig lets you analyze large data sets using a simple high-level language, making it easier to process and manage big data efficiently.
Apache Arrow offers a universal columnar data format and tools for fast, multi-language data analytics and seamless data interchange between systems.
Apache Zeppelin is a web-based notebook for interactive data analytics, letting you create collaborative documents using SQL, Scala, Python, R, and more.
Explore pandas, the open source Python library for fast, flexible data analysis and manipulation. Get started with guides, docs, and a helpful community.
Apache Spark is an open-source engine for large-scale data analytics, supporting data engineering, science, and machine learning in multiple languages.
Open-source tool for analyzing and visualizing data across sciences and engineering, supporting everything from large-scale simulations to desktop use.
Galaxy Europe is an open-source platform for accessible, FAIR data analysis with tools, resources, and a strong community for scientific collaboration.
Apache Hive is a distributed data warehouse system for scalable analytics, letting you read, write, and manage big data using SQL on various storage systems.
Apache Druid is a high-performance analytics database for fast, real-time querying of streaming and batch data at any scale.
Dask is an open-source Python library that helps you run data analysis and machine learning tasks faster by scaling your existing Python tools.
Manage and analyze massive multidimensional data cubes for science and research with flexible, scalable tools supporting open standards.
Query Wikipedia and related databases using SQL right in your browser. Explore, analyze, and share data easily—no software installation needed.
Voyant Tools is a web-based platform for analyzing and visualizing texts, making it easy to explore word patterns and trends in documents.
Create custom data visualizations in JavaScript with D3. Flexible tools for interactive charts and graphics, perfect for developers and data storytellers.
Explore interactive visuals to analyze global health data, track disease trends, and compare risk factors across countries and time periods.
Gephi is a free, open-source platform to visualize and explore graphs and network data, making complex connections easy to see and understand.
data.world helps you organize, find, and use business data easily with a searchable catalog and tools for analytics, collaboration, and data governance.
Opendatasoft helps you centralize, share, and interact with your data easily and securely, offering a user-friendly data marketplace and visualization tools.
TopQuadrant helps you connect and manage all your business data with AI-ready tools, making it easier to organize, govern, and use information smartly.
Qlik helps you integrate, manage, and analyze your business data with powerful tools for insights, data quality, and organization-wide analytics.
Pentaho offers a platform for data integration, analytics, and management to help organizations handle and analyze data efficiently in an AI-driven world.
Data Axle provides business and consumer data, analytics, and marketing services to help companies grow customer relationships and make smarter decisions.
Domo connects and analyzes data from any source, using AI to deliver real-time business insights, custom dashboards, and easy-to-use analytics tools.
Materialize lets you create real-time data views and transform fast-changing data using familiar SQL, making it easy to power live data products.
Frictionless Data offers open-source tools and standards to simplify working with complex data, making integration and management easier for teams and individuals.
TIBCO Platform connects data, apps, and systems to deliver real-time insights and integration for businesses in any environment.
Talend offers a cloud-independent platform for data integration, quality, and governance, helping businesses manage and trust their data easily.
StarRocks is an open-source database for fast, real-time analytics using SQL, designed to help businesses handle large-scale data easily and efficiently.
Polars offers a modern DataFrame platform for fast, scalable data analysis, letting you write queries and handle big data without managing servers.
Galaxy offers web-based tools for life science research, letting you analyze data, collaborate, and share results—no programming required.
Juice Analytics helps you turn complex data into clear, actionable insights with easy-to-use tools designed for businesses and technology teams.
MAXQDA is a software platform for qualitative and mixed methods data analysis, helping you code, analyze, and present research data with AI-powered tools.
Finaeon offers in-depth financial data and analytics to help investment professionals and researchers make informed decisions using historical market insights.
Spotfire is a visual data science platform for businesses, offering easy data analysis, AI-driven insights, and interactive dashboards for smarter decisions.
HEAVY.AI offers fast, GPU-accelerated analytics for businesses and government to visualize and analyze massive geospatial and time-based data in real time.
Lizeo helps businesses make better decisions with data-driven insights, offering tools for price intelligence, product analysis, and market trends.
Virtuoso lets you connect, manage, and analyze data from multiple sources using open standards, with flexible AI-powered tools for individuals and businesses.
JMP offers powerful tools for data analysis, visualization, and sharing, making it easy for scientists, engineers, and anyone to explore and understand data.
Create interactive dashboards and reports to visualize your data, helping you make smarter business decisions. Free and easy to use for everyone.
Power BI lets you visualize data, create interactive dashboards, and analyze information to gain insights and make better business decisions.
Collaborate on data analysis and create interactive charts and dashboards together in real time with Observable's online data visualization platform.
Graph Commons lets you map, analyze, and share complex data networks easily, helping you find insights and collaborate with others online.
Cuebiq offers location intelligence tools for brands, agencies, and researchers to analyze real-world movement, measure foot traffic, and target audiences.
KNIME is a free, open-source platform for building visual workflows to analyze data, automate tasks, and deploy AI solutions in your organization.
LSEG Data & Analytics offers global financial data, analytics, and AI-powered tools to help professionals gain insights and make informed decisions.
Dremio is a cloud-based data lakehouse platform offering fast SQL analytics, self-service data exploration, and AI-ready data management for businesses.
VSNi provides data analysis software and consultancy for plant, animal, aquaculture, and forestry breeding, supporting research in agri-science.
Trino is a fast, distributed SQL query engine that lets you analyze big data from multiple sources, helping you explore and understand your data easily.
Alteryx is a platform for automating analytics and preparing AI-ready data, letting you analyze, visualize, and make smarter decisions without coding.
Knoema lets you discover, visualize, and manage global data easily, helping individuals and businesses make informed decisions without coding skills.
Presto is a free, open-source SQL query engine that lets you run fast, interactive data analytics across data lakes, databases, and lakehouses.
Build and deploy interactive data and AI apps for your business with Plotly. Create powerful analytics dashboards with scalable, production-ready tools.
Premise gathers real-time local data worldwide to help businesses and organizations make informed decisions through actionable insights and analytics.
Manage, analyze, and visualize scientific data with AI-powered tools for smarter workflows and collaboration. Ideal for researchers and scientists.
Dynata offers a leading platform for first-party data collection, insights, and audience activation to help businesses make informed decisions and measure results.
Bloomberg Second Measure offers transaction data analytics for deep insights into company performance and consumer trends, helping you make informed decisions.