Find guides and reference material for using Databricks on AWS, helping data teams with analytics, AI, and collaboration in the data lakehouse environment.
Step-by-step guides for Databricks on AWS
The Databricks documentation site is your go-to resource for learning how to use Databricks on AWS. Here, you’ll find detailed guides, how-tos, and reference materials designed for data analysts, data scientists, and data engineers. Whether you’re just getting started or looking for advanced information, the documentation covers Databricks Data Science & Engineering, Mosaic AI, and Databricks SQL environments.
With a focus on real-world problem solving in analytics and AI, the site helps you collaborate with your team using the Databricks Data Intelligence Platform and the data lakehouse approach. You can easily access resources, developer tools, release notes, and community support, making it simple to find the answers and best practices you need to get the most out of Databricks on AWS.
The documentation is organized for easy navigation, offering sections for beginners and experts alike, as well as multilingual options. Whether you’re integrating new tools, troubleshooting, or exploring new features, this site helps you unlock the full power of your data projects.
Discover websites similar to Docs.databricks.com based on shared categories, topics, and features.
Dremio is a cloud-based data lakehouse platform offering fast SQL analytics, self-service data exploration, and AI-ready data management for businesses.
Snowflake is a secure cloud platform for building AI-powered data apps, collaborating, and analyzing data at scale, with built-in privacy and compliance.
Find Snowflake guides, tutorials, and reference docs to help you learn, build, and troubleshoot with the Snowflake cloud data platform.
Neo4j is a graph database platform for connecting and analyzing complex data, enabling advanced queries, analytics, and AI-powered business solutions.
Progress MarkLogic is a robust database platform designed to help you manage, integrate, and analyze complex data for smarter, AI-powered applications.
GoodData lets you build custom data apps and add AI-powered analytics to your platforms, making it easy to turn data into useful insights for your users.
Sisense offers AI-powered data analytics software with pro-code, low-code, and no-code tools to help you gain insights and turn data into business growth.
Climate FieldView is a digital farming platform that helps growers collect, analyze, and use field data to make smarter decisions and boost crop yields.
Access and analyze satellite imagery and geospatial data easily with interactive dashboards and powerful cloud-based tools for custom insights.
Virtuoso lets you connect, manage, and analyze data from multiple sources using open standards, with flexible AI-powered tools for individuals and businesses.
Ontotext helps enterprises use AI and knowledge graphs to unify data, boost analytics, and improve collaboration across platforms like Microsoft 365 Copilot.
Gurobi offers advanced mathematical optimization tools and decision intelligence solutions for businesses, researchers, and developers worldwide.
KNIME is a free, open-source platform for building visual workflows to analyze data, automate tasks, and deploy AI solutions in your organization.
BigML is an easy-to-use machine learning platform for building models, making predictions, and analyzing data without complex setup or coding.
SingleStore is a real-time data platform for building intelligent apps, enabling fast analytics, data processing, and AI on large-scale datasets.
Wolfram offers a unified platform for computation, data analysis, and AI development, featuring tools like Mathematica, Wolfram Language, and Wolfram|Alpha.
Pearl Organisation offers digital transformation, IT, and internet-related services for businesses worldwide, including AI, cloud, and cybersecurity solutions.
VACAN lets you check real-time crowd levels at restaurants, hotels, toilets, and public spaces in Japan using AI and IoT technology. Japanese language.
Antmicro builds custom hardware and software platforms for robotics, AI, IoT, and cloud solutions, offering open source tech for innovative projects.
Celonis helps businesses analyze and optimize their processes using AI and process mining, making workflows more efficient across departments and systems.
Generate knowledge graphs easily with RML.io tools for Windows, Mac, and Linux. Use simple rules to turn your data into structured, connected insights.
WEKA offers a high-performance data platform for storing, processing, and managing data across cloud and on-premises, powering AI and machine learning workloads.
Seaborn is a Python library for creating beautiful, informative statistical data visualizations. Explore guides, tutorials, and API docs for easy plotting.
AMPLab at UC Berkeley shares research, software, and resources focused on machine learning, cloud computing, and big data analytics innovations.
Development Seed creates geospatial tools and data solutions to help you better understand our planet and make smarter decisions about a changing world.
Explore and create interactive data visualizations in Python with Vega-Altair's easy-to-use, declarative charting library and helpful documentation.
Japanese platform offering life log apps, data analysis using AI and big data, and tools to help enrich daily living through digital records.
Toloka provides expertly crafted data for training and evaluating AI models, offering access to skilled experts across domains and languages for scalable solutions.
Apache Mahout is a distributed linear algebra and machine learning platform for building custom algorithms, designed for data scientists and developers.
Data Council 2025 is a no-nonsense data and AI conference in Oakland, featuring expert talks, networking, and the latest trends in data engineering and AI.
OPeNDAP offers free, open-source tools to help researchers and data providers access, share, and manage distributed scientific datasets easily.
Posit offers open-source tools for data science, letting you code, share, and manage R and Python projects in the cloud or on your own servers.
Linaro helps you develop, test, and deploy Arm-based products quickly with collaborative tools, expert support, and solutions for embedded and AI projects.
Elastic offers an AI-powered search and analytics platform for businesses to find, analyze, and visualize data quickly across multiple environments.
IoTeX helps you build decentralized IoT and AI projects with tools and real-time data, making it easy to create secure and interoperable DePIN solutions.
Build Circle offers expert technology consulting to help businesses unlock value from data, engineering, and AI for improved efficiency and growth.
Matplotlib is a Python library for creating static, animated, and interactive data visualizations, with extensive guides, examples, and documentation.
RDF HDT offers a compact binary format and tools for storing, managing, and sharing RDF data efficiently. Find documentation, downloads, and tech resources.
Anaconda offers a secure, unified AI and data science platform built on open source, empowering data scientists and enterprises to develop and deploy AI solutions.
Oxford Semantic Technologies offers an advanced AI platform for building fast, rules-based knowledge graphs and answering complex data questions.
Explore and analyze large-scale networks with SNAP, Stanford's platform for efficient graph mining, available in C++ and Python for research and development.
Norkart offers advanced geographic IT solutions to improve workflows, streamline processes, and enable smarter collaboration for Norwegian communities.
Explore C-DAC, India’s hub for advanced computing, research, and innovation in AI, high performance computing, and technology-driven solutions.
Scale AI provides high-quality training data and tools to help companies build, evaluate, and scale AI applications across industries like automotive and government.
Quansight helps organizations solve complex data problems using open source software, specializing in AI, machine learning, and data engineering services.
Ocean Protocol lets you monetize AI models and data securely, using blockchain to protect privacy and enable trading and earning with data assets.
Radim Řehůrek offers machine learning consulting and AI solutions for intelligent data processing, helping businesses automate and optimize workflows.
Falling Rain connects you to innovative solutions for accessing, moving, and simplifying complex information through advanced engineering and data tools.
Solvd is an AI-driven advisory and digital engineering firm helping brands transform with custom AI solutions, app development, and cloud services.
Dotsquares offers custom web, app, and software development, providing AI-powered, scalable solutions for businesses seeking digital transformation.
SLB offers global energy technology solutions, helping experts collaborate from exploration to production with digital tools and sustainability in mind.
Build and deploy interactive data and AI apps for your business with Plotly. Create powerful analytics dashboards with scalable, production-ready tools.
KX offers a high-performance vector database and analytics platform for real-time data analysis, helping organizations make faster, data-driven decisions.
Kdan offers digital workflow tools like PDF editing, eSignatures, and AI-powered solutions to help businesses work smarter and boost efficiency.
Apache Spark is an open-source engine for large-scale data analytics, supporting data engineering, science, and machine learning in multiple languages.
Polars offers a modern DataFrame platform for fast, scalable data analysis, letting you write queries and handle big data without managing servers.
Access IBM's cloud platform to program, test, and run real quantum systems for research or development, with tools for quantum computing projects.
Appen offers high-quality data and tools for building, training, and improving AI models, supporting innovation for businesses and developers worldwide.
Dataiku is a platform to build, deploy, and manage AI and analytics projects, helping teams turn data into business insights and smarter decisions.
Apache Hadoop offers open-source tools for scalable, distributed computing and data analysis, letting you process big data efficiently and reliably.