copy and paste this google map to your website or blog!
Press copy button and paste into your blog or website.
(Please switch to 'HTML' mode when posting into your blog. Examples: WordPress Example, Blogger Example)
GitHub - sodadata soda-core: :zap: Data quality testing for the modern . . . An open-source, CLI tool and Python library for data quality testing Compatible with the Soda Checks Language (SodaCL) Enables data quality testing both in and out of your data pipelines and development workflows Integrated to allow a Soda scan in a data pipeline, or programmatic scans on a time-based schedule
GitHub - cleanlab cleanlab: The standard data-centric AI package for . . . While this open-source package finds data issues, its utility depends on you having: a good existing ML model + an interface to efficiently fix these issues in your dataset Providing all these pieces, Cleanlab Studio is a Data Curation platform to find and fix problems in any {text, image, tabular} dataset
The premier open source Data Quality solution - GitHub The premier Open Source Data Quality solution DataCleaner is a Data Quality toolkit that allows you to profile, correct and enrich your data People use it for ad-hoc analysis, recurring cleansing as well as a swiss-army knife in matching and Master Data Management solutions
data-quality · GitHub Topics · GitHub An open-source data logging library for machine learning models and data pipelines 📚 Provides visibility into data quality model performance over time 🛡️ Supports privacy-preserving data collection, ensuring safety robustness 📈
GitHub - kwanUm awesome-data-quality: Curated list of tools and . . . mobydq - tool for data engineering teams to run automate data quality checks on their data pipeline ydata-quality - python library for assessing data quality throughout stages of the data pipeline development great-expectations - tool for data testing, documentation, and profiling
datachecks dcs-core: Open Source Data Quality Monitoring. - GitHub Datachecks is an open-source data monitoring tool that helps to monitor the data quality of databases and data pipelines It identifies potential issues, including in the databases and data pipelines It helps to identify the root cause of the data quality issues and helps to improve the data quality
GitHub - opendatadiscovery awesome-data-catalogs: Awesome Data . . . DataKitchen's Open Source Data Observability Products are full featured with Apache 2 0 license Data breaks Servers break Your toolchain breaks Ensure your team is the first to know and the first to solve with visibility across and down your data estate Save time with simple, fast data quality test generation and execution
GitHub - ydataai ydata-quality: Data Quality assessment with one line . . . ydata_quality is an open-source python library for assessing Data Quality throughout the multiple stages of a data pipeline development A holistic view of the data can only be captured through a look at data from multiple dimensions and ydata_quality evaluates it in a modular way wrapped into a single Data Quality engine This repository
Cloud Data Quality Engine - GitHub CloudDQ is a cloud-native, declarative, and scalable Data Quality validation Command-Line Interface (CLI) application for Google BigQuery CloudDQ allows users to define and schedule custom Data Quality checks across their BigQuery tables Data Quality validation results will be available in another BigQuery table of their choice
GitHub - awesome-mlops awesome-ml-monitoring: A curated list of awesome . . . Whylogs: The open source standard for data logging Enables ML monitoring and observability ydata-quality: Data Quality assessment with one line of code Yellowbrick: Visual analysis and diagnostic tools to facilitate machine learning model selection Soda Core: Data profiling, testing, and monitoring for SQL accessible data