,

|

DataRecce | Explore, validate, and share data impact before merging


DataRecce
Recce

Introduction

Recce is a data change management and validation platform designed to help teams review, understand, and confidently ship data changes—particularly in dbt environments. It provides contextual diffing, lineage tracing, and production-safe validation workflows.

Use Cases

  • dbt PR Review
    Compare development and production environments to catch data issues before merging.
  • Impact Analysis
    Visualize downstream effects of schema or data changes using column-level lineage and diffs.
  • Continuous Integration (CI) Integration
    Automate data validation in CI/CD pipelines with GitHub Actions support.
  • Data Quality Checks
    Spot irregularities by diffing row counts, distributions, histograms, and value profiles.
  • Stakeholder Collaboration
    Share contextual validation reports and comment-ready PR insights for business team alignment.

Features & Benefits

  • Column‑Level Lineage Traceability
    Understand exactly how each column is derived and transformed across changes.
  • Multi‑Modal Diffs
    Compare schema, row counts, value distributions, top-k categories, and query outputs.
  • Live PR Integration
    Embed summaries and validation results directly into pull requests.
  • CI/CD Automation
    Seamlessly incorporate Recce into GitHub workflows for continuous validation.
  • SOC 2 Certified & Scalable
    Enterprise-grade security and support for shared checklists and team collaboration.

Pros

  • Faster Reviews
    Reduces PR review time by up to 90% by providing automated insights.
  • Contextual Understanding
    Enables stakeholders to grasp data change implications via rich lineage views.
  • Enhanced Productivity
    Automates validation, freeing data teams from manual checks and SQL queries.
  • Robust CI Compatibility
    Supports GitHub Actions and dbt pipelines for production-grade integration.

Cons

  • Requires dbt or Similar Setup
    Best suited for teams already using dbt or comparable data modeling tools.
  • Learning Curve
    Users need to configure environments, artifacts, and validation workflows.
  • Cloud Tier Constraints
    Free tier limits include 500 minutes/month and a single concurrent session.

Tutorial

None

Pricing